Pierce Tones

Description

This animation was produced in conjunction with Richard Duda of the Department of Electrical Engineering at San Jose State University during the Summer of 1989. Thanks to Richard Duda for both the audio examples and the explanation that follows and to John Pierce for calling this experiment to our attention.

Researchers in psychoacoustics have long looked to cochlear models to explain the perception of musical pitch [Small70]. Many experiments have made it clear that the auditory system has more than one mechanism for pitch estimation. In one of these experiments, Flanagan and Guttman used short-duration impulse trains to investigate two different mechanisms for matching periodic sounds, one based on spectrum and one based on pulse rate [Flanagan60]. They used two different impulse trains, one having one pulse per period of the fundamental, the other having four pulses per period, every fourth pulse being negative . These signals have the interesting property that they have the same power spectrum, which seems to suggest that they should have the same pitch. The standard conclusion, however, was that below 150 pulses per second the trains "matched" if they had the same pulse rate; they "matched" on spectrum only when the fundamental frequency was above about 200 Hz.

[Pierce89] modified this experiment by replacing the pulses by tone burstsóshort periods of a 4,800-Hz sine wave modulated by a raised-cosine Hamming window. In essence, he used Flanagan and Guttman's pulses to amplitude modulate a steady high-frequency carrier. His purpose in doing this was to narrow the spectrum, keeping the large response of the basilar membrane near one place (the 4,800-Hz place), regardless of pulse rate.

To be more specific, Pierce used the three signal "patterns" shown below. All have the same burst duration, which is one-eighth of a pattern period. Pattern a has four bursts in a pattern period. Pattern b has the same burst rate or pulse rate, but every fourth burst is inverted in phase. Thus, the fundamental frequency of b is a factor of four or two octaves lower than that of a. Pattern c has only one burst per pattern period, and thus has the same period as b; in fact, it can be shown that b and c have the same power spectrum. Thus, a and b sound alike at low pulse rates where pulse-rate is dominant, and b and c sound alike at high pulse rates where spectrum is dominant. Pierce observed that the ear matches a and b for pattern frequencies below 75 Hz, and matches b and c for pattern frequencies above 300 Hz. He found the interval between 75 and 300 Hz to be ambiguous, the b pattern being described as sounding inharmonic.

Pierce's tone bursts. Patterns a and b have the same pulse rate frequency, while b and c have the same power spectrum. Here the test sounds are shown with one cycle per burst.

To see if and how these results are reflected in correlograms, a similar set of tone burst signals were generated. The only difference between our signals and Pierce's signals was due to differences in the digital sampling rate used. To get a Fourier spectrum with minimum spectral splatter, Pierce imposed two requirements:

  1. The tone-burst frequency fb was set at half the Nyquist rate. Where Pierce's 19,200-Hz sampling rate led to fb = 4,800 Hz, our 16,000-Hz sampling rate forced fb down to 4,000 Hz.
  2. Each burst had to contain an exact integral number n of cycles. This number, n, is a major parameter for the experiments, ranging from 1 to 128. If the pattern period is T, then to obtain exactly n cycles of frequency fb in time T/8 requires that fb T/8 = n, so that T = 8n/fb .

Thus, to obtain the same spectral characteristics, we had to use different numerical values for the tone-burst frequency fb and the corresponding pattern period T. The table shown below is our version of Table I in Pierce's paper.

Group # PatternPattern Frequency 1/TFundamental Frequency (Hz) Bursts per SecondCycles per Burst (n)Pattern Period T(msec)
1a3.9062515.62515.625128256
b3.906253.9062515.625128256
c3.906253.906253.90625128256
2a7.812531.2531.2564128
b7.81257.812531.2564128
c7.81257.81257.812564128
3a 15.62562.562.53264
b15.62515.62562.53264
c15.62515.62515.6253264
4a31.251251251632
b31.2531.251251632
c31.2531.2531.251632
5a62.5250250816
b62.562.5250816
c62.562.562.5816
6a12550050048
b12512550048
c12512512548
7a2501000100024
b250250100024
c25025025024
8a5002000200012
b500500200012
c50050050012

A set of eight test signals was generated according to this scheme. Each test signal consists of a sequence of the a, b and c patterns, each pattern lasting 1.024 seconds. This time interval was chosen to get an exact integer number of bursts, ranging from 4 for Case 1c to 2000 for Cases 8a and 8b.

The following sounds are presented in this video clip:
1)Pattern 1a
2)Pattern 8a (one cycle per burst)
3)Pattern 8b (one cycle per burst, every fourth pulse inverted)
4)Pattern 8c (one cycle per burst, 1/4 spacing of 8a)
5)Group Pattern 8 (1 cycle/burst)
6)Group Pattern 7 (2 cycles/burst)
7)Group Pattern 6 (4 cycles/burst)
8)Group Pattern 5 (8 cycles/burst)
9)Group Pattern 4 (16 cycles/burst)
10)Group Pattern 3 (32 cycles/burst)
11)Group Pattern 2 (64 cycles/burst)
12)Group Pattern 1 (128 cycles/burst)

Our conclusions (both auditory and visual) were very similar to those that Pierce reports, though the ranges were a bit different. We found very clear matching on pulse rate (a and b) for pattern frequencies of 62.5 Hz or less, and very clear matching on fundamental frequency (b and c) for pattern frequencies of 250 Hz or more. The 125-Hz case was ambiguous, but could certainly have been called a pulse-rate match. The following table summarizes our admittedly subjective impressions.

Cycles/burst nPattern freq 1/TMatching PatternsComments
1283.90625a ba & b sound identical,"pulsy"; c much slower, same "tone" as a, b
647.8125a ba & b almost identical, "chopped"; c "pulsy," same "tone" as a, b
3215.625a ba & b very close, diff tone color; c "pulsy," "tone" resembles a, b
1631.25a ba & b close, harsh sounding; c "pulsy," "tone" uncertain
862.5a ba & b related, b more inharmonic; c much lower in pitch, buzzy
4125.a~ba & b related, a a bit higher pitch; c lower in pitch, buzzy
2250.b cb & c very close, c buzzier; a two octaves higher in pitch
1500.b cb & c identical

These tone-burst signals were processed by the cochlear model, and their correlograms were compared. While interpretation of the images is at least as subjective as interpretation of the sounds, there seemed to be a remarkable correspondence between the appearance of the correlograms and the results shown above.

For high pattern frequencies, the b and c correlograms were very similar, their common fundamental frequencies being clearly evident; they differed primarily in the c pattern having a cleaner high-frequency spectrum. This is roughly in accord with the fact that b and c have the same power spectrum. At the "critical" 125-Hz pattern frequency, the correlograms for b suddenly resembles a much more than c, a resemblance which becomes complete at the lower pattern frequencies. In none of these cases does one see much energy at the low fundamental frequencies, and the higher-frequency time structure of b dominates the match.



Frequency
Time Delay

Transcript

The following sounds are presented in this video clip:
  1. Pattern 1a
  2. Pattern 8a (one cycle per burst)
  3. Pattern 8b (one cycle per burst, every fourth pulse inverted)
  4. Pattern 8c (one cycle per burst, 1/4 spacing of 8a)
  5. Group Pattern 8 (1 cycle/burst)
  6. Group Pattern 7 (2 cycles/burst)
  7. Group Pattern 6 (4 cycles/burst)
  8. Group Pattern 5 (8 cycles/burst)
  9. Group Pattern 4 (16 cycles/burst)
  10. Group Pattern 3 (32 cycles/burst)
  11. Group Pattern 2 (64 cycles/burst)
  12. Group Pattern 1 (128 cycles/burst)