D C Miller's analytic experiments showed that the sounding of vowels depended on a certain frequency, rather that a certain order of harmonic, with the results shown below:
Harmonic | Frequency | Percentage of Energy |
Soprano singing 'Ahh' | ||
1 | 308 | 9 |
2 | 616 | 6 |
3 | 924 | 69 |
4 | 1232 | 8 |
5 | 1540 | 5 |
6 | 1848 | 1 |
7 | 2156 | 0 |
8 | 2464 | 0 |
9 | 2772 | 0 |
Bass singing 'Ahh' | ||
1 | 154 | 1 |
2 | 308 | 3 |
3 | 462 | 1 |
4 | 616 | 1 |
5 | 772 | 12 |
6 | 924 | 66 |
7 | 1078 | 7 |
8 | 1232 | 7 |
9 | 1386 | 1 |
Clearly the frequency is the factor that influences the sound, not the harmonic.
So the vowels are characterised by the formant values, and these are used in their synthesis. These are shown for an adult male:
Vowel | F1 | F2 | F3 |
bead | 270 | 2300 | 3000 |
bid | 400 | 2000 | 2550 |
bed | 530 | 1850 | 2500 |
bad | 660 | 1700 | 2400 |
bard | 730 | 1100 | 2450 |
bored | 570 | 850 | 2400 |
budha | 440 | 1000 | 2250 |
boot | 330 | 850 | 2250 |
bud | 640 | 1200 | 2400 |
bird | 500 | 1500 | 2500 |
And for an adult female:
Vowel | F1 | F2 | F3 |
bead | 300 | 2800 | 3300 |
bid | 430 | 2500 | 3100 |
bed | 600 | 2350 | 3000 |
bad | 860 | 2050 | 2850 |
bard | 850 | 1200 | 2800 |
bored | 590 | 900 | 2700 |
budha | 470 | 1150 | 2700 |
boot | 370 | 950 | 2650 |
bud | 760 | 1400 | 2800 |
bird | 520 | 1650 | 1950 |
One repercussion of the frequency of the formants being important is that if a singer tries to sing a vowel at a pitch above that of the first couple of formants, it becomes very difficult. If a singer tries to sing at a frequency that is above the most prominent first couple of formants, producing the correct vowel sound is very hard.
An example of this was given to me by my music teacher when I was a treble in the choir at school: The descant for O Come All Ye Faithful ends on a 'top A' on the words 'Christ the Lord'. We were told not to sing the i in Christ not as eye but as the i in bid. A look at the table above explains why. Top A is 1760Hz. Eye is a diphthong, the first part of which (and the bit most of the note is sounded on) is similar to the a in bard. This has its second formant at only 1200Hz, whereas bid has its second formant at 2500Hz. So i is possible to get a better sound from since the formant is not below the singing pitch (and it's not that noticeable to people listening!).