StudySmarter - The all-in-one study app.
4.8 • +11k Ratings
More than 3 Million Downloads
Free
Americas
Europe
What makes vowels acoustically distinct? For example, what makes the vowel in street different from the vowel in strut? Obviously, you position your mouth differently when you produce each vowel. But how does a listener "hear" that difference in mouth posture? What allows a person to perceive different vowels in speech?
Explore our app and discover over 50 million learning materials for free.
Lerne mit deinen Freunden und bleibe auf dem richtigen Kurs mit deinen persönlichen Lernstatistiken
Jetzt kostenlos anmeldenWhat makes vowels acoustically distinct? For example, what makes the vowel in street different from the vowel in strut? Obviously, you position your mouth differently when you produce each vowel. But how does a listener "hear" that difference in mouth posture? What allows a person to perceive different vowels in speech?
The acoustic signals that distinguish vowels in speech are called formants. Formants are amplified regions of Glottal source harmonics that change in frequency depending on mouth posture. Formants are relevant in linguistics because they provide a structured acoustic signal for analyzing speech.
Formants are the bread and butter of vowel analysis in linguistics.
Formants: amplified frequency ranges within a vowel that distinguish one vowel from another.
You can see formants by looking at a spectrogram. Vowel formants show up as dark horizontal bands along the spectrogram of a speech sample. The higher the horizontal band on the spectrogram, the higher the frequency of the formant. When viewing a spectrogram, you should be able to see three or four formants.
Fig. 1 - The dark horizontal bands represent formants. There are four visible formants on this spectrogram.
The darkness of each formant band represents amplitude or how loud the formant is.
Three formants in particular help distinguish vowels from one another. These are the first, second, and third formants, also known as F1, F2, and F3. These formants signal a vowel's relative height and backness.
Formants are closely related to another series of frequencies: harmonics.
Harmonics: the resonance frequencies present in the Glottal source wave.
Glottal source wave: the pressure wave created when air passes through the glottis.
When you produce a vowel, you push air from your lungs through your glottis, causing your vocal folds to vibrate. The vibration produces a pressure wave; this wave is the sound of your voice.
The glottal source wave is a large wave made up of several smaller (higher-frequency) waves. Each of these higher-frequency waves is a harmonic of the glottal source wave.
Harmonics are visualized on a vowel's spectrum. The spectrum displays frequency on the x-axis and amplitude on the y-axis. It looks like a spiky row of frequencies that gradually decrease in amplitude. Each "spike" on the spectrum is a harmonic.
The formants, on the other hand, are the loudest groups of harmonics. If you look at the spectrum of a vowel, you'll see that the tops of the harmonic "spikes" form a wavy shape. The peak of each "wave" is a formant. In Fig. 2, you can see four distinct wave shapes. These are the same formants visible in Fig. 1.
Fig. 2 - Each "spike" on the spectrum represents a harmonic, and each "wave" represents a formant.
To summarize simply, harmonics are elements of the glottal source wave, and formants are elements of the vowel shape.
Are you wondering how the pitch of the voice plays into this? The perceived pitch of the voice is the fundamental frequency or F0. On a spectrum, the lowest-frequency harmonic is the fundamental frequency. A vowel's F0 is not affected by the formants and vice versa. That's why you can sing the same vowel on different notes or different vowels on the same note!
The position of your mouth when you produce a vowel determines the structure of the vowel's formant frequencies. Here's a basic overview of how this works:
F1 is the lowest formant frequency in a vowel. It corresponds to vowel height.
High vowels are produced with a high tongue position. If you say the words Peter and potter to yourself, you'll notice that you place your tongue higher in your mouth for the [i] in Peter than for the [a] in potter.
F1 correlates inversely with height. The higher the vowel, the lower the F1 value. Because it involves a higher tongue position, the [i] in Peter has a lower F1 value than the [a] in potter.
The next formant is F2, which corresponds to vowel backness.
When you produce back vowels, you push your tongue further back into your mouth. Say the words bead and booed to yourself. You'll notice that your tongue shifts back and forth in your mouth as you alternate between the front [i] and the back [u].
F2 correlates inversely with backness. The further back the vowel, the lower the F2 value. Because it requires the tongue to be further back in the mouth, the [u] in booed has a lower F2 value than the [i] in bead.
F3 doesn't change based on height and backness as drastically as F1 and F2, but it does change in response to r-colored vowels.
R-colored vowels are produced with the bunched-up tongue characteristic of the American r sound. For example, the vowel in a general American English pronunciation of bird is an r-colored vowel. R-colored vowels have a very low F3 value compared to other vowels.
Now for some examples of relative formant values. The Vowel Chart of the International Phonetic Alphabet (IPA) plots the different vowel categories by their F1 and F2 values. It shows F1 on the y-axis in inverse order: the vowels with the lowest F1 are closest to the top of the vowel chart. The chart shows F2 on the x-axis, also in inverse order. The vowels with the highest F2 are closest to the left of the chart.
Fig. 3 - The IPA Vowel Chart shows F2 (high to low) on the x-axis and F1 (high to low) on the y-axis.
By looking at the chart, you can identify a vowel's F1 and F2 values relative to the other vowels. For example:
Looking at the chart allowed for some predictions about the formant values of different vowels in natural speech. So, how accurate were those predictions?
Fig. 4 - You can see on this spectrogram how F1 and F2 change with each vowel.
In Fig. 4, you can see the spectrogram for a sequence of synthesized vowels [i, e, a, o, u]. The formants are marked with red dots.
The lowest band is F1. You can see that the vowels that are highest on the chart, [i] and [u], have the lowest F1 band. The lowest vowel, [a], has the highest F1 value, and the mid vowels have F1 frequencies between those of the high and low vowels.
The next band after F1 is F2. The vowels in Fig. 4 are shown in order from the left to the right side of the vowel chart or from the front to the back tongue position. You can see that F2 is highest for [i] and gradually steps down as the vowels move further back.
Since none of these vowels are r-colored, F3 stays fairly constant on the spectrogram.
There you have it! Vowel height and backness are reliable predictors of F1 and F2 frequencies. Likewise, F1 and F2 frequencies reliably signal vowel positions.
Formants are amplified frequency ranges within a vowel that distinguish one vowel from another.
The position of your mouth when you produce a vowel determines the vowel's formant frequencies. Vowel height determines F1, vowel backness determines F2, and F3 lowers in response to lip rounding and r-colored vowels.
The perceived pitch of the voice is the fundamental frequency, or F0. On a spectrum, the lowest-frequency harmonic is the fundamental frequency. A vowel's F0 is not affected by the formants, and vice versa. That's why you can sing the same vowel on different notes or different vowels on the same note!
Formants are amplified frequency ranges within a vowel that distinguish one vowel from another. In speech, formants allow people to perceive vowel height and backness.
A vowel's relative formant values are determined by the vowel's height and backness. A vowel with a high tongue position like [i] has a lower F1 value than a vowel with a low tongue position like [a]. Similarly, a back vowel like [u] has a lower F2 value than a front vowel like [i].
Flashcards in Formant15
Start learningWhat are formants?
Formants are amplified frequency ranges within a vowel that distinguish one vowel from another.
Formants show up on a spectrogram as:
dark horizontal bands
The darkness of each formant band on a spectrogram represents its _____.
amplitude, or loudness
The first _____ formants are relevant for linguistic analysis.
3
What are harmonics?
Harmonics are the resonance frequencies present in the glottal source wave.
What is the difference between harmonics and formants?
Harmonics are elements of the glottal source wave, and formants are elements of the vowel shape. In other words, harmonics tell you about the quality of the voice, and formants tell you about the quality of the vowel.
Already have an account? Log in
The first learning app that truly has everything you need to ace your exams in one place
Sign up to highlight and take notes. It’s 100% free.
Save explanations to your personalised space and access them anytime, anywhere!
Sign up with Email Sign up with AppleBy signing up, you agree to the Terms and Conditions and the Privacy Policy of StudySmarter.
Already have an account? Log in