Where Do Dialog Intelligibility Problems Originate?
What are the challenges when creating content with clear and intelligible dialog? In this article, we examine the different stages of the audio production chain that can influence dialog intelligibility.
From a metering perspective, some factors can be measured and flagged, while others remain outside the reach of objective audio measurement.
Capturing
Let us begin at the source: the point where the audio is captured.
Performer
The performer (actor, artist, commentator or speaker) can be a primary source of dialog intelligibility issues. Typical examples include:
Poor articulation
Mumbling
Strong dialects or accents
Whispering
These factors originate from the performance itself and may be difficult to correct later in the production chain.
It is important to note that some of these examples may be the result of creative intent. In certain cases, reduced dialog intelligibility is a deliberate artistic choice.
Technical issues
On the technical side, several factors can negatively affect intelligibility:
Sub-standard microphones or recording equipment
Poor microphone placement
Both can reduce clarity and limit the ability to capture speech accurately.
Location
The recording environment also plays an important role. Whether dialog is captured outdoors or in a studio, the location itself can introduce challenges such as:
Poor room acoustics
Background noise on location
Dialog intelligibility issues at the point of capture can be monitored using a dialog intelligibility meter operating in real time during the recording process.
Post-Production
During post-production, intelligibility problems may either be improved or unintentionally made worse.
Voice-over-voice masking (multiple voices competing for clarity)
Excessive compression reducing the natural dynamics of speech
Extensive use of effects that can mask the dialog, such as reverb
Poor balance between dialog, music, and effects
Distribution
The path from post-production to the end listener can also introduce intelligibility issues.
Data compression
Encoding and decoding processes
Changes in channel layouts or downmixing
Each of these processes may alter the spectral balance or dynamics of speech, potentially affecting intelligibility.
Reproduction
Once the audio reaches the listener’s environment, additional challenges may arise.
Many flat-screen televisions, for example, use very small speakers that lack the physical dimensions required for accurate sound reproduction.
Factors include:
Poor reproduction equipment
Poor room acoustics
Background noise in the listening environment
The Listener
Finally, the characteristics of the listener themselves can influence how speech is perceived. Several factors come into play:
Hearing impairment
Hearing damages
Listener attention and focus
Age
The diagram below illustrates how hearing sensitivity changes with age. While individuals between roughly 15 and 30 years of age typically experience minimal limitations, the impact becomes increasingly significant at older ages.


