User Guide

Chapter 15 EVOC 20 PolySynth 225
Editing the Analysis and Synthesis Signals
The following section outlines how you can edit the analysis and synthesis signals to
achieve better speech intelligibility.
Compressing the Analysis Signal
The less the level changes, the better the intelligibility of the vocoder. You should
therefore compress the analysis signal in most cases.
Enhancing High Frequency Energy
The vocoder, in a way, always generates the intersection point of the analysis and
synthesis signals. To explain: If there’s no treble portion in the analysis signal, the
resulting vocoder output will also lack treble. This is also the case when the synthesis
signal features a lot of high frequency content. This is true of each frequency band. As
such, the vocoder demands a stable level in all frequency bands from both input
signals, in order to obtain the best results.
Due to the way human beings hear, the intelligibility of speech is highly dependent on
the presence of high frequency content. To aid in keeping speech clear, it may be
worthwhile using equalization to boost or cut particular frequencies in analysis signals
before processing them with the vocoders.
If the analysis signal consists of vocals or speech, a simple shelving filter should be
sufficient. It doesn’t require much processing power, and efficiently boosts the high-
mid and treble range, which is so important for speech intelligibility.
If the synthesis signal lacks treble energy, it can be generated with a distortion effect.
The Overdrive effect is perfect for this purpose (see “Overdrive on page 56).
Avoiding Sonic Artifacts
A common problem with vocoder sounds are sudden signal interruptions (ripping,
breaking sounds) and rapidly triggered noises, during speech pauses.
Release Parameter in the Analysis Section
The Release parameter defines the speed that a given synthesis frequency band can
decrease in level, if the signal level of the respective analysis band decreases abruptly.
The sound is smoother when the band levels decrease slowly. To achieve this smoother
character, use higher Release values in the analysis section of the interface. Longer
release times result in a washy sound.
Short Attack values are no problem. They may, in fact, even be desirable when a fast
reaction to impulse signals by the vocoder is required.