Effective techniques to guide ElevenLabs AI in adding pauses, conveying emotions, and pacing the speech.
<break time="1.5s" />
. This will create an exact and natural pause in the speech. It is not just added silence between words, but the AI has an actual understanding of this syntax and will add a natural pause.
An example could look like this:
-
or the em-dash —
. You can even add multiple dashes such as -- --
for a longer pause.
...
can sometimes also work to add a pause between words but usually also adds some “hesitation” or “nervousness” to the voice that might not always fit.
<phoneme alphabet="ipa" ph="your-IPA-Pronunciation-here">word</phoneme>
tag for IPA, or <phoneme alphabet="cmu-arpabet" ph="your-CMU-pronunciation-here">word</phoneme>
tag for CMU Arpabet. Replace "your-IPA-Pronunciation-here"
or "your-CMU-pronunciation-here"
with the desired IPA or CMU Arpabet pronunciation.
An example for IPA:
he said, confused
, or he shouted angrily
. These types of prompts will help the AI understand the desired emotional tone and try to generate a voiceover that accurately reflects it. Since the AI reads all text literally, including the dialogue tags, you’ll need to trim these narrative portions from the audio in post-processing.