Articles in this section

Responsiveness & Interruption Sensitivity

Published:

Overview

This guide explains how to configure the speech detection settings for an AI Voice Agent, including how quickly the agent responds after the caller stops speaking, how sensitive it is to interruptions, and how it detects turn-taking in conversation.


Steps to Configure Speech Detection

  1. Navigate to Call Flows and locate the agent you want to edit.
  2. Ensure you are in the Advanced View.
  3. Go to Voice and click Speech Detection.

Responsiveness

  1. Adjust the Responsiveness slider — this controls how quickly the agent starts speaking after the caller stops talking. A higher value means the agent responds faster.

Interruption Sensitivity

  1. Adjust the Interruption Sensitivity slider — this controls how many words the caller needs to say before the agent stops talking and yields the floor.
    • A higher setting means the agent will stop talking even for brief acknowledgements like “yes” or “okay”
    • A lower setting means the agent will continue speaking through short interjections and only yield when the caller says something more substantial — this reduces conversational overlap

Turn Detection Model

  1. Select the Turn Detection Model:
    • STT-Based (Speech-to-Text) – uses speech-to-text processing to determine when the caller has finished speaking
    • AI Detection – uses AI to predict turn-taking

Preemptive Generation

  1. Toggle Preemptive Generation on or off. When enabled, the agent begins generating its response before the caller has finished speaking. This can reduce latency but may result in incorrect responses if the caller’s full intent was not yet captured. It is generally recommended to leave this off.

  2. Click Save to Draft or Publish to Production.


Key Notes

Item Details
Responsiveness Controls the delay between the caller stopping and the agent starting to speak
Interruption Sensitivity Controls how reactive the agent is to short caller inputs — lower values reduce conversational overlap
STT-Based (Speech-to-Text) Turn Detection Recommended default — more reliable for most use cases
Preemptive Generation Can reduce latency but risks generating incorrect responses — leave off unless specifically needed
Advanced View Must be enabled to access Speech Detection settings

🎥 Watch the demo video below



Access denied
Access denied