Overview

Bolna AI incorporates an advanced interruption handling mechanism that ensures accurate and contextually relevant transcripts during voice agent interactions.

This feature is currently in beta. Please use it with caution.

Enable precise generation of transcripts

When a user interrupts the AI agent mid-conversation, rather than logging the full transcript generated by Large Language Models (LLMs), Bolna intelligently computes the actual transcript by filtering out incomplete or overridden responses. This enhances clarity, ensuring that only the final, meaningful exchange is stored, processed and used for the conversations.

How It Works

Bolna AI’s interruption handling system functions through a three-step process:

  • Detection of Interruptions: The system continuously monitors speech input to detect when the user starts speaking while the Voice agent is still speaking.

  • Contextual Computation: Whenever an interruption is detected, Bolna AI determines whether the user’s input should overrides the Voice agent’s response.

  • Final Transcript Adjustment: Bolna then reconstructs the conversation transcript to exclude everything after the interruption, ensuring that only the final & meaningful parts of the dialogue are retained, processed and used for further processing.

Example

Without precise transcript generationUsing precise transcript generation
Assistant: “Hello, Thank you for calling Wayne Enterprises. How can we help you today?”Assistant: “Hello, Thank you for calling Wayne Enterprises. How can we help you today?”
User: “hello”User: “hello”
Assistant: “Hello! How can I assist you today?”Assistant: “Hello! How can I assist you today?
User: “yeah where are you calling from”User: “yeah where are you calling from”
Assistant: “I’m here to support you regarding your recent order from Wayne Enterprises. How can I assist you?”Assistant: “I’m here to support you regarding your recent order from Wayne Enterprises. How can I assist you?
User: “yeah i’m facing an issue with the item i purchased”User: “yeah i’m facing an issue with the item i purchased”

In the above example, the strikethrough text is only for representation purposes. In practice, you’ll see only the transcripts till the interruptions if the precise transcript generation is enabled.

Conclusion

Bolna AI’s interruption handling feature ensures that conversation transcripts reflect actual user intent rather than an unfiltered log of AI responses. By dynamically computing the actual transcript, this feature enhances the efficiency of voice AI applications, making conversations more human-like and structured.