Capturing precise transcripts in Bolna Voice AI
Bolna Voice AI enables to capture actual transcripts when the conversations involve interruptions to improve call accuracy and experience.
Overview
Bolna AI incorporates an advanced interruption handling mechanism that ensures accurate and contextually relevant transcripts during voice agent interactions.
Enable precise generation of transcripts
When a user interrupts the AI agent mid-conversation, rather than logging the full transcript generated by Large Language Models (LLMs), Bolna intelligently computes the actual transcript by filtering out incomplete or overridden responses. This enhances clarity, ensuring that only the final, meaningful exchange is stored, processed and used for the conversations.
How It Works
Bolna AI’s interruption handling system functions through a three-step process:
-
Detection of Interruptions: The system continuously monitors speech input to detect when the user starts speaking while the Voice agent is still speaking.
-
Contextual Computation: Whenever an interruption is detected, Bolna AI determines whether the user’s input should overrides the Voice agent’s response.
-
Final Transcript Adjustment: Bolna then reconstructs the conversation transcript to exclude everything after the interruption, ensuring that only the final & meaningful parts of the dialogue are retained, processed and used for further processing.
Example
Without precise transcript generation | Using precise transcript generation |
---|---|
Assistant: “Hello, Thank you for calling Wayne Enterprises. How can we help you today?” | Assistant: “Hello, Thank you for calling Wayne Enterprises. How can we help you today?” |
User: “hello” | User: “hello” |
Assistant: “Hello! How can I assist you today?” | Assistant: “Hello! How can I |
User: “yeah where are you calling from” | User: “yeah where are you calling from” |
Assistant: “I’m here to support you regarding your recent order from Wayne Enterprises. How can I assist you?” | Assistant: “I’m here to support you regarding your recent order |
User: “yeah i’m facing an issue with the item i purchased” | User: “yeah i’m facing an issue with the item i purchased” |
… | … |
In the above example, the strikethrough text is only for representation purposes. In practice, you’ll see only the transcripts till the interruptions if the precise transcript generation
is enabled
.
Conclusion
Bolna AI’s interruption handling feature ensures that conversation transcripts reflect actual user intent rather than an unfiltered log of AI responses. By dynamically computing the actual transcript, this feature enhances the efficiency of voice AI applications, making conversations more human-like and structured.