Discussions

Ask a Question
Back to All

how is hyegen audio processing so fast? is it using openai realtime api ?

Hi

I am using whisper and grok to convert user audio to text and then getting text response back from LLM and then passing it as user speak. But this takes 3-4 seconds, whereas in the heygen demo its alomost instant, wanted to check how is this implemented?

do i need to use websockets?