Discussions
how is hyegen audio processing so fast? is it using openai realtime api ?
17 days ago by super
Hi
I am using whisper and grok to convert user audio to text and then getting text response back from LLM and then passing it as user speak. But this takes 3-4 seconds, whereas in the heygen demo its alomost instant, wanted to check how is this implemented?
do i need to use websockets?