Full duplex, low-latency streamingFull-duplex audio streaming over a single WebSocket or WebRTC connection.
Intelligent turn takingContext-aware turn detection with adjustable eagerness.
Function callingRegister tools mid-session. The assistant calls your functions without breaking audio.
Provider agnosticRoute to the model that fits your latency, cost, or quality requirements, and swap it out at any time.
Dynamic context managementCreate, retrieve, delete, or truncate conversation items mid-session to control context length and token cost.
Conversational intelligenceUse acoustic and metadata signals to condition what is said, when it is said, and how it is expressed.