Live Sessions
Everything in Aura Live revolves around a Live Session. A session is a temporary bridge between your audio source and the transcription engine.
Prerequisites
Section titled “Prerequisites”Before starting a session, you should identify:
- Language: What language is being spoken? Use the
iso_639_1code (e.g.,en,fr). - Model: Which ASR model should process the audio? (e.g.,
tiny,medium,large-v3).
Use these endpoints to discover available options:
- List Languages:
GET /languages/ - List Models:
GET /models/ - List Translations:
GET /translations/
Starting a Session
Section titled “Starting a Session”To start a session, send a POST request to create a live session.
Session Life Cycle
Section titled “Session Life Cycle”- Pending: The session is created but no audio is being received.
- Active: Audio is being streamed and transcribed.
- Finished: The connection has been closed by either the client or the server.
- Error: Something went wrong with the transcription engine.
WebSocket Connection
Section titled “WebSocket Connection”Once a session is created, connect to the WebSocket endpoint to stream audio and receive transcription results in real time.
[!TIP] Use the models’
vramfield to understand the resource requirements. Smaller models liketinyare faster but less accurate, whereaslarge-v3provides the highest accuracy but requires more compute.