Introduction to Aura Live
Aura Live is a high-performance, real-time transcription (Automatic Speech Recognition) platform built with FastAPI and designed for low-latency live audio processing.
Core Capabilities
Section titled “Core Capabilities”- Real-time Transcription: Process audio streams and receive text results with minimal latency.
- Multilingual Support: Support for a growing list of languages and dialects.
- WebSocket Protocol: A robust, binary-friendly protocol for streaming audio data.
- REST API: Comprehensive management of languages, models, and sessions.
- Extensible Architecture: Built to integrate with various ASR engine backends.
How it Works
Section titled “How it Works”The typical lifecycle of an Aura Live interaction follows this flow:
- Authentication: The client authenticates using an API token.
- Discovery: The client queries the REST API to discover available languages and ASR models.
- Session Initiation: The client creates a “Live Session” via a REST endpoint.
- Streaming: The client connects to a provided WebSocket URI and streams raw audio data.
- Result Retrieval: Aura Live sends transcription results back through the same WebSocket connection.
Target Audience
Section titled “Target Audience”This documentation is intended for:
- Developers looking to integrate real-time transcription into their own applications.
- System Integrators connecting Aura Live to existing audio infrastructure.
- Users who want to understand the underlying mechanics of the transcription service.
[!NOTE] For administrative tasks (user management, system configuration), please refer to the Admin Dashboard documentation (internal).