Prerequisites
| Service | What You Need |
|---|---|
| Plivo | Auth ID, Auth Token, Voice-enabled phone number |
| Sarvam AI | API key from console.sarvam.ai |
| OpenAI | API key from platform.openai.com |
Installation
Environment Variables
Pipeline Configuration
Service Details
Sarvam STT
Real-time speech recognition optimized for Indian languages via WebSocket streaming.| Feature | Description |
|---|---|
| Indian language support | Hindi, Tamil, Telugu, Kannada, Malayalam, and more |
| Voice Activity Detection | Automatic speech endpoint detection |
| Multiple audio formats | Flexible input handling |
| Models | Saarika (STT), Saaras (STT with translation) |
OpenAI LLM
Chat completion with GPT-4o supporting streaming responses and function calling.| Model | Description |
|---|---|
gpt-4o | Most capable, multimodal |
gpt-4o-mini | Faster, cost-effective |
Sarvam TTS
Text-to-speech synthesis specialized for Indian languages and voices.| Feature | Description |
|---|---|
| Indian voices | Natural-sounding voices for multiple Indian languages |
| Voice customization | Pitch, pace, and loudness controls |
| Mixed-language support | Handle content combining multiple languages |
Supported Indian Languages
Sarvam supports multiple Indian languages including:- Hindi
- Tamil
- Telugu
- Kannada
- Malayalam
- Bengali
- Marathi
- Gujarati
- And more
Quick Start
Inbound Calls
Outbound Calls
When to Use Sarvam
Choose Sarvam when:- Your users speak Indian languages
- You need high accuracy for Indian accents and dialects
- You want natural-sounding Indian voices
- Your content mixes English with Indian languages
- Your users primarily speak English or European languages
- You need broader language coverage
- You’re not targeting Indian markets
Related
- Pipecat Overview - Architecture and setup
- Sarvam STT Docs - STT configuration
- Sarvam TTS Docs - TTS configuration
- OpenAI Docs - LLM configuration
- Sarvam AI - Official Sarvam documentation