🏆 1st Place ElevenLabs Hackathon – $20,000🚀 EBRD Star Venture Program🥈 2nd Place Sevan Startup Summit🚀 Google Cloud $25K Grant
Kallina AI
RO
Kallina Voice AI

Audio Codecs

Selectarea codec-ului optim pentru calitate audio și eficiență bandwidth.

Codec = Quality + Efficiency

Codec-urile comprimă și decomprimă audio. Alegerea corectă balansează calitatea vocii cu utilizarea bandwidth-ului și latența.

Codec Comparison

CodecBitrateSample RateMOSLatencyBest For
G.711 μ-law64 kbps8 kHz4.1Very LowPSTN compatibility
G.711 A-law64 kbps8 kHz4.1Very LowEU PSTN
Opus6-510 kbps8-48 kHz4.5+LowWebRTC, quality
G.7298 kbps8 kHz3.9MediumBandwidth savings
G.72264 kbps16 kHz4.3LowHD Voice
SILK6-40 kbps8-24 kHz4.2LowVariable networks

Opus: Recommended for Voice AI

Why Opus?

  • ✓ Adaptive bitrate based on network
  • ✓ Wideband audio (16 kHz+) for clear speech
  • ✓ Low latency (2.5-60ms frames)
  • ✓ Excellent packet loss resilience
  • ✓ Open and royalty-free
  • ✓ Native WebRTC support

Opus Configuration

{
  "codec": "opus",
  "bitrate": 24000,      // 24 kbps
  "sampleRate": 16000,   // 16 kHz
  "channels": 1,         // Mono
  "frameSize": 20,       // 20ms frames
  "fec": true,           // Forward error correction
  "dtx": true            // Discontinuous transmission
}

G.711: PSTN Standard

μ-law (PCMU)

Used in North America and Japan. Optimized for voice frequencies.

Sample Rate:8000 Hz
Bit Depth:8 bits
Payload Type:0

A-law (PCMA)

Used in Europe and rest of world. Slightly better SNR.

Sample Rate:8000 Hz
Bit Depth:8 bits
Payload Type:8

Codec Selection Strategy

PSTN Calls (Inbound/Outbound)

Use G.711 (PCMU/PCMA) - universal compatibility, no transcoding needed.

Offer: PCMU, PCMA → Accept: carrier default

WebRTC Browser Calls

Use Opus - best quality, adaptive to network conditions.

Offer: opus/48000/2 → Best audio quality

Low Bandwidth Scenarios

Use G.729 or Opus at low bitrate - efficient compression.

Offer: G729, opus@8kbps → Bandwidth efficient

Quality vs Bandwidth

G.711
Bandwidth: 87.2 kbps
Quality: 82%
Opus 24kbps
Bandwidth: 40 kbps
Quality: 90%
Opus 16kbps
Bandwidth: 32 kbps
Quality: 85%
G.722
Bandwidth: 87.2 kbps
Quality: 86%
G.729
Bandwidth: 24 kbps
Quality: 78%

* Including RTP/UDP/IP overhead. Quality measured as MOS equivalent percentage.

Transcoding Considerations

Avoid When Possible

Transcoding adds latency și poate degrada calitatea.

  • • +5-20ms latency per transcode
  • • Quality loss (especially lossy→lossy)
  • • CPU resources consumed

When Necessary

Use dedicated transcoding resources.

  • • WebRTC ↔ PSTN calls
  • • Different codec endpoints
  • • Recording in specific format

Voice AI Codec Requirements

ComponentPreferred FormatReason
STT (Speech-to-Text)16 kHz, 16-bit PCMOptimal for speech recognition
TTS (Text-to-Speech)24 kHz, 16-bit PCMHigh quality synthesis output
Recording StorageOpus or MP3Storage efficiency
Live PlaybackMatch caller codecAvoid transcoding

Crystal Clear Audio

Codec optimization pentru calitate vocală superioară.

Vezi Demo →
Începe Astăzi

Transformă Comunicarea cu Clienții

Agenți vocali AI care răspund 24/7 în română și rusă. Implementare în 2 săptămâni, fără infrastructură specială.

Setup în 24 oreSuport dedicatGDPR compliant

Rămâi la curent

Obține cele mai recente știri despre tehnologia de apelare AI și actualizările platformei

Made with ♡ by Kallina AI Team — 2025