Introducing Speech Engine
ElevenLabs Speech Engine adds real-time voice to your own chat agent or LLM. ElevenLabs handles speech-to-text, turn-taking, text-to-speech and browser playback while your server owns the agent logic and streams response text over a Speech Engine WebSocket. Use it when you want voice on a custom runtime rather than a fully hosted ElevenAgents configuration.
Quickstart
Build a voice-enabled agent with the Speech Engine SDK in minutes.
Agent skill
Guidance for AI coding assistants integrating Speech Engine.
API reference
Create, update and manage Speech Engine resources over the REST API.
ElevenAgents
- Text behavior overrides: Added
text_behavior_overrides, a per-ConversationInitiationSourcemap ofBehaviorOverrideobjects with optionalverbosity,output_formatandinteraction_budgetfields for channel-specific agent behavior. - Integration sources: Added Intercom, Telegram and Freshdesk.
- OTLP conversation traces: Get conversation details now accepts an optional
formatquery parameter. Setformat=otlp_tracesto return OTLP-compatible trace data alongside the standard conversation payload. - ASR keyword overrides: Added
ASRConversationalConfigOverrideandASRConversationalConfigOverrideConfigschemas with optionalkeywordsarrays, wired into conversation config client override models. - Webhook auth metadata: Webhook tool configuration schemas now expose optional
auth_resolved_params(string array) documenting URL placeholders resolved from the auth connection.
Music
- Generation mode: Added
MusicGenerationMode(track,loop,ambience) and an optionalgeneration_modefield on music prompt request bodies. - Video to music model: Video to music (
POST /v1/music/video-to-music) now accepts optionalmodel_id(string, defaultmusic_v1).
ElevenCreative Studio
- Conversion credits: Chapter and voice conversion statistics response models now include optional
credits_needed_to_convert(integer) indicating credits required before conversion.
Workspaces
- Resource types: Added
studio_projectstoWorkspaceResourceType.
SDK Releases
Python SDK
- v2.50.0 - Regenerated the SDK for the May 25, 2026 API schema.
- v2.49.1 - Updated Speech Engine API calls to return the full response object.
- v2.49.0 - Regenerated the SDK for the May 18–25, 2026 API schema, including ElevenAgents text behavior overrides, music
generation_mode, and workspacestudio_projects.
JavaScript SDK
- v2.50.0 - Added missing Music API methods and tests, and regenerated the SDK for the May 25, 2026 API schema.
- v2.49.1 - Updated Speech Engine API calls to return the full response object.
- v2.49.0 - Regenerated the SDK for the May 18–25, 2026 API schema, including ElevenAgents text behavior overrides, music
generation_mode, and workspacestudio_projects.
Packages
- @elevenlabs/client@1.8.1 - Fixed iOS Safari dropping the first agent message on WebSocket voice sessions by unlocking an
AudioContexton the first user gesture and priming the playback graph after the audio worklet loads. - @elevenlabs/convai-widget-core@0.12.8 and @elevenlabs/convai-widget-embed@0.12.8 - Style emotion and audio tags in voice transcripts when
strip_audio_tagsis off, and treat null top-levelterms_htmlorterms_textas a kill switch for the terms and conditions modal. - @elevenlabs/convai-widget-core@0.12.7 and @elevenlabs/convai-widget-embed@0.12.7 - Updated widget dependencies to
@elevenlabs/client@1.8.1.
API
View API changes
Updated Endpoints and Schemas
ElevenAgents
- Get conversation details -
GET /v1/convai/conversations/{conversation_id}- Added optional
formatquery parameter;otlp_tracesreturns OTLP-compatible trace data
- Added optional
- Agent configuration schemas
- Added
text_behavior_overridesmap keyed byConversationInitiationSourcewithBehaviorOverridevalues (verbosity,output_format,interaction_budget)
- Added
ConversationInitiationSourceenum- Added
intercom_integration,telegram_integrationandfreshdesk_integration
- Added
- ASR override schemas
- Added
ASRConversationalConfigOverrideandASRConversationalConfigOverrideConfigwith optionalkeywords(array of strings)
- Added
- Webhook tool schemas
- Added optional
auth_resolved_params(string array) on webhook configuration models
- Added optional
Music
- Video to music -
POST /v1/music/video-to-music- Added optional
model_id(string, defaultmusic_v1)
- Added optional
- Music prompt request schemas
- Added optional
generation_modereferencingMusicGenerationMode(track,loop,ambience)
- Added optional
ElevenCreative Studio
- Chapter and voice conversion statistics schemas
- Added optional
credits_needed_to_convert(integer)
- Added optional
Workspaces
WorkspaceResourceTypeenum- Added
studio_projects
- Added