Senior Software Engineer, Voice Agent
Decagon
Location
San Francisco
Employment Type
Full time
Department
Engineering, Product & Design
Compensation
- Base Salary $250K – $375K • Offers Equity
About Decagon
Decagon is the leading conversational AI platform empowering every brand to deliver concierge customer experience. Our AI agents provide intelligent, human-like responses across chat, email, and voice, resolving millions of customer inquiries across every language and at any time.
Since coming out of stealth, Decagon has experienced rapid growth. We partner with industry leaders like Hertz, Eventbrite, Duolingo, Oura, Bilt, Curology, and Samsara to redefine customer experience at scale. We've raised over $200M from Bain Capital Ventures, Accel, a16z, BOND Capital, A*, Elad Gil, and notable angels such as the founders of Box, Airtable, Rippling, Okta, Lattice, and Klaviyo.
We’re an in-office company, driven by a shared commitment to excellence and velocity. Our values—customers are everything, relentless momentum, winner’s mindset, and stronger together—shape how we work and grow as a team.
About the Team
The Voice Agent team builds the real-time systems that allow Decagon agents to carry natural conversations across our omnichannels (phone, web, mobile, etc). We work across speech understanding, audio streaming, synthesis quality, and the voice specific execution logic that enables timing, pacing, and responsive dialogue.
Our systems must remain accurate, responsive, and stable at scale. Voice is one of the most technically challenging surfaces in conversational AI, and our small team owns this entire capability end to end.
About the Role
As a Senior Software Engineer focused on Voice Agent, you will design and improve the systems that power Decagon’s live voice agents. You will work across streaming audio, transcription, synthesis, timing, and real time orchestration.
You will collaborate closely with Research to bring new models to production, with Infra to optimize performance, and with Product to unlock new conversational capabilities.
In this role, you will
Build the real time voice runtime that powers natural customer conversations
Improve speech understanding and synthesis quality while keeping latency low
Design systems that manage timing, interruptions, and streaming audio reliably
Create tools that make voice interactions easy to debug, test, and observe at scale
Your background looks something like this
5+ years of experience in software engineering
Proficiency with Typescript, Python, and asynchronous programming
Experience with asynchronous or streaming systems
Strong debugging skills across audio, networking, or real time pipelines
An interest in speech, audio, or multimodal AI
Even better
Experience with speech recognition or synthesis systems
Experience with VAD, streaming protocols, or other real time audio systems
Experience designing or maintaining LLM driven applications
Experience optimizing performance for low latency use cases
Benefits:
Medical, dental, and vision benefits
Take what you need vacation policy
Daily lunches, dinners and snacks in the office to keep you at your best
Compensation
$250K – $330K + Offers Equity
Compensation Range: $250K - $375K