Applied AI Engineer
Create a free account to apply in seconds
Sendbird is building AI agents for customer experience. Our platform already powers billions of conversations every month across chat, voice, video, and messaging APIs. We are now using that foundation to build agents that understand customer context, reason over business data, and take reliable action in production.
We are looking for an Applied AI Engineer to research, build, and productionize new capabilities for those agents. This role sits at the intersection of agent product development, applied AI research, and production engineering. You will work on systems that enterprise customers depend on every day, not demos or isolated prototypes.
About Sendbird and delight.ai
Sendbird has spent more than a decade building communication infrastructure for in-app chat, voice, video, and messaging APIs. More than 4,000 brands use our platform, including DoorDash, Match Group, Noom, Yahoo Sports, and Rakuten. Our systems support more than 7 billion messages every month.
In 2024, we made a strategic shift toward AI-first customer experience. In 2025, we launched our enterprise AI agent product, delight.ai. Delight.ai helps businesses deliver customer support and engagement that is faster, more contextual, and more personal. Unlike simple FAQ bots, our agents are built to remember customer context, use tools, retrieve relevant knowledge, connect across channels, and handle real customer workflows with accuracy and control.
The Role
As an Applied AI Engineer, you will design, build, evaluate, and ship new capabilities for our AI agents. You will work across agent architecture, retrieval, memory, planning, tool use, workflow automation, voice, evaluation, data pipelines, model adaptation, inference, and production integration.
This is a hands-on engineering role for someone who can turn AI research and product ideas into reliable customer-facing features. Some problems will require training, fine-tuning, or adapting models. Others will require better retrieval, better context handling, better tools, stronger evaluation, or a more thoughtful product design. The right person knows how to choose the right lever and ship the result.
What you will work on
• Research, prototype, evaluate, and productionize new agent capabilities, including memory, planning, tool use, workflow execution, reasoning, and context management.
• Improve the quality, reliability, and usefulness of customer-facing AI agents through better retrieval, prompting, evaluation, model adaptation, product behavior, and system design.
• Build the core intelligence layer for our agents, including retrieval systems, memory, tool-calling pipelines, workflow orchestration, and agentic reasoning.
• Train, fine-tune, and adapt LLMs and related models when model-level work is the right way to improve agent quality or product capability.
• Build data pipelines for model training, agent evaluation, and product improvement, including labeling workflows, dataset construction, quality checks, and feedback loops.
• Design evaluation systems that measure task completion, accuracy, latency, cost, reliability, safety, customer experience quality, and production regressions.
• Optimize inference systems for production, including model selection, serving architecture, batching, caching, latency, throughput, and cost.
• Integrate voice AI, internal tools, third-party APIs, and workflow systems so agents can take useful actions across real customer journeys.
• Partner with product managers, engineers, and customer-facing teams to turn ambiguous AI product requirements into shippable agent features.
What we are looking for
• 5+ years of professional experience in applied AI engineering, machine learning, data science, or backend engineering with substantial AI/ML ownership.
• Experience building AI-powered product features for real users, preferably involving agents, conversational AI, workflow automation, retrieval, or LLM systems.
• Hands-on experience training, fine-tuning, or adapting LLMs or other deep learning models for production use cases.
• Strong practical knowledge of model training workflows, including dataset preparation, experiment tracking, evaluation, model selection, and deployment.
• Experience building production LLM systems, such as RAG, tool calling, agents, model orchestration, prompt systems, or evaluation pipelines.
• Strong Python skills and experience building production-grade services.
• Working knowledge of inference and serving tradeoffs, including latency, throughput, GPU utilization, model size, batching, caching, and cost.
• Experience with ML infrastructure or MLOps tools used for data pipelines, training jobs, model deployment, monitoring, or evaluation.
• Strong debugging instincts for agent systems, including failure analysis, hallucination reduction, retrieval quality, model behavior, tool-use failures, and regressions.
• Clear communication skills, especially when explaining technical tradeoffs to product managers, engineers, and leadership.
You may be a strong fit if
• You read AI research or model release notes and immediately think about what is testable, useful, and production-worthy.
• You are skeptical of AI demos until you see evaluation results, latency numbers, and failure modes.
• You know the difference between a good prompt, a good product experience, and a reliable production system.
• You like building agent capabilities that make a product meaningfully better, not just adding AI because the model can do it.
• You are comfortable working across data, models, retrieval, inference, backend services, product behavior, and monitoring.
Nice to have
• Experience building natural language processing, conversational AI, or voice AI products.
• Experience with large-scale data processing, labeling systems, human feedback workflows, or synthetic data generation.
• Experience with GPU-based training or inference infrastructure.
• Experience with cloud platforms such as AWS, GCP, or Azure.
• Bilingual fluency in Korean and English.
Our KR benefits include (but are not limited to)
• Silicon Valley's equity program (1-year cliff)
• Hybrid work policy, flexible work hours
• Be Your Best Self: 3.9 million won (prorated by start date) for expenses ranging from professional development classes and training, to personality assessments, to gym memberships, to books, to fitness classes, to mental health services, to massages
• Learn a Language benefit - up to 3.6 million won per year towards language lessons
• Weekly team lunch cost
• Monthly team-building cost
• Partial support for commuting costs
• Free parking at the Seolleung office
• Group insurance support that covers employees, spouses, and children
• Medical checkup support, including MRI and genetic testing - for the employee and one family member
• Seven additional paid holidays in addition to annual leave (Boost leave, Birthday leave, etc.)
• Support for the latest work devices, such as MacBook Pro (special support options for each job position)
• Unlimited snack bar filled with snacks, beverages, and instant noodles every day
• 12-week paid parental leave support (available for both mothers and fathers)
• Other support programs for congratulations and condolences
Flexible Work Policy
We offer a flexible work schedule at Sendbird. We also value collaboration and relationship building. With those values in mind, we require all employees to gather with their team in the office three days per week as a minimum. Some of our roles require a more frequent in-office schedule. Please work with your manager to understand the office time requirements for your position.
What diversity and inclusion mean to us
There is no such thing as a perfect candidate and the best employees come from a wide range of backgrounds, experiences, and skill sets. Sendbird is a place where everyone can learn and grow. We respect, promote, and encourage diversity for equal employment opportunities and encourage you to apply if this role excites you.
About Sendbird
Combining omnichannel AI and battle-tested, award-winning communication APIs, Sendbird enables businesses to build AI agents and meaningful customer connections at scale. Trusted by 4,000+ leading apps—including DoorDash, Match Group, Noom, and Yahoo Sports—Sendbird powers over 7 billion conversations every month, offering exceptional reliability, security, and compliance that meet enterprise-level demands. Headquartered in California, Sendbird is backed by ICONIQ, SoftBank, Tiger Global, Y Combinator, and other reputable investors.
Why Sendbird
We spent a decade building the conversation infrastructure that the world runs on. Now we're putting intelligence on top of it. You won't be joining a company still figuring out its AI strategy. You'll be joining one that has already made the bet, shipped the product, and is moving fast to lead the category. If you want to build something that matters, with a team that knows how, this is it.