AssemblyAI

Senior Research Engineer, JAX

Reposted 2 Days Ago

Be an Early Applicant

Easy Apply

Remote

Hiring Remotely in USA

190K-217K Annually

Senior level

Easy Apply

Remote

Hiring Remotely in USA

190K-217K Annually

Senior level

The Senior Research Engineer will enhance JAX frameworks, optimize JAX inference for speech models, and bridge research and engineering teams to improve performance and scalability of AI systems.

The summary above was generated by AI

About AssemblyAI

At AssemblyAI, we’re building at the forefront of Speech AI, creating powerful models for speech-to-text and speech understanding available through a straightforward API. With more than 200,000 developers building on our API and over 5,000 paying customers, AssemblyAI is helping unlock and support the next generation of powerful, meaningful products built with AI.

Progress in AI is moving at an unprecedented pace– and our team is made up of experts in AI research that are focused on making sure that our customers are able to stay on the cutting edge, with production-ready AI models that are constantly updating and improving as our team continues to improve accuracy, latency, and what’s possible with Speech AI. Our models consistently rank highest in industry benchmarks for accuracy, outperforming models from Google and Amazon, and up to 30% fewer hallucinations than OpenAI’s Whisper. Our models power more than 2 billion end-user experiences each day, helping companies better understand customer feedback, run more productive meetings with automated meeting notes, and helping improve childhood literacy via ed tech tools.

We’ve raised funding by leading investors including Accel, Insight Partners, Y Combinator’s AI Fund, Patrick and John Collision, Nat Friedman, and Daniel Gross. We’re a remote team looking to build one of the next great AI companies, and are looking for driven, talented people to help us get there!

About the Role

We are seeking a highly skilled Senior Research Engineer to collaborate closely with both Research and Engineering teams. The role involves diagnosing and resolving bottlenecks across large-scale distributed training, data processing, and inference systems, while also driving optimizations for existing high-performance pipelines.

The ideal candidate possesses a deep understanding of modern deep learning systems, combined with strong engineering expertise in areas such as layer-level optimization, large-scale distributed training, streaming, low-latency and asynchronous inference, inference compilers, and advanced parallelization techniques.

This is a cross-functional role requiring strong technical rigor, attention to detail, intellectual curiosity, and excellent communication skills. The position is embedded within the Research team and is responsible for developing and refining the technical foundation that enables cutting-edge research and translates its outcomes into production, bridging research and production engineering.

What You’ll Do

Maintain and evolve our JAX training framework, ensuring scalability and efficiency for large-scale distributed training runs.
Optimize production JAX inference systems for speech-to-text models using advanced techniques like continuous batching, model sharding, paged attention, and quantization.
Refactor and modernize model architectures and infrastructure, translating research prototypes into production-ready systems.
Investigate and resolve performance bottlenecks across the stack—from low-level kernels (XLA, Pallas) to high-level system design.
Design and deploy scalable, distributed workloads optimized for TPU and GPU architectures.
Bridge Research and Engineering teams, ensuring seamless knowledge transfer and alignment on technical priorities.

What You’ll Need

Expert-level proficiency with JAX and its ecosystem (Flax, Optax, XLA compilation pipeline).
Strong experience optimizing inference systems for production, ideally with LLMs or speech models.
Hands-on experience with TPU programming and optimization; GPU/CUDA expertise is also valuable.
Passion for refactoring and improving existing systems—you thrive on making code faster, cleaner, and more maintainable.
Familiarity with modern inference optimization techniques: continuous batching, KV-cache management, sharding strategies, quantization.
Domain knowledge in Speech-to-Text (ASR architectures, audio processing, streaming inference) is a plus.
Strong Python skills; C++ or Rust experience for kernel-level work is a plus.
Deep understanding of distributed training at scale and ML infrastructure best practices.
Excellent communication skills and a collaborative mindset—you can clearly explain complex tradeoffs and prioritize high-impact work.

Pay Transparency:

AssemblyAI strives to recruit and retain exceptional talent from diverse backgrounds while ensuring pay equity for our team. Our salary ranges are based on paying competitively for our size, stage, and industry, and are one part of many compensation, benefit, and other reward opportunities we provide.

There are many factors that go into salary determinations, including relevant experience, skill level, qualifications assessed during the interview process, and maintaining internal equity with peers on the team. The range shared below is a general expectation for the function as posted, but we are also open to considering candidates who may be more or less experienced than outlined in the job description. In this case, we will communicate any updates in the expected salary range.

This is a remote role open to candidates across Europe. The provided range is listed in Swiss francs (CHF) as the position is posted in Zurich. Compensation will be adjusted to reflect local market rates and paid in the appropriate local currency for each candidate’s location. Any variations from the listed range will be clearly communicated during the interview process.

Salary range: CHF190,050.00 - CHF280,000.00

Working at AssemblyAI

We are a small but mighty group of startup veterans and experienced AI researchers with over 20 years of expertise in Machine Learning, Speech Recognition, and NLP. As a fully remote team, we’re looking for people to join our team who are ambitious, curious, and lead with integrity. We’re still in the early days of AI and of AssemblyAI’s journey, and are looking for teammates who won’t just fit in, but will help us define and build our company culture.

We’re committed to creating a space where our employees can bring their full selves to work and have equal opportunity to succeed. No matter your race, gender identity or expression, sexual orientation, religion, origin, ability, age, veteran status, if joining this mission speaks to you, we encourage you to apply!

Using AI to Interview:

If you’re selected for an interview, please review this resource to better understand how AssemblyAI approaches the use of AI in our interview process.

Keep Exploring AssemblyAI:

Check us out on YouTube!

Learn more about AI models for speech recognition

Core Transcription | Audio Intelligence | LeMUR | Try the Playground

Our $50M Series C fundraise

Top Skills

C++

Cuda

Flax

Gpu

Jax

Optax

Python

Rust

Tpu

Xla

Similar Jobs

MetLife

Senior Regulatory Compliance Officer

A Minute Ago

Remote or Hybrid

United States

90K-140K Annually

Senior level

90K-140K Annually

Senior level

Fintech • Information Technology • Insurance • Financial Services • Big Data Analytics

The Compliance Officer will manage regulatory filings, change management, compliance reviews, and collaborate with various teams for compliance oversight.

Top Skills: Compliance Risk ManagementRegulatory Compliance

MetLife

Live Chat Disability Customer Advocate - 13947

A Minute Ago

Remote or Hybrid

United States

Mid level

Fintech • Information Technology • Insurance • Financial Services • Big Data Analytics

The Customer Advocate resolves customer issues related to MetLife Disability Products by analyzing root causes, improving services, and maintaining customer relationships through effective communication.

MetLife

Disability Customer Advocate II - 13945

A Minute Ago

Remote or Hybrid

United States

70K-70K Annually

Mid level

70K-70K Annually

Mid level

Fintech • Information Technology • Insurance • Financial Services • Big Data Analytics

The Disability Customer Advocate II responds to customer inquiries, resolves issues, and provides updates, ensuring customer satisfaction and service recovery through effective communication and collaboration with internal teams.

What you need to know about the Los Angeles Tech Scene

Los Angeles is a global leader in entertainment, so it’s no surprise that many of the biggest players in streaming, digital media and game development call the city home. But the city boasts plenty of non-entertainment innovation as well, with tech companies spanning verticals like AI, fintech, e-commerce and biotech. With major universities like Caltech, UCLA, USC and the nearby UC Irvine, the city has a steady supply of top-flight tech and engineering talent — not counting the graduates flocking to Los Angeles from across the world to enjoy its beaches, culture and year-round temperate climate.

Key Facts About Los Angeles Tech

Number of Tech Workers: 375,800; 5.5% of overall workforce (2024 CompTIA survey)
Major Tech Employers: Snap, Netflix, SpaceX, Disney, Google
Key Industries: Artificial intelligence, adtech, media, software, game development
Funding Landscape: $11.6 billion in venture capital funding in 2024 (Pitchbook)
Notable Investors: Strong Ventures, Fifth Wall, Upfront Ventures, Mucker Capital, Kittyhawk Ventures
Research Centers and Universities: California Institute of Technology, UCLA, University of Southern California, UC Irvine, Pepperdine, California Institute for Immunology and Immunotherapy, Center for Quantum Science and Engineering