Groq Logo

Groq

Senior Staff Software Engineer, High Performance Inference System

Reposted 4 Days Ago
In-Office or Remote
2 Locations
249K-336K
Senior level
In-Office or Remote
2 Locations
249K-336K
Senior level
The role involves designing and implementing low-latency, scalable distributed systems for Groq's real-time inference system, optimizing performance, and ensuring reliability across hardware.
The summary above was generated by AI

About Groq

Groq delivers fast, efficient AI inference. Our LPU-based system powers GroqCloud™, giving businesses and developers the speed and scale they need. Headquartered in Silicon Valley, we are on a mission to make high performance AI compute more accessible and affordable. When real-time AI is within reach, anything is possible. Build fast.

Senior Staff Software Engineer – High Performance Inference System

Mission: 

Join the team that builds and operates Groq’s real-time, distributed inference system delivering large scale inference for LLMs and next-gen AI applications at ultra-low latency. Your work will optimize for heterogeneous hardware, dynamic global workloads, and extreme performance—all while running code at the edge of physics.

Responsibilities & opportunities in this role:

  • Distributed Systems Engineering: Design and implement scalable, low-latency runtime systems that coordinate thousands of GroqChips across a software-scheduled interconnect.
  • Low-Level Optimization: Develop deterministic, hardware-aware abstractions that prioritize execution speed, fault tolerance, and reliability.
  • Performance & Diagnostics: Build tools and infrastructure to support real-time system observability, diagnostics, and SLO improvements.
  • Future-Proofing: Evolve Groq’s system stack to support emerging silicon, topologies, and heterogeneous accelerators (e.g., FPGAs).
  • Cross-Functional Collaboration: Partner with teams across compiler, infra, cloud, hardware, and data centers to align architecture and drive shared progress.

Ideal candidates have/are:

  • Consistently ship high-impact, production-ready systems code.
  • Have deep knowledge of computer architecture, operating systems, algorithms, and hardware-software interfaces.
  • Are fluent in low-level systems languages such as C++ or Rust, and comfortable with hardware-aware programming.
  • Rigorously profile and optimize for latency, throughput, and resource efficiency—every cycle counts.
  • Believe in automation and CI/CD best practices—you don’t ship untested code.
  • Thrive across the stack—from kernel internals to hardware integration to cloud load balancers.
  • Communicate clearly, make pragmatic technical decisions, and write maintainable code for the long term.
  • Ensures code stays fast, scales well, and takes ownership of outcomes.
Nice to have:
  • Operating large-scale distributed systems for real-time, high-traffic services.
  • Deploying and optimizing ML or HPC workloads in production environments.
  • Hands-on experience with GPUs, FPGAs, or ASICs in performance-critical systems.
  • Familiarity with ML frameworks (e.g., PyTorch) or compiler tools (e.g., MLIR).
  • Experience delivering complex projects in fast-paced, high-impact environments.

Attributes of a Groqster:

  • Humility - Egos are checked at the door
  • Collaborative & Team Savvy - We make up the smartest person in the room, together
  • Growth & Giver Mindset - Learn it all versus know it all, we share knowledge generously
  • Curious & Innovative - Take a creative approach to projects, problems, and design
  • Passion, Grit, & Boldness - no limit thinking, fueling informed risk taking

If this sounds like you, we’d love to hear from you!

Compensation: At Groq, a competitive base salary is part of our comprehensive compensation package, which includes equity and benefits. For this role, the base salary range is $248,710 - $336,490, determined by your skills, qualifications, experience and internal benchmarks.

#LI-Hybrid

Location: Some roles may require being located near or on our primary sites, as indicated in the job description.  

At Groq: Our goal is to hire and promote an exceptional workforce as diverse as the global populations we serve. Groq is an equal opportunity employer committed to diversity, inclusion, and belonging in all aspects of our organization. We value and celebrate diversity in thought, beliefs, talent, expression, and backgrounds. We know that our individual differences make us better.


Groq is an Equal Opportunity Employer that is committed to inclusion and diversity. Qualified applicants will receive consideration for employment without regard to race, color, religion, national origin, gender, sexual orientation, gender identity, disability or protected veteran status.  We also take affirmative action to offer employment opportunities to minorities, women, individuals with disabilities, and protected veterans.

Groq is committed to working with qualified individuals with physical or mental disabilities. Applicants who would like to contact us regarding the accessibility of our website or who need special assistance or a reasonable accommodation for any part of the application or hiring process may contact us at:  [email protected].  This contact information is for accommodation requests only.  Evaluation of requests for reasonable accommodations will be determined on a case-by-case basis.

Top Skills

C++
Mlir
PyTorch
Rust

Similar Jobs

3 Hours Ago
Remote or Hybrid
Toronto, ON, CAN
Mid level
Mid level
Artificial Intelligence • Cloud • HR Tech • Information Technology • Productivity • Software • Automation
Lead technical consulting engagements for ServiceNow solutions, ensuring configuration, optimization, and integration while mentoring teams and advising customers on best practices.
Top Skills: AIBootstrapCSSHTMLJavaScriptLdapSaaSServicenowSsoWeb ServicesXML
3 Hours Ago
Remote or Hybrid
Toronto, ON, CAN
Junior
Junior
Artificial Intelligence • Cloud • HR Tech • Information Technology • Productivity • Software • Automation
The Technology Consultant will focus on ITAM solutions, guiding clients in ServiceNow products, improving processes, and providing technical support and training.
Top Skills: BootstrapCSSHardware Asset ManagementHTMLItamJavaScriptLdapMiddlewareServicenowSoftware Asset ManagementSsoWeb ServicesXML
6 Hours Ago
Remote
Canada
Expert/Leader
Expert/Leader
Cloud • Information Technology • Payments • Software • Database
As SVP of Finance, you'll lead financial strategy, planning, and operations while providing critical insights to drive growth and efficiency. You'll oversee fundraising and manage the finance team to ensure compliance and financial excellence.
Top Skills: GaapSaaS

What you need to know about the Los Angeles Tech Scene

Los Angeles is a global leader in entertainment, so it’s no surprise that many of the biggest players in streaming, digital media and game development call the city home. But the city boasts plenty of non-entertainment innovation as well, with tech companies spanning verticals like AI, fintech, e-commerce and biotech. With major universities like Caltech, UCLA, USC and the nearby UC Irvine, the city has a steady supply of top-flight tech and engineering talent — not counting the graduates flocking to Los Angeles from across the world to enjoy its beaches, culture and year-round temperate climate.

Key Facts About Los Angeles Tech

  • Number of Tech Workers: 375,800; 5.5% of overall workforce (2024 CompTIA survey)
  • Major Tech Employers: Snap, Netflix, SpaceX, Disney, Google
  • Key Industries: Artificial intelligence, adtech, media, software, game development
  • Funding Landscape: $11.6 billion in venture capital funding in 2024 (Pitchbook)
  • Notable Investors: Strong Ventures, Fifth Wall, Upfront Ventures, Mucker Capital, Kittyhawk Ventures
  • Research Centers and Universities: California Institute of Technology, UCLA, University of Southern California, UC Irvine, Pepperdine, California Institute for Immunology and Immunotherapy, Center for Quantum Science and Engineering

Sign up now Access later

Create Free Account

Please log in or sign up to report this job.

Create Free Account