The Lead Site Reliability Engineer will build, deploy, and manage microservices in Kubernetes, optimize cloud applications, and integrate emerging technologies in AI and GenAI, ensuring high reliability and scalability.
Coupa makes margins multiply through its community-generated AI and industry-leading total spend management platform for businesses large and small. Coupa AI is informed by trillions of dollars of direct and indirect spend data across a global network of 10M+ buyers and suppliers. We empower you with the ability to predict, prescribe, and automate smarter, more profitable business decisions to improve operating margins.
Why join Coupa?
🔹 Pioneering Technology: At Coupa, we're at the forefront of innovation, leveraging the latest technology to empower our customers with greater efficiency and visibility in their spend.
🔹 Collaborative Culture: We value collaboration and teamwork, and our culture is driven by transparency, openness, and a shared commitment to excellence.
🔹 Global Impact: Join a company where your work has a global, measurable impact on our clients, the business, and each other.
Learn more on Life at Coupa blog and hear from our employees about their experiences working at Coupa.
The Impact of a Cloud Software Engineer / Lead DevOps Engineer at Coupa:
If you are passionate about new technologies, have a strong technical background and you are looking for an environment where you can continuously expand your knowledge, you are the right fit for this role. At Coupa, the “Cloud team” is looking for a Lead engineer who is ready to constantly question the status quo with a mixture of system design, code development, deployment, automation, networking, and experience in managing Machine Learning/GenAI / Agentic AI platforms.
What You'll Do:
- Build, deploy, and troubleshoot microservices in Kubernetes and Amazon EKS, ensuring scalability and reliability.
- Design secure, highly available web applications with a focus on capacity planning and performance optimization.
- Deploy and manage the lifecycle of LLMs and embedding models, defining KPIs to measure and improve AI application performance.
- Evaluate and integrate emerging technologies such as RAG systems, MCP servers, AI Agents, and agentic workflows into our platform.
- Manage AWS core and GenAI services (S3, IAM, EKS, Bedrock, etc.) using infrastructure-as-code tools like Terraform and Chef, while maintaining observability through tools like New Relic or PagerDuty.
- Collaborate across product, platform, and engineering teams on architecture design, security patching, incident response, and release management to ensure the reliability of our ML and GenAI infrastructure
What You Will Bring to Coupa:
- Bachelor’s degree and 8+ years of experience managing large-scale cloud applications with a strong background in Linux administration and troubleshooting. Excellent communication skills, a collaborative mindset, and the confidence to take ownership, drive solutions, and deliver results independently while thinking globally.
- Over 5 years of hands-on experience managing cloud infrastructure across AWS, GCP, and Azure environments.
- A solid understanding of today’s generative AI ecosystem, with practical experience using LLMs and embedding models (OpenAI, AWS Bedrock, SageMaker); familiarity with vector databases like LanceDB is a plus.
- Strong scripting skills in Bash or Python, and experience with container orchestration platforms like Amazon EKS or Azure AKS.
- Proficiency with DevOps and automation tools such as Chef, GitHub Actions, Rundeck, and IaC frameworks like Terraform, Spacelift, and Helm.
- Working knowledge of DNS, load balancers, and MySQL, along with a good grasp of source control and branching strategies in Git.
Coupa complies with relevant laws and regulations regarding equal opportunity and offers a welcoming and inclusive work environment. Decisions related to hiring, compensation, training, or evaluating performance are made fairly, and we provide equal employment opportunities to all qualified candidates and employees.
Please be advised that inquiries or resumes from recruiters will not be accepted.
By submitting your application, you acknowledge that you have read Coupa’s Privacy Policy and understand that Coupa receives/collects your application, including your personal data, for the purposes of managing Coupa's ongoing recruitment and placement activities, including for employment purposes in the event of a successful application and for notification of future job opportunities if you did not succeed the first time. You will find more details about how your application is processed, the purposes of processing, and how long we retain your application in our Privacy Policy.
Similar Jobs at Coupa
Artificial Intelligence • Fintech • Information Technology • Logistics • Payments • Business Intelligence • Generative AI
The Product Marketing Specialist manages go-to-market plans, executes product releases, and creates marketing content while collaborating across teams and leveraging AI for efficiency.
Top Skills:
AsanaGenerative AiHighspotJIRASalesforceSeismicTableau
Artificial Intelligence • Fintech • Information Technology • Logistics • Payments • Business Intelligence • Generative AI
Lead engineering teams building AI-powered platform products (Navi, Agent Studio) and agent platforms. Drive architecture, productionalize LLM/agent systems, partner with product, establish engineering practices, define metrics, and hire and mentor a global team.
Top Skills:
Agent StudioAgentic AiAi Platform ServicesAi/LlmDistributed SystemsEnterprise SaasNavi
Artificial Intelligence • Fintech • Information Technology • Logistics • Payments • Business Intelligence • Generative AI
As a Product Management Director, you will lead the strategy for Coupa's Contract Management solution, solving customer problems, aligning with engineering, and driving market adoption.
Top Skills:
AIData AnalyticsSaaS
What you need to know about the Los Angeles Tech Scene
Los Angeles is a global leader in entertainment, so it’s no surprise that many of the biggest players in streaming, digital media and game development call the city home. But the city boasts plenty of non-entertainment innovation as well, with tech companies spanning verticals like AI, fintech, e-commerce and biotech. With major universities like Caltech, UCLA, USC and the nearby UC Irvine, the city has a steady supply of top-flight tech and engineering talent — not counting the graduates flocking to Los Angeles from across the world to enjoy its beaches, culture and year-round temperate climate.
Key Facts About Los Angeles Tech
- Number of Tech Workers: 375,800; 5.5% of overall workforce (2024 CompTIA survey)
- Major Tech Employers: Snap, Netflix, SpaceX, Disney, Google
- Key Industries: Artificial intelligence, adtech, media, software, game development
- Funding Landscape: $11.6 billion in venture capital funding in 2024 (Pitchbook)
- Notable Investors: Strong Ventures, Fifth Wall, Upfront Ventures, Mucker Capital, Kittyhawk Ventures
- Research Centers and Universities: California Institute of Technology, UCLA, University of Southern California, UC Irvine, Pepperdine, California Institute for Immunology and Immunotherapy, Center for Quantum Science and Engineering

