Postman Logo

Postman

Member of Technical Staff, AI Reliability & Monitoring Engineering Lead

Posted Yesterday
Be an Early Applicant
Hybrid
San Francisco, CA
256K-276K
Senior level
Hybrid
San Francisco, CA
256K-276K
Senior level
Seeking an experienced AI Systems Reliability Engineer to ensure the reliability and performance of AI services, focusing on metrics, monitoring, and incident response.
The summary above was generated by AI
Who Are We?

Postman is the world’s leading API platform, used by more than 40 million developers and 500,000 organizations, including 98% of the Fortune 500. Postman is helping developers and professionals across the globe build the API-first world by simplifying each step of the API lifecycle and streamlining collaboration—enabling users to create better APIs, faster.

The company is headquartered in San Francisco and has offices in Boston, New York, and Bangalore - where Postman was founded. Postman is privately held, with funding from Battery Ventures, BOND, Coatue, CRV, Insight Partners, and Nexus Venture Partners. Learn more at postman.com or connect with Postman on X via @getpostman.

P.S: We highly recommend reading The "API-First World" graphic novel to understand the bigger picture and our vision at Postman.

The Opportunity

Postman is seeking an experienced AI Systems Reliability Engineer to help define, build, and maintain the infrastructure and processes that ensure the reliability, scalability, and performance of Postman’s AI-powered API and agentic systems in production. This role focuses on monitoring, availability, incident response, and automation to support AI services and tools trusted by millions of developers globally.

What You’ll Do
  • Develop and manage reliability metrics (SLOs) for AI-driven API services and agentic AI platform features

  • Implement comprehensive observability and monitoring systems for real-time performance and fault detection

  • Design and drive automated failover, recovery, and incident response strategies for high-availability AI infrastructure

  • Optimize resource utilization, particularly GPU/accelerator efficiency, ensuring cost-effective AI system operation

  • Collaborate closely with engineering, platform, and product teams to align reliability efforts with broader organizational goals

  • Lead efforts to build internal tooling and automation focused on AI system stability and operational excellence

  • Drive continuous improvement in deployment practices, monitoring approaches, and incident management processes

About You
  • Have a strong background in AI reliability engineering, SRE, or DevOps for distributed systems

  • Understand the unique challenges of maintaining large-scale AI systems and integrating AI-specific metrics into reliability frameworks

  • Are experienced with cloud platforms, monitoring tools, and incident response automation

  • Are comfortable collaborating across teams to influence best practices for AI system reliability and operational health

  • Thrive in dynamic, fast-paced environments focusing on delivering reliable, safe AI-powered services

Bonus Skills and Experiences

  • Hands-on experience with AI/ML infrastructure, including GPU/xPU optimization and scaling

  • Familiarity with API platform operations and large-scale distributed services

  • Prior experience building or operating observability tools tailored for AI and agentic systems

  • Contribution to open-source projects or reliability engineering thought leadership

The reasonably estimated base salary for this role ranges from $256,000 to $276,000, plus a competitive equity package. Actual compensation is based on the candidate's skills, qualifications, and experience. 

What Else?

In addition to Postman's pay-on-performance philosophy, and a flexible schedule working with a fun, collaborative team, Postman offers a comprehensive set of benefits, including full medical coverage, flexible PTO, wellness reimbursement, and a monthly lunch stipend. Along with that, our wellness programs will help you stay in the best of your physical and mental health. Our frequent and fascinating team-building events will keep you connected, while our donation-matching program can support the causes you care about. We’re building a long-term company with an inclusive culture where everyone can be the best version of themselves. 

At Postman, we embrace a hybrid work model. For all roles based out of San Francisco Bay Area, Boston, Bangalore, Hyderabad, and New York, employees are expected to come into the office 3-days a week. We were thoughtful in our approach which is based on balancing flexibility and collaboration and grounded in feedback from our workforce, leadership team, and peers. The benefits of our hybrid office model will be shared knowledge, brainstorming sessions, communication, and building trust in-person that cannot be replicated via zoom.

Our Values

At Postman, we create with the same curiosity that we see in our users. We value transparency and honest communication about not only successes, but also failures. In our work, we focus on specific goals that add up to a larger vision. Our inclusive work culture ensures that everyone is valued equally as important pieces of our final product. We are dedicated to delivering the best products we can.

Equal opportunity

Postman is an Equal Employment Opportunity and Affirmative Action Employer. Qualified applicants will receive consideration for employment without regard to race, color, religion, sex, sexual orientation, gender perception or identity, national origin, age, marital status, protected veteran status, or disability status. Headhunters and recruitment agencies may not submit resumes/CVs through this website or directly to managers. Postman does not accept unsolicited headhunter and agency resumes. Postman will not pay fees to any third-party agency or company that does not have a signed agreement with Postman.

Top Skills

AI
Cloud Platforms
DevOps
Gpu
Monitoring Tools
Sre

Similar Jobs at Postman

Yesterday
Hybrid
San Francisco, CA, USA
256K-276K
Senior level
256K-276K
Senior level
Software
As a Member of Technical Staff, you will develop AI infrastructure, optimize performance for cloud environments, and collaborate with teams on architecture and reliability.
Top Skills: Ai InfrastructureCloud ComputingDistributed SystemsGoGpu/Xpu AcceleratorsPython
Yesterday
Hybrid
San Francisco, CA, USA
210K-240K
Senior level
210K-240K
Senior level
Software
The role focuses on accelerating enterprise adoption of Postman by creating scalable technical demos, engaging with customers, and supporting field teams. The advocate must have deep technical experience along with strong communication and storytelling skills to connect with developers and executives.
Top Skills: Api DesignCi/Cd PipelinesEnterprise Integration WorkflowsTest Automation
Yesterday
Hybrid
San Francisco, CA, USA
256K-276K
Senior level
256K-276K
Senior level
Software
Lead the design and deployment of AI agents, collaborating with teams to ensure AI safety and mentor technical staff.
Top Skills: JaxPythonPyTorchTensorFlow

What you need to know about the Los Angeles Tech Scene

Los Angeles is a global leader in entertainment, so it’s no surprise that many of the biggest players in streaming, digital media and game development call the city home. But the city boasts plenty of non-entertainment innovation as well, with tech companies spanning verticals like AI, fintech, e-commerce and biotech. With major universities like Caltech, UCLA, USC and the nearby UC Irvine, the city has a steady supply of top-flight tech and engineering talent — not counting the graduates flocking to Los Angeles from across the world to enjoy its beaches, culture and year-round temperate climate.

Key Facts About Los Angeles Tech

  • Number of Tech Workers: 375,800; 5.5% of overall workforce (2024 CompTIA survey)
  • Major Tech Employers: Snap, Netflix, SpaceX, Disney, Google
  • Key Industries: Artificial intelligence, adtech, media, software, game development
  • Funding Landscape: $11.6 billion in venture capital funding in 2024 (Pitchbook)
  • Notable Investors: Strong Ventures, Fifth Wall, Upfront Ventures, Mucker Capital, Kittyhawk Ventures
  • Research Centers and Universities: California Institute of Technology, UCLA, University of Southern California, UC Irvine, Pepperdine, California Institute for Immunology and Immunotherapy, Center for Quantum Science and Engineering

Sign up now Access later

Create Free Account

Please log in or sign up to report this job.

Create Free Account