Red Hat Logo

Red Hat

Senior Performance and Scale Engineer, OpenShift AI Platform

Reposted 16 Days Ago
Be an Early Applicant
Remote
Hiring Remotely in ON
Senior level
Remote
Hiring Remotely in ON
Senior level
Lead performance and scalability strategy for RHOAI components, build automated benchmarking and analysis tools, identify and resolve performance bottlenecks, collaborate with product and engineering, triage customer performance issues, mentor engineers, and represent Red Hat externally.
The summary above was generated by AI

About the Job

Red Hat’s Performance and Scale Engineering team is looking for a highly motivated Senior Software Engineer to join the PSAP (Performance and Scale for AI Platforms) team. In this high-impact role, you will serve as a technical leader, driving the performance, scalability, and efficiency of Red Hat OpenShift AI (RHOAI).

RHOAI is a cornerstone of Red Hat’s AI portfolio, providing a robust platform for managing the full lifecycle of predictive and generative AI (GenAI) models at scale across the hybrid cloud. As a senior member of this team, you will ensure that RHOAI remains the industry’s premier choice for enterprise-grade AI by tuning components ranging from GenAI API servers to distributed training frameworks.

This is a dynamic role for a Senior Software Engineer with a growth mindset who handles and adapts to rapid change, has a strong commitment to open-source values, and the willingness to learn and apply new technologies. You will be joining a vibrant open source culture, and helping promote performance and innovation in this Red Hat engineering team. The broader mission of the Performance and Scale team is to establish performance and scale leadership of the Red Hat product and cloud services portfolio. The scope includes component level, system and solution analysis and targeted enhancements. The team collaborates with engineering, product management, product marketing and customer support as well as Red Hat’s hardware and software ecosystem partners.

What you’ll have

  • Define and lead the performance and scalability strategy for RHOAI components, including but not limited to GenAI API servers, vector databases, MCP Gateways, and Model Registry.

  • Design and maintain tools and automated frameworks to streamline performance data collection, and analysis.

  • Identify bottlenecks in component performance and collaborate with core RHOAI engineering teams to drive performance improvements.

  • Triage and resolve complex performance-related customer cases collaborating with the customer facing teams

  • Collaborate with Product Management and Core Engineering to influence the product roadmap based on performance data and industry trends

  • Provide technical guidance and mentorship to junior engineers. Champion a culture of performance-centric development within the broader PSAP team.

  • Represent Red Hat in industry consortia and at global conferences. Author high-impact technical blogs and white papers to establish Red Hat’s thought leadership.

What you’ll bring

  • Bachelor’s degree in Computer Science or related field 

  • 5+ years of software engineering experience

  • Experience in systems-level performance analysis, profiling, and tuning (CPU, Memory, I/O, and Network).

  • Experience with Kubernetes or OpenShift (containers, pods, and orchestration).

  • Strong Python proficiency in designing complex, maintainable automation software and data analysis pipelines.

  • Experience working in a Linux environment with an understanding of system resources (CPU, Memory, I/O).

  • Understanding of AI concepts (classical ML, Gen AI and agentic AI), and knowledge of AI lifecycle and MLops workflows

  • Experience preparing and managing high quality datasets for accurate benchmarking of data-intensive systems

  • Proven ability to communicate complex performance metrics into clear, actionable insights that bridge the gap between technical engineering and strategic business objectives for stakeholders at all levels. 

The following is considered a plus

  • Master’s or PhD in Computer Science or a related quantitative field.

  • 3+ years of relevant industry experience in performance engineering or distributed/operating systems.

  • Advanced experience using AI-assisted coding and productivity tools to optimize team workflows and accelerate complex debugging.

  • Experience with SQL and noSQL databases and their performance tuning

  • Significant contributions to open-source projects, particularly in the Kubernetes, MLOps, or AI domains.

  • Experience in applying statistical methods to massive datasets including trend forecasting and anomaly detection

#LI-EK1

#AI-HIRING

About Red Hat

Red Hat is the world’s leading provider of enterprise open source software solutions, using a community-powered approach to deliver high-performing Linux, cloud, container, and Kubernetes technologies. Spread across 40+ countries, our associates work flexibly across work environments, from in-office, to office-flex, to fully remote, depending on the requirements of their role. Red Hatters are encouraged to bring their best ideas, no matter their title or tenure. We're a leader in open source because of our open and inclusive environment. We hire creative, passionate people ready to contribute their ideas, help solve complex problems, and make an impact.

Inclusion at Red Hat
Red Hat’s culture is built on the open source principles of transparency, collaboration, and inclusion, where the best ideas can come from anywhere and anyone. When this is realized, it empowers people from different backgrounds, perspectives, and experiences to come together to share ideas, challenge the status quo, and drive innovation. Our aspiration is that everyone experiences this culture with equal opportunity and access, and that all voices are not only heard but also celebrated. We hope you will join our celebration, and we welcome and encourage applicants from all the beautiful dimensions that compose our global village.

Equal Opportunity Policy (EEO)
Red Hat is proud to be an equal opportunity workplace and an affirmative action employer. We review applications for employment without regard to their race, color, religion, sex, sexual orientation, gender identity, national origin, ancestry, citizenship, age, veteran status, genetic information, physical or mental disability, medical condition, marital status, or any other basis prohibited by law.


Red Hat does not seek or accept unsolicited resumes or CVs from recruitment agencies. We are not responsible for, and will not pay, any fees, commissions, or any other payment related to unsolicited resumes or CVs except as required in a written contract between Red Hat and the recruitment agency or party requesting payment of a fee.

Red Hat supports individuals with disabilities and provides reasonable accommodations to job applicants. If you need assistance completing our online job application, email [email protected]. General inquiries, such as those regarding the status of a job application, will not receive a reply.

Top Skills

Distributed Training Frameworks
Kubernetes
Linux
Mlops
NoSQL
Openshift
Python
SQL
Vector Databases

Red Hat Los Angeles, California, USA Office

811 Wilshire Blvd, Los Angeles, CA, United States, 90017

Similar Jobs

10 Hours Ago
Remote
2 Locations
144K-216K Annually
Senior level
144K-216K Annually
Senior level
Artificial Intelligence • Productivity • Software • Automation
Design and build partner-facing APIs and the Powered by Zapier platform, improve developer tools and docs, ensure scalability and reliability, collaborate cross-functionally, lead technical initiatives, and mentor teammates to support partner integration and embedded automation.
Top Skills: Api KeysDjangoDjango Rest FrameworkJwtsNext.JsOauthOpenapiPythonReact
10 Hours Ago
Remote or Hybrid
Canada
95K-145K Annually
Senior level
95K-145K Annually
Senior level
Cloud • Insurance • Payments • Software • Business Intelligence • App development • Big Data Analytics
Lead design and implementation of cloud-native payment systems using TypeScript/React/Node.js. Collaborate with cross-functional Agile teams, optimize for performance and reliability, mentor junior engineers, drive architectural decisions, and build on GCP with Kubernetes and PostgreSQL.
Top Skills: ConfluenceDockerGCPGitJIRAKubernetesNode.jsPostgresReactRest ApisTypescript
10 Hours Ago
In-Office or Remote
Toronto, ON, CAN
130K-170K Annually
Mid level
130K-170K Annually
Mid level
AdTech • Digital Media • eCommerce • Marketing Tech
Design, implement, and maintain AWS security controls and monitoring (GuardDuty, CloudTrail, Security Hub). Manage IAM and federated identity (Okta), secure networking, containers, serverless, and Databricks on AWS. Investigate and remediate findings using Wiz, support SOC 2 compliance, automate security via IaC and scripting, and develop incident response playbooks while partnering with engineering and auditors.
Top Skills: AlbApi GatewayAws ConfigAws Ec2Aws Identity CenterBashCloudfrontCloudtrailCloudwatchDatabricksEcsEksGuarddutyIamInfrastructure As CodeLambdaOidcOktaPowershellPythonRdsS3SAMLSecurity HubSnsSqsVpcWafWiz

What you need to know about the Los Angeles Tech Scene

Los Angeles is a global leader in entertainment, so it’s no surprise that many of the biggest players in streaming, digital media and game development call the city home. But the city boasts plenty of non-entertainment innovation as well, with tech companies spanning verticals like AI, fintech, e-commerce and biotech. With major universities like Caltech, UCLA, USC and the nearby UC Irvine, the city has a steady supply of top-flight tech and engineering talent — not counting the graduates flocking to Los Angeles from across the world to enjoy its beaches, culture and year-round temperate climate.

Key Facts About Los Angeles Tech

  • Number of Tech Workers: 375,800; 5.5% of overall workforce (2024 CompTIA survey)
  • Major Tech Employers: Snap, Netflix, SpaceX, Disney, Google
  • Key Industries: Artificial intelligence, adtech, media, software, game development
  • Funding Landscape: $11.6 billion in venture capital funding in 2024 (Pitchbook)
  • Notable Investors: Strong Ventures, Fifth Wall, Upfront Ventures, Mucker Capital, Kittyhawk Ventures
  • Research Centers and Universities: California Institute of Technology, UCLA, University of Southern California, UC Irvine, Pepperdine, California Institute for Immunology and Immunotherapy, Center for Quantum Science and Engineering

Sign up now Access later

Create Free Account

Please log in or sign up to report this job.

Create Free Account