Arm Logo

Arm

Director, Product Management - AI Inference Platform

Posted 2 Days Ago
Be an Early Applicant
Hybrid
San Jose, CA
260K-352K Annually
Senior level
Hybrid
San Jose, CA
260K-352K Annually
Senior level
Lead strategy and execution for a next-generation AI inference platform: define roadmap, drive inference architecture and orchestration, enable hardware-software co-design, own control plane services and KPIs, and align cross-functional teams to improve latency, throughput, and cost efficiency.
The summary above was generated by AI
The future isn't just about technology - it's about people and the limitless possibilities AI can unlock for us all. As AI transforms how we live, work, and connect, Arm is at its very foundation, powering the innovations that are shaping our world.
Arm's Cloud AI team is building next-generation platforms to power the AI workloads of tomorrow, and we're growing our product management organization to help guide this journey. This is an opportunity to help shape the product direction of a major strategic investment while also helping build a new team and culture.
About the Role
We are seeking a Director / Principal Product Manager to lead the strategy and execution of a next-generation AI Inference Platform. This role sits at the intersection of hardware and software, defining how modern AI models are executed, served, and optimized at scale across distributed compute environments.
You will own the core platform stack-driving innovation in performance, efficiency, and scalability while partnering closely with engineering, research, and infrastructure teams to deliver world-class AI systems.
Responsibilities:
  • Define and drive the product vision, strategy, and roadmap for large-scale AI inference systems
  • Own product direction across compute execution, inference serving, and control plane systems
  • Drive inference architecture and orchestration strategy, including distributed serving, routing, batching, and scheduling
  • Define capabilities for KV cache management, memory optimization, and token lifecycle efficiency (prefill vs decode)
  • Partner with engineering to enable hardware-software co-design, improving performance across accelerators and interconnects
  • Shape the development of inference platform capabilities that deliver measurable gains in latency, throughput, and cost efficiency
  • Own and evolve control plane services (APIs, policy engines, context/state management, usage/accounting)
  • Translate complex technical systems into clear product value and customer impact
  • Align multi-functional collaborators across infrastructure, research, and product organizations
  • Define and track key performance metrics (P99 latency, TTFT, throughput, cost per inference/token, availability)
  • Influence platform direction across constantly evolving AI workloads (LLMs, multimodal, agentic systems)

Required Skills and Experience :
  • Proven experience in product management within AI/ML infrastructure, cloud platforms, or distributed systems
  • Deep understanding of AI inference systems and large-scale serving architectures
  • Solid understanding of LLM inference concepts (prefill vs decode, KV cache, token streaming)
  • Demonstrated ability to deliver scalable, high-performance platform products
  • Strong technical expertise and ability to collaborate directly with engineering teams

"Nice To Have" Skills and Experience :
  • Experience with accelerator-based systems and performance optimization
  • Familiarity with modern inference frameworks (e.g., TensorRT-LLM, vLLM)
  • Experience working on production-scale AI systems

Why Join Arm?
You will play a critical role in shaping the future of AI infrastructure-defining how next-generation models are deployed, optimized, and scaled efficiently. This is an opportunity to work at the forefront of AI systems, solving complex challenges at the intersection of performance, scalability, and usability.
Salary Range:
$260,000-$351,800 per year
We value people as individuals and our dedication is to reward people competitively and equitably for the work they do and the skills and experience they bring to Arm. Salary is only one component of Arm's offering. The total reward package will be shared with candidates during the recruitment and selection process.
Accommodations at Arm
At Arm, we want to build extraordinary teams. If you need an adjustment or an accommodation during the recruitment process, please email [email protected] . To note, by sending us the requested information, you consent to its use by Arm to arrange for appropriate accommodations. All accommodation or adjustment requests will be treated with confidentiality, and information concerning these requests will only be disclosed as necessary to provide the accommodation. Although this is not an exhaustive list, examples of support include breaks between interviews, having documents read aloud, or office accessibility. Please email us about anything we can do to accommodate you during the recruitment process.
Hybrid Working at Arm
Arm's approach to hybrid working is designed to create a working environment that supports both high performance and personal wellbeing. We believe in bringing people together face to face to enable us to work at pace, whilst recognizing the value of flexibility. Within that framework, we empower groups/teams to determine their own hybrid working patterns, depending on the work and the team's needs. Details of what this means for each role will be shared upon application. In some cases, the flexibility we can offer is limited by local legal, regulatory, tax, or other considerations, and where this is the case, we will collaborate with you to find the best solution. Please talk to us to find out more about what this could look like for you.
Equal Opportunities at Arm
Arm is an equal opportunity employer, committed to providing an environment of mutual respect where equal opportunities are available to all applicants and colleagues. We are a diverse organization of dedicated and innovative individuals, and don't discriminate on the basis of race, color, religion, sex, sexual orientation, gender identity, national origin, disability, or status as a protected veteran.

Similar Jobs at Arm

13 Hours Ago
Hybrid
171K-231K Annually
Senior level
171K-231K Annually
Senior level
Artificial Intelligence • Internet of Things • Semiconductor
The Senior Software Engineer will enhance Arm's MCP server, develop technical content, improve evaluation frameworks, and drive developer enablement through AI agents integration.
Top Skills: Ai-Assisted Development ToolsC++LinuxPython
2 Days Ago
Hybrid
185K-250K Annually
Senior level
185K-250K Annually
Senior level
Artificial Intelligence • Internet of Things • Semiconductor
The Director of Strategic Sales leads business development and partnerships across Cloud AI and Datacenter accounts, focusing on strategic sales growth and ecosystem collaboration.
Top Skills: Ai TrainingCloud AiCustom SiliconDatacenter TechnologiesHeterogeneous ComputeInference Platforms
2 Days Ago
Hybrid
323K-437K Annually
Expert/Leader
323K-437K Annually
Expert/Leader
Artificial Intelligence • Internet of Things • Semiconductor
Lead and manage a global team of Silicon FAEs, ensure customer satisfaction from evaluation to deployment, establish operational frameworks and metrics, and engage with executives on strategic accounts.
Top Skills: DeploymentEvaluationServer SiliconSilicon EngineeringSupport Models

What you need to know about the Los Angeles Tech Scene

Los Angeles is a global leader in entertainment, so it’s no surprise that many of the biggest players in streaming, digital media and game development call the city home. But the city boasts plenty of non-entertainment innovation as well, with tech companies spanning verticals like AI, fintech, e-commerce and biotech. With major universities like Caltech, UCLA, USC and the nearby UC Irvine, the city has a steady supply of top-flight tech and engineering talent — not counting the graduates flocking to Los Angeles from across the world to enjoy its beaches, culture and year-round temperate climate.

Key Facts About Los Angeles Tech

  • Number of Tech Workers: 375,800; 5.5% of overall workforce (2024 CompTIA survey)
  • Major Tech Employers: Snap, Netflix, SpaceX, Disney, Google
  • Key Industries: Artificial intelligence, adtech, media, software, game development
  • Funding Landscape: $11.6 billion in venture capital funding in 2024 (Pitchbook)
  • Notable Investors: Strong Ventures, Fifth Wall, Upfront Ventures, Mucker Capital, Kittyhawk Ventures
  • Research Centers and Universities: California Institute of Technology, UCLA, University of Southern California, UC Irvine, Pepperdine, California Institute for Immunology and Immunotherapy, Center for Quantum Science and Engineering

Sign up now Access later

Create Free Account

Please log in or sign up to report this job.

Create Free Account