Point72 Logo

Point72

Machine Learning Infrastructure Engineer, GenAI Technology

Reposted 5 Days Ago
Remote
Hiring Remotely in United States
180K-300K Annually
Mid level
Remote
Hiring Remotely in United States
180K-300K Annually
Mid level
Design and implement infrastructure to support generative AI and machine learning, optimizing workflows and integrating new technologies while ensuring reliability and performance.
The summary above was generated by AI

A Career with Point72's Technology Team

As Point72 reimagines the future of investing, our Technology team is constantly evolving our firm’s IT infrastructure and engineering capabilities, positioning us at the forefront of a rapidly evolving technology landscape. We’re a team of experts who experiment and work to discover new ways to harness open-source solutions, modern cloud architectures, and sophisticated Artificial Intelligence (AI) solutions, while embracing enterprise agile methodologies. Our commitment to building and innovating in the AI space provides the framework intended to drive smarter decision making and enhance how we build and operate our platforms and applications.

As a member of Point72’s Technology team, we encourage and support your professional development from day one—helping you advance your technical skills, contribute innovative ideas, and satisfy your own intellectual curiosity—all while delivering real business impact for our multi-billion-dollar global business. 


WHAT YOU'LL DO

  • Design and implement high-performance infrastructure to support large-scale generative AI and machine learning workloads, enabling faster model iteration and real business impact
  • Design and operate distributed systems for model training, hyperparameter tuning, inference, and data preprocessing pipelines to deliver reliable end-to-end machine learning (ML) workflows
  • Collaborate with ML researchers and engineers to produce models, optimizing compute utilization, training throughput, and inference latency
  • Develop and automate deployment, orchestration, and CI/CD pipelines for models and data workflows using container orchestration and infrastructure-as-code (IaC)
  • Implement observability, monitoring, and cost-management strategies for GPU and accelerator compute environments to maintain predictable performance and spend
  • Evaluate, integrate, and benchmark emerging hardware and software technologies across cloud and on-prem environments to improve scalability and throughput
  • Drive security, compliance, and operational runbooks for GenAI infrastructure including access controls, secrets management, and incident response procedures
  • Troubleshoot, profile, and optimize performance across GPU and CPU compute stacks to remove bottlenecks and increase reliability
  • Document architecture, operational practices, and mentor engineers to expand team capability and accelerate adoption of production-ready GenAI infrastructure

WHAT'S REQUIRED

  • Bachelor's or master's degree in computer science, electrical engineering, or a related technical field
  • 3–7 years of experience building and maintaining scalable compute or machine learning infrastructure systems
  • Deep understanding of distributed systems, container orchestration (Kubernetes), and public cloud platforms such as AWS, Google Cloud Platform, or Azure
  • Hands-on experience with machine learning operations and infrastructure tools such as MLflow, Ray, Airflow, Kubeflow, and Terraform
  • Strong understanding of reinforcement learning concepts and their infrastructure implications
  • Proficiency in Python and systems-level programming in one or more languages such as Go, C++, or Rust
  • Strong debugging, performance profiling, and optimization skills across GPU and CPU compute stacks
  • Experience implementing monitoring, observability, and cost-optimization for GPU/accelerator-based compute environments
  • Excellent collaboration and communication skills with a systems-thinking mindset
  • Commitment to the highest ethical standards

WE TAKE CARE OF OUR PEOPLE

We invest in our people, their careers, their health, and their well-being. When you work here, we provide:

  • Fully-paid health care benefits
  • Generous parental and family leave policies
  • Volunteer opportunities
  • Support for employee-led affinity groups representing women, people of color and the LGBT+ community
  • Mental and physical wellness programs
  • Tuition assistance
  • A 401(k) savings program with an employer match and more

ABOUT POINT72

Point72 is a leading global alternative investment firm led by Steven A. Cohen. Building on more than 30 years of investing experience, Point72 seeks to deliver superior returns for its investors through fundamental and systematic investing strategies across asset classes and geographies. We aim to attract and retain the industry's brightest talent by cultivating an investor-led culture and committing to our people's long-term growth. For more information, visit https://point72.com/.

The annual base salary range for this role is $180,000-$300,000 (USD) , which does not include discretionary bonus compensation or our comprehensive benefits package. Actual compensation offered to the successful candidate may vary from posted hiring range based upon geographic location, work experience, education, and/or skill level, among other things.



Similar Jobs

35 Minutes Ago
Easy Apply
Remote or Hybrid
Easy Apply
165K-235K Annually
Expert/Leader
165K-235K Annually
Expert/Leader
Cloud • Information Technology • Security • Software • Cybersecurity
Lead design and implementation of highly available, scalable infrastructure across multi-cloud and bare-metal environments. Drive automation-first culture by writing code, build observability (Prometheus/Grafana/OpenTelemetry), define SLIs/SLOs, lead incident response and post-incident analysis, and partner with engineering teams to improve operability and reliability at global scale.
Top Skills: AnsibleAWSAzureBare-MetalBgpC/C++Chaos EngineeringDnsGCPGoGrafanaGreHaproxyHelmIpsecItilLinuxOpentelemetryPrometheusPythonRhelSlis/SlosTemporalTerraform
35 Minutes Ago
Easy Apply
Remote or Hybrid
Easy Apply
119K-170K Annually
Senior level
119K-170K Annually
Senior level
Cloud • Information Technology • Security • Software • Cybersecurity
Lead design and implementation of highly available, scalable infrastructure across cloud and bare-metal. Drive automation using Python/Go, improve observability (Prometheus, Grafana, OpenTelemetry), define SLIs/SLOs, lead incident response and post-incident analysis, and partner with engineering teams to improve operability and reduce mean time to mitigate.
Top Skills: AnsibleAWSAzureBgpC/C++DnsGCPGoGrafanaGreHaproxyHelmIpsecLinuxOpentelemetryPrometheusPythonRhelTemporalTerraform
40 Minutes Ago
Remote
United States
Senior level
Senior level
Edtech • Fintech • Payments • Social Impact • Financial Services • Big Data Analytics
Design, build, and manage scalable AWS infrastructure; optimize GitHub-centric CI/CD; implement observability stacks; automate compliance for GovRAMP/FedRAMP and SOC2; author audit-ready documentation and runbooks; collaborate with engineering and security teams to ensure high availability and reliability.
Top Skills: AWSAws CdkBlue/GreenCanaryCloudFormationCloudwatchDatadogDockerEc2EcsEksGitGithub ActionsGrafanaIamKubernetesLambdaNew RelicNode.jsPrometheusRdsRoute53S3TerraformVpc

What you need to know about the Los Angeles Tech Scene

Los Angeles is a global leader in entertainment, so it’s no surprise that many of the biggest players in streaming, digital media and game development call the city home. But the city boasts plenty of non-entertainment innovation as well, with tech companies spanning verticals like AI, fintech, e-commerce and biotech. With major universities like Caltech, UCLA, USC and the nearby UC Irvine, the city has a steady supply of top-flight tech and engineering talent — not counting the graduates flocking to Los Angeles from across the world to enjoy its beaches, culture and year-round temperate climate.

Key Facts About Los Angeles Tech

  • Number of Tech Workers: 375,800; 5.5% of overall workforce (2024 CompTIA survey)
  • Major Tech Employers: Snap, Netflix, SpaceX, Disney, Google
  • Key Industries: Artificial intelligence, adtech, media, software, game development
  • Funding Landscape: $11.6 billion in venture capital funding in 2024 (Pitchbook)
  • Notable Investors: Strong Ventures, Fifth Wall, Upfront Ventures, Mucker Capital, Kittyhawk Ventures
  • Research Centers and Universities: California Institute of Technology, UCLA, University of Southern California, UC Irvine, Pepperdine, California Institute for Immunology and Immunotherapy, Center for Quantum Science and Engineering

Sign up now Access later

Create Free Account

Please log in or sign up to report this job.

Create Free Account