Genies

Machine Learning Engineer: ML Infra and Model Optimization

Posted 23 Days Ago

Be an Early Applicant

Easy Apply

In-Office

Los Angeles, CA

215K-275K Annually

Mid level

Easy Apply

In-Office

Los Angeles, CA

215K-275K Annually

Mid level

Design and maintain ML infrastructure for image and 3D models, optimize pipelines, and support ML model deployment at scale.

The summary above was generated by AI

Genies is an avatar technology company powering the next era of interactive digital identity through AI companions. With the Avatar Framework and intuitive creation tools, Genies enables developers, talent, and creators to generate and deploy game-ready AI companions. The company’s technology stack supports full customization, AI-generated fashion and props, and seamless integration of user-generated content (UGC). Backed by investors including Bob Iger, Silver Lake, BOND, and NEA, Genies’ mission is to become the visual and interactive layer for the LLM-powered internet.

Genies is looking for a ML Infra and Model Optimization Engineer to join our R&D team. Based either in our Los Angeles or San Francisco offices (Hybrid), you will work closely with a dedicated and talented team of technical artists, engineers and artists. Together, you will explore new concepts and technologies to further Genies' mission of empowering users to develop their own avatar ecosystems. We're looking for someone who is passionate about creating high-quality visuals and has the technical foundation to help us build the next wave of digital identity.

What You’ll be Doing:

Design, build, and maintain production-grade ML infrastructure for image and 3D generative models.
Develop and own backend services and APIs that support model inference at scale (high concurrency, low latency, high reliability).
Deploy, monitor, and operate ML models on cloud and large-scale platforms (e.g., SageMaker, Kubernetes, Ray Serve, custom GPU services).
Optimize inference pipelines using model acceleration techniques such as:

quantization, pruning, mixed precision
ONNX / TensorRT / torch.compile

Partner with ML researchers to productionize diffusion models, transformer-based models, and 3D generation systems.
Implement evaluation, logging, monitoring, and alerting to ensure system stability and performance.
Improve end-to-end system efficiency across data loading, inference, post-processing, and storage.
Support rapid experimentation while maintaining production safety and scalability.

What You Should Have:

Strong experience building backend and infrastructure systems in production environments.
Proficiency in Python and experience designing APIs/services (e.g., FastAPI, Flask, gRPC).
Hands-on experience deploying and operating ML models at scale, including:

GPU-based inference services
concurrency handling and request batching
latency and throughput optimization

Experience with cloud platforms and ML deployment stacks, such as:

AWS (SageMaker, EC2, EKS), GCP, or similar
Docker, containers, CI/CD pipelines

Solid understanding of systems performance, debugging, and reliability engineering.
Experience supporting real user traffic, not just offline research workflows.

Bonus Skills (Nice-to-Have)

Experience with generative models, especially:

diffusion models
transformer-based architectures
multimodal image / 3D pipelines

Familiarity with 3D generation or computer graphics pipelines (e.g., meshes, textures, multi-view data).
Hands-on experience with model optimization and acceleration, such as:

quantization, pruning, distillation
ONNX Runtime, TensorRT, FSDP, DeepSpeed

Experience with distributed systems or scalable inference frameworks (Ray, Triton, TorchServe).
Background in machine learning fundamentals (training, evaluation, model behavior), even if not research-focused.

Here's why you'll love working at Genies:

You'll work with a team that you’ll be able to learn from and grow with, including support for your own professional development
You'll be at the helm of your own career, shaping it with your own innovative contributions to a nascent team and product
You'll enjoy the culture and perks of a startup, with the stability of being well funded
Comprehensive health insurance for you and your family (Anthem + Kaiser Options Available), Dental and Vision Insurance
Flexible paid time off, sick time, and paid company holidays, in addition to paid parental leave, bereavement leave, and jury duty leave for full-time employees
Health & wellness support through programs such as monthly wellness reimbursement
Working in a brand new, bright, open-environment and fun office space - there’s even a slide!
Choice of MacBook or windows laptop

Salary Range: $215K-$275K depending on experience

Genies is an equal opportunity employer committed to promoting an inclusive work environment free of discrimination and harassment. We value diversity, inclusion, and aim to provide a sense of belonging for everyone.

Top Skills

AWS

Deepspeed

Docker

Ec2

Eks

Fastapi

Flask

GCP

Grpc

Onnx

Python

Ray

Sagemaker

Tensorrt

Torchserve

4121 Redwood Ave, Los Angeles, CA, United States, 90066

Similar Jobs

ServiceNow

Consultant

5 Minutes Ago

Remote or Hybrid

San Diego, CA, USA

107K-150K Annually

Senior level

107K-150K Annually

Senior level

Artificial Intelligence • Cloud • HR Tech • Information Technology • Productivity • Software • Automation

The Sr Solution Consultant supports sales by providing technical expertise, leading workshops, answering product queries, and guiding strategic programs in top accounts, while achieving sales goals.

Top Skills: AICloud Software SolutionsServicenow

ServiceNow

Enterprise Architect

5 Minutes Ago

Remote or Hybrid

Santa Clara, CA, USA

166K-274K Annually

Senior level

166K-274K Annually

Senior level

Artificial Intelligence • Cloud • HR Tech • Information Technology • Productivity • Software • Automation

The Executive Enterprise Architect engages with customers, identifies solutions using ServiceNow, leads architecture engagements, and facilitates communication with executives. They must understand both customer business strategies and the technical capabilities of ServiceNow's platform.

Top Skills: AIAi/MlAnalyticsApplication ArchitectureBianBig DataCloud-Based PlatformsDevOpsEnterprise ArchitectureIntegrationIt4ItMobilityPaasSaaSTmforumTogafZachman

ServiceNow

Staff Software Engineer

9 Minutes Ago

Remote or Hybrid

Santa Clara, CA, USA

173K-303K Annually

Senior level

173K-303K Annually

Senior level

Artificial Intelligence • Cloud • HR Tech • Information Technology • Productivity • Software • Automation

Design and develop AI/ML solutions, focusing on scalable software and user experience, while collaborating across teams to integrate generative AI technologies.

Top Skills: AngularJavaKubernetesLlmsOpentelemetryReactVue

What you need to know about the Los Angeles Tech Scene

Los Angeles is a global leader in entertainment, so it’s no surprise that many of the biggest players in streaming, digital media and game development call the city home. But the city boasts plenty of non-entertainment innovation as well, with tech companies spanning verticals like AI, fintech, e-commerce and biotech. With major universities like Caltech, UCLA, USC and the nearby UC Irvine, the city has a steady supply of top-flight tech and engineering talent — not counting the graduates flocking to Los Angeles from across the world to enjoy its beaches, culture and year-round temperate climate.

Key Facts About Los Angeles Tech

Number of Tech Workers: 375,800; 5.5% of overall workforce (2024 CompTIA survey)
Major Tech Employers: Snap, Netflix, SpaceX, Disney, Google
Key Industries: Artificial intelligence, adtech, media, software, game development
Funding Landscape: $11.6 billion in venture capital funding in 2024 (Pitchbook)
Notable Investors: Strong Ventures, Fifth Wall, Upfront Ventures, Mucker Capital, Kittyhawk Ventures
Research Centers and Universities: California Institute of Technology, UCLA, University of Southern California, UC Irvine, Pepperdine, California Institute for Immunology and Immunotherapy, Center for Quantum Science and Engineering

Genies

Machine Learning Engineer: ML Infra and Model Optimization

Top Skills

Genies Los Angeles, California, USA Office

Similar Jobs

Consultant

Enterprise Architect

Staff Software Engineer

What you need to know about the Los Angeles Tech Scene

Key Facts About Los Angeles Tech