Genies Logo

Genies

Machine Learning Engineer: ML Infra and Model Optimization

Posted 23 Days Ago
Be an Early Applicant
Easy Apply
In-Office
Los Angeles, CA
215K-275K Annually
Mid level
Easy Apply
In-Office
Los Angeles, CA
215K-275K Annually
Mid level
Design and maintain ML infrastructure for image and 3D models, optimize pipelines, and support ML model deployment at scale.
The summary above was generated by AI

Genies is an avatar technology company powering the next era of interactive digital identity through AI companions. With the Avatar Framework and intuitive creation tools, Genies enables developers, talent, and creators to generate and deploy game-ready AI companions. The company’s technology stack supports full customization, AI-generated fashion and props, and seamless integration of user-generated content (UGC). Backed by investors including Bob Iger, Silver Lake, BOND, and NEA, Genies’ mission is to become the visual and interactive layer for the LLM-powered internet.

Genies is looking for a ML Infra and Model Optimization Engineer to join our R&D team. Based either in our Los Angeles or San Francisco offices (Hybrid), you will work closely with a dedicated and talented team of technical artists, engineers and artists. Together, you will explore new concepts and technologies to further Genies' mission of empowering users to develop their own avatar ecosystems. We're looking for someone who is passionate about creating high-quality visuals and has the technical foundation to help us build the next wave of digital identity. 

What You’ll be Doing: 

  • Design, build, and maintain production-grade ML infrastructure for image and 3D generative models.
  • Develop and own backend services and APIs that support model inference at scale (high concurrency, low latency, high reliability).
  • Deploy, monitor, and operate ML models on cloud and large-scale platforms (e.g., SageMaker, Kubernetes, Ray Serve, custom GPU services).
  • Optimize inference pipelines using model acceleration techniques such as:
    • quantization, pruning, mixed precision
    • ONNX / TensorRT / torch.compile
  • Partner with ML researchers to productionize diffusion models, transformer-based models, and 3D generation systems.
  • Implement evaluation, logging, monitoring, and alerting to ensure system stability and performance.
  • Improve end-to-end system efficiency across data loading, inference, post-processing, and storage.
  • Support rapid experimentation while maintaining production safety and scalability.

What You Should Have: 

  • Strong experience building backend and infrastructure systems in production environments.
  • Proficiency in Python and experience designing APIs/services (e.g., FastAPI, Flask, gRPC).
  • Hands-on experience deploying and operating ML models at scale, including:
    • GPU-based inference services
    • concurrency handling and request batching
    • latency and throughput optimization
  • Experience with cloud platforms and ML deployment stacks, such as:
    • AWS (SageMaker, EC2, EKS), GCP, or similar
    • Docker, containers, CI/CD pipelines
  • Solid understanding of systems performance, debugging, and reliability engineering.
  • Experience supporting real user traffic, not just offline research workflows.

Bonus Skills (Nice-to-Have)

  • Experience with generative models, especially:
    • diffusion models
    • transformer-based architectures
    • multimodal image / 3D pipelines
  • Familiarity with 3D generation or computer graphics pipelines (e.g., meshes, textures, multi-view data).
  • Hands-on experience with model optimization and acceleration, such as:
    • quantization, pruning, distillation
    • ONNX Runtime, TensorRT, FSDP, DeepSpeed
  • Experience with distributed systems or scalable inference frameworks (Ray, Triton, TorchServe).
  • Background in machine learning fundamentals (training, evaluation, model behavior), even if not research-focused.

Here's why you'll love working at Genies:

  • You'll work with a team that you’ll be able to learn from and grow with, including support for your own professional development
  • You'll be at the helm of your own career, shaping it with your own innovative contributions to a nascent team and product
  • You'll enjoy the culture and perks of a startup, with the stability of being well funded 
  • Comprehensive health insurance for you and your family (Anthem + Kaiser Options Available), Dental and Vision Insurance
  • Flexible paid time off, sick time, and paid company holidays, in addition to paid parental leave, bereavement leave, and jury duty leave for full-time employees
  • Health & wellness support through programs such as monthly wellness reimbursement 
  • Working in a brand new, bright, open-environment and fun office space - there’s even a slide! 
  • Choice of MacBook or windows laptop

Salary Range: $215K-$275K depending on experience

Genies is an equal opportunity employer committed to promoting an inclusive work environment free of discrimination and harassment. We value diversity, inclusion, and aim to provide a sense of belonging for everyone.

Top Skills

AWS
Deepspeed
Docker
Ec2
Eks
Fastapi
Flask
GCP
Grpc
Onnx
Python
Ray
Sagemaker
Tensorrt
Torchserve
HQ

Genies Los Angeles, California, USA Office

4121 Redwood Ave, Los Angeles, CA, United States, 90066

Similar Jobs

5 Minutes Ago
Remote or Hybrid
San Diego, CA, USA
107K-150K Annually
Senior level
107K-150K Annually
Senior level
Artificial Intelligence • Cloud • HR Tech • Information Technology • Productivity • Software • Automation
The Sr Solution Consultant supports sales by providing technical expertise, leading workshops, answering product queries, and guiding strategic programs in top accounts, while achieving sales goals.
Top Skills: AICloud Software SolutionsServicenow
5 Minutes Ago
Remote or Hybrid
Santa Clara, CA, USA
166K-274K Annually
Senior level
166K-274K Annually
Senior level
Artificial Intelligence • Cloud • HR Tech • Information Technology • Productivity • Software • Automation
The Executive Enterprise Architect engages with customers, identifies solutions using ServiceNow, leads architecture engagements, and facilitates communication with executives. They must understand both customer business strategies and the technical capabilities of ServiceNow's platform.
Top Skills: AIAi/MlAnalyticsApplication ArchitectureBianBig DataCloud-Based PlatformsDevOpsEnterprise ArchitectureIntegrationIt4ItMobilityPaasSaaSTmforumTogafZachman
9 Minutes Ago
Remote or Hybrid
Santa Clara, CA, USA
173K-303K Annually
Senior level
173K-303K Annually
Senior level
Artificial Intelligence • Cloud • HR Tech • Information Technology • Productivity • Software • Automation
Design and develop AI/ML solutions, focusing on scalable software and user experience, while collaborating across teams to integrate generative AI technologies.
Top Skills: AngularJavaKubernetesLlmsOpentelemetryReactVue

What you need to know about the Los Angeles Tech Scene

Los Angeles is a global leader in entertainment, so it’s no surprise that many of the biggest players in streaming, digital media and game development call the city home. But the city boasts plenty of non-entertainment innovation as well, with tech companies spanning verticals like AI, fintech, e-commerce and biotech. With major universities like Caltech, UCLA, USC and the nearby UC Irvine, the city has a steady supply of top-flight tech and engineering talent — not counting the graduates flocking to Los Angeles from across the world to enjoy its beaches, culture and year-round temperate climate.

Key Facts About Los Angeles Tech

  • Number of Tech Workers: 375,800; 5.5% of overall workforce (2024 CompTIA survey)
  • Major Tech Employers: Snap, Netflix, SpaceX, Disney, Google
  • Key Industries: Artificial intelligence, adtech, media, software, game development
  • Funding Landscape: $11.6 billion in venture capital funding in 2024 (Pitchbook)
  • Notable Investors: Strong Ventures, Fifth Wall, Upfront Ventures, Mucker Capital, Kittyhawk Ventures
  • Research Centers and Universities: California Institute of Technology, UCLA, University of Southern California, UC Irvine, Pepperdine, California Institute for Immunology and Immunotherapy, Center for Quantum Science and Engineering

Sign up now Access later

Create Free Account

Please log in or sign up to report this job.

Create Free Account