Nubank Logo

Nubank

Staff Machine Learning Engineer (Infrastructure)

Reposted 2 Days Ago
Remote
Hiring Remotely in USA
Senior level
Remote
Hiring Remotely in USA
Senior level
Design and optimize AI infrastructure, ensuring reliability and efficiency. Lead projects, create ETL pipelines, and support ML workload operations.
The summary above was generated by AI
About Nu

Nu is the world’s largest digital banking platform outside of Asia, serving over 105 million customers across Brazil, Mexico, and Colombia. The company has been leading an industry transformation by leveraging data and proprietary technology to develop innovative products and services. Guided by its mission to fight complexity and empower people, Nu caters to customers’ complete financial journey, promoting financial access and advancement with responsible lending and transparency. The company is powered by an efficient and scalable business model that combines low cost to serve with growing returns. Nu’s impact has been recognized in multiple awards, including Time 100 Companies, Fast Company’s Most Innovative Companies, and Forbes World’s Best Banks. Learn more: https://international.nubank.com.br/careers/


About the role

At Nubank, one of our engineering principles is "Leverage Through Platforms". We believe that platforms are a very efficient way of solving complex concerns that are needed for different products and teams.

The AI Infrastructure Squad within the AI Core BU builds and scales the foundational cloud, data, and AI infrastructure that powers machine learning workloads across the organization. We design and optimize high-performance training, inference, and data processing systems while ensuring reliability, scalability, and efficiency. Our team enables AI practitioners by providing robust compute, model serving, monitoring, and orchestration frameworks to drive innovation and operational excellence.


As a Software Engineer in the AI Core BU, we expect you to demonstrate:
  • Strong expertise in cloud infrastructure (AWS or GCP) and distributed computing.
  • Experience with Kubernetes, container orchestration, and infrastructure as code (Terraform, Pulumi).
  • Proficiency in programming languages. Experience with Python and Go is a plus.
  • Experience writing ETL pipelines (experience with Spark or BigQuery is preferred)
  • Experience with ML infrastructure, including model training, batch and online inference, and monitoring.
  • Strong knowledge of networking, storage, and security in large-scale systems.
  • Familiarity with workflow orchestration tools (e.g., Dagster, Airflow) and model-serving frameworks (e.g., Ray Serve, vLLM).
  • Experience optimizing performance and cost efficiency of AI workloads on cloud and on-prem environments.
Project Experience:
  • Proven track record of leading complex infrastructure projects from design to production.
  • Comfortable working on ambiguous and evolving projects, quickly identifying key challenges and driving solutions.
  • Experience in designing high-availability, fault-tolerant systems for AI/ML workloads.
  • Has worked on developer tooling, platform engineering, or ML infrastructure, ensuring AI teams can build and deploy efficiently.
  • Hands-on experience with monitoring, observability, and alerting for production systems.

We’re looking for individuals who thrive in horizontal, high-impact teams that build foundational infrastructure for multiple AI initiatives. People who enjoy solving deep technical challenges at the intersection of AI, cloud, and distributed systems, and who take ownership with a strong product mindset - ensuring infrastructure is reliable, scalable, and built around user needs. We value collaborators and mentors who help teammates grow while upholding high engineering standards. If you’re passionate about building scalable, efficient, and cost-effective AI infrastructure that drives meaningful, real-world impact, we’d love to meet you.

If you feel interested in these challenges and want to work on a very engaged and talented team, this is the place for you!


What we have to offer
  • High-Impact, Cross-Functional Work – Our team sits at the core of AI operations, enabling ML engineers, researchers, and data scientists to build and deploy models at scale. You'll work across multiple teams and business units, directly shaping AI-driven products and decisions.
  • Cutting-Edge AI & Cloud Infrastructure – Be part of a team that designs and operates high-performance AI infrastructure, spanning cloud, data, and ML platforms. You'll tackle technical challenges in distributed systems, model serving, and large-scale data processing.
  • 0 to 1 & Large-Scale Initiatives – Work on both greenfield projects and mission-critical AI infrastructure, from building scalable training pipelines to optimizing real-time inference workloads. Your work will directly influence the efficiency and scalability of AI across the company.
  • Growth & Ownership Opportunities – As a senior engineer, you'll have the autonomy to drive technical direction, lead high-impact projects, and contribute to architectural decisions. You'll also have opportunities to mentor others, shape engineering best practices, and grow into a leadership role.
  • Culture of Excellence & Collaboration – Join a team that values deep technical expertise, curiosity, and a strong engineering culture. We operate in a fast-moving environment where innovation, reliability, and efficiency drive everything we build.

Our Benefits
  • Remote work, with quarterly trips to Sao Paulo to build relationships with coworkers. 
  • Top Tier Medical Insurance
  • Top Tier Dental and Vision Insurance
  • 20 days time off, 14 company holidays, and great culture that emphasizes work life balance. 
  • Life Insurance and AD&D
  • Extended maternity and paternity leaves 
  • Nucleo - Our learning platform of courses
  • NuLanguage - Our language learning program
  • NuCare - Our mental health and wellness assistance program
  • Extended maternity and paternity leaves 
  • 401K
  • Saving Plans - Health Saving Account and Flexible Spending Account

Top Skills

Airflow
AWS
BigQuery
Dagster
GCP
Go
Kubernetes
Ml Infrastructure
Pulumi
Python
Ray Serve
Spark
Terraform
Vllm

Similar Jobs

An Hour Ago
Remote or Hybrid
New York, NY, USA
99K-150K Annually
Mid level
99K-150K Annually
Mid level
Productivity • Sales • Software
As a CRM Account Executive, you will drive CRM sales, manage the full sales cycle, and develop strategies to enhance the product within a growing sales environment.
Top Skills: CRM
An Hour Ago
Remote or Hybrid
US
115K-140K Annually
Senior level
115K-140K Annually
Senior level
Artificial Intelligence • eCommerce • Information Technology • Internet of Things • Automation
Responsible for generating new business opportunities for AI Factory solutions by engaging with clients and account teams, articulating value propositions, and leading sales motions.
Top Skills: Ai InfrastructureAi/Ml ToolsCpuDataGpuKubernetesMlopsSlurmStorage
An Hour Ago
In-Office or Remote
Chicago, IL, USA
100K-225K Annually
Senior level
100K-225K Annually
Senior level
Fintech
The Sr. Sales Executive is responsible for business development in small group markets and must build relationships with brokers and sponsors, manage sales processes, and meet sales objectives.
Top Skills: Salesforce CRM

What you need to know about the Los Angeles Tech Scene

Los Angeles is a global leader in entertainment, so it’s no surprise that many of the biggest players in streaming, digital media and game development call the city home. But the city boasts plenty of non-entertainment innovation as well, with tech companies spanning verticals like AI, fintech, e-commerce and biotech. With major universities like Caltech, UCLA, USC and the nearby UC Irvine, the city has a steady supply of top-flight tech and engineering talent — not counting the graduates flocking to Los Angeles from across the world to enjoy its beaches, culture and year-round temperate climate.

Key Facts About Los Angeles Tech

  • Number of Tech Workers: 375,800; 5.5% of overall workforce (2024 CompTIA survey)
  • Major Tech Employers: Snap, Netflix, SpaceX, Disney, Google
  • Key Industries: Artificial intelligence, adtech, media, software, game development
  • Funding Landscape: $11.6 billion in venture capital funding in 2024 (Pitchbook)
  • Notable Investors: Strong Ventures, Fifth Wall, Upfront Ventures, Mucker Capital, Kittyhawk Ventures
  • Research Centers and Universities: California Institute of Technology, UCLA, University of Southern California, UC Irvine, Pepperdine, California Institute for Immunology and Immunotherapy, Center for Quantum Science and Engineering

Sign up now Access later

Create Free Account

Please log in or sign up to report this job.

Create Free Account