d-Matrix Logo

d-Matrix

ML Compiler Software Engineering Technical Lead

Reposted 5 Days Ago
Be an Early Applicant
In-Office
Santa Clara, CA
196K-300K Annually
Senior level
In-Office
Santa Clara, CA
196K-300K Annually
Senior level
The technical lead oversees the design of an MLIR-based compiler framework for NLP models, collaborating across multiple teams for efficient implementation.
The summary above was generated by AI

At d-Matrix, we are focused on unleashing the potential of generative AI to power the transformation of technology. We are at the forefront of software and hardware innovation, pushing the boundaries of what is possible. Our culture is one of respect and collaboration.

We value humility and believe in direct communication. Our team is inclusive, and our differing perspectives allow for better solutions. We are seeking individuals passionate about tackling challenges and are driven by execution.  Ready to come find your playground? Together, we can help shape the endless possibilities of AI. 

Location:

Hybrid, working onsite at our Santa Clara, CA headquarters 3 days per week.

The role: MLIR Software Engineering Technical Lead

What you will do:

The Compiler Technical Lead role is driving the design and implementation of the MLIR-based compiler framework. In this role, you will be overseeing the development of the compiler that partitions and maps large-scale NLP models to our scalable, multi-chiplet, parallel processing architecture with hundreds of digital in-memory tensor processors, vector processors, data shaping processors and both on-chip and off-chip memory. The compiler will also coordinate the scheduling of parallel tasks onto the processors, data movements and inter processor synchronization. The many-pass compiler architecture requires graph optimization passes, constant folding, data reshaping, padding, tiling and various other backend-specific operations. The software will support a split offline/online mapping process with just-in-time mapping to chiplets, processors and DDR memory channels.

This role requires collaborating with the HW and SW architecture team, the Pytorch front-end pre-processing team, the data science numerics team, AI kernel team, SW test group, the benchmark group and the teams developing the various simulator and emulation platforms. It is central to the overall efficiency of the solution. As such, we are seeking an AI compiler expert with experience in the TVM, Glow or preferably, the MLIR project. Also important is familiarity with the LLVM project. Experience mapping graph operations to many-core processors (or spatial fabrics) would be desirable.

This role does NOT require hardware design or verification experience. That said, an understanding of the trade-offs made by processor architects when implementing accelerators for DNNs, DCNNs, transformer models and attention mechanisms is useful - especially when it comes to mapping very large NLP models to such architectures.

What you will bring:

Minimum:

  • BS / MS Preferred in Computer Science or equivalent with 10+ years in ML Compiler.

  • Experience establishing, growing and/or developing engineering teams (and software teams in particular).

  • Experience with leading agile development methods is preferable including coordinating scrums, managing sprints and project task tracking with Kanban boards or similar.

  • Experience running code reviews, bug tracking meetings, familiarity and experience with CI/CD flows.

  • Managing interdependencies with other teams in order to meet milestones and target levels of performance.

  • Excellent documentation and presentation skills.

This role includes technical leadership aspects: specifically the motivation, engagement, goal setting, performance tracking, objective setting and performance management.

#LI-DL1

Equal Opportunity Employment Policy

d-Matrix is proud to be an equal opportunity workplace and affirmative action employer. We’re committed to fostering an inclusive environment where everyone feels welcomed and empowered to do their best work. We hire the best talent for our teams, regardless of race, religion, color, age, disability, sex, gender identity, sexual orientation, ancestry, genetic information, marital status, national origin, political affiliation, or veteran status. Our focus is on hiring teammates with humble expertise, kindness, dedication and a willingness to embrace challenges and learn together every day.

d-Matrix does not accept resumes or candidate submissions from external agencies. We appreciate the interest and effort of recruitment firms, but we kindly request that individual interested in opportunities with d-Matrix apply directly through our official channels. This approach allows us to streamline our hiring processes and maintain a consistent and fair evaluation of al applicants. Thank you for your understanding and cooperation.

Top Skills

Agile Development
Ci/Cd
Llvm
Mlir
PyTorch

Similar Jobs

2 Minutes Ago
In-Office
Los Angeles, CA, USA
Senior level
Senior level
Artificial Intelligence • Hardware • Machine Learning • Robotics • Software
You will develop GPU-accelerated physics simulations for robotic forming processes, optimizing performance and integration with ML systems.
Top Skills: C++CudaNvidia OmniverseVulkan
4 Minutes Ago
Easy Apply
Hybrid
Long Beach, CA, USA
Easy Apply
112K-143K Annually
Junior
112K-143K Annually
Junior
3D Printing • Aerospace • Hardware • Robotics • Software • Manufacturing
As a Weld Engineer II, you will maintain production systems, develop welding processes, establish parameter baselines, and integrate new technologies in additive manufacturing.
Top Skills: Additive ManufacturingAutomation TechnologiesGmawRobotics
4 Minutes Ago
Easy Apply
Hybrid
Long Beach, CA, USA
Easy Apply
97K-124K Annually
Junior
97K-124K Annually
Junior
3D Printing • Aerospace • Hardware • Robotics • Software • Manufacturing
Develop high-quality tooling for production, integrate mechanical design, collaborate across teams, and manage various engineering projects.
Top Skills: Ansys WorkbenchCadFeaFemapMechanicaNastran SolverNxNx Advance SimulationSiemens NxTeamcenter Plm

What you need to know about the Los Angeles Tech Scene

Los Angeles is a global leader in entertainment, so it’s no surprise that many of the biggest players in streaming, digital media and game development call the city home. But the city boasts plenty of non-entertainment innovation as well, with tech companies spanning verticals like AI, fintech, e-commerce and biotech. With major universities like Caltech, UCLA, USC and the nearby UC Irvine, the city has a steady supply of top-flight tech and engineering talent — not counting the graduates flocking to Los Angeles from across the world to enjoy its beaches, culture and year-round temperate climate.

Key Facts About Los Angeles Tech

  • Number of Tech Workers: 375,800; 5.5% of overall workforce (2024 CompTIA survey)
  • Major Tech Employers: Snap, Netflix, SpaceX, Disney, Google
  • Key Industries: Artificial intelligence, adtech, media, software, game development
  • Funding Landscape: $11.6 billion in venture capital funding in 2024 (Pitchbook)
  • Notable Investors: Strong Ventures, Fifth Wall, Upfront Ventures, Mucker Capital, Kittyhawk Ventures
  • Research Centers and Universities: California Institute of Technology, UCLA, University of Southern California, UC Irvine, Pepperdine, California Institute for Immunology and Immunotherapy, Center for Quantum Science and Engineering

Sign up now Access later

Create Free Account

Please log in or sign up to report this job.

Create Free Account