CIQ Logo

CIQ

Senior/Principal Performance Engineer

Reposted Yesterday
Remote
Hiring Remotely in USA
Senior level
Remote
Hiring Remotely in USA
Senior level
The Senior/Principal Performance Engineer will lead system and application performance across products, optimize workloads, enhance CI/CD pipelines, and integrate AI tools while collaborating across teams to ensure top-tier performance outcomes.
The summary above was generated by AI

CIQ OVERVIEW

CIQ builds the enterprise infrastructure that powers the world's most demanding workloads. From the operating system layer through AI infrastructure, high-performance computing, and cloud-native orchestration, CIQ delivers the speed, security, scalability, and sovereignty that major enterprises, government agencies, and research institutions depend on.

CIQ is the founding support and services partner of Rocky Linux and the developer of the RLC Pro family of Enterprise Linux distributions, Fuzzball workload orchestration, Warewulf Pro cluster provisioning, and Ascender Pro automation. Our customers include some of the largest and most technically sophisticated organizations in the world, working across HPC, AI/ML, defense, and regulated industries.

We are a company of builders, operators, and open source practitioners. If you want to do work that matters, at a company that is genuinely changing how enterprise infrastructure gets built and run, we want to talk.

CIQ is seeking a highly experienced Senior or Principal Performance Engineer to own and drive system-level and application-level performance across our product portfolio. This is an AI-first role, both in methodology and focus area,  and the right candidate will bring deep expertise in operating system internals, kernel and userspace performance, and the performance demands of modern AI workloads, HPC environments, general-purpose computing, and production service workloads.

In this role, you will be the standard-bearer for performance at CIQ. Our performance-focused solutions must always be the fastest in the industry, and you will be responsible for ensuring that remains true. You will be intimately involved with Fuzzball, CIQ's cloud-native computing platform, learning its architecture end-to-end and integrating workloads - both user-facing and CI/testing pipelines - directly through it.

Position Summary

This role is leveled as Senior or Principal based on qualifications and demonstrated capabilities.

Benchmarking & Profiling

  • Design, develop, and maintain comprehensive benchmarking frameworks spanning OS, kernel, and application layers.
  • Profile workloads across CPU, memory, I/O, network, and accelerator (GPU/NPU) subsystems to identify bottlenecks and optimization opportunities.
  • Establish and own performance baselines across CIQ's product and solutions portfolio.
  • Leverage AI-assisted tooling and agentic workflows to accelerate profiling, analysis, and root cause identification.

Regression Detection & Resolution

  • Build and maintain automated performance regression-detection pipelines integrated into CI/CD workflows using Fuzzball.
  • Identify, triage, and resolve regressions across user space, kernel space, and application layers with urgency and rigor.
  • Collaborate across engineering teams to root-cause regressions introduced by upstream kernel changes, compiler updates, or library modifications.

Proactive Performance Engineering

  • Drive proactive performance improvements - not just reactive fixes - to keep CIQ solutions ahead of the competition across every layer of the stack.
  • Own core operating system performance: kernel subsystem tuning (scheduler, memory management, I/O, networking), system call overhead reduction, and user space library and runtime optimizations.
  • Identify and implement kernel-level enhancements, including patches, configuration changes, and upstream contributions that yield measurable performance gains for CIQ's customer workloads.
  • Optimize for AI inference and training workloads, including LLM serving, model parallelism, and accelerator utilization.
  • Tune performance for HPC workloads, including modeling, simulation, and tightly coupled parallel applications (MPI, OpenMP, etc.).
  • Optimize general computing and service workloads - web services, databases, messaging systems, and other production software that runs on CIQ's OS platform.
  • Work at all levels of the stack: compiler flags, kernel parameters, scheduler tuning, NUMA topology, memory allocation, and application-level algorithmic improvements.

AI-First Approach

  • Champion an AI-first engineering philosophy - use AI tools, agents, and automation to accelerate your own productivity and the quality of performance insights.
  • Identify and prioritize optimization opportunities that directly impact AI training throughput and inference latency/cost.
  • Stay current on state-of-the-art techniques in ML system performance, including quantization, batching strategies, kernel fusion, and hardware-software co-design.

Fuzzball Integration

  • Develop deep expertise in CIQ's Fuzzball platform - its architecture, scheduling, and workload execution model.
  • Integrate performance benchmarks, regression tests, and user-facing workloads into Fuzzball-based pipelines.
  • Contribute to the performance characterization of Fuzzball itself, ensuring the platform adds minimal overhead and scales efficiently.

Cross-Functional Collaboration

  • Develop broad familiarity with the full CIQ product portfolio — including Rocky Linux and RLC (and its variants), Fuzzball, Apptainer (formerly Singularity), and Warewulf - understanding how performance considerations span and interconnect across each.
  • Collaborate deeply with the engineering teams behind each product line to surface, prioritize, and deliver performance improvements that benefit customers across the entire CIQ ecosystem.
  • Partner with product and customer success teams to translate real-world performance pain points into engineering priorities and measurable outcomes.
  • Document and communicate findings clearly - from low-level profiling data to executive-level summaries.
  • Contribute to technical publications, conference presentations, and thought leadership that reinforces CIQ's reputation for performance excellence.

NEEDED TO SUCCEED

Successful candidates will have:

  • A deep, principled understanding of operating system internals -  Linux kernel scheduler, memory subsystem, I/O stack, and networking.
  • Proven experience identifying and resolving performance regressions across kernel and user space in production environments.
  • Hands-on expertise with profiling and tracing tools: perf, eBPF/bpftrace, Flamegraphs, VTune, Nsight, strace, ftrace, and similar.
  • Strong background in AI/ML workload performance - including inference optimization (TensorRT, ONNX, vLLM, or similar), training efficiency, and GPU/accelerator utilization.
  • Experience with HPC workloads: MPI, OpenMP, parallel filesystems, RDMA/InfiniBand, and job schedulers (Slurm, PBS, etc.).
  • Familiarity with modern AI-first development workflows and comfort using LLM-based tools to accelerate engineering work.
  • Experience building automated performance testing and regression detection pipelines in CI/CD environments.
  • Excellent analytical skills -  able to form hypotheses, design experiments, and draw actionable conclusions from complex data.
  • Strong written and verbal communication skills; able to present findings to both deeply technical audiences and business stakeholders.
  • A collaborative, humble, and always-learning mindset -  combined with the confidence to champion performance as a first-class engineering concern.

EDUCATION AND EXPERIENCE

  • PhD in Computer Science, Computer Engineering, or a related field strongly preferred; equivalent industry experience considered.
  • 15+ years of industry experience in systems performance engineering, OS development, or a closely related discipline.
  • Demonstrated track record of measurable, published, or production-deployed performance improvements at scale.
  • Experience working in or with open-source ecosystems (Linux kernel contributions, upstream community engagement) is a strong plus.
  • Background with cloud-native, containerized, and/or HPC computing environments preferred.

BENEFITS

  • Medical, dental, and vision insurance.

  • Flexible paid time off.

  • Employee stock options.

  • Remote work; no travel required for most positions.

 

Similar Jobs

13 Minutes Ago
Remote or Hybrid
77K-202K Annually
Senior level
77K-202K Annually
Senior level
Artificial Intelligence • Professional Services • Business Intelligence • Consulting • Cybersecurity • Generative AI
Senior Data Engineer on PwC's Managed Data, Analytics & Insights team to design, build and manage advanced data ecosystems. Responsibilities include designing data solutions and scalable pipelines, solving complex problems, mentoring junior staff, maintaining high delivery standards, and building client relationships while aligning solutions to business context.
Top Skills: DatabricksKafka
21 Minutes Ago
Remote or Hybrid
United States
81K-109K Annually
Junior
81K-109K Annually
Junior
Automotive • Big Data • Information Technology • Robotics • Software • Transportation • Manufacturing
Field leadership role driving aftersales performance across a Nebraska district. Partner with dealership leadership to execute Customer Care & Aftersales initiatives, improve customer satisfaction, grow revenue, analyze performance, develop business plans, resolve warranty and service issues, and deliver on KPIs through data-driven insights and frequent onsite engagement.
Top Skills: Automotive Parts And Service SystemsDealer Operating ReportsFixed Ops Analysis ToolsExcelSales Reporting Tool (Srt)
21 Minutes Ago
Remote or Hybrid
United States
81K-109K Annually
Junior
81K-109K Annually
Junior
Automotive • Big Data • Information Technology • Robotics • Software • Transportation • Manufacturing
Regional field leader for dealership aftersales in Columbus, driving parts and service revenue, customer retention, and satisfaction. Partners with dealer leadership to execute CCA programs, analyze performance, develop business plans, deliver KPI improvements, resolve aftersales issues, and provide consultative operational guidance. Requires frequent on-site visits and data-driven recommendations.
Top Skills: ExcelSales Reporting Tool (Srt)

What you need to know about the Los Angeles Tech Scene

Los Angeles is a global leader in entertainment, so it’s no surprise that many of the biggest players in streaming, digital media and game development call the city home. But the city boasts plenty of non-entertainment innovation as well, with tech companies spanning verticals like AI, fintech, e-commerce and biotech. With major universities like Caltech, UCLA, USC and the nearby UC Irvine, the city has a steady supply of top-flight tech and engineering talent — not counting the graduates flocking to Los Angeles from across the world to enjoy its beaches, culture and year-round temperate climate.

Key Facts About Los Angeles Tech

  • Number of Tech Workers: 375,800; 5.5% of overall workforce (2024 CompTIA survey)
  • Major Tech Employers: Snap, Netflix, SpaceX, Disney, Google
  • Key Industries: Artificial intelligence, adtech, media, software, game development
  • Funding Landscape: $11.6 billion in venture capital funding in 2024 (Pitchbook)
  • Notable Investors: Strong Ventures, Fifth Wall, Upfront Ventures, Mucker Capital, Kittyhawk Ventures
  • Research Centers and Universities: California Institute of Technology, UCLA, University of Southern California, UC Irvine, Pepperdine, California Institute for Immunology and Immunotherapy, Center for Quantum Science and Engineering

Sign up now Access later

Create Free Account

Please log in or sign up to report this job.

Create Free Account