Cloudera Logo

Cloudera

Principal Software Engineer, Ozone / HDFS

Reposted 17 Days Ago
Be an Early Applicant
Remote
2 Locations
236K-295K Annually
Senior level
Remote
2 Locations
236K-295K Annually
Senior level
The Principal Software Engineer will design and implement features for Apache Ozone, mentor engineers, and support enterprise customer needs in large-scale distributed systems.
The summary above was generated by AI

Business Area:

Engineering

Seniority Level:

Director

Job Description: 

At Cloudera, we empower people to transform complex data into clear and actionable insights. With as much data under management as the hyperscalers, we're the preferred data partner for the top companies in almost every industry.  Powered by the relentless innovation of the open source community, Cloudera advances digital transformation for the world’s largest enterprises.

Cloudera is looking for an exceptional and passionate software engineer with a strong distributed systems background to join the Storage Engineering team focused on building Apache Ozone. The Storage team is responsible for primary storage and storage access layers, which are core to the platform. Apache Ozone (Apache Ozone) provides a massively scalable distributed object store with a distributed file system interface. Ozone is designed to scale to tens of billions of files and blocks, and overcome the limitations of Hadoop Distributed File System (HDFS), namely, millions of small files and managing a huge number of datanodes. 

Ozone is one of the fastest-growing products inside CDP in terms of customer adoption and expansion revenue. Opportunity to join the team that created and wrote most of the HDFS code and make a huge impact on the big data and cloud computing industry.

As a Principal Software Engineer, you will…

  • You will be directly involved in the design and implementation of the core feature set of Apache Ozone and Apache Ratis (open-source RAFT implementation) 

  • You will regularly contribute code and design docs to the Apache open-source community.

  • As part of storage engineering, you will support enterprise customers running 100s of petabytes-scale big data analytics and ML/AI pipelines. 

  • You will partner with Engineering leaders, product managers, and cross-functional teams as a part of the Cloudera Data platform ecosystem in understanding requirements and turning them into a solid design and implementation, and facilitating integration and adoption.

  • Additionally, in this role, you will be responsible for leading a talented group of engineers working on a feature and mentoring junior engineers.

  • This role is not eligible for immigration sponsorship

We are excited if you have…

  • Bachelor's +10, Master's +8 years of relevant industry experience required (5+ for PhD candidate)

  • Strong backend engineering skill set with expertise in Java, or strong C++ skills, with intermediate Java expertise

  • Passionate about programming. Clean coding habits, attention to detail, and focus on quality

  • Experience with large-scale, distributed systems design and development with a strong understanding of scaling, replication, consistency, and high availability

  • Solid experience with system software design and development with a strong understanding of computer architecture, storage, network, and IO subsystems, and distributed systems

  • Hands-on programmer with strong data structures and algorithms skillset

  • Strong oral and written communication skills

You may also have..

  • Strong background in a distributed storage system, including file systems, database storage internals, NoSQL storage, or distributed hash tables

  • Strong background in performance tuning, identifying performance bottlenecks, and implementing performance optimizations

  • Strong understanding of the Apache Big Data ecosystem and over 3+ years of experience in systems software, including file systems

  • Recognized contributions to open source projects

  • Experience using projects such as Hive, Pig, MapReduce, HBase, etc., is a big plus

  • Good Understanding of storage development, RAFT replication framework, or equivalent distributed consensus frameworks

This role is not eligible for immigration sponsorship

 The expected base salary range for this role in

  • California is $236,000-$295,000

Please note that the compensation details listed in our job  postings reflect the base salary only, and do not include commissions or bonus as applicable. The salary will vary depending on your job-related skills, experience and location 

What you can expect from us:

  • Generous PTO Policy 

  • Support work life balance with Unplugged Days

  • Flexible WFH Policy 

  • Mental & Physical Wellness programs 

  • Phone and Internet Reimbursement program 

  • Access to Continued Career Development 

  • Comprehensive Benefits and Competitive Packages 

  • Paid Volunteer Time

  • Employee Resource Groups

EEO/VEVRAA

#LI-HYBRID

#LI-SZ1

Top Skills

Apache Ozone
Apache Ratis
C++
Hadoop
Hbase
Hive
Java
Mapreduce
Pig

Similar Jobs

Yesterday
Easy Apply
Remote
USA
Easy Apply
173K-254K
Senior level
173K-254K
Senior level
Consumer Web • Healthtech • Professional Services • Social Impact • Software
As Counsel for Regulatory and Privacy, you will guide product and operations teams on healthcare regulations and privacy laws, ensuring compliance while enabling innovation.
Top Skills: Artificial IntelligenceHealth TechnologyHipaa
Yesterday
Remote or Hybrid
US
126K-185K Annually
Senior level
126K-185K Annually
Senior level
Artificial Intelligence • eCommerce • Information Technology • Internet of Things • Automation
Design, implement, and manage IAM and IGA solutions. Collaborate with teams for compliance and support identity management processes, ensuring security and optimization.
Top Skills: Azure Active DirectoryEntra Id GovernanceForgerockHrm Systems (WorkdayIdentity And Access Management (Iam)Identity Governance And Administration (Iga)MimOauthOidcOktaPeoplesoft)SailpointSAMLScim
Yesterday
Remote or Hybrid
USA
Junior
Junior
Artificial Intelligence • Cloud • Mobile • Security • Software
Design, code, and operate high performance backend services and data processing systems primarily using Scala and AWS technologies.
Top Skills: SparkAWSDynamoDBKafkaPostgresRedshiftScala

What you need to know about the Los Angeles Tech Scene

Los Angeles is a global leader in entertainment, so it’s no surprise that many of the biggest players in streaming, digital media and game development call the city home. But the city boasts plenty of non-entertainment innovation as well, with tech companies spanning verticals like AI, fintech, e-commerce and biotech. With major universities like Caltech, UCLA, USC and the nearby UC Irvine, the city has a steady supply of top-flight tech and engineering talent — not counting the graduates flocking to Los Angeles from across the world to enjoy its beaches, culture and year-round temperate climate.

Key Facts About Los Angeles Tech

  • Number of Tech Workers: 375,800; 5.5% of overall workforce (2024 CompTIA survey)
  • Major Tech Employers: Snap, Netflix, SpaceX, Disney, Google
  • Key Industries: Artificial intelligence, adtech, media, software, game development
  • Funding Landscape: $11.6 billion in venture capital funding in 2024 (Pitchbook)
  • Notable Investors: Strong Ventures, Fifth Wall, Upfront Ventures, Mucker Capital, Kittyhawk Ventures
  • Research Centers and Universities: California Institute of Technology, UCLA, University of Southern California, UC Irvine, Pepperdine, California Institute for Immunology and Immunotherapy, Center for Quantum Science and Engineering

Sign up now Access later

Create Free Account

Please log in or sign up to report this job.

Create Free Account