Iron Mountain Logo

Iron Mountain

Site Reliability Engineer

Posted Yesterday
In-Office or Remote
2 Locations
Senior level
In-Office or Remote
2 Locations
Senior level
The Site Reliability Engineer at Iron Mountain will troubleshoot escalated tickets, manage Windows Server builds, perform security patching, and collaborate with customers and vendors to resolve issues and maintain systems.
The summary above was generated by AI

At Iron Mountain we know that work, when done well, makes a positive impact for our customers, our employees, and our planet. That’s why we need smart, committed people to join us. Whether you’re looking to start your career or make a change, talk to us and see how you can elevate the power of your work at Iron Mountain.

We provide expert, sustainable solutions in records and information management, digital transformation services, data centers, asset lifecycle management, and fine art storage, handling, and logistics. We proudly partner every day with our 225,000 customers around the world to preserve their invaluable artifacts, extract more from their inventory, and protect their data privacy in innovative and socially responsible ways. 

Are you curious about being part of our growth stor​y while evolving your skills in a culture that will welcome your unique contributions? If so, let's start the conversation.

 Job Summary:

We are looking for a talented Systems Engineer who has the experience necessary to help build out our existing infrastructure and troubleshoot problems as they arise. The ideal candidate for this position can prioritize mission critical tasks and coordinate expansion of our system so updates and other maintenance tasks do not get in the way of daily operations. In addition to solid technical, analytical, and troubleshooting skills, the candidate must have great soft and customer service skills allowing them to confidently interact with customers and explain highly technical concepts in simple, easy to understand terms.

What You’ll Do (Responsibilities)

● Troubleshoot escalated tickets

● Provide support to external and internal customers

● Join critical incident calls for priority 1 issues as the subject matter expert

● Perform on-call duties on a weekly rotation

● Windows Server Builds

● Maintaining backups

● Security patching and vulnerability management

● General break-fix and KLO

● Create and maintain system documentation and support processes

● Communicate verbally and in writing with customers

● Participate in the setup and configuration of new customers, environments, and proof-of-concept solutions

● Work with multiple vendors routinely for support and troubleshooting of the solution

● Work with internal cross-functional teams on troubleshooting system issues

● Responsible for working and prioritizing issues and project tasks according to SLA’s

● Responsible for managing their own tickets and maintaining them on a daily basis

● Participate in testing and post-deployment validation of the system

● Perform Active Directory Services administration and management to include design, cleanup and routine maintenance and configuration

● Other duties and projects as assigned

What You’ll Bring (Skills & Qualifications)

  • Requires a Bachelor’s degree (BS) from an accredited/recognized university.

  • Requires U.S. Citizenship.

  • Core Experience: 5+ years of hands-on technical architecture experience with a track record of delivering production-grade systems.

  • Infrastructure Depth: Expert knowledge in Cloud, Virtualization, Network, Compute, and Storage.

  • Operating Systems: Advanced experience with Windows Server and required proficiency in Linux.

  • Hardware & Virtualization: * Experience with Hyper-Converged Infrastructure (HCI) and Nutanix hardware (highly preferred).

    • Familiarity with Rubrik backup and recovery solutions.

  • Systems Management: * Proficiency in server-oriented architectures and web platform applications.

    • Knowledge of AD Architecture (LDAP, Group Policy, IAM, Schema changes).

    • Experience with Microsoft Endpoint Configuration Manager (MECM) or System Center.

    • Proficiency in PowerShell scripting.

  • ITIL Standards: 3+ years of experience in Incident and Request Management (ITIL v3).

  • Problem Solving: Excellent analytical and troubleshooting skills with a focus on technical resolution.

  • Vendor & Team Coordination: * Proven ability to work with geographically distributed teams.

    • Experience coordinating issue resolution with 3rd party vendors.

  • Communication: * Written and verbal proficiency in English.

    • Strong track record of engaging with external customers and accelerating technical solutions.

  • Soft Skills: A motivated self-starter who thrives in fast-paced environments and takes pride in building new products. 

Category: Operations Group

Top Skills

Cloud
Compute
Hyper-Converged Infrastructure
Linux
Microsoft Endpoint Configuration Manager
Network
Nutanix
Powershell
Rubrik
Storage
Virtualization
Windows Server

Similar Jobs

Yesterday
Remote or Hybrid
United States
165K-235K Annually
Mid level
165K-235K Annually
Mid level
Big Data • Cloud • Productivity • Software • Database • Analytics • Automation
The Site Reliability Engineer will automate tasks, enhance platform infrastructure, improve observability, and lead incident response efforts for optimal performance.
Top Skills: AWSGrafanaHoneycombLinuxPythonTerraform
Yesterday
Remote or Hybrid
130K-170K Annually
Senior level
130K-170K Annually
Senior level
AdTech • Cloud • Digital Media • Information Technology • News + Entertainment • App development
Oversee operational support of SAP BTP CPI applications, manage incidents, lead support specialists, and collaborate on architecture and governance for finance processes.
Top Skills: Abap ProxiesAemCapmCloud ConnectorCloud FoundryEdge Integration CellIdocJSONMessage QueuesOauthOdataRestSAMLSap BtpSfapiSftpSoapXML
5 Days Ago
Remote or Hybrid
United States
160K-180K Annually
Senior level
160K-180K Annually
Senior level
Artificial Intelligence • Other • Security • Software • Analytics • Big Data Analytics
The Lead Site Reliability Engineer will oversee the Infrastructure SRE team, focusing on system reliability, automation, and mentoring while collaborating with product engineering.
Top Skills: Ci/CdDatadogDockerElk StackGitopsGoKubernetesLinux/UnixNew RelicNoSQLPrometheusPythonSQLStackdriverTerraform

What you need to know about the Los Angeles Tech Scene

Los Angeles is a global leader in entertainment, so it’s no surprise that many of the biggest players in streaming, digital media and game development call the city home. But the city boasts plenty of non-entertainment innovation as well, with tech companies spanning verticals like AI, fintech, e-commerce and biotech. With major universities like Caltech, UCLA, USC and the nearby UC Irvine, the city has a steady supply of top-flight tech and engineering talent — not counting the graduates flocking to Los Angeles from across the world to enjoy its beaches, culture and year-round temperate climate.

Key Facts About Los Angeles Tech

  • Number of Tech Workers: 375,800; 5.5% of overall workforce (2024 CompTIA survey)
  • Major Tech Employers: Snap, Netflix, SpaceX, Disney, Google
  • Key Industries: Artificial intelligence, adtech, media, software, game development
  • Funding Landscape: $11.6 billion in venture capital funding in 2024 (Pitchbook)
  • Notable Investors: Strong Ventures, Fifth Wall, Upfront Ventures, Mucker Capital, Kittyhawk Ventures
  • Research Centers and Universities: California Institute of Technology, UCLA, University of Southern California, UC Irvine, Pepperdine, California Institute for Immunology and Immunotherapy, Center for Quantum Science and Engineering

Sign up now Access later

Create Free Account

Please log in or sign up to report this job.

Create Free Account