Manager, Cloud Infrastructure
In this leadership role, the Manager – Cloud Infrastructure will oversee daily operations of BlackLine's global data centers, including network infrastructure, storage and servers. S/he will manage 24/7 engineering staff, operations processes, incident engagement, and disaster recovery activities. The candidate must possess solid critical thinking skills and have experience supporting large server farms, 24x7 High Availability mission-critical traffic-intensive web infrastructures, and be familiar with commonly used server, storage, and virtualization technologies.
Responsibilities:
- Lead a dedicated team of top engineers solving real-life problems in a high-performance, and high-traffic environment.
- Ensure 99.99%+ availability of the infrastructure that spans across multiple global datacenters in private and public clouds.
- Monitor and maintain health, performance, and security of all infrastructure components.
- Maintain and improve efficiency of the infrastructure processes. Automate wherever possible.
- Adhere to the change management and other established processes and procedures.
- Evaluate and analyze systems, performance, issues and metrics to provide recommendations for continuous improvement.
- Monitor and plan for capacity and growth.
- Maintain documentation and operational knowledge base.
- Respond to and troubleshoot incidents. Participate in root cause analyses.
- On call for incidents and maintenances as needed.
- Contribute to management of departmental budget.
- Support infrastructure assessments and audit activities.
- Establish and maintain vendor relationships, negotiate services, and manage service level agreements.
- Support negotiations and administration of vendor contracts and service agreements.
- Manage business continuity and disaster recovery. Conduct DR tests.
- Ensure safety and security best practices are always used when in data centers.
- Maintain inventory of physical and virtual assets.
Qualifications:
- 5+ years supporting a SaaS/Hosting type critical revenue-generating environment.
- Substantial technical knowledge of enterprise hardware and software including Kaminario, Netapp, Palo Alto, F5, Cisco UCS, Brocade and VMWare.
- Proven data center management experience.
- Working knowledge of Windows Server, Active Directory, ASP.NET, MS SQL Server, networks, caching and load balancing technologies. UNIX/LINUX experience desirable.
- Experience working in a strict change-controlled, 24/7 environment.
- Skill managing and prioritizing troubleshooting of enterprise services with complex interactions between applications, operating systems, network protocols, and client configurations.
- Experience with compliance activities associated with SOX and PCI DSS.
- Strong problem-solving methodology and root cause analysis.
- Ability to work with individuals at all levels across the organization.
- Experience managing large, complex projects across multiple teams and disciplines.
- Empathy for working with support teams to identify and remedy pain points.
- Someone energized by a fast-paced, iterative approach.
- Hands-on problem-solving skills, technical leadership and mentoring qualities.