Systems Engineer
In this role, the Systems Engineer will develop, maintain and create new tools to help our internal teams. Collaborating with multiple teams within the company, your skills will help streamline processes, reduce manual tasks and work to develop new ways of completing daily work. S/he will participate in operations processes, incident engagement, and disaster recovery activities. The candidate must possess solid critical thinking skills and have experience supporting large server farms, transitioning manual tasks in to automated or scripted work. All while maintaining 24x7 High Availability mission-critical traffic-intensive web infrastructures, and be familiar with commonly used server, storage, and virtualization technologies.
Responsibilities:
- Ensure 100% availability of a SaaS platform that is built around Microsoft .NET tools and technologies.
- Work with development teams on transitioning to a continuous delivery process with automated build, test, and deployment tools such as Jenkins, Chef and PowerShell.
- Work with systems and network engineering teams on infrastructure automation with Chef.
- Debug and resolve internally developed scripts and jobs.
- Collaborate with development and other technology teams on requirements definition, capacity planning, and process refinement.
- Use data from a variety of performance and health management tools to deliver a continuous assessment of application performance and reliability.
- Adhere to existing operational processes and maintain up-to-date operational documentation.
- Monitor industry trends. Research, design, develop and implement solutions for fault tolerance, performance and capacity management.
- Contribute knowledge, skills, and personal qualities to a dedicated team of top engineers solving real-life problems in a bleeding-edge, high-performance, and high-traffic environment.
- Participate in 24/7 on-call rotation and support major incidents as they occur
- Contribute knowledge, skills, and personal qualities to a dedicated team of top engineers solving real-life problems in a bleeding-edge, high-performance, and high-traffic environment.
- Monitor and manage the ticketing queue (JIRA), using the daily work to develop projects to continually improve our daily work.
- Maintain infrastructure documentation and operational knowledge base. Constantly creating new documentation for training purposes.
Qualifications:
- Intermediate level understanding of build/test/deploy automation tools and concepts.
- Very familiar with principles of Continuous Integration and Continuous Delivery.
- Experience with common Microsoft .NET build, test, packaging, and deployment tools and techniques.
- Exposure to common pipeline automation tools such as Jenkins, TeamCity, Microsoft TFS, Selenium.
- Experience with scripted provisioning of servers, applications, and/or infrastructure in a production environment.
- Familiarity with automated configuration management tool such as Chef, Puppet, or CFengine.
- Solid foundation in programming fundamentals (variables, control structures, boolean logic, OOP concepts).
- Experience in at least one of the following programming languages: C#, PowerShell, Ruby, Java.
- Experience with modern software development workflows, including code reviews, revision control, and test-driven development.
- Experience with virtual host configuration and management, either in private (VMWare, Xen) or public (AWS, Azure) cloud environments.
- Intermediate level understanding of design principles for high availability systems and software.
- Experience developing or operating production web applications and solid understanding of HTTP(S), HTML, JavaScript, and XML.
- Experience with troubleshooting distributed web applications.
- Intermediate level knowledge of Windows server and network administration, including some subset of IIS, Active Directory, ADFS, Failover Clusters, and/or SQL Server.
- Comfortable writing basic SQL queries.
- BS degree in Information Technology, Business or related field or equivalent experience.
- 3+ years related experience.
- Understanding of ITIL concepts. Certificate in ITIL Foundations or greater is preferred.
- A minimum of three years of experience in a 24x7 operations organization.
- Working knowledge of cloud platforms (AWS, GCP and/or Azure).
- Familiarity with container technologies (Kubernetes, Docker, Rancher, etc.).
- Experience with CI/CD pipelines