Engineer, DevOps

| El Segundo
Sorry, this job was removed at 11:27 a.m. (PST) on Wednesday, January 24, 2018
Find out who's hiring in El Segundo.
See all Developer + Engineer jobs in El Segundo
Apply
By clicking Apply Now you agree to share your profile information with the hiring company.

Ellie Mae (NYSE:ELLI) is a leading provider of innovative on-demand software solutions and services for the residential mortgage industry. Mortgage lenders of all sizes use Ellie Mae’s Encompass® all-in-one mortgage management solution, Mavent Compliance Service, and AllRegs research, reference and education resources to improve compliance, loan quality and efficiency across the entire mortgage lifecycle. 

Summary of Responsibilities

This is a fantastic opportunity to work and collaborate closely with our software engineering, architecture and operations teams at Ellie Mae. Our Site Reliability Engineers are responsible for ensuring Ellie Mae services are highly available, reliable, secure and scalable. The ideal candidates are fluent in systems programming and/or automation, and can leverage their experience to solve complex problems associated with running production environments at massive scale in multi-tenant environments.

Primary Responsibilities & Objectives

  • Employ deep troubleshooting and scripting skills to improve the availability, performance, and security of Ellie Mae Services.
  • Implementation of proactive monitoring, alerting, trend analysis and self-healing systems
  • Participate in on-call rotations, driving restoration and repair of service-impacting issues
  • Conduct Root Cause Analysis and drive repair of Problem Records in order to prevent recurrence through to closure including, but not limited to, resolution of product/service defects or design changes, infrastructure changes, or operational changes
  • Define non-functional requirements as part of the product lifecycle to influence the new designs, standards, and methods for scalable, highly available distributed systems
  • Author system support documents and update production application service run books where needed
  • Seasoned professional in critical incident triage & response; effective working under pressure
  • Contribute to product development / engineering as needed to ensure Quality of Service of Highly Available services

Qualifications, Skills and Education

  • 5+ years of Systems Engineering in 24x7 Production Services environments
  • BS in Computer Science, Computer Engineering, Math, or equivalent professional experience
  • Fluency with at least one current generation scripting language used by DevOps professionals (Python, Perl, PHP, Ruby) + Java Development and/or .NET
  • Excellent troubleshooter, utilizing a systematic problem-solving approach spanning code,
    systems, and network theory & protocols (TCP/IP, UDP, ICMP) ability to read a packet capture/tcpdump, etc.
  • Demonstrated experience in designing, analyzing, and diagnosing large-scale distributed systems + Windows Server and/or Linux systems internals (system libraries, file systems, client- server protocols)
  • Experience operating on AWS (both PaaS and IaaS offerings)
  • Experience in both Windows (2k8R2+) and Linux (centos) + Security triage & forensic analysis
  • Experience in supporting microservices a big plus
  • NoSQL/Docker/MongoDB/SQL experience a plus
  •  
Read Full Job Description
Apply Now
By clicking Apply Now you agree to share your profile information with the hiring company.

Location

222 N Sepulveda Blvd Ste 1800 , El Segundo, CA 90245

Similar Jobs

Apply Now
By clicking Apply Now you agree to share your profile information with the hiring company.
Learn more about VelocifyFind similar jobs