iHerb is seeking a strong Senior Site Reliability Engineer to join our SRE Practice group. iHerb’s ecommerce platform serves millions of customers across 150+ countries. Accordingly, we have a widely distributed global footprint with unique network, data and scalability challenges.
Our ideal candidate has a software engineering background and a DevOps mindset. You will also have a deep passion for automation and observability, with a keen understanding of how distributed systems play together. You will leverage this understanding to help build bridges between our engineering, infrastructure, SRE and NOC functions.
Working as part of the SRE Practice area, support development teams in understanding and leveraging SLI’s and SLO’s to improve service availability and performance.
Contribute to instrumentation and platform libraries in various languages such as Go and C#.
Collaborate with our NOC team to build out proper SOP’s around identifying root cause, escalation and remediation.
Work with other SRE’s to champion and establish KPIs for measurement of the SRE Practice’s success and improvement.
Lead SRE audit and sign-off meetings, ensuring services are ready for production.
Assist in building a blameless post-mortem culture that leads to fixed weaknesses and increased reliability.
Knowledge, Skills and Abilities:
Experience in open-source and commercial observability tooling such as Prometheus, Jaeger, EFK stack, Grafana, Datadog, NewRelic, etc.
Infrastructure/cloud automation tooling (e.g. Terraform, Packer, Helm)
Service Mesh/Discovery Tooling (e.g. Consul, Envoy, Istio, etcd)
Continuous Integration (e.g. Spinnaker, Jenkins)
Containers and Container Management (Docker, Kubernetes)
Configuration and Security Management (e.g. Puppet, Chef, Ansible, Salt, Vault, KMS)
Demonstrated experience leading/contributing significantly to an open-source infrastructure/application platform initiative a big plus (e.g. Kubernetes/Istio/etc. upstream commits)
Experience as a platform or back-end engineer in a prominent language such as Go, Python, Java or C# is a strong plus.
Generally, requires a minimum of five (5) to seven (7) plus years of experience as a Software Engineer, DevOps Engineer and/or Systems Engineer
Bachelor’s Degree in Information Technology/Computer Science or a related field preferred and/or equivalent years of experience