Site Reliability Manager

Sorry, this job was removed at 10:08 p.m. (PST) on Thursday, August 17, 2017
Find out who's hiring in Pasadena.
See all Developer + Engineer jobs in Pasadena
Apply
By clicking Apply Now you agree to share your profile information with the hiring company.
 

About the role:



Are you motivated by an incredible sense of purpose in doing work
that helps keep people safe and business running daily, with results
that regularly make headlines? Are you passionate about innovating on
the cutting edge to develop solid architecture principles, operability
guidelines, progressive scaling methodologies, and other sophisticated
techniques to reliably operate critical technology infrastructure at
scale? Do you take an actively engaged servant leadership approach with
empowering high-performance teams to become more than the sum of their
parts? Are you looking to grow your leadership experience, while still
spending more time doing work than just talking about it? If so, this
position is a perfect opportunity for you to join the Everbridge Site
Reliability Engineering team in a hands-on tactical leadership role to
help drive the design, implementation, and operation of our global
platforms.



About the team:



As the Everbridge Site Reliability Engineering team, we are
responsible for ensuring overall service quality and availability of
Everbridge's solutions. The technology platforms that we support
automate the international delivery of critical information to help keep
people safe and businesses running. We are a 24x7x365 distributed team
that can do our job anytime, anywhere on the planet with an Internet
connection. Our holistic understanding of OSI layers 0 through 8 allows
us to effectively maintain a heterogeneous blend of worldwide public and
private cloud services where lives and livelihoods are at stake in the
event of failures. We are dedicated, passionate people who are committed
to internal/external customer service and doing the right thing.



What you’ll help us do:



* Keep people safe and businesses running.



* Lead a team owning operational availability, security, scalability,
efficiency, monitoring, instrumentation, and overall service
reliability of Everbridge's solutions.



* Coordinate collaboration across Agile teams with Architects,
Developers, Quality, Data, Security, and other Operations engineers on
designing and implementing highly reliable solutions.



* Establish Site Reliability Engineering principles of proactivity,
automation, cross-functional collaboration, data-driven decision making,
and fast+safe failing to continually improve our technology and
culture.



* Drive the team to enhance our infrastructure, tooling, and
processes to extend operability as a self-service function for other
groups in the engineering value stream.



* Participate in a rotating on-call schedule to troubleshoot and
resolve production escalations from our 24x7x365 NOC, leading incident
command when necessary.



* Have fun while we work hard to make a difference.



Your qualifications include:



* Previous technical leadership and/or management experience in a
production Site Reliability, DevOps, SaaS/Technical Operations, or NOC
environment



* Dedicated commitment to technical excellence and quality customer service



* Ability to write code in at least one programming language (e.g. Python, Perl, Java, Ruby, Go)



* Comfort using Git for practical configuration data and code management



* Expertise with cloud compute IaaS/abstracted PaaS solutions (AWS
Solutions Architect or equivalent) and hybrid/on-premises private
compute environments (VMware Certified Professional or equivalent)



* Deep knowledge in one of these disciplines forms the central pillar of your T-shaped skill set:



o Network architecture and operation with an emphasis on: application
load balancing at local and global scales (F5 BIG-IP LTM/GTM, ELB/Route
53), IPv4 routing and dynamic routing protocols (OSPF, BGP), IPsec VPN,
and network security best practices



o Automation framework orchestration, configuration management, and
software-defined infrastructure management techniques (SaltStack
preferred, others e.g. Puppet, Chef, Ansible, etc. also acceptable)



o Large scale production UNIX/Linux operating system, application,
and security maintenance in an online service provider environment
(Ubuntu and Debian GNU/Linux preferred)



* US Citizenship and ability to pass a Federal drug screening



Familiarity with any of the following technology areas is a plus:



* Infrastructure/application monitoring and alerting solutions
(Datadog, Elastic BELK/X-Pack, Prometheus, Nagios, Cacti,
Graphite/Grafana, InfluxDB, OpenTSDB, Splunk, Graylog, etc.)



* Application virtualization, containerization, and
service-oriented-architecture technologies (Nomad & rest of
HashiCorp suite, Docker, Kubernetes, Mesos, CoreOS/rkt)



* Email transport software and deliverability management concepts
(Postfix/Sendmail and derivative commercial MTAs, SPF, DomainKeys/DKIM,
DMARC, IP reputation)



* VoIP (FreeSWITCH or Asterisk w/ SIP) and/or TDM telephony infrastructure



* Cisco IOS/NX-OS, Juniper JUNOS, and related hardware device and
virtual appliance families (Cisco Catalyst/Nexus/ISR/ASR, Juniper
routing/switching/firewall platforms, Brocade Vyatta)



* RDBMS, NoSQL, and hybrid data tier platforms (MongoDB, Elasticsearch, Postgres, MySQL, Riak, Cassandra, HBase, etc.)



* SEIM, HIDS/NIDS, and related infrastructure tooling required to maintain positive control over security



* Practical knowledge of BGP traffic engineering, DDoS mitigation, and active threat defense techniques



* Continuous integration and deployment/delivery pipelines in a release engineering context



* Performance measurement and tuning methodology for capacity planning and bottleneck hunting



About the company:



Everbridge
is a global enterprise software company that provides applications
which automate the delivery of critical information to help keep people safe and businesses running.
During mission-critical business events or man-made or natural
disasters, over 2,700 global customers rely on the Everbridge platform
to quickly and reliably construct and deliver contextual notifications
to millions of people at one time. The company’s platform sent over 1 billion messages in 2015,
and offers the ability to reach more than 200 countries and territories
with secure delivery to over 100 different communication devices. With
headquarters in both Boston and Los Angeles, Everbridge serves 8 of the
10 largest U.S. cities, 7 of the 10 largest U.S.-based investment banks,
24 of the 25 busiest North American airports, and 6 of the 10 largest
global automakers. As a company with a culture that is committed to
“Making a Difference,” Everbridge was recently named a “Best Place to
Work” by both the Boston and Los Angeles Business Journals!



Everbridge
is an Equal Opportunity/Affirmative Action employer.  All qualified
applicants will receive consideration for employment without regard to
race, creed, color, religion, sex including sexual orientation and
gender identity, national origin, disability, protected Veteran Status,
or any other characteristic protected by applicable federal, state, or
local law.

Read Full Job Description
Apply Now
By clicking Apply Now you agree to share your profile information with the hiring company.

Location

Located very close to busy Pasadena location filled with restaurants, bars, retail, etc.

Similar Jobs

Apply Now
By clicking Apply Now you agree to share your profile information with the hiring company.
Learn more about EverbridgeFind similar jobs