The Senior Observability Engineer designs and maintains observability systems, optimizes workflows, automates processes, and provides technical leadership and support for managed services.
AHEAD builds platforms for digital business. By weaving together advances in cloud infrastructure, automation and analytics, and software delivery, we help enterprises deliver on the promise of digital transformation.
At AHEAD, we prioritize creating a culture of belonging, where all perspectives and voices are represented, valued, respected, and heard. We create spaces to empower everyone to speak up, make change, and drive the culture at AHEAD.
We are an equal opportunity employer, and do not discriminate based on an individual's race, national origin, color, gender, gender identity, gender expression, sexual orientation, religion, age, disability, marital status, or any other protected characteristic under applicable law, whether actual or perceived.
We embrace all candidates that will contribute to the diversification and enrichment of ideas and perspectives at AHEAD.
As a Senior Observability Engineer, you will leverage your extensive expertise in Information Technology to enhance and streamline AHEAD's Managed Services platforms and tooling architecture at a strategic level. This role requires a deep technical understanding and hands-on experience with observability tools, monitoring platforms, and integrations with third-party technologies. You will play a pivotal role in driving the design, development, and maintenance of observability systems, ensuring they are robust, scalable, and aligned with organizational goals.
As a senior member of the team, you will collaborate closely with customer service owners, process owners, and various service delivery groups, providing technical leadership and insights during meetings and discussions. You will oversee documentation and quality assurance for development efforts, ensuring solutions are delivered with excellence. Your role will also focus on automating operational tasks, optimizing workflows, and expanding the operational toolset to drive efficiencies across the Managed Services team.
Responsibilities:
- Configure and tune monitoring tools to allow Managed Services to proactively manage customer environments
- Document processes and standard operating procedures across the managed services team
- Support P1 Platform outages and lead troubleshooting/problem solving, as needed
- Provide third-level support and troubleshooting assistance
- Automate processes and standard operating procedures across the managed services team. These processes could involve working with a variety of technological stacks.
- Engage effectively with customers, vendors, and other team members
- Obtain and/or maintain technical skills required to meet the obligations of our customers
- Document operational processes / procedures to optimize support and management of systems
- Provide proactive solutions and optimizations with our platform.
- Provide emergency after-hours support as part of a scheduled on-call rotation
- Provide periodic after-hours support for scheduled maintenance activities
- Provide Guidance to other Team Members on proper implementation of solutions.
Expectations:
- Recognized subject matter expert in professional discipline
- Contribute to development of innovative and high impact solutions for complex challenges
- Demonstrate deep expertise across multiple automation/tooling technologies
- Able to support the deployment of moderately complex solutions
- Communicate with internal customers and relevant stakeholders
- Communicate with external customers and stakeholders.
- Lead discussions with vendors around new functionality and product needs.
- Provide measurable input into new products, processes, standards, and / or plans.
- Taking a consultative approach with internal and external stakeholders
Desired Skills/Expertise:
- BS/BA Degree in Computer Science or equivalent industry experience
- Recognized subject matter expert in Observability subject matter areas
- 3+ years administrating an enterprise environment with 24x7x365 uptime requirements
- Demonstrated experience with monitoring and event management technologies
- Scripting and automation skills with PowerShell Perl or Python
- Experience interacting with SOAP and Rest APIs
- Excellent oral and written communication skills
- Experience with LogicMonitor Platform
- Experience with Elastic Observability Platform
- Experience building monitoring integrations and connecting with various technologies.
- Experience across multiple observability solutions in complex environments.
- Experience with MSP environments and solutioning standards for multiple customers.
- Experience with the Planning and design of monitoring solutions.
Desired Skills/Expertise:
- Experience with ServiceNow
- Industry technical certifications such as MCSA, MCSE, ITIL, CCNA, NPP etc
- Experience working for a SaaS provider or MSP
- Multiple certifications in LogicMonitor: LMCA, LMCP, LMCI, & LMCD
- Elastic certified Engineer, Observability Engineer, or Analyst
- Solution design and architecture experience
Work Schedule:
- M-F Standard Business Hours
- Periodic after-hours scheduled maintenance activities
- Rotating 24x7 Weekly On-Call Schedule for P1 Incident support
Why AHEAD:
Through our daily work and internal groups like Moving Women AHEAD and RISE AHEAD, we value and benefit from diversity of people, ideas, experience, and everything in between.
We fuel growth by stacking our office with top-notch technologies in a multi-million-dollar lab, by encouraging cross department training and development, sponsoring certifications and credentials for continued learning.
USA Employment Benefits include:
- Medical, Dental, and Vision Insurance
- 401(k)
- Paid company holidays
- Paid time off
- Paid parental and caregiver leave
- Plus more! See benefits https://www.aheadbenefits.com/ for additional details.
The compensation range indicated in this posting reflects the On-Target Earnings (“OTE”) for this role, which includes a base salary and any applicable target bonus amount. This OTE range may vary based on the candidate’s relevant experience, qualifications, and geographic location.
Top Skills
Elastic Observability Platform
Logicmonitor Platform
Monitoring Platforms
Observability Tools
Perl
Powershell
Python
Rest Apis
Servicenow
Soap
Similar Jobs
Artificial Intelligence • Cloud • HR Tech • Information Technology • Productivity • Software • Automation
Lead the development of ServiceNow's Observability and Data platform, design system architectures, mentor engineers, and drive best practices in coding and design.
Top Skills:
AIJavaKubernetesLog DatabasesPostgresTimeseries DatabasesUnix Shell
Healthtech
The Senior Software Engineer will design and optimize observability solutions, integrate AI/ML models, and collaborate with cross-functional teams for observability capabilities.
Top Skills:
DynatraceGoGrafanaJavaOpentelemetryPrometheusPythonSplunk
Software
Join Temporal's Infrastructure Team as a Senior Software Engineer focusing on building and supporting observability tools for internal and customer-facing use, enhancing productivity and reliability of the developer experience.
Top Skills:
AWSClickhouseGCPGoGrafanaKubernetesLokiPrometheusSQLTemporalThanos
What you need to know about the Los Angeles Tech Scene
Los Angeles is a global leader in entertainment, so it’s no surprise that many of the biggest players in streaming, digital media and game development call the city home. But the city boasts plenty of non-entertainment innovation as well, with tech companies spanning verticals like AI, fintech, e-commerce and biotech. With major universities like Caltech, UCLA, USC and the nearby UC Irvine, the city has a steady supply of top-flight tech and engineering talent — not counting the graduates flocking to Los Angeles from across the world to enjoy its beaches, culture and year-round temperate climate.
Key Facts About Los Angeles Tech
- Number of Tech Workers: 375,800; 5.5% of overall workforce (2024 CompTIA survey)
- Major Tech Employers: Snap, Netflix, SpaceX, Disney, Google
- Key Industries: Artificial intelligence, adtech, media, software, game development
- Funding Landscape: $11.6 billion in venture capital funding in 2024 (Pitchbook)
- Notable Investors: Strong Ventures, Fifth Wall, Upfront Ventures, Mucker Capital, Kittyhawk Ventures
- Research Centers and Universities: California Institute of Technology, UCLA, University of Southern California, UC Irvine, Pepperdine, California Institute for Immunology and Immunotherapy, Center for Quantum Science and Engineering