Sparksoft Corporation Logo

Sparksoft Corporation

Site Reliability Engineer (SRE)

Posted Yesterday
Remote
2 Locations
8-9
Senior level
Remote
2 Locations
8-9
Senior level
Seeking a proactive Site Reliability Engineer to ensure the reliability and performance of cloud-based systems. Responsibilities include designing AWS infrastructure, conducting performance tests, managing CI/CD pipelines, and improving system observability.
The summary above was generated by AI

Join us at Sparksoft, where we're not just another tech company—we're a catalyst for change. Our mission isn't just to offer IT solutions; it's to revolutionize the way you work. Here, passion isn't just a buzzword; it's the fuel behind groundbreaking ideas and transformative technologies. We serve a wide range of government clients, delivering impact that's felt across the nation.

Our true strength lies in our people. They're the problem-solvers and innovators consistently delivering extraordinary outcomes. With Sparksoft, you're not stepping into a routine job; you're joining a team committed to innovation and excellence. Our innovation extends beyond just delivering projects. Through our specialized Innovation Centers, we continuously refine our methods, ensuring we remain industry leaders.

We are Sparksoft!

ROLE & RESPONSIBILITIES:

We are seeking a skilled and proactive Site Reliability Engineer (SRE) with strong expertise in AWS infrastructure and performance testing. The ideal candidate will be responsible for ensuring the reliability, scalability, and performance of our cloud-based systems and applications. This role involves close collaboration with development, operations, and QA teams to build robust systems and improve service uptime

  • Design, implement, and maintain scalable and reliable infrastructure on AWS.
  • Monitor system performance and availability using tools like New Relic and Splunk and AWS CloudWatch.
  • Conduct performance testing using tools such as JMeter, Performance Center to identify bottlenecks and optimize system performance.
  • Develop and maintain CI/CD pipelines to support rapid and reliable software delivery.
  • Implement and manage incident response processes, including root cause analysis and postmortems.
  • Created dashboards and configured alerts in Splunk, New Relic, and AWS CloudWatch to monitor system performance, availability, and application health.
  • Collaborate with development teams to improve system reliability and performance.
  • Ensure security and compliance standards are met across infrastructure and applications.
  • Continuously improve observability and alerting systems.

REQUIRED EXPERIENCE: 

  • 8-9+ years of experience as an SRE, DevOps Engineer, Performance engineer or similar role.
  • Strong hands-on experience with AWS services (EC2, EBS, Lambda, RDS, S3, ECS, EKS etc.).
  • Proficiency in performance testing tools like JMeter, Performance Center and methodologies.
  • Experience with containerization and orchestration (Docker, Kubernetes).
  • Experience with monitoring and logging tools like CloudWatch, New Relic and Splunk.
  • Strong scripting skills (Bash, Groovy, Java Script, Python).
  • Experience with performance diagnostics, tuning, and JVM/JBoss EAP on Linux and on AWS EKS.
  • Working knowledge of Agile development practices.
  • Excellent problem-solving and communication skills.
  • Candidates must be able to obtain and maintain a Public Trust clearance
  • Candidates must have lived in the United States 3 out of the past 5 years

PREFERRED EXPERIENCE:

  • Experience with NoSQL databases.
  • Exposure to other cloud platforms like Azure, Google Cloud, or VMware.
  • Understanding of high availability and disaster recovery architectures.
  • Knowledge of application networking, firewalls, and load balancing.
  • Experience with both Linux and Windows operating systems.

EDUCATION & CERTIFICATIONS:

  • Bachelor’s degree in computer science, Information Technology or equivalent
  • AWS Certifications are preferred

If you need accommodation seeking employment with Sparksoft Corporation, please email [email protected] or call 410-424-7700. Accommodations are made on a case-by-case basis.

At Sparksoft Corporation, we take security and protection of personal information very seriously. We will never ask you to send private personal information over email. Accordingly, we ask you to immediately contact our security team via email at [email protected] upon receiving a suspicious request.

Top Skills

AWS
Aws Cloudwatch
Bash
Docker
Groovy
Java Script
Jmeter
Kubernetes
New Relic
Performance Center
Python
Splunk

Similar Jobs

Yesterday
Remote or Hybrid
New York, NY, USA
135K-155K Annually
Senior level
135K-155K Annually
Senior level
AdTech • Big Data • Digital Media • Software
As a Senior Site Reliability Engineer, you will provide technical leadership, improve system reliability, and collaborate on infrastructure projects, ensuring operational integrity 24/7.
Top Skills: AnsibleArgo CdAws EcrGitGithub ActionsJenkinsKubernetesNexusPuppetTerraformYum
Yesterday
Remote or Hybrid
United States
Senior level
Senior level
Artificial Intelligence • Fintech • Information Technology • Software • Data Privacy
The Principal Site Reliability Engineer ensures SaaS products are fast and stable, optimizes performance, automates deployments, and champions best practices for system operations.
Top Skills: .NetAksAnsibleAppdynamicsAzureAzure DevopsBashC#Cloud NetworkingCosmosDatadogDynatraceEksFirewallHarnessIdera Sql Diagnostic ManagerJavaJenkinsKubernetesLoad BalancingNew RelicPowershellPythonRedgate Sql MonitorSolarwinds Database Performance AnalyzerSQLTerraform
2 Days Ago
Remote or Hybrid
New York, NY, USA
130K-180K Annually
Senior level
130K-180K Annually
Senior level
AdTech • Cloud • Digital Media • Information Technology • News + Entertainment • App development
The Staff Software Engineer will manage the support of SAP BTP applications, leading a team to ensure system performance, availability, and implementation of integration patterns while collaborating with product teams on enhancements and troubleshooting.
Top Skills: Abap ProxiesCapmIdentity ManagementIdocJSONMessage QueuesOauthOdataRestSAMLSap AribaSap BtpSap C4CSap CallidusSap CpiSap Success FactorsSfapiSftpSoapWorkdayXML

What you need to know about the Los Angeles Tech Scene

Los Angeles is a global leader in entertainment, so it’s no surprise that many of the biggest players in streaming, digital media and game development call the city home. But the city boasts plenty of non-entertainment innovation as well, with tech companies spanning verticals like AI, fintech, e-commerce and biotech. With major universities like Caltech, UCLA, USC and the nearby UC Irvine, the city has a steady supply of top-flight tech and engineering talent — not counting the graduates flocking to Los Angeles from across the world to enjoy its beaches, culture and year-round temperate climate.

Key Facts About Los Angeles Tech

  • Number of Tech Workers: 375,800; 5.5% of overall workforce (2024 CompTIA survey)
  • Major Tech Employers: Snap, Netflix, SpaceX, Disney, Google
  • Key Industries: Artificial intelligence, adtech, media, software, game development
  • Funding Landscape: $11.6 billion in venture capital funding in 2024 (Pitchbook)
  • Notable Investors: Strong Ventures, Fifth Wall, Upfront Ventures, Mucker Capital, Kittyhawk Ventures
  • Research Centers and Universities: California Institute of Technology, UCLA, University of Southern California, UC Irvine, Pepperdine, California Institute for Immunology and Immunotherapy, Center for Quantum Science and Engineering

Sign up now Access later

Create Free Account

Please log in or sign up to report this job.

Create Free Account