Whatnot Logo

Whatnot

Software Engineer, Infrastructure

Reposted 9 Hours Ago
Be an Early Applicant
In-Office or Remote
Hiring Remotely in Kraków, Małopolskie
Senior level
In-Office or Remote
Hiring Remotely in Kraków, Małopolskie
Senior level
As a Software Engineer in Infrastructure Reliability, you'll build distributed systems that enhance platform reliability, implement traffic control mechanisms, and develop monitoring tools, working closely with diverse teams.
The summary above was generated by AI
🚀 Join the Future of Commerce with Whatnot!

Whatnot is the largest live shopping platform in North America and Europe to buy, sell, and discover the things you love. We’re re-defining e-commerce by blending community, shopping, and entertainment into a community just for you. As a remote co-located team, we’re inspired by innovation and anchored in our values. With hubs in the US, UK, Germany, Ireland, Poland, and Australia, we’re building the future of online marketplaces –together.

From fashion, beauty, and electronics to collectibles like trading cards, comic books, and even live plants, our live auctions have something for everyone.

And we’re just getting started! As one of the fastest growing marketplaces, we’re looking for bold, forward-thinking problem solvers across all functional areas. Check out the latest Whatnot updates on our news and engineering blogs and join us as we enable anyone to turn their passion into a business, and bring people together through commerce.

💻 Role 

We're looking for software engineers to join our Infrastructure Reliability Engineering team. In this role, you will build the distributed systems, services, and frameworks that make reliability a built-in property of the Whatnot platform. As scale, traffic, and complexity continue to grow, you will ensure our systems stay ahead of that curve.

Whatnot's backend is built primarily in Python and Elixir, with Go used for performance-sensitive infrastructure components. You'll work across these languages depending on the problem, and we value engineers who are comfortable learning new stacks over those who are deep in only one.

This is systems engineering work. You'll design and build components that sit in the critical path of Whatnot's traffic, testing infrastructure that validates system behavior before and during peak events, and developer-facing frameworks that raise the reliability floor across the entire platform. You will partner closely with product, platform, and infrastructure teams to embed reliability into system design, development workflows, and runtime behavior.

You will work on problems like:
  • Designing and building distributed systems that support reliability, resiliency, and safe operation at scale

  • Designing and operating traffic control mechanisms: circuit breakers, rate limiting, admission control, backpressure, and graceful degradation

  • Building and evolving load testing frameworks that validate system behavior under sustained, burst, and peak event traffic patterns

  • Building chaos and resilience testing infrastructure to proactively surface failure modes and validate recovery behavior

  • Building systems that enable teams to define and implement SLOs, SLIs, and error budgets to guide reliability tradeoffs

  • Developing tooling that improves incident detection, response, and automated mitigation

  • Reviewing service architectures with a focus on failure modes, scalability limits, and operational safety

  • Participating in incident response and driving systemic fixes that reduce repeated failure patterns

This is a highly visible role. The Reliability team provides foundational systems and frameworks that allow Whatnot to scale rapidly while remaining stable and trustworthy for buyers and sellers.

👋 About You
  • 5+ years of experience designing and building large-scale distributed systems. Experience in Python, Elixir, or Go is preferred, but strong engineers from other backend stacks who are eager to learn are welcome.

  • You identify as a software engineer first. You want to build systems and write code, not just configure infrastructure or respond to pages.

  • Strong fundamentals in designing, building, and operating shared production services and frameworks.

  • Experience with one or more of the following:

    • Traffic control mechanisms such as circuit breakers and rate limiting

    • Building or operating load testing and chaos testing frameworks

    • Hands-on observability, monitoring, and debugging of production systems

    • SLOs, error budgets, and incident response processes

  • Comfortable in cloud-native environments such as AWS or GCP with Kubernetes and infrastructure as code.

  • Strong collaborator with clear written and verbal communication skills.

  • Bonus: experience with high-traffic, real-time, or event-driven systems.

  • Bonus: experience building developer-facing tools, frameworks, or platform libraries consumed by other engineering teams.

💛 EOE

Whatnot is proud to be an Equal Opportunity Employer. We value diversity, and we do not discriminate on the basis of race, religion, color, national origin, gender, sexual orientation, age, marital status, veteran status, parental status, disability status, or any other status protected by local law. We believe that our work is better and our company culture is improved when we encourage, support, and respect the different skills and experiences represented within our workforce.

Top Skills

AWS
Chaos Testing Frameworks
GCP
Incident Response Tools
Kubernetes
Load Testing Frameworks
Monitoring Tools
Observability Tools
Traffic Control Mechanisms
HQ

Whatnot Culver City, California, USA Office

Whatnot Whatnot Office

Culver City, California, United States

Similar Jobs at Whatnot

5 Days Ago
In-Office or Remote
Kraków, Małopolskie, POL
Mid level
Mid level
eCommerce • Mobile
As a Software Engineer on the Payments team, you will build scalable payment systems, manage transactions, and mentor peers while ensuring user satisfaction.
Top Skills: ElixirJavaScriptPython
9 Days Ago
In-Office or Remote
Kraków, Małopolskie, POL
Senior level
Senior level
eCommerce • Mobile
Lead a Reliability Engineering team, develop reliability strategies, oversee load testing, incident response, and ensure systems remain reliable during growth.
Top Skills: Chaos TestingIncident ToolingLoad TestingMonitoringPlatform EngineeringSlo DesignTraffic Control Mechanisms

What you need to know about the Los Angeles Tech Scene

Los Angeles is a global leader in entertainment, so it’s no surprise that many of the biggest players in streaming, digital media and game development call the city home. But the city boasts plenty of non-entertainment innovation as well, with tech companies spanning verticals like AI, fintech, e-commerce and biotech. With major universities like Caltech, UCLA, USC and the nearby UC Irvine, the city has a steady supply of top-flight tech and engineering talent — not counting the graduates flocking to Los Angeles from across the world to enjoy its beaches, culture and year-round temperate climate.

Key Facts About Los Angeles Tech

  • Number of Tech Workers: 375,800; 5.5% of overall workforce (2024 CompTIA survey)
  • Major Tech Employers: Snap, Netflix, SpaceX, Disney, Google
  • Key Industries: Artificial intelligence, adtech, media, software, game development
  • Funding Landscape: $11.6 billion in venture capital funding in 2024 (Pitchbook)
  • Notable Investors: Strong Ventures, Fifth Wall, Upfront Ventures, Mucker Capital, Kittyhawk Ventures
  • Research Centers and Universities: California Institute of Technology, UCLA, University of Southern California, UC Irvine, Pepperdine, California Institute for Immunology and Immunotherapy, Center for Quantum Science and Engineering

Sign up now Access later

Create Free Account

Please log in or sign up to report this job.

Create Free Account