About ThriveCart:
ThriveCart is the leading no-code sales platform for digital course creators, coaches, entrepreneurs, and online businesses looking to boost revenue, drive conversions, and scale audiences. ThriveCart powers over 65,000 businesses and 12 million enrolled students, generating over $2 billion in annual sales. The platform provides all the tools businesses need to create high-converting checkout experiences, manage powerful affiliate campaigns, and deliver seamless student experiences with its built-in learning management system, Learn/Learn+.
Location:
Remote in Europe or in Asia
Must have full authorization to work in country of residence being in Europe or in Asia.
The RoleWe're looking for an able hands-on DevOps engineer who loves being in the trenches. You'll spend your days building CloudWatch dashboards, writing Terraform code, investigating production issues, and making our infrastructure more observable and reliable.
This is NOT a planning or coordination role - it's about doing the technical work that keeps a revenue-generating platform running smoothly.
What you'll do:
- Build monitoring systems - Create CloudWatch dashboards and alarms that catch real problems (payment failures, database issues, API errors) before customers notice
- Write infrastructure code - Use Terraform to manage AWS resources (EC2, RDS, CloudWatch, IAM) with code review and automation
- Improve deployments - Make releases faster and safer through CI/CD improvements and automated rollbacks
- Handle production issues - Investigate anomalies, respond to alerts, find root causes, and prevent future occurrences
- Analyze performance - Identify bottlenecks in payment processing, database queries, and API responses
You must have:
- ⭐ Terraform production experience - managing real infrastructure with IaC
- ⭐ AWS CloudWatch experience - building dashboards, configuring alarms, writing custom metrics
- Production operations - supporting high-traffic web applications where downtime costs money
- Database operations - Database system monitoring, performance tuning, backups
- Docker & containers - Docker Compose, multi-container applications
- Linux system administration - comfortable with shell scripting and troubleshooting
- Web servers - Nginx or Apache configuration
- E-commerce or payment processing infrastructure experience
- PCI compliance knowledge
- Multi-database architecture experience
- On-call experience for revenue-critical systems
- CI/CD pipeline construction
- Proactive - You spot problems before they become incidents
- Systematic - You troubleshoot methodically with data, not guesses
- Clear communicator - You explain complex issues in simple terms
- Detail-oriented - You catch edge cases and think about failure modes
- Calm under pressure - Production incidents don't rattle you
- Documentation-focused - You write runbooks because you know future-you will thank present-you
Infrastructure you'll work with:
- AWS CloudWatch
- Terraform for infrastructure as code
- Docker for development environments
- MariaDB 10.11 (multi-database architecture)
- PHP & Node/typescript application stack
- Nginx web server
- Memcached caching
- AWS: EC2, RDS, Route53, IAM
You'll be responsible for:
- Production monitoring and alerting
- Infrastructure as code (Terraform)
- CI/CD pipeline improvements
- Database operations and optimization
- Container orchestration
- Security and compliance support
- High impact - Your observability will directly affect how fast we find and fix issues
- Real stakes - When payments fail, merchants lose money. Your work matters.
- Hands-on - You'll write code and build systems, not just coordinate or plan
- Complex system - Multi-database, legacy + modern, payment processing - interesting technical challenges
- Visibility - Your work is immediately visible to the entire engineering team
In 30 days:
- You've responded to your first production alert
- You understand the architecture and data flows
- You've identified gaps in our current monitoring
In 90 days:
- You've built 2+ dashboards the team relies on
- You've implemented 5+ alarms for previously blind spots
- You handle production incidents independently
In 6 months:
- You've proactively prevented multiple production issues
- The team depends on your monitoring systems
- You've led an infrastructure improvement project
- You're comfortable on the on-call rotation
- Shared rotation (about 1 week per month)
- Focus on critical alerts (payment processing, database issues)
- Most scenarios have documented runbooks
- Team backup available for complex issues
- Compensated with time off or additional pay
✅ Direct impact - Your work directly affects revenue and reliability
✅ Modern practices - Terraform, IaC, CloudWatch, Docker
✅ Interesting problems - Complex distributed systems with real-world challenges
✅ Learning opportunities - Diverse tech stack (legacy + modern)
✅ Collaborative team - Work with experienced engineers
✅ Remote flexibility - FULL Remote role
Our Values are:
Commit to Excellence
Find a Way
Help Each Other
Keep Growing
Data First
Top Skills
Similar Jobs
What you need to know about the Los Angeles Tech Scene
Key Facts About Los Angeles Tech
- Number of Tech Workers: 375,800; 5.5% of overall workforce (2024 CompTIA survey)
- Major Tech Employers: Snap, Netflix, SpaceX, Disney, Google
- Key Industries: Artificial intelligence, adtech, media, software, game development
- Funding Landscape: $11.6 billion in venture capital funding in 2024 (Pitchbook)
- Notable Investors: Strong Ventures, Fifth Wall, Upfront Ventures, Mucker Capital, Kittyhawk Ventures
- Research Centers and Universities: California Institute of Technology, UCLA, University of Southern California, UC Irvine, Pepperdine, California Institute for Immunology and Immunotherapy, Center for Quantum Science and Engineering

_0.png)