Job Description:
• Own the security, reliability, and observability of the platform
• Establish and maintain monitoring and alerting systems
• Conduct security audits and reviews across the platform
• Build automation and tooling that reduces operational toil
• Design alerting strategies and incident response systems
• Implement security controls and manage CI/CD pipeline security
Requirements:
• 8+ years of experience building, securing, and operating complex distributed systems at scale
• Deep expertise in security, reliability, and observability
• Proficiency in Go microservices
• Familiarity with Google Cloud Platform (Cloud Run, Cloud Build, Pub/Sub, Cloud Storage)
• Knowledge of PostgreSQL, Redis, Terraform
• Experience with incident response processes, runbooks, and postmortem practices
• Ability to conduct security audits and threat modeling for distributed systems
Benefits:
• Comprehensive health insurance packages with dependent coverage
• Opportunities for career advancement and development
• Enjoy the flexibility of a fully remote work environment
• Access to employee wellness programs designed to support your overall well-being