About the position
We are seeking a highly experienced Infrastructure Architect with deep expertise in Azure Cloud, Observability Platforms, and Enterprise Architecture. This is a senior-level architectural role requiring hands-on technical leadership in designing scalable, secure, and high-performance cloud environments.
The ideal candidate will bring strong experience in Azure infrastructure, observability stacks (New Relic, Prometheus, LGTM), Kubernetes, infrastructure-as-code, and CI/CD integration, along with the ability to translate business requirements into future-ready architecture solutions.
Responsibilities
• Design and implement scalable, secure, and high-performance enterprise architecture solutions
• Lead observability strategy, including dashboards, alerts, logs, and reliability metrics
• Architect and develop LGTM stack dashboards for real-time monitoring and visualization
• Define and implement SLOs, SLIs, alerting mechanisms, and reliability KPIs
• Integrate multiple APIs and data sources to enhance monitoring and analytics capabilities
• Drive Azure cloud architecture, ensuring scalability, resilience, and security
• Integrate code scanning tools into CI/CD pipelines (GitHub Actions, Azure DevOps, Jenkins)
• Architect and support containerized environments using Docker and Kubernetes (AKS)
• Lead infrastructure automation using Terraform, Ansible, and other IaC tools
• Document architecture designs, migration workflows, and best practices
• Mentor engineering teams and provide technical guidance and code reviews
• Evaluate and recommend tools, frameworks, and technologies aligned with business goals
• Collaborate with cross-functional stakeholders to align architecture with strategic objectives
• Support migration initiatives (including observability platform transitions such as New Relic to LGTM, where applicable)
Requirements
• 5+ years in an Architectural role
• 10+ years of Azure Infrastructure experience
• 6+ years working with Azure CLI and PowerShell
• 8+ years of experience with observability tools such as New Relic, LGTM stack, or Prometheus
• 3+ years defining and implementing SLO/SLI metrics and alerting frameworks
• 6+ years of infrastructure-as-code (Terraform, Ansible)
• 6+ years of containerization and orchestration (Docker/Kubernetes, including AKS)
• 6+ years in Agile development environments
• 6+ years defining solutions, designing architecture, and deploying applications on Azure
• 3+ years integrating APIs and multiple data sources for analytics dashboards
• Experience with New Relic APM, NRQL queries, dashboards, alerts, and logs
• Experience integrating static code analysis tools (e.g., CodeQL) into CI/CD pipelines
• Strong understanding of microservices architecture
• Strong communication, presentation, and documentation skills
• Ability to align technical decisions with long-term business strategy
• Confident decision-maker with strong understanding of trade-offs and scalability
• Strong stakeholder management and collaboration skills
• Analytical mindset with proactive risk identification and resolution
• Ability to mentor teams and provide constructive technical feedback
• Strong organizational and time management skills
Nice-to-haves
• Hands-on experience in .NET or other programming languages
• Experience executing migration from New Relic to LGTM stack