
Job title: Senior DevOps Engineer – Edwin AI
Company: LogicMonitor
Job description: About Us:We love going to work and think you should too. Our team is dedicated to trust, customer obsession, agility, and striving to be better everyday. These values serve as the foundation of our culture, guiding our actions and driving us towards excellence. We foster a culture of performance and recognition, allowing us to transform growth as we enable our employees to do the best work of their careers.We are seeking a highly skilled Senior DevOps Engineer having 4+ years of experience to drive innovation, reliability, and security across our cloud infrastructure on the Edwin AI team at LogicMonitor. The ideal candidate has hands-on experience managing multi-cloud environments, automating infrastructure, and implementing modern DevOps practices that improve system performance, scalability, and cost efficiency.Here’s a closer look at this key role:
- Multi-Cloud Enablement: Expand and manage application hosting across AWS, Azure, and Google Cloud, ensuring performance, flexibility, and resilience.
 - Infrastructure as Code (IaC): Develop and maintain Terraform or similar installers for Azure and GCP to fully automate infrastructure deployments.
 - Cost Optimization: Design and implement AWS cost optimization strategies, including reserved instances, right-sizing, and resource efficiency initiatives.
 - Cloud Security: Strengthen infrastructure security with robust access controls, encryption, monitoring, and alerting frameworks.
 - Observability: Build and enhance monitoring platforms with Grafana dashboards and Prometheus alerts for real-time performance insights and proactive issue resolution.
 - Kubernetes Management: Implement Role-Based Access Control (RBAC) and optimize Ingress controllers (Traefik or similar) for enhanced security and delivery resilience.
 
What You’ll Need:
- 4+ years of experience in DevOps or similar roles
 - Proven experience with AWS, Azure, and GCP in production environments.
 - Strong expertise in Infrastructure as Code practices.
 - Solid knowledge of Kubernetes (EKS), container orchestration, and cluster security.
 - Hands-on experience with Grafana, Prometheus, and alerting/monitoring systems.
 - Understanding of network connectivity over the private link endpoint, VPC, cross-account vpc connectivity, how to make things accessible internally, externally, etc.
 - Experience in deploying automated Canary and Integration testing pipelines,CI/CD pipeline etc..
 - Exposing internal self-hosted services like LangFuse via WebUI for internal users using Traefik or Ingress controller or any other tool
 - Experience in deployment of LLM related solutions that require MCP, LangFuse, Airflow, GraphDB, VectorDB, Redis etc…
 - Experience working with developers on on-demand JIT access to Prod clusters to troubleshoot/debug issues with tools like Teleport or some other.
 - Strong background in cloud security, access management, and encryption.
 - Proficiency in Python and Bash scripting for automation.
 
 Residents of California, click to view our California Applicant Privacy Notice.Anticipated Application Close Date: 10/08/25LogicMonitor is an Equal Opportunity Employer
       At LogicMonitor, we believe that innovation thrives when every voice is heard and each individual is empowered to bring their unique perspective. We’re committed to creating a workplace where diversity is celebrated, and all employees feel inspired and supported to contribute their best.For us, equal opportunity means fostering a truly inclusive culture where everyone has the chance to grow and succeed. We don’t just open doors; we invite you to step through and be part of something bigger. All qualified applicants will receive consideration for employment without regard to race, color, religion, sex, sexual orientation, gender identity, national origin, disability, or status as a protected veteran.#LI-JP1 #LI-Hybrid #BI-Hybrid
Expected salary:
Location: San Francisco, CA
Job date: Sun, 12 Oct 2025 03:16:38 GMT
Apply for the job now!
