All jobs

[Remote] Senior AI Engineer - Grafana Ops, AI/ML | USA | Remote

100% Remote Full-time Open now

Note: The job is a remote job and is open to candidates in USA. Grafana Labs, the company behind the open observability cloud, is seeking a Senior AI Engineer to develop AI-driven features that enhance observability tools. The role involves collaborating with cross-functional teams to build and deliver AI solutions that improve incident response and automate analysis tasks.

Responsibilities

  • Build and deliver AI solutions: Take ownership of developing high-performance AI features to help users detect, triage, and resolve incidents using observability data and tools
  • Rapid experimentation and iteration: Implement a highly iterative process where you quickly prototype, test, and validate with real users, including shipping and evolving LLM- or agent-powered workflows for incident lifecycle management and automated analysis tasks
  • Collaborate cross-functionally: Work with data analysts, product managers, and designers to shape AI-driven product features, including integration of agentic components with internal tools, alerting systems, runbooks, and developer workflows
  • Utilize AI tools effectively: Use AI and automation tools to enhance both product functionality and your own development workflows
  • Effective communication: You’ll be working in a highly dynamic and collaborative environment, so we need someone who can communicate effectively and contribute across teams
  • Ownership and impact: Take full ownership of the AI solutions you develop, ensuring they are not only innovative but also scalable, maintainable, and aligned with real user workflows

Skills

  • Experience with LLMs, prompt engineering, and building applications powered by GenAI
  • Proven track record of delivering software that made it into production and is actively used by users
  • Exposure to working in cloud-native environments (e.g., AWS, GCP, Azure)
  • Experience using observability tools to understand and troubleshoot system behavior
  • Experience building or working with agent frameworks or multi‑agent workflows
  • Experience with infrastructure / devops related tooling: Kubernetes, Docker, Terraform or similar for deployments
  • Familiarity with model fine-tuning techniques
  • Experience building observability tooling

Benefits

  • Benefits include equity, bonus (if applicable) and other benefits listed [here](https://grafana.com/about/careers/#jobs).
  • All of our roles include Restricted Stock Units (RSUs), giving every team member ownership in Grafana Labs' success.
  • 100% Remote, Global Culture -As a remote-only company, we bring together talent from around the world, united by a culture of collaboration and shared purpose.
  • Career Growth Pathways – Defined opportunities to grow and develop your career.
  • In-Person onboarding - We want you to thrive from day 1 with your fellow new ‘Grafanistas’ to learn all about what we do and how we do it.
  • We operate a global annual leave policy of 30 days per annum. 3 days of your annual leave entitlement are reserved for Grafana Shutdown Days to allow the team to really disconnect. **We will comply with local legislation where applicable.

Company Overview

  • Grafana Labs is an open-source software platform built to support monitoring, visualization, and metric analytics. It was founded in 2014, and is headquartered in New York, New York, USA, with a workforce of 1001-5000 employees. Its website is http://grafana.com.
  • Company H1B Sponsorship

  • Grafana Labs has a track record of offering H1B sponsorships, with 2 in 2025, 1 in 2022, 1 in 2020. Please note that this does not guarantee sponsorship for this specific role.
  • Apply To This Job

    You might also like