Observability Engineer – Grafana, Prometheus, Thanos

Neshent Tech

Chandler, AZ

Posted On: Mar 25, 2026

Posted On: Mar 25, 2026

Job Overview

Job Type

Full-time

Experience

6 - 12 Years

Salary

Depends on Experience

Work Arrangement

On-Site

Travel Requirement

0%

Required Skills

  • observability
  • ML/LLM
  • Data Analysis/Visualization
  • Grafana
  • Tableau
  • SQL
  • PromQL
  • Prometheus
Job Description
Roles and Responsibilities
  • Design and implement observability frameworks for ML/LLM applications
  • Build telemetry for AI models including latency, token usage, throughput, error rates, and SLOs
  • Develop and maintain self-service observability dashboards
  • Monitor model performance, data drift, reliability, and cost metrics
  • Create dashboards using Grafana and Tableau for operational insights
  • Implement monitoring using Prometheus and Thanos for scalable metrics collection
  • Analyze time-series data and build actionable visualizations.
  • Partner with ML engineers and platform teams to improve system reliability
  • Define and track SLOs for AI endpoints and services
  • Enable end-to-end observability using metrics, logs, and traces
Required Skills
  • Strong experience in Data Analysis and Visualization
  • Hands-on experience with Grafana dashboard creation
  • Experience with Tableau, Grafana, Prometheus, and Thanos stack
  • Strong knowledge of SQL and time-series data
  • Experience with PromQL
  • Working knowledge of Linux environments
  • Expertise in building telemetry dashboards
  • Understanding of different visualization graphs and charts
  • Experience monitoring production systems and observability pipelines
Preferred Qualifications
  • Experience with ML/AI observability
  • Knowledge of LLM monitoring metrics (tokens, latency, hallucination tracking, etc.)
  • Experience defining SLOs/SLIs
  • Familiarity with distributed tracing and logging frameworks
  • Experience with large-scale observability platforms

Job ID: NT220824


Posted By

Abhishek

Resource Manager


Related Jobs

  • Company
  • COMPANY

    Neshent Tech

  • Company
  • experience

    6 - 12 Years

  • Travel Requirements
  • Work Arrangement

    On-Site

  • Wallet
  • SALARY

    Depends on Experience

  • Skills
  • SKILLS

    • observability
    • ML/LLM
    • Data Analysis/Visualization
    • Grafana
    • +4 more

Posted On: Mar 25, 2026

  • Contract - W2

  • Company
  • COMPANY

    Long Finch Technology

  • Company
  • experience

    9 - 14 Years

  • Travel Requirements
  • Work Arrangement

    On-Site

  • Wallet
  • SALARY

    Depends on Experience

  • Skills
  • SKILLS

    • Core Java
    • Spring/Spring Boot
    • Kubernetes
    • Kafka
    • +4 more

Posted On: Mar 10, 2026

  • Full-time

  • Company
  • COMPANY

    2T Consulting

  • Company
  • experience

    7 - 10 Years

  • Travel Requirements
  • Work Arrangement

    On-Site

  • Wallet
  • SALARY

    Depends on Experience

  • Skills
  • SKILLS

    • Kafka Operations
    • Grafana
    • Prometheus
    • Splunk
    • +1 more

Posted On: Feb 02, 2026

  • Contract - W2
  • Contract - Independent

  • Company
  • COMPANY

    Neshent Tech

  • Company
  • experience

    8 - 15 Years

  • Travel Requirements
  • Work Arrangement

    Hybrid

  • Wallet
  • SALARY

    Depends on Experience

  • Skills
  • SKILLS

    • DevOps
    • Grafana
    • Prometheus
    • OpenTelemetry
    • +3 more

Posted On: Nov 21, 2025

  • Full-time

  • Company
  • COMPANY

    Neshent Tech

  • Company
  • experience

    7 - 12 Years

  • Travel Requirements
  • Work Arrangement

    On-Site

  • Wallet
  • SALARY

    $100,000 - $120,000 Per Year

  • Skills
  • SKILLS

    • Grafana
    • Prometheus
    • LogQL
    • Unix/Linux
    • +2 more

Posted On: Oct 10, 2025