Spotify’s performance & control across large monitoring environments with VictoriaMetrics

Spotify’s performance & control across large monitoring environments with VictoriaMetrics

Share: Share on LinkedIn Share on X (Twitter)

When your active time series is in the billions and the total number of data points you need to monitor runs into the tens of trillions, you need a high-performance observability solution with operational simplicity.

Streaming behemoth Spotify is one such case. Their observability team chose VictoriaMetrics as the fastest monitoring and observability solution on the market.

Spotify’s challenges

#

Spotify needed to replace its legacy in-house time series database (Heroic), which had become outdated, difficult to maintain, and inefficient at scale.

The goal was to implement a modern time-series database (TSDB) that could efficiently handle large-scale metric ingestion and querying, improve dashboard and alert performance, reduce operational overhead, and align with open observability standards such as Prometheus, OTel, and Grafana.

Difficulties Spotify’s observability team faced:

  • Stability and performance limitations in its previous in-house TSDB, leading to query delays and timeouts
  • Limited feature parity with modern observability systems
  • A bespoke, closed-source architecture that restricted community support and maintainability
  • Growing maintenance overhead as team familiarity with the legacy system decreased
  • Latency issues with the existing alert engine
  • Inconsistent metric models and difficulty handling high-cardinality data
  • Limited compatibility with Prometheus and related open standards

Spotify evaluated multiple vendors and technologies before selecting VictoriaMetrics.

The alternative systems they tested during the evaluation phase showed limitations in scalability, compatibility with existing tooling, and flexibility of deployment models.

VictoriaMetrics is “a robust, efficient, and flexible platform aligned with Spotify’s
operational and architectural requirements”

Lauren Roshore, Engineering Manager, Observability

Spotify’s observability team had several evaluation criteria:

  • Performance (data ingestion and query speed)
  • Scalability for large, distributed workloads
  • Cost efficiency (storage, licensing)
  • Flexibility between self-managed and managed deployment models
  • Compatibility with open-source standards
  • Alerting infrastructure compatibility
  • Operational maintainability

From the many different observability solutions on the market, VictoriaMetrics came out on top to support Spotify’s scalability and performance goals.

Outcome of VictoriaMetrics adoption

#

“Spotify’s transition to VictoriaMetrics has resulted in significant performance improvements across its monitoring stack, greater efficiency in engineering operations, and enhanced scalability to support future growth.”

Lauren Roshore, Engineering Manager, Observability

The solution provided a robust, efficient, and flexible platform aligned with the team’s operational and architectural requirements.

Some of the key benefits VictoriaMetrics now brings to Spotify’s observability:

  • Significant improvements in data ingestion and query performance
  • Prometheus-compatible APIs and query language
  • Simplified architecture for easier deployment and management
  • Enhanced data retention and cost efficiency through downsampling and control features
  • Support for both cloud and self-hosted deployments, offering high operational visibility
  • Scalable, performant alerting infrastructure
  • A predictable and transparent licensing model
  • Noticeable improvements in dashboard responsiveness and alert evaluation times

Spotify is not stopping there in the coming months and years that involve VictoriaMetrics and observability in general. Their plans include UX and alert-annotation enhancements for a better on-call experience, anomaly detection in time-series data for advanced analytics, adoption of OTel, and stronger integration between reliability tooling (SLOs) and VictoriaMetrics/Grafana.

If you want to learn more about Spotify’s observability journey, join us for our quarterly meetup on December 18, 2025. At the meetup, Spotify’s Observability Engineering Manager, Lauren Roshore, will explain “How & why we use VictoriaMetrics".

Leave a comment below or Contact Us if you have any questions!
comments powered by Disqus

You might also like:

VictoriaMetrics at FOSDEM, Cloud Native Days France, and CfgMgmtCamp Ghent

A developer-focused recap of VictoriaMetrics’ participation at FOSDEM, Cloud Native Days France and CfgMgmtCamp, highlighting open source observability, community feedback and real-world engineering perspectives.

VictoriaLogs in VictoriaMetrics Cloud: Fast, Cost-Effective Log Management is Here

Announcing VictoriaLogs in VictoriaMetrics Cloud: fast, cost-effective log management with native OpenTelemetry support, LogsQL for powerful analysis, and integrations with Grafana and Perses for complete observability monitoring, is the best option to save costs when compared to other alternatives like ElasticSearch or Datadog.

What’s new in VictoriaMetrics Anomaly Detection (2025)

VictoriaMetrics Anomaly Detection has had a productive year with lots of user feedback that has had a major impact on product development. We’ve added improvements across the board: in core functionality, simplicity, performance, visualisation and AI integration. In addition to bug fixes and speedups, below is a list of what was accomplished in 2025.

VictoriaMetrics January 2026 Ecosystem Updates

January 2026 updates deliver quality of life improvements, performance optimizations, and tighter Kubernetes integration across the VictoriaMetrics Observability Stack.