Case Study: Dig Security
Dig Security is a cloud data security startup of 50+ employees that provides real-time visibility, control, and protection of data assets.
Main Benefits of Using VictoriaMetrics
- Easy to use & maintain
- The ability to handle billions of time series events at any point of time
- Multiple K8s clusters to monitor
- Consistent monitoring infrastructure for each cluster across multiple regions and clouds
- Secure communication and data storage
- Easy Retention
Challenge
We started with a Prometheus server on EKS. That worked until it didn't.
We then spent time scaling it, maintaining it, throwing more $ at it, until we stumbled across VictoriaMetrics.
What we looked for:
- Reducing costs by not using a managed solution of one of the big clouds
- Support HA / High Availability & fast recovery
- No downtime
- Having our main Prometheus using too much RAM and causing too many restarts
Solution
With VictoriaMetrics we found the following solution:
- The API is compatible with Prometheus & all standard PromQL queries work well out of the box
- Handles storage well
- Available to use in Grafana easily
- Single & small executable
- Easy & fast backups
- Better benchmarks than all the competitors
- Open source & maintained with good community
Why VictoriaMetrics Was Chosen Over Other Solutions
- Usage in High Scale for HA
- Fast Performance
- Highly Efficient
- Cost effective, Highly recommended!
- Can serve us in the future in even bigger scales
- It’s seamless and doesn't cause override-complications for our Infrastructure Team
Next up: Collect more and more metrics, we will grow lots more still to-do
Technical Stats
sum(median_over_time(process_resident_memory_bytes[24h]))
sum(rate(process_cpu_seconds_total[24h]))
sum(max_over_time(vm_cache_entries{type="storage/hour_metric_ids"}[24h]))
sum(increase(vm_new_timeseries_created_total[24h]))
sum(rate(vm_rows_inserted_total[24h]))
sum(vm_rows{type=~"storage/.+"})
sum(vm_rows{type="indexdb"}))
sum(vm_data_size_bytes{type=~"storage/.+"})
sum(vm_data_size_bytes{type="indexdb"})
sum(vm_data_size_bytes) / sum(vm_rows{type=~"storage/.+"})
sum(rate(vm_http_requests_total{path=~".*/api/v1/query_range"}[24h]))