github-mirrors/VictoriaMetrics

mirror of https://github.com/VictoriaMetrics/VictoriaMetrics.git synced 2024-11-21 14:44:00 +00:00

Author	SHA1	Message	Date
Andrii Chubatiuk	2da45a8368	vmagent: updated dashboard and alert for stream aggregation (#6427 ) ### Describe Your Changes Added streaming aggregation section to vmagent dashboards Added alert for streaming aggregation and deduplication flush timeouts Removed deprecated compose versions from compose files Signed-off-by: hagen1778 <roman@victoriametrics.com> Co-authored-by: hagen1778 <roman@victoriametrics.com>	2024-06-10 11:49:00 +02:00
hagen1778	9dd9b4442f	dashboards: use `$__interval` variable for offsets and look-behind windows in annotations This should improve precision of `restarts` and `version change` annotations when zooming-in/zooming-out on the dashboards. The change also makes `restarts` dashboard visible on the panels, so user can disable it from displaying if needed. This could be useful when restarts overlap with version change events. Signed-off-by: hagen1778 <roman@victoriametrics.com>	2024-05-22 16:32:51 +02:00
hagen1778	c746ba154d	deployment/dashboards: fix `AnnotationQueryRunner` error in Grafana The error appears when executing annotations query against Prometheus backend because the query itself hasn't specified look-behind window (which is allowed in VictoriaMetrics query engine). https://github.com/VictoriaMetrics/VictoriaMetrics/issues/6309 Signed-off-by: hagen1778 <roman@victoriametrics.com>	2024-05-21 11:39:02 +02:00
hagen1778	9256df17fa	deployment: bump Grafana version to 10.4.2 Signed-off-by: hagen1778 <roman@victoriametrics.com>	2024-04-29 12:10:24 +02:00
Aliaksandr Valialkin	4927e64700	all: replace remaining https://docs.victoriametrics.com/vmagent.html urls with the new one - https://docs.victoriametrics.com/vmagent/	2024-04-18 01:36:13 +02:00
hagen1778	0ab1069363	dashboards: update links in various panels * use docs.victoriametrics.com instead of github docs * add links to common terms used in VictoriaMetrics Signed-off-by: hagen1778 <roman@victoriametrics.com>	2024-03-04 15:43:31 +01:00
hagen1778	487a94565b	dashboards/all: add new panel `CPU spent on GC` It should help identifying cases when too much CPU is spent on garbage collection, and advice users on how this can be addressed. Signed-off-by: hagen1778 <roman@victoriametrics.com>	2024-02-02 16:21:21 +01:00
hagen1778	29a9b31584	dashboards: add `Targets scraped/s` A new stat panel shows the number of targets scraped by the vmagent per-second. Signed-off-by: hagen1778 <roman@victoriametrics.com>	2024-02-02 15:48:26 +01:00
hagen1778	db11b94e30	dashboards: update to grafana/grafana:10.3.1 Signed-off-by: hagen1778 <roman@victoriametrics.com>	2024-02-02 15:41:08 +01:00
Hui Wang	97373b7786	vmagent: add `vm_promscrape_scrape_pool_targets` for scrape jobs like… (#5335 ) * vmagent: export `vm_promscrape_scrape_pool_targets` metric to track the number of targets that each scrape_job discovers * add extra panel for new metric	2023-12-06 15:44:39 +08:00
Aliaksandr Valialkin	aefd744abb	dashboards: remove `path!="/favicon.ico"` filter from `requests rate` graphs The `path!="/favicon.ico"` filter has little sense, since there are many other special paths, which may be filtered out - /metrics, /flags, /health, /ping, /robots.txt, /-/healthy, /-/ready, /reload, etc. See /lib/httpserver/httpserver.go for more details. It will be hard or impossible to maintain filters for all these paths, so it is better to drop this filter in order to simplify queries and improve the consistency of these queries.	2023-11-16 19:28:49 +01:00
hagen1778	d3ae2b2f62	dashboards: update description for RSS and anonymous memory panels to be consistent for single-node, cluster and vmagent dashboards. Signed-off-by: hagen1778 <roman@victoriametrics.com>	2023-11-14 09:50:06 +01:00
Roman Khavronenko	a4bd73ec7e	lib/promscrape: make concurrency control optional (#5073 ) * lib/promscrape: make concurrency control optional Before, `-maxConcurrentInserts` was limiting all calls to `promscrape.Parse` function: during ingestion and scraping. This behavior is incorrect. Cmd-line flag `-maxConcurrentInserts` should have effect onl on ingestion. Since both pipelines use the same `promscrape.Parse` function, we extend it to make concurrency limiter optional. So caller can decide whether concurrency should be limited or not. This commit makes `c53b5788b4` obsolete. Signed-off-by: hagen1778 <roman@victoriametrics.com> * Revert "dashboards: move `Concurrent inserts` panel to Troubleshooting section" This reverts commit `c53b5788b4`. --------- Signed-off-by: hagen1778 <roman@victoriametrics.com>	2023-10-02 21:32:11 +02:00
Aliaksandr Valialkin	859977d591	Revert "lib/promscrape: add metric `vm_promscrape_scrapes_skipped_total` (#5074 )" This reverts commit `74301cdbf5`. Reason for revert: vmagent already provides better approach for detecting slow scrape targets via the following query: scrape_duration_seconds / scrape_timeout_seconds > 1 This query depends on automatically generated per-target metrics. See https://docs.victoriametrics.com/vmagent.html#automatically-generated-metrics for more details. Updates https://github.com/VictoriaMetrics/VictoriaMetrics/pull/5074	2023-10-02 20:59:56 +02:00
Roman Khavronenko	74301cdbf5	lib/promscrape: add metric `vm_promscrape_scrapes_skipped_total` (#5074 ) * lib/promscrape: add metric `vm_promscrape_scrapes_skipped_total` add metric `vm_promscrape_scrapes_skipped_total`to show whether vmagent skips the scrapes. This could happen if vmagent is overloaded or target is responding too slow for configured `scrape_interval`. The follow-up commit should add a corresponding alerting rule and panel to vmagent dashboard. Signed-off-by: hagen1778 <roman@victoriametrics.com> * deployment/docker: add `TooManyScrapeSkips` alerting rule for vmagent Signed-off-by: hagen1778 <roman@victoriametrics.com> * dashboards: add panels `Scrape duration 0.99 quantile` and `Skipped scrapes` to vmagent dashboard Signed-off-by: hagen1778 <roman@victoriametrics.com> --------- Signed-off-by: hagen1778 <roman@victoriametrics.com>	2023-10-02 17:12:12 +02:00
hagen1778	c53b5788b4	dashboards: move `Concurrent inserts` panel to Troubleshooting section Moved because this panel is related to both: scraped and ingested data. Before, it could have give a misleading impression that it is related to ingested metrics only. Signed-off-by: hagen1778 <roman@victoriametrics.com>	2023-09-26 14:26:40 +02:00
hagen1778	481a2c70fd	dashboard: fix display of ingested rows rate Fix display of ingested rows rate for `Samples ingested/s` and `Samples rate` panels for vmagent's dasbhoard. Previously, not all ingested protocols were accounted in these panels. An extra panel `Rows rate` was added to `Ingestion` section to display the split for rows ingested rate by protocol. Signed-off-by: hagen1778 <roman@victoriametrics.com>	2023-08-15 08:45:10 +02:00
hagen1778	e311a7bf80	dashboards: add `Concurrent inserts` panel to vmagent's dasbhoard The new panel supposed to show whether the number of concurrent inserts processed by vmagent isn't reaching the limit. The panel contains recommendation what to do if limit is reached. Signed-off-by: hagen1778 <roman@victoriametrics.com>	2023-08-03 10:46:25 +02:00
Roman Khavronenko	3eebe52a06	Dashboards upd (#3942 ) * dashboards/cluser: use `quantile` since `median` isn't supported by PromQL Signed-off-by: hagen1778 <roman@victoriametrics.com> * dashboards/: add `restarts` annotation to show when there were restarts The cluster's annotation query is aggregated `by job`, while vmagent/vmalert are aggregated `by job, instance`. This is because cluster dashboard can contains too many instances and annotation could become too noisy. Signed-off-by: hagen1778 <roman@victoriametrics.com> dashboards/*: support instance filter in Version annotation Signed-off-by: hagen1778 <roman@victoriametrics.com> --------- Signed-off-by: hagen1778 <roman@victoriametrics.com>	2023-03-10 17:13:19 +01:00
Roman Khavronenko	9f1403db38	dashboards: add non-default flags panel for vmagent (#3453 ) Signed-off-by: hagen1778 <roman@victoriametrics.com>	2022-12-07 12:22:20 +01:00
Roman Khavronenko	7dfb01bd7b	dashboards: update vmagent dash (#3411 ) The change list is the following: * bump Grafana version to 9.2.6; * add version change annotations; * switch to per-job panels instead of per-instance; * add drilldown option for resource usage panels. Signed-off-by: hagen1778 <roman@victoriametrics.com> Signed-off-by: hagen1778 <roman@victoriametrics.com>	2022-11-29 19:22:13 +01:00
Timur Bakeyev	9ad578214e	Update `datasource` entries consistently contain type `prometheus` and uid `$ds`. (#3393 ) Co-authored-by: Timour I. Bakeev <tbakeev@ripe.net>	2022-11-28 08:37:39 +01:00
Roman Khavronenko	b4410b1c63	Dashboards (#3120 ) * dashboards/cluster: few updates * apply consistent formatting across panels; * make resource usage panels per component more detailed; * add extra panels to vmselect for displaying `vm_rows_read_per_query`, `vm_rows_scanned_per_query`, `vm_rows_read_per_series` and `vm_series_read_per_query` metrics. Signed-off-by: hagen1778 <roman@victoriametrics.com> * dashboards/single: few updates * apply consistent formatting across panels; * add extra panels to Performance for displaying `vm_rows_read_per_query`, `vm_rows_scanned_per_query`, `vm_rows_read_per_series` and `vm_series_read_per_query` metrics. Signed-off-by: hagen1778 <roman@victoriametrics.com> * dashboards/vmagent: few updates * apply consistent formatting across panels; * add panels for showing number of samples ingested or scraped; * adapt resource usage panels for multiple selected jobs/instances; * add adhoc variable; * display vmagent's version in Stats. Signed-off-by: hagen1778 <roman@victoriametrics.com> * dashboards/vmalert: few updates * apply consistent formatting across panels; * adapt resource usage panels for multiple selected jobs/instances; * show vmalert version in Stats section. Signed-off-by: hagen1778 <roman@victoriametrics.com>	2022-09-16 21:24:32 +02:00
Roman Khavronenko	27f1c65074	vmagent: expose metric `vmagent_remotewrite_queues` (#2871 ) The new metric `vmagent_remotewrite_queues` exports a static value of number of configured remote write queus. This metric is useful to calculate total saturation per each configured URL with given number of queues. See corresponding changes to vmagent alerts and dashboard. Signed-off-by: hagen1778 <roman@victoriametrics.com>	2022-07-18 14:31:35 +03:00
Roman Khavronenko	3960fecac2	dashboards: small visual tweaks for vmagent's dashboard (#2828 ) * remove lines filling * filter series with zero values * update descriptions Signed-off-by: hagen1778 <roman@victoriametrics.com>	2022-07-05 11:05:35 +02:00
Roman Khavronenko	4c1fbcd6b0	Single dashboards (#2492 ) * dashboards: remove index filter from stats panel for DiskUsage The diskUsage stats panel was showing disk usage without including size of the index, which is not correct. The filter was removed to reflect the total disk usage. https://github.com/VictoriaMetrics/VictoriaMetrics/issues/2368 Signed-off-by: hagen1778 <roman@victoriametrics.com> * dashboards: add adhoc filter to dasbhoard variables The adhoc filter allows to quickly apply global filters without modifying the panels. Signed-off-by: hagen1778 <roman@victoriametrics.com> * dashboards: add new panel `IndexDB items rate` The new panel supposed to reflect the pressure on indexDB caused by churn rate or new series registration. Signed-off-by: hagen1778 <roman@victoriametrics.com> * dashboards: rm "Deferred merges" panel since it could be misleading See more context here https://github.com/VictoriaMetrics/VictoriaMetrics/issues/1682#issuecomment-938608067 Signed-off-by: hagen1778 <roman@victoriametrics.com> * dashboards: replace fixed interval of `5m` for `rate` expressions Before we used fixed `5m` interval for expressions with `rate` func. Unfortunately, this interval wasn't a fit for all the cases. So we switch to `$__rate_interval` instead. Signed-off-by: hagen1778 <roman@victoriametrics.com> * dashboards: bump version requirement Signed-off-by: hagen1778 <roman@victoriametrics.com> * dashboards: rm `vm_indexdb_items_added_size_bytes_total` expression Rate over `vm_indexdb_items_added_size_bytes_total` doesn't seem to be useful on the dasbhoard panel. Signed-off-by: hagen1778 <roman@victoriametrics.com>	2022-04-24 23:27:56 +03:00
Roman Khavronenko	e29b2b8444	Monitoring single (#2190 ) * dashboards: plot cpu limits for vmagent, vmalert and vm-single dashboards Signed-off-by: hagen1778 <roman@victoriametrics.com> * alerts: add `TooHighCPUUsage` alert for all VM components Signed-off-by: hagen1778 <roman@victoriametrics.com> * dashboards: bump components version requirements Signed-off-by: hagen1778 <roman@victoriametrics.com>	2022-02-15 11:54:28 +02:00
Roman Khavronenko	871528fedb	dashboards/vmagent: fix cached datasource uid (#1984 ) Signed-off-by: hagen1778 <roman@victoriametrics.com>	2021-12-20 17:32:41 +02:00
Roman Khavronenko	bc79bdf68a	Dashboards vmagent updates (#1973 ) * dashboards/vmagent: shuffle panels for better visibility More important error/dropped panels were moved higher on the main row. Network usage panel moved to Resource usage row. Signed-off-by: hagen1778 <roman@victoriametrics.com> * dashboards/vmagent: add Troubleshooting row to show top 5 instances/jobs by churn rate New panels are supposed to show top 5 jobs or targets which generate the most of the churn rate. They were placed into a new row "Troubleshooting". Signed-off-by: hagen1778 <roman@victoriametrics.com> * dashboards/vmagent: add panels for showing persistent queue saturation New panels were added to Torubleshooting row to show the persistent queue saturation. The corresponding alerts were added and linked to these panels as well. Signed-off-by: hagen1778 <roman@victoriametrics.com> * dashboards/vmagent: add alert "RejectedRemoteWriteDataBlocksAreDropped" New alert suppose to send a notification when vmagent starts to drop data blocks rejected by configured remote write destiantion. Signed-off-by: hagen1778 <roman@victoriametrics.com>	2021-12-20 12:16:53 +02:00
Aliaksandr Valialkin	802f05f73f	dashboards: consistently use regexp filters for template vars (#1798 ) Template vars may contain regexp when `all` is selected (.*) or when multiple values are selected (foo\|bar). So they must be passed to regexp filters.	2021-11-09 16:50:21 +02:00
Roman Khavronenko	ea8f625b53	dashboards: add cardnilaity limiter panels for vmagent (#1720 ) Signed-off-by: hagen1778 <roman@victoriametrics.com>	2021-10-18 19:15:33 +03:00
Roman Khavronenko	867e426070	dashboards: bump vmagent version requirement	2021-09-01 14:20:50 +03:00
Roman Khavronenko	0f4bcc00b2	Single dashboards upd (#1593 ) * dasbhoard: replace `null` datasources null datasource value may confuse Grafana and make it drop panel query in some versions. * docker: bump grafana image version * dashboards: add URL variable selector to vmagent dashboard * dashboards: add new panel `Remote write connection saturation` to vmagent dashboard * alerts: add new alert for `Remote write connection saturation` panel of vmagent dashboard * dashboards: add "Logging rate" panel to vmagent dashboard	2021-09-01 11:46:22 +03:00
Roman Khavronenko	a90012ef26	dashboard: bump version requirements (#1378 )	2021-06-14 13:31:59 +03:00
Roman Khavronenko	b8526e88d3	Dashboard single (#1374 ) * dashboard: update single version dash The update contains the following changes: * display anonymous memory usage metric. This metric suppose to reflect memory usage of the process which can't be freed by OS; * add legends to all panels. This is important for cases when users share the screenshots; * modify panels for Grafana v8.0.0 * dashboard: update single version dash tags * dashboard: update vmagent dash The update contains the following changes: * display anonymous memory usage metric. This metric suppose to reflect memory usage of the process which can't be freed by OS; * add legends to all panels. This is important for cases when users share the screenshots; * modify panels for Grafana v8.0.0	2021-06-14 13:03:23 +03:00
Aliaksandr Valialkin	6bc52fe41a	all: rename https://victoriametrics.github.io to https://docs.victoriametrics.com	2021-04-20 20:16:17 +03:00
Roman Khavronenko	b1e49bab52	Dashboards update (#1153 ) * dashboard: update single node dashboard * add number of new series created over last 24h; * bump version requirements. * dashboard: update vmagent dashboard * add panel for open file descriptors; * add panel for disk I/O; * add panel for `vmagent_remotewrite_packets_dropped_total` metric; * bump version requirements.	2021-03-29 12:37:17 +03:00
Roman Khavronenko	ed899ca9e8	Single dashboards update (#736 ) * dashboard: rename var `datasource` to `ds` for consistency reason Dasbhoards for cluster version or vmagent operate with datasource variable named `ds`. For consistency sake we rename this variable in single node version as well. * dashboard: add instance variable picker See dashboard reviews here https://grafana.com/grafana/dashboards/10229/reviews * dashboard: limit number of buckets in histogram to 12 for vmagent dashboard * dashboard: bump version requirement in description for single version * dashboard: drop extra series override for single version * dashboard: set Y-min to zero for most of panels in vmagent dashboard	2020-09-02 15:16:40 +03:00
Roman Khavronenko	dfa156e6aa	vmagent: update grafana dashboard (#634 ) * reference datasource variable instead of datasource name; * change unit from `bytes` to `bits/s` for Network panel.	2020-07-17 02:11:20 +03:00
Roman Khavronenko	9eb71dda3d	vmagent: add grafana dashboard (#629 ) `vmagent` Grafana dashboard suppose to provide basic observability over multiple `vmagent` instances. Dashboard is saved in Grafana export format so it can be easily imported. It was also integrated into docker-compose environment.	2020-07-15 13:56:06 +03:00

40 commits