github-mirrors/VictoriaMetrics

mirror of https://github.com/VictoriaMetrics/VictoriaMetrics.git synced 2024-11-21 14:44:00 +00:00

Author	SHA1	Message	Date
hagen1778	3380043424	dashboards: follow-up `4369bc1df2` * add more details to changelog * simplify panels description * remove capacity planning recommendation, as it proves it incompetent Signed-off-by: hagen1778 <roman@victoriametrics.com>	2024-02-08 09:51:43 +01:00
Hui Wang	4369bc1df2	deployment/dashboards: fix `Storage full ETA` panels (#5747 ) During background downsampling, rate(vm_deduplicated_samples_total{type="merge"}) could be much bigger than rate(vm_rows_added_to_storage_total) and it could last quite some time, which causes negative values of Storage full ETA and confuses users, see playground. Instead of trying to get more accurate results during downsampling, I think it's ok to ignore vm_deduplicated_samples_total at all, it's more reasonable to see Storage full ETA increase after downsampling. --------- Co-authored-by: Aliaksandr Valialkin <valyala@victoriametrics.com>	2024-02-08 09:43:39 +01:00
hagen1778	487a94565b	dashboards/all: add new panel `CPU spent on GC` It should help identifying cases when too much CPU is spent on garbage collection, and advice users on how this can be addressed. Signed-off-by: hagen1778 <roman@victoriametrics.com>	2024-02-02 16:21:21 +01:00
hagen1778	29a9b31584	dashboards: add `Targets scraped/s` A new stat panel shows the number of targets scraped by the vmagent per-second. Signed-off-by: hagen1778 <roman@victoriametrics.com>	2024-02-02 15:48:26 +01:00
hagen1778	db11b94e30	dashboards: update to grafana/grafana:10.3.1 Signed-off-by: hagen1778 <roman@victoriametrics.com>	2024-02-02 15:41:08 +01:00
hagen1778	02492bc1a4	dashboards/single: fix typo in query for `version` annotation The typo falsely produced many version change events. Signed-off-by: hagen1778 <roman@victoriametrics.com>	2024-01-31 09:13:46 +01:00
hagen1778	c23e8bee89	dashboards: specify where to see details about dropped labels Signed-off-by: hagen1778 <roman@victoriametrics.com>	2024-01-29 07:37:51 +01:00
hagen1778	b25ef138ce	dashboards: reflect dashboard rename in copy script This is a follow-up for `ff33e60a3d` Signed-off-by: hagen1778 <roman@victoriametrics.com>	2024-01-22 16:51:24 +01:00
hagen1778	b0287867fe	deployment/dashboards: change title `VictoriaMetrics` to `VictoriaMetrics - single-node` The new title should provide better understanding of this dashboard purpose. Signed-off-by: hagen1778 <roman@victoriametrics.com>	2024-01-16 20:39:52 +01:00
hagen1778	463455665b	dashboards: update cluster dashboard * add panels for detailed visualization of traffic usage between vmstorage, vminsert, vmselect components and their clients. New panels are available in the rows dedicated to specific components. * update "Slow Queries" panel to show percentage of the slow queries to the total number of read queries served by vmselect. The percentage value should make it more clear for users whether there is a service degradation. Signed-off-by: hagen1778 <roman@victoriametrics.com>	2024-01-08 11:58:31 +01:00
Dmytro Kozlov	935bec447b	app/vmalert: replace error metrics for gauges with counter metrics (#5217 ) See https://github.com/VictoriaMetrics/VictoriaMetrics/issues/5160 Signed-off-by: hagen1778 <roman@victoriametrics.com> Co-authored-by: hagen1778 <roman@victoriametrics.com>	2023-12-06 19:39:35 +01:00
Hui Wang	97373b7786	vmagent: add `vm_promscrape_scrape_pool_targets` for scrape jobs like… (#5335 ) * vmagent: export `vm_promscrape_scrape_pool_targets` metric to track the number of targets that each scrape_job discovers * add extra panel for new metric	2023-12-06 15:44:39 +08:00
Aliaksandr Valialkin	aefd744abb	dashboards: remove `path!="/favicon.ico"` filter from `requests rate` graphs The `path!="/favicon.ico"` filter has little sense, since there are many other special paths, which may be filtered out - /metrics, /flags, /health, /ping, /robots.txt, /-/healthy, /-/ready, /reload, etc. See /lib/httpserver/httpserver.go for more details. It will be hard or impossible to maintain filters for all these paths, so it is better to drop this filter in order to simplify queries and improve the consistency of these queries.	2023-11-16 19:28:49 +01:00
hagen1778	d389a4fcf3	dashboards: use `version` instead of `short_version` in annotations `version` label won't show the difference if various flavors of the same version were deployed. But `short_version` will. For example, on the sandbox env we test VM builds before new version release. Without this change, the version update won't be visible on dashboard. Signed-off-by: hagen1778 <roman@victoriametrics.com>	2023-11-16 09:26:47 +01:00
hagen1778	d3ae2b2f62	dashboards: update description for RSS and anonymous memory panels to be consistent for single-node, cluster and vmagent dashboards. Signed-off-by: hagen1778 <roman@victoriametrics.com>	2023-11-14 09:50:06 +01:00
hagen1778	d6ae082598	deployment/dashboards: respect `job` and `instance` filters for `alerts` annotation in cluster and single-node dashboards Signed-off-by: hagen1778 <roman@victoriametrics.com>	2023-11-14 09:38:15 +01:00
hagen1778	f6208965ce	dashboards/cluster: fix description about `max` threshold for `Concurrent selects` panel. Before, it was mistakenly implying that `max` is equal to the double of available CPUs. Addresses https://github.com/VictoriaMetrics/VictoriaMetrics/pull/5214 Signed-off-by: hagen1778 <roman@victoriametrics.com>	2023-10-31 16:05:33 +01:00
hagen1778	aaf9e3d526	dashboards/vmalert: add new panel `Missed evaluations` The new panel supposed to indicate alerting groups that miss their evaluations. Signed-off-by: hagen1778 <roman@victoriametrics.com>	2023-10-31 10:35:19 +01:00
hagen1778	8874b525b7	dashboards: fix `Errors rate to Alertmanager` filter The panel `Errors rate to Alertmanager` had `group` label filter applied to the expression, while the metric `vmalert_alerts_send_errors_total` doesn't have that label. This resulted into always empty results. Signed-off-by: hagen1778 <roman@victoriametrics.com>	2023-10-31 10:16:45 +01:00
hagen1778	c2d252c045	dashboards/vmalert: respect job and instance filters in `No data errors` Signed-off-by: hagen1778 <roman@victoriametrics.com>	2023-10-17 09:40:39 +02:00
hagen1778	edba9f6266	dashboards/vmalert: use `desc` sorting for tooltips on panels Signed-off-by: hagen1778 <roman@victoriametrics.com>	2023-10-17 09:31:09 +02:00
hagen1778	d43566605b	dasbhoards: fix vminsert/vmstorage/vmselect metrics filtering Fix vminsert/vmstorage/vmselect metrics filtering when dashboard is used to display data from many sub-clusters with unique job names. Before, only one specific job could have been accounted for component-specific panels, instead of all available jobs for the component. Signed-off-by: hagen1778 <roman@victoriametrics.com>	2023-10-11 12:09:04 +02:00
Roman Khavronenko	a4bd73ec7e	lib/promscrape: make concurrency control optional (#5073 ) * lib/promscrape: make concurrency control optional Before, `-maxConcurrentInserts` was limiting all calls to `promscrape.Parse` function: during ingestion and scraping. This behavior is incorrect. Cmd-line flag `-maxConcurrentInserts` should have effect onl on ingestion. Since both pipelines use the same `promscrape.Parse` function, we extend it to make concurrency limiter optional. So caller can decide whether concurrency should be limited or not. This commit makes `c53b5788b4` obsolete. Signed-off-by: hagen1778 <roman@victoriametrics.com> * Revert "dashboards: move `Concurrent inserts` panel to Troubleshooting section" This reverts commit `c53b5788b4`. --------- Signed-off-by: hagen1778 <roman@victoriametrics.com>	2023-10-02 21:32:11 +02:00
Aliaksandr Valialkin	859977d591	Revert "lib/promscrape: add metric `vm_promscrape_scrapes_skipped_total` (#5074 )" This reverts commit `74301cdbf5`. Reason for revert: vmagent already provides better approach for detecting slow scrape targets via the following query: scrape_duration_seconds / scrape_timeout_seconds > 1 This query depends on automatically generated per-target metrics. See https://docs.victoriametrics.com/vmagent.html#automatically-generated-metrics for more details. Updates https://github.com/VictoriaMetrics/VictoriaMetrics/pull/5074	2023-10-02 20:59:56 +02:00
Roman Khavronenko	74301cdbf5	lib/promscrape: add metric `vm_promscrape_scrapes_skipped_total` (#5074 ) * lib/promscrape: add metric `vm_promscrape_scrapes_skipped_total` add metric `vm_promscrape_scrapes_skipped_total`to show whether vmagent skips the scrapes. This could happen if vmagent is overloaded or target is responding too slow for configured `scrape_interval`. The follow-up commit should add a corresponding alerting rule and panel to vmagent dashboard. Signed-off-by: hagen1778 <roman@victoriametrics.com> * deployment/docker: add `TooManyScrapeSkips` alerting rule for vmagent Signed-off-by: hagen1778 <roman@victoriametrics.com> * dashboards: add panels `Scrape duration 0.99 quantile` and `Skipped scrapes` to vmagent dashboard Signed-off-by: hagen1778 <roman@victoriametrics.com> --------- Signed-off-by: hagen1778 <roman@victoriametrics.com>	2023-10-02 17:12:12 +02:00
hagen1778	c53b5788b4	dashboards: move `Concurrent inserts` panel to Troubleshooting section Moved because this panel is related to both: scraped and ingested data. Before, it could have give a misleading impression that it is related to ingested metrics only. Signed-off-by: hagen1778 <roman@victoriametrics.com>	2023-09-26 14:26:40 +02:00
hagen1778	0c60228fea	dashboards/victoriametrics: account for instance filter in annotations Signed-off-by: hagen1778 <roman@victoriametrics.com>	2023-09-20 14:50:03 +02:00
Artem Navoiev	f04eb762c1	add annotation to VictoriaLogs dashboards - restarts and version change (#5008 ) Signed-off-by: Artem Navoiev <tenmozes@gmail.com>	2023-09-15 15:12:23 +02:00
Artem Navoiev	fef0c232e8	Update VL daashboard. Add Resource Section, add ds and job filters, a… (#4981 ) * Update VL daashboard. Add Resource Section, add ds and job filters, add metric collection in docker compose from victorialogs, fix networkigs usage in docker compose Signed-off-by: Artem Navoiev <tenmozes@gmail.com> * add vl dashboard to docker compose Signed-off-by: Artem Navoiev <tenmozes@gmail.com> * add vl dashboard to docker compose Signed-off-by: Artem Navoiev <tenmozes@gmail.com> --------- Signed-off-by: Artem Navoiev <tenmozes@gmail.com>	2023-09-10 15:04:07 +02:00
Roman Khavronenko	e8db78eaa4	dashboards: provide copies of Grafana dashboards alternated with Vict… (#4905 ) dashboards: provide copies of Grafana dashboards alternated with VictoriaMetrics datasource Signed-off-by: hagen1778 <roman@victoriametrics.com>	2023-08-29 11:06:55 +02:00
hagen1778	481a2c70fd	dashboard: fix display of ingested rows rate Fix display of ingested rows rate for `Samples ingested/s` and `Samples rate` panels for vmagent's dasbhoard. Previously, not all ingested protocols were accounted in these panels. An extra panel `Rows rate` was added to `Ingestion` section to display the split for rows ingested rate by protocol. Signed-off-by: hagen1778 <roman@victoriametrics.com>	2023-08-15 08:45:10 +02:00
hagen1778	d890038a94	dashboards: correctly calculate `Bytes per point` value Correctly calculate `Bytes per point` value for single-server and cluster VM dashboards. Before, the calculation mistakenly accounted for the number of entries in indexdb in denominator, which could have shown lower values than expected. Signed-off-by: hagen1778 <roman@victoriametrics.com>	2023-08-03 16:22:50 +02:00
hagen1778	c47138e1b0	dashboards: add panels for absoulte value of mem and cpu usage by vmalert See https://github.com/VictoriaMetrics/VictoriaMetrics/issues/4627 Signed-off-by: hagen1778 <roman@victoriametrics.com>	2023-08-03 11:14:14 +02:00
hagen1778	e311a7bf80	dashboards: add `Concurrent inserts` panel to vmagent's dasbhoard The new panel supposed to show whether the number of concurrent inserts processed by vmagent isn't reaching the limit. The panel contains recommendation what to do if limit is reached. Signed-off-by: hagen1778 <roman@victoriametrics.com>	2023-08-03 10:46:25 +02:00
Aliaksandr Valialkin	f35d27aa2b	app/vlstorage: expose vl_data_size_bytes metric at /metrics page for tracking the on-disk data size (both indexdb and the data itself)	2023-07-31 07:56:53 -07:00
Zakhar Bessarab	6f3fee197e	dashboards/cluster: fix using storage filter for cache usage panel (#4657 ) Using `job=~$job_storage` forces "Cache usage" panel to display only vmstorage caches, but there is a cache peresent at vmselect(`promql/rollupResult`). Updated selector to match generic `$job` so that all caches will be displayed with an option to display per-job caches. Signed-off-by: Zakhar Bessarab <z.bessarab@victoriametrics.com>	2023-07-18 11:40:40 +02:00
Artem Navoiev	b024e46284	Add docker compose examples: filebeat(docker, syslog), fluentbit(docker), logstash, vector(docker) Signed-off-by: Artem Navoiev <tenmozes@gmail.com>	2023-06-21 03:59:31 -07:00
Roman Khavronenko	ccaa9571ef	Dashboard upd (#4438 ) dashboards: update dashboard for single-node version * add anonymous mem usage panel; * add syscall rate panel; * add location to logs panel; * update legend for panels to reflect instance name; * update queries to aggregate per instance. dashboards: update dashboard for cluster version * add syscall rate panel; * add drilldown to logs panel. Signed-off-by: hagen1778 <roman@victoriametrics.com>	2023-06-12 15:58:47 +02:00
Aliaksandr Valialkin	91533531f5	docs/Troubleshooting.md: document an additional case, which could result in slow inserts If `-cacheExpireDuration` is lower than the interval between ingested samples for the same time series, then vm_slow_row_inserts_total` metric is increased. See https://github.com/VictoriaMetrics/VictoriaMetrics/issues/3976#issuecomment-1476883183	2023-03-20 13:28:36 -07:00
Roman Khavronenko	3eebe52a06	Dashboards upd (#3942 ) * dashboards/cluser: use `quantile` since `median` isn't supported by PromQL Signed-off-by: hagen1778 <roman@victoriametrics.com> * dashboards/: add `restarts` annotation to show when there were restarts The cluster's annotation query is aggregated `by job`, while vmagent/vmalert are aggregated `by job, instance`. This is because cluster dashboard can contains too many instances and annotation could become too noisy. Signed-off-by: hagen1778 <roman@victoriametrics.com> dashboards/*: support instance filter in Version annotation Signed-off-by: hagen1778 <roman@victoriametrics.com> --------- Signed-off-by: hagen1778 <roman@victoriametrics.com>	2023-03-10 17:13:19 +01:00
Roman Khavronenko	2e153b68cd	dashboards: account for indexdb size in Bytes-per-Point panel (#3884 ) Signed-off-by: hagen1778 <roman@victoriametrics.com>	2023-02-28 17:47:52 +01:00
Roman Khavronenko	b209d4ace0	dashboards: use `median` instead of `avg` (#3800 ) `avg` can be affected by just one outlier, which may lead to false conclusions. `median` is supposed to reflect reality better by leveling outliers out. Signed-off-by: hagen1778 <roman@victoriametrics.com>	2023-02-11 10:01:30 -08:00
Aliaksandr Valialkin	88fed0232c	dashboards: typo fix `Datapoints scanned per series` -> `Datapoints scanned per query`	2023-02-03 19:12:33 -08:00
Roman Khavronenko	ec7c3f45ba	dashboards: bump operator dash to v9 of Grafana (#3642 ) Signed-off-by: hagen1778 <roman@victoriametrics.com> Signed-off-by: hagen1778 <roman@victoriametrics.com>	2023-01-12 16:31:26 +01:00
Roman Khavronenko	b3a70b8284	dasbhoards: fix the tooltip info for 1.86 (#3628 ) See `c63755c316 (diff-bba263a473e7fbc9d0fde075ebef6b3d4e32c322ee1210a3e07182292c7723aaR18)` Signed-off-by: hagen1778 <roman@victoriametrics.com>	2023-01-11 11:30:12 +01:00
Aliaksandr Valialkin	c63755c316	lib/writeconcurrencylimiter: improve the logic behind -maxConcurrentInserts limit Previously the -maxConcurrentInserts was limiting the number of established client connections, which write data to VictoriaMetrics. Some of these connections could be idle. Such connections do not consume big amounts of CPU and RAM, so there is a little sense in limiting the number of such connections. So now the -maxConcurrentInserts command-line option limits the number of concurrently executed insert requests, not including idle connections. It is recommended removing -maxConcurrentInserts command-line option, since the default value for this option should work good for most cases.	2023-01-06 22:20:19 -08:00
Thomas Danielsson	9d1104d812	dashboards: fix operator datasource variable (#3604 ) Got "Failed to upgrade legacy queries Datasource $ds was not found" in Grafana on operator dashboard. It's datasource variable was incorrectly named `datasource`. Also made the rest of the dashboards have homogeneous datasource-variable names and selections, matching vmagent dashboard.	2023-01-05 14:59:56 +01:00
Roman Khavronenko	9d0e1f8e68	dashboards: add backupmanager dashboard (#3599 ) Signed-off-by: hagen1778 <roman@victoriametrics.com>	2023-01-04 17:26:15 +01:00
Roman Khavronenko	e40c7d6efa	dashboards: respect $job var in sub-vars for cluster dash (#3487 ) Previously, $job_select, $job_storage and $job_insert didn't respect the $job filter. This change updates the variable queries to account for set $job variable. Signed-off-by: hagen1778 <roman@victoriametrics.com>	2022-12-16 09:53:32 +01:00
Roman Khavronenko	eb275be99d	dashboards: add VersionChange annotation (#3473 ) The new annotation is hidden by default and suppose to show component `short_version` label change on the panels. Signed-off-by: hagen1778 <roman@victoriametrics.com>	2022-12-12 16:32:26 +01:00
Roman Khavronenko	0b6b6d52bf	dashboards: remove DataLinks from single version (#3456 ) Those data links were copy&paste artifact from cluster version and aren't needed on the dash. Signed-off-by: hagen1778 <roman@victoriametrics.com>	2022-12-07 14:35:52 +01:00
Roman Khavronenko	9f1403db38	dashboards: add non-default flags panel for vmagent (#3453 ) Signed-off-by: hagen1778 <roman@victoriametrics.com>	2022-12-07 12:22:20 +01:00
Aliaksandr Valialkin	f3e84b4dea	{dashboards,alerts}: subtitute `{type="indexdb"}` with `{type=~"indexdb.*"}` inside queries after `8189770c50` Updates https://github.com/VictoriaMetrics/VictoriaMetrics/issues/3337	2022-12-05 16:00:22 -08:00
Roman Khavronenko	6801b37e53	dashboards: add `Disk space usage %` and `Disk space usage % by type` panels (#3436 ) The new panels have been added to the vmstorage and drilldown rows. `Disk space usage %` is supposed to show disk space usage percentage. This panel is now also referred by `DiskRunsOutOfSpace` alerting rule. This panel has Drilldown option to show absolute values. `Disk space usage % by type` shows the relation between datapoints and indexdb size. It supposed to help identify cases when indexdb starts to take too much disk space. This panel has Drilldown option to show absolute values. Signed-off-by: hagen1778 <roman@victoriametrics.com>	2022-12-05 08:35:33 +01:00
Roman Khavronenko	f989c20dd7	dashboards: fix typo in data link (#3426 ) Fixes a missing `&` char in data link for ETA panel on cluster dashboards. Without `&` char it generates wrong link when click on Drilldown menu. Signed-off-by: hagen1778 <roman@victoriametrics.com>	2022-12-01 13:21:14 +01:00
Roman Khavronenko	bdd0683c4a	dashboards: update VM single dash (#3400 ) The change list is the following: * bump Grafana version to 9.2.6; * replace old "Graph" panel with "TimeSeries" panel; * show % usage of Mem and CPU additionally to of absolute values; * `Caches` row was removed. All needed info for caches is now part of `Troubleshooting`; * add Annotations for Alert triggers. Not all alerts are supposed to be displayed on the dashboard, but only those with label `show_at: dashboard`. See `alerts.yml` change. Signed-off-by: hagen1778 <roman@victoriametrics.com> Signed-off-by: hagen1778 <roman@victoriametrics.com>	2022-11-29 19:28:22 +01:00
Roman Khavronenko	5d835a6d64	dashboards: update vmalert dash (#3404 ) The change list is the following: * bump Grafana version to 9.2.6; * replace old Graph panel with TimeSeries panel; * add RemoteWrite section; * allow configuring topK elements for some of the panels; * Preer grouping by job instead of grouping by instance. Signed-off-by: hagen1778 <roman@victoriametrics.com> Signed-off-by: hagen1778 <roman@victoriametrics.com>	2022-11-29 19:26:31 +01:00
Roman Khavronenko	7dfb01bd7b	dashboards: update vmagent dash (#3411 ) The change list is the following: * bump Grafana version to 9.2.6; * add version change annotations; * switch to per-job panels instead of per-instance; * add drilldown option for resource usage panels. Signed-off-by: hagen1778 <roman@victoriametrics.com> Signed-off-by: hagen1778 <roman@victoriametrics.com>	2022-11-29 19:22:13 +01:00
Roman Khavronenko	31ff26065b	dashboards: update VM cluster dash (#3401 ) The change list is the following: * bump Grafana version to 9.2.6; * remove artifacts in data links. Signed-off-by: hagen1778 <roman@victoriametrics.com>	2022-11-28 14:13:00 +01:00
Timur Bakeyev	9ad578214e	Update `datasource` entries consistently contain type `prometheus` and uid `$ds`. (#3393 ) Co-authored-by: Timour I. Bakeev <tbakeev@ripe.net>	2022-11-28 08:37:39 +01:00
Roman Khavronenko	42e63fe0fd	dashboards: cleanup & remove artifacts (#3387 ) * some unexpected DS UIDs were removed; * replace `$instance.` filter with `$instance` since we respect the instance port anyway; remove predefined datasource for `clusterbytenant` in favour of datasource variable `ds`. Signed-off-by: hagen1778 <roman@victoriametrics.com> Signed-off-by: hagen1778 <roman@victoriametrics.com>	2022-11-25 09:28:14 +01:00
Roman Khavronenko	3407006cdb	dashboards: cluster dashboard update (#3380 ) The purpose of the update is to make the dash more usable for large installations with many instances. Panels which showed metrics per-instance (Mem, CPU) now are showing metrics per-job or min/max/avg aggregations in % instead. This supposed to help immediately to identify resource shortage and remain usable for small and big installations. For cases when detailed info is needed, to the bottom of the dashboard a new row `Drilldown` was added. Panels like Mem or CPU now contain a `data-link` named `Drilldown` (cis shown on line click) which takes user to more detailed panel. The change list is the following: * bump Grafana version to 9.1.0; * replace old "Graph" panel with "TimeSeries" panel; * improve Uptime panel to show number of instances per job; * show % usage of Mem and CPU instead of absolute values; * `Caches` row was removed. All needed info for caches is now part of `Troubleshooting`; * add `Drilldown` section for detailed resource usage; * add Annotations for Alert triggers. Not all alerts are supposed to be displayed on the dashboard, but only those with label `show_at: dashboard`. See `alerts-cluster.yml` change. Signed-off-by: hagen1778 <roman@victoriametrics.com> Signed-off-by: hagen1778 <roman@victoriametrics.com>	2022-11-23 18:03:25 -08:00
Roman Khavronenko	908fe6a623	dashboards: replace `Index size` panel with `Active series` (#3157 ) Panel `Index size` showed itself impractical for users. So replacing it with `Active series` panel. https://github.com/VictoriaMetrics/VictoriaMetrics/issues/776#issuecomment-1255823734 Signed-off-by: hagen1778 <roman@victoriametrics.com>	2022-09-25 21:49:18 +02:00
Roman Khavronenko	b4410b1c63	Dashboards (#3120 ) * dashboards/cluster: few updates * apply consistent formatting across panels; * make resource usage panels per component more detailed; * add extra panels to vmselect for displaying `vm_rows_read_per_query`, `vm_rows_scanned_per_query`, `vm_rows_read_per_series` and `vm_series_read_per_query` metrics. Signed-off-by: hagen1778 <roman@victoriametrics.com> * dashboards/single: few updates * apply consistent formatting across panels; * add extra panels to Performance for displaying `vm_rows_read_per_query`, `vm_rows_scanned_per_query`, `vm_rows_read_per_series` and `vm_series_read_per_query` metrics. Signed-off-by: hagen1778 <roman@victoriametrics.com> * dashboards/vmagent: few updates * apply consistent formatting across panels; * add panels for showing number of samples ingested or scraped; * adapt resource usage panels for multiple selected jobs/instances; * add adhoc variable; * display vmagent's version in Stats. Signed-off-by: hagen1778 <roman@victoriametrics.com> * dashboards/vmalert: few updates * apply consistent formatting across panels; * adapt resource usage panels for multiple selected jobs/instances; * show vmalert version in Stats section. Signed-off-by: hagen1778 <roman@victoriametrics.com>	2022-09-16 21:24:32 +02:00
Max Golionko	7da9443686	moved cluster dashboard to master (#3074 ) dashboards: move cluster dashboard to master branch This change should simplify dashboards management.	2022-09-06 16:19:43 +02:00
Roman Khavronenko	289a4862ba	dashboards: add `Cache usage %` panel to Caches row (#2964 ) https://github.com/VictoriaMetrics/VictoriaMetrics/issues/2941 Signed-off-by: hagen1778 <roman@victoriametrics.com>	2022-08-08 19:37:34 +03:00
Roman Khavronenko	27f1c65074	vmagent: expose metric `vmagent_remotewrite_queues` (#2871 ) The new metric `vmagent_remotewrite_queues` exports a static value of number of configured remote write queus. This metric is useful to calculate total saturation per each configured URL with given number of queues. See corresponding changes to vmagent alerts and dashboard. Signed-off-by: hagen1778 <roman@victoriametrics.com>	2022-07-18 14:31:35 +03:00
Roman Khavronenko	3960fecac2	dashboards: small visual tweaks for vmagent's dashboard (#2828 ) * remove lines filling * filter series with zero values * update descriptions Signed-off-by: hagen1778 <roman@victoriametrics.com>	2022-07-05 11:05:35 +02:00
Artem Navoiev	cd7fb05b7c	dashboards: update cluster by tenant dashboard (#2695 ) Signed-off-by: Artem Navoiev <tenmozes@gmail.com>	2022-06-09 10:39:30 +02:00
Nikolay	cbfc1b7eb8	dashboards: adds dashboard for operator (#2621 ) Apply suggestions from code review Co-authored-by: Roman Khavronenko <roman@victoriametrics.com> Adds proper interval to rate functions	2022-05-23 11:32:51 +03:00
Roman Khavronenko	4c1fbcd6b0	Single dashboards (#2492 ) * dashboards: remove index filter from stats panel for DiskUsage The diskUsage stats panel was showing disk usage without including size of the index, which is not correct. The filter was removed to reflect the total disk usage. https://github.com/VictoriaMetrics/VictoriaMetrics/issues/2368 Signed-off-by: hagen1778 <roman@victoriametrics.com> * dashboards: add adhoc filter to dasbhoard variables The adhoc filter allows to quickly apply global filters without modifying the panels. Signed-off-by: hagen1778 <roman@victoriametrics.com> * dashboards: add new panel `IndexDB items rate` The new panel supposed to reflect the pressure on indexDB caused by churn rate or new series registration. Signed-off-by: hagen1778 <roman@victoriametrics.com> * dashboards: rm "Deferred merges" panel since it could be misleading See more context here https://github.com/VictoriaMetrics/VictoriaMetrics/issues/1682#issuecomment-938608067 Signed-off-by: hagen1778 <roman@victoriametrics.com> * dashboards: replace fixed interval of `5m` for `rate` expressions Before we used fixed `5m` interval for expressions with `rate` func. Unfortunately, this interval wasn't a fit for all the cases. So we switch to `$__rate_interval` instead. Signed-off-by: hagen1778 <roman@victoriametrics.com> * dashboards: bump version requirement Signed-off-by: hagen1778 <roman@victoriametrics.com> * dashboards: rm `vm_indexdb_items_added_size_bytes_total` expression Rate over `vm_indexdb_items_added_size_bytes_total` doesn't seem to be useful on the dasbhoard panel. Signed-off-by: hagen1778 <roman@victoriametrics.com>	2022-04-24 23:27:56 +03:00
Roman Khavronenko	ea86716d06	dashboards: add row Caches to single node dasbhoard (#2208 ) The new row Caches adds more visibility for cache utilization by VM. It replaces the old `Cache size` panel. Signed-off-by: hagen1778 <roman@victoriametrics.com>	2022-02-18 13:40:19 +02:00
Roman Khavronenko	445edcc6ac	dashboards: update the threshold for slow inserts % on the dashboard (#2197 ) Signed-off-by: hagen1778 <roman@victoriametrics.com>	2022-02-15 21:56:53 +02:00
Roman Khavronenko	e29b2b8444	Monitoring single (#2190 ) * dashboards: plot cpu limits for vmagent, vmalert and vm-single dashboards Signed-off-by: hagen1778 <roman@victoriametrics.com> * alerts: add `TooHighCPUUsage` alert for all VM components Signed-off-by: hagen1778 <roman@victoriametrics.com> * dashboards: bump components version requirements Signed-off-by: hagen1778 <roman@victoriametrics.com>	2022-02-15 11:54:28 +02:00
Roman Khavronenko	871528fedb	dashboards/vmagent: fix cached datasource uid (#1984 ) Signed-off-by: hagen1778 <roman@victoriametrics.com>	2021-12-20 17:32:41 +02:00
Roman Khavronenko	52a3b2d77e	Dashboards vmsingle (#1980 ) * dashboards/vmsingle: add "Merges deferred" panel The new panel supposed to show if there were deferred merges due to insufficient disk space. It goes within alerting rule which suppose to send a signal in such cases. Signed-off-by: hagen1778 <roman@victoriametrics.com> * dashboards/vmsingle: add "Cache usage" panel The new panel supposed to show the % of the used cache compared to allowed size by type. It should help to determine underutilized types of caches. Signed-off-by: hagen1778 <roman@victoriametrics.com> * dashboards/vmsingle: bump version requirement Signed-off-by: hagen1778 <roman@victoriametrics.com> * dashboards/vmsingle: rm alert for `vm_merge_need_free_disk_space` Signed-off-by: hagen1778 <roman@victoriametrics.com>	2021-12-20 17:28:35 +02:00
Roman Khavronenko	bc79bdf68a	Dashboards vmagent updates (#1973 ) * dashboards/vmagent: shuffle panels for better visibility More important error/dropped panels were moved higher on the main row. Network usage panel moved to Resource usage row. Signed-off-by: hagen1778 <roman@victoriametrics.com> * dashboards/vmagent: add Troubleshooting row to show top 5 instances/jobs by churn rate New panels are supposed to show top 5 jobs or targets which generate the most of the churn rate. They were placed into a new row "Troubleshooting". Signed-off-by: hagen1778 <roman@victoriametrics.com> * dashboards/vmagent: add panels for showing persistent queue saturation New panels were added to Torubleshooting row to show the persistent queue saturation. The corresponding alerts were added and linked to these panels as well. Signed-off-by: hagen1778 <roman@victoriametrics.com> * dashboards/vmagent: add alert "RejectedRemoteWriteDataBlocksAreDropped" New alert suppose to send a notification when vmagent starts to drop data blocks rejected by configured remote write destiantion. Signed-off-by: hagen1778 <roman@victoriametrics.com>	2021-12-20 12:16:53 +02:00
Aliaksandr Valialkin	802f05f73f	dashboards: consistently use regexp filters for template vars (#1798 ) Template vars may contain regexp when `all` is selected (.*) or when multiple values are selected (foo\|bar). So they must be passed to regexp filters.	2021-11-09 16:50:21 +02:00
Roman Khavronenko	ea8f625b53	dashboards: add cardnilaity limiter panels for vmagent (#1720 ) Signed-off-by: hagen1778 <roman@victoriametrics.com>	2021-10-18 19:15:33 +03:00
Roman Khavronenko	867e426070	dashboards: bump vmagent version requirement	2021-09-01 14:20:50 +03:00
Roman Khavronenko	0f4bcc00b2	Single dashboards upd (#1593 ) * dasbhoard: replace `null` datasources null datasource value may confuse Grafana and make it drop panel query in some versions. * docker: bump grafana image version * dashboards: add URL variable selector to vmagent dashboard * dashboards: add new panel `Remote write connection saturation` to vmagent dashboard * alerts: add new alert for `Remote write connection saturation` panel of vmagent dashboard * dashboards: add "Logging rate" panel to vmagent dashboard	2021-09-01 11:46:22 +03:00
Roman Khavronenko	eff940aa76	Vmalert metrics update (#1580 ) * vmalert: remove `vmalert_execution_duration_seconds` metric The summary for `vmalert_execution_duration_seconds` metric gives no additional value comparing to `vmalert_iteration_duration_seconds` metric. * vmalert: update config reload success metric properly Previously, if there was unsuccessfull attempt to reload config and then rollback to previous version - the metric remained set to 0. * vmalert: add Grafana dashboard to overview application metrics * docker: include vmalert target into list for scraping * vmalert: extend notifier metrics with addr label The change adds an `addr` label to metrics for alerts_sent and alerts_send_errors to identify which exact address is having issues. The according change was made to vmalert dashboard. * vmalert: update documentation and docker environment for vmalert's dashboard Mention Grafana's dashboard in vmalert's README in a new section #Monitoring. Update docker-compose env to automatically add vmalert's dashboard. Update docker-compose README with additional info about services.	2021-08-31 12:28:02 +03:00
Roman Khavronenko	a38a6fe8ad	dashboard: move panel `Disk writes/reads` to `Resource usage` row (#1417 ) * dashboard: move panel `Disk writes/reads` to `Resource usage` row * dashboard: make Stats panel consistent with Cluster dashboard	2021-07-01 05:46:26 +03:00
Roman Khavronenko	a90012ef26	dashboard: bump version requirements (#1378 )	2021-06-14 13:31:59 +03:00
Roman Khavronenko	b8526e88d3	Dashboard single (#1374 ) * dashboard: update single version dash The update contains the following changes: * display anonymous memory usage metric. This metric suppose to reflect memory usage of the process which can't be freed by OS; * add legends to all panels. This is important for cases when users share the screenshots; * modify panels for Grafana v8.0.0 * dashboard: update single version dash tags * dashboard: update vmagent dash The update contains the following changes: * display anonymous memory usage metric. This metric suppose to reflect memory usage of the process which can't be freed by OS; * add legends to all panels. This is important for cases when users share the screenshots; * modify panels for Grafana v8.0.0	2021-06-14 13:03:23 +03:00
Aliaksandr Valialkin	6bc52fe41a	all: rename https://victoriametrics.github.io to https://docs.victoriametrics.com	2021-04-20 20:16:17 +03:00
Roman Khavronenko	b955fe0038	dashboard: use unit `short` for `Labels limit exceeded` panel (#1227 )	2021-04-19 13:33:21 +03:00
Roman Khavronenko	f80156d9df	dashboard: fix avg GC duration expression (#1228 ) Previous expression was not correct.	2021-04-19 13:28:41 +03:00
Artem Navoiev	e9ee2122df	[draft] per tenant statistic (#121 ) * [draft] per tenant statistic * updates metric name update graph adds link and example config * quick fix * adds grafana dashboard adds example alert Co-authored-by: f41gh7 <nik@victoriametrics.com>	2021-04-14 11:23:07 +03:00
Aliaksandr Valialkin	edd1590ac7	dashboards/victoriametrics.json: typo fix: `chur rate` -> `churn rate`	2021-04-08 09:35:50 +03:00
Roman Khavronenko	b1e49bab52	Dashboards update (#1153 ) * dashboard: update single node dashboard * add number of new series created over last 24h; * bump version requirements. * dashboard: update vmagent dashboard * add panel for open file descriptors; * add panel for disk I/O; * add panel for `vmagent_remotewrite_packets_dropped_total` metric; * bump version requirements.	2021-03-29 12:37:17 +03:00
Roman Khavronenko	b457739f87	Single dashboard (#1126 ) * dashboard: update single node dashboard * add panel `Open FDs` for file descriptors metrics; * add panel `Disk writes/reads` to show the real read/write load on storage layer; * add `process_resident_memory_bytes` metric to memory usage panel; * add stats panel to show available CPUs, memory and disk space; * rm flags panel since it didn't prove its usefulness. * alerts: add alert for reaching FDs limit	2021-03-15 12:04:24 +02:00
Roman Khavronenko	9ca7d76b25	Add `Labels limit exceeded` panel to dashboard (#1072 ) New panel supposed to display events when VM drops extra label on exceeding `maxLabelsPerTimeseries` limit.	2021-02-16 23:38:20 +02:00
Roman Khavronenko	83c0c241a7	dashboard: release to grafana.com (#940 )	2020-12-06 13:34:19 +02:00
Roman Khavronenko	6f0038209c	dashboard: Prometheus compatibility fix for `Storage full ETA` panel (#938 )	2020-12-06 01:20:07 +02:00
John Belmonte	067188501f	dashboard: incorporate dedup rate into storage ETA (#920 ) * dashboard: incorporate dedup rate into storage ETA address #916 * exclude dedups during query and simplify	2020-11-26 10:27:54 +02:00
Roman Khavronenko	50d44d5932	dashboard: add `Storage full ETA` panel (#858 ) * dashboard: add `Storage full ETA` panel The new panel suppose to help to estimate the time needed to run out of free disk space. Thx to @belm0 @hekmon * disable legend for `Storage full ETA` panel	2020-11-01 23:37:31 +02:00
Roman Khavronenko	20311f6065	dashboard: clarify the purpose of `Concurrent flushes on disk` panel (#849 ) Current description led to confusion at https://victoriametrics.slack.com/archives/CGZF1H6L9/p1603270014273800	2020-10-28 18:10:46 +00:00
Roman Khavronenko	ed899ca9e8	Single dashboards update (#736 ) * dashboard: rename var `datasource` to `ds` for consistency reason Dasbhoards for cluster version or vmagent operate with datasource variable named `ds`. For consistency sake we rename this variable in single node version as well. * dashboard: add instance variable picker See dashboard reviews here https://grafana.com/grafana/dashboards/10229/reviews * dashboard: limit number of buckets in histogram to 12 for vmagent dashboard * dashboard: bump version requirement in description for single version * dashboard: drop extra series override for single version * dashboard: set Y-min to zero for most of panels in vmagent dashboard	2020-09-02 15:16:40 +03:00
John Belmonte	67277abecf	use Y-min 0 on Grafana dashboard graphs (#732 )	2020-09-01 19:56:56 +01:00

1 2 3 4

168 commits