github-mirrors/VictoriaMetrics

mirror of https://github.com/VictoriaMetrics/VictoriaMetrics.git synced 2024-11-21 14:44:00 +00:00

Author	SHA1	Message	Date
Roman Khavronenko	1cb7037fc8	Vmalert metrics update (#1580 ) * vmalert: remove `vmalert_execution_duration_seconds` metric The summary for `vmalert_execution_duration_seconds` metric gives no additional value comparing to `vmalert_iteration_duration_seconds` metric. * vmalert: update config reload success metric properly Previously, if there was unsuccessfull attempt to reload config and then rollback to previous version - the metric remained set to 0. * vmalert: add Grafana dashboard to overview application metrics * docker: include vmalert target into list for scraping * vmalert: extend notifier metrics with addr label The change adds an `addr` label to metrics for alerts_sent and alerts_send_errors to identify which exact address is having issues. The according change was made to vmalert dashboard. * vmalert: update documentation and docker environment for vmalert's dashboard Mention Grafana's dashboard in vmalert's README in a new section #Monitoring. Update docker-compose env to automatically add vmalert's dashboard. Update docker-compose README with additional info about services.	2021-09-01 12:19:34 +03:00
Aliaksandr Valialkin	3788d4f4eb	app/vmselect: show useful endpoints when requested `/select/<accountID>/` page	2021-08-29 12:05:14 +03:00
Aliaksandr Valialkin	39bb6bdd79	app/vmselect/promql: add `quantile("phiLabel", phi1, ..., phiN, q)` aggregate function to MetricsQL See https://github.com/VictoriaMetrics/VictoriaMetrics/issues/1573	2021-08-27 18:40:25 +03:00
Aliaksandr Valialkin	6e5864b014	docs/{vmgateway,vmbackupmanager}: mention that enterprise binaries are free for download and evaluation	2021-08-27 14:53:48 +03:00
Aliaksandr Valialkin	102ab795f8	docs/vmagent.md: document the ability to load scrape configs from multiple files See https://github.com/VictoriaMetrics/VictoriaMetrics/issues/1559	2021-08-26 09:13:54 +03:00
benclive	76816a0193	Remove trailing slash for URLPrefixes with specific path (#1554 )	2021-08-25 13:32:04 +03:00
Aliaksandr Valialkin	335de30083	app/vmselect/promql: `make fmt` after `0078486ea7`	2021-08-23 23:05:34 +03:00
Aliaksandr Valialkin	40b06e84f8	app/vmselect/promql: rename `sign()` function to `sgn()` in order to be consistent with Prometheus See https://github.com/prometheus/prometheus/pull/8457 for details.	2021-08-23 11:46:29 +03:00
Aliaksandr Valialkin	ff4c7c1a3d	docs/vmalert.md: run `make docs-sync` after `9ee3d0378f`	2021-08-21 20:25:26 +03:00
Roman Khavronenko	0c2284b95f	vmalert: add flag `disableAlertgroupLabel` for disabling extra label added to series (#1534 ) The new label added in https://github.com/VictoriaMetrics/VictoriaMetrics/issues/611 may negatively impact deduplication in Alertmanager. The new flag supposed to give an option to disable adding this label. To enable flag just add `-disableAlertgroupLabel` to binary execution command. https://github.com/VictoriaMetrics/VictoriaMetrics/issues/1532	2021-08-21 20:23:22 +03:00
Alexander Rickardsson	9e2e9d83a5	vmalert: accept http.StatusOK for remotewrite (#1550 )	2021-08-21 20:23:22 +03:00
Aliaksandr Valialkin	91534057a3	app/vmselect/prometheus: do not extend `[d]` to the detected interval between samples for `first_over_time(m[d])` This is for the sake of consistency with similar change for the last_over_time(m[d]) at `a724229b5d`	2021-08-21 19:56:56 +03:00
Roman Khavronenko	1ccb77904b	vmselect: update `vm_request_duration_seconds` value when request fails (#1537 ) Before, metric `vm_request_duration_seconds` was update only on successful attempts which could be misleading. For example, timeout errors on netstorage request may be not accounted in the metric and won't be visible on dashboards. Using `defer` statement to update the metric after query arguments validation may improve the situation.	2021-08-19 14:07:00 +03:00
Aliaksandr Valialkin	ee1f3414d1	app/vmselect/promql: do not override `[d]` at `last_over_time(m[d])` if `[d]` is smaller than `scrape_interval` Since most users do not expect the overriding of explicitly set `[d]`.	2021-08-19 10:33:10 +03:00
Aliaksandr Valialkin	5d92fafc40	app/vmselect: add `-search.noStaleMarkers` command-line flag for disabling stale markers handling in queries This option allows reducing CPU usage a bit when VictoriaMetrics is used for collecting and processing non-Prometheus data. For example, InfluxDB line protocol, Graphite, OpenTSDB, CSV, etc.	2021-08-18 13:58:06 +03:00
Aliaksandr Valialkin	f21fad53b4	lib/promscrape: add ability to disable sending Prometheus staleness markers with -promscrape.disableStaleMarkers command-line flag This option can be useful when vmagent consumes too much additional memory for staleness markers functionality and when staleness markers aren't needed.	2021-08-18 13:58:05 +03:00
Aliaksandr Valialkin	49886ecbc8	app/vmselect/promql: add bitmap_and(), bitmap_or() and bitmap_xor() functions to MetricsQL Updates https://github.com/VictoriaMetrics/VictoriaMetrics/issues/1541	2021-08-17 13:22:15 +03:00
Aliaksandr Valialkin	38065bec7b	app/vmselect/promql: move common condition to dropStaleNaNs in order to improve code maintainability	2021-08-17 11:00:58 +03:00
Aliaksandr Valialkin	fe8c462044	app/vmalert: mention -remoteWrite.disablePathAppend in the description for -remoteWrite.url	2021-08-16 15:23:39 +03:00
Aliaksandr Valialkin	21974cb571	app/vmalert: follow-up for `2400f85761`	2021-08-16 15:20:35 +03:00
Alexander Rickardsson	d27dc3721b	vmalert: enable configuring explicit path (#1536 ) * vmalert: allow to disable automatically added path to remote write address via disablePathAppend flag * docs: update docs to include remoteWrite.disablePathAppend	2021-08-16 14:58:05 +03:00
Aliaksandr Valialkin	48920bdef8	app/vmagent/remotewrite: expose vmagent_remotewrite_send_duration_seconds_total metric This metric can be used for determining high saturation of every connection to remote storage with an alerting query `rate(vmagent_remotewrite_send_duration_seconds_total) > 0.9s`. This query triggers when a connection is satureated by more than 90%	2021-08-15 13:34:07 +03:00
Aliaksandr Valialkin	5420c3d967	app/vmselect/promql: drop staleness marks before calling rollupConfig.Do This allows dropping staleness marks only once and then calculate multiple rollup functions on the result. Updates https://github.com/VictoriaMetrics/VictoriaMetrics/issues/1526	2021-08-15 13:22:26 +03:00
Aliaksandr Valialkin	6c4c54eaad	Revert "app/vmselect/promql: properly handle Prometheus staleness marks in removeCounterResets functions" This reverts commit 94dfcb6747a3b29a11d14e71bea21a2312bb6346. It is better to remove staleness marks (decimal.StaleNaN) before calling rollupConfig.Do, e.g. in preFunc	2021-08-15 13:22:24 +03:00
Aliaksandr Valialkin	af4a306d7b	app/vmselect/promql: properly handle Prometheus staleness marks in removeCounterResets functions Prometheus stalenss marks shouldn't be changed in removeCounterResets. Otherwise they will be converted to an ordinary NaN values, which couldn't be removed in dropStaleNaNs() function later. This may result in incorrect calculations for rollup functions. Updates https://github.com/VictoriaMetrics/VictoriaMetrics/issues/1526	2021-08-14 12:45:31 +03:00
Aliaksandr Valialkin	c1f81f08d4	all: add support for Prometheus staleness markers Updates https://github.com/VictoriaMetrics/VictoriaMetrics/issues/1526 Updates https://github.com/VictoriaMetrics/VictoriaMetrics/issues/748 Updates https://github.com/VictoriaMetrics/VictoriaMetrics/issues/1509 Updates https://github.com/VictoriaMetrics/VictoriaMetrics/issues/1530 Updates https://github.com/VictoriaMetrics/VictoriaMetrics/issues/845	2021-08-13 12:13:15 +03:00
Aliaksandr Valialkin	b35ae791f1	app/vmselect: `make vmui-update` after the commit 4ae14df864a7e327955f44941295a286175423b3	2021-08-11 13:42:53 +03:00
Aliaksandr Valialkin	f60ff85dbe	app/vmui: actualize Dockerfiles	2021-08-11 13:42:53 +03:00
Aliaksandr Valialkin	9eb828b2c2	app/vminsert: add vm_rpc_send_duration_seconds_total metric per each `vminsert->vmstorage` link This metric is useful for determining high link saturation with the following alerting rule: rate(vm_rpc_send_duration_seconds_total) > 0.9s	2021-08-11 11:42:33 +03:00
Aliaksandr Valialkin	90efb5831b	lib/envflag: add a link to docs for -envflag.enable	2021-08-11 10:32:40 +03:00
Yury Molodov	aca2cb245e	vmui: fix layout and add server url by default (#1519 ) * fix: change layout for correctly display big query * fix: set default server from url * fix: change get default server url	2021-08-06 12:16:53 +03:00
Roman Khavronenko	d5ba8248cc	vmalert: expose new metrics for tracking number of produced samples during last evaluation (#1518 ) * vmalert: expose new metrics for tracking number of produced samples during last evaluation Two new metrics were added to track the number of samples produced during the last evaluation: * vmalert_recording_rules_last_evaluation_samples * vmalert_alerting_rules_last_evaluation_samples The gauge type is used to remain consistent with Prometheus metric `prometheus_rule_group_last_evaluation_samples` which is on the group level. However, the counter type was considered as well. Two metrics instead of one are used to make it easier to separate recording and alerting rules. It is likely, number of samples produced by recording rules is more important so people will refer to it more frequently. The expected usage of the new metric is the following: ``` - alert: RecordingRuleReturnsEmptyResults expr: sum(vmalert_recording_rules_last_evaluation_samples) by(recording) < 1 annotations: summary: Recording rule {{$labels.recording}} returns empty results. Please verify expression correctness. ``` Addresses https://github.com/VictoriaMetrics/VictoriaMetrics/issues/1494 * vmalert: rename `vmalert_alerts_error` to `vmalert_alerting_rules_error` to remain consistent with recording rules metrics	2021-08-05 10:02:35 +03:00
Aliaksandr Valialkin	13d438d808	app/vmagent: typo fix in the description for `-remoteWrite.queues`	2021-08-05 10:00:58 +03:00
Aliaksandr Valialkin	b877538622	app/vmagent: follow-up after `fe445f753b` Updates https://github.com/VictoriaMetrics/VictoriaMetrics/issues/1491	2021-08-05 09:51:00 +03:00
Omar Ghader	fe445f753b	feature: Add multitenant for vmagent (#1505 ) * feature: Add multitenant for vmagent * Minor fix * Fix rcs index out of range * Minor fix * Fix multi Init * Fix multi Init * Fix multi Init * Add default multi * Adjust naming * Add TenantInserted metrics * Add TenantInserted metrics * fix: remove unused metrics for vmagent * fix: remove unused metrics for vmagent Co-authored-by: mghader <marc.ghader@ubisoft.com> Co-authored-by: Sebastian YEPES <syepes@gmail.com>	2021-08-05 09:44:29 +03:00
Qifei Wan	095bb90879	app/vmalert: update config state metrics if config parsed failed (#1507 )	2021-08-03 16:12:48 +03:00
Aliaksandr Valialkin	60cfa5f100	app/vmselect/promql: add `present_over_time(m[d])` function, which will be available starting from Prometheus 2.29.0 See https://github.com/prometheus/prometheus/releases/tag/v2.29.0-rc.0 and https://github.com/prometheus/prometheus/pull/9097	2021-08-03 12:21:53 +03:00
wusphinx	511e5c2e68	Update TimeSelector.tsx (#1515 ) delete garbled code	2021-08-03 11:14:56 +03:00
Nikolay	3f3ad13753	adds /rules and /alerts api for grafana (#1504 ) Co-authored-by: Aliaksandr Valialkin <valyala@gmail.com>	2021-08-02 17:29:49 +03:00
Aliaksandr Valialkin	99004a6a40	app/vmselect/netstorage: unpack time series data in mostly local big chunks This should improve performance on multi-CPU systems for queries selecting time series with big number of raw samples	2021-07-30 12:26:33 +03:00
Aliaksandr Valialkin	c473d8ffe1	li/storage: re-use the per-day inverted index search code for searching in global index This allows removing a big pile of outdated code for global index search. This may help https://github.com/VictoriaMetrics/VictoriaMetrics/issues/1486	2021-07-30 10:28:20 +03:00
Aliaksandr Valialkin	cbb81c2ce9	app/vmselect/netstorage: do not query Go maps with unsafe string keys, since this breaks in Go 1.17	2021-07-30 10:28:19 +03:00
Aliaksandr Valialkin	b709fa387a	app/vmselect: follow-up for `ed95bc9531` Updates https://github.com/VictoriaMetrics/VictoriaMetrics/issues/1493	2021-07-29 09:48:47 +03:00
arnoldyahad	ed95bc9531	Add case prometheus/rules for grafana 8 (#1502 )	2021-07-29 06:15:35 +03:00
assassins	6ab0001a1f	Performance optimization (#1481 ) There are redundant steps	2021-07-28 19:29:22 +03:00
Aliaksandr Valialkin	49bf3abf67	app/vmselect: follow-up for `626073bca8` * Rename -search.maxMetricsPointSearch to -search.maxSamplesPerQuery, so it is more consistent with the existing -search.maxSamplesPerSeries * Move the -search.maxSamplesPerQuery from vmstorage to vmselect, so it could effectively limit the number of raw samples obtained from all the vmstorage nodes * Document the -search.maxSamplesPerQuery in docs/CHANGELOG.md	2021-07-28 18:00:04 +03:00
匠心零度	626073bca8	protection vmselect ,avoid metrics point too much let vmselect cup load very, very high (#1478 ) * protection vmselect…… * protection vmselect…… * protection vmselect…… * All checks have failed,fix Co-authored-by: lirenzuo <lirenzuo@shein.com>	2021-07-28 14:39:35 +03:00
Aliaksandr Valialkin	5d255846ac	all: add `go:build` lines for Go1.17 See https://tip.golang.org/doc/go1.17#gofmt for more details	2021-07-26 15:50:46 +03:00
Aliaksandr Valialkin	3921d8afae	app/vmselect: prevent from possible deadlock when f callback blocks inside RunParallel	2021-07-26 15:50:45 +03:00
Aliaksandr Valialkin	c3e6ce1db9	app/vmselect: `make vmui-update` after `a91d41f12a`	2021-07-26 10:32:01 +03:00
Yury Molodov	401de2dca4	Vmui/query editor (#1472 ) * fix: move request button to server input * feat: add switch for query autocomplete * refactor: rename state for popover open * feat: add detect os by userAgent * fix: change hotkey to run query for mac * fix: change detect mac os * fix: change div to span inside Typography Co-authored-by: yury <yurymolodov@victoriametrics.com>	2021-07-23 21:08:58 +03:00
Aliaksandr Valialkin	b047feeb8b	app/vmselect/promql: properly handle `(a op b) default N` if `(a op b)` returns NaN series The result should be a series with `N` values and `a op b` labels. Previously such series has been removed from the result.	2021-07-16 01:44:24 +03:00
Aliaksandr Valialkin	b92702f6d5	app/vmselect/netstorage: use more scalable algorithm for ditributing the work among among multiple channels on systems with big number of CPU cores	2021-07-16 00:35:36 +03:00
Aliaksandr Valialkin	df117f85bd	app/vmselect: do not track queries with less than 1ms execution time at /api/v1/status/top_queries This should improve the readability and usefullness of the /api/v1/status/top_queries when debugging slow queries or queries that take too much cpu time.	2021-07-15 16:53:35 +03:00
Aliaksandr Valialkin	5830ce2706	app/vmselect/netstorage: add `-search.maxSamplesPerSeries` command-line option for limiting the number of samples a query can process per each series This should prevent from out of memory crashes like in https://github.com/VictoriaMetrics/VictoriaMetrics/issues/1067	2021-07-15 16:53:35 +03:00
Aliaksandr Valialkin	6c42db87a8	app/vmselect/netstorage: improve scalability of series unpacking on multi-CPU systems	2021-07-15 15:40:53 +03:00
Aliaksandr Valialkin	3059e4feec	app/vmui/README.md: typo fix: naviate->navigate	2021-07-15 15:02:56 +03:00
Aliaksandr Valialkin	9add9d86a6	app/vmselect/promql: duration handling improvements in MetricsQL queries - Support durations anywhere in MetricsQL queries. E.g. sum_over_time(m[1h])/1h is equivalent to sum_over_time(m[1h])/3600 - Support durations without suffix. E.g. rate(m[300]) is equivalent to rate(m[5m])	2021-07-12 17:19:32 +03:00
Aliaksandr Valialkin	d98e22fe50	app/vmalert: accept Prometheus-like durations in `interval` config option inside `group` section	2021-07-12 12:36:22 +03:00
Aliaksandr Valialkin	f5fa177141	Revert "app/vmselect: expose vmui at /select/<accountID>/prometheus/vmui additionally to /select/<accountID>/vmui" This reverts commit 885a79def6799f288e14df05b35a12569659ab85. Reason for revert: Grafana doesn't allows accessing /select/<accountID>/prometheus/vmui :(	2021-07-12 09:08:43 +03:00
Aliaksandr Valialkin	ddaa12050d	app/vmselect: expose vmui at /select/<accountID>/prometheus/vmui additionally to /select/<accountID>/vmui The /select/<accountID>/prometheus/vmui is needed for accessing via server-side Prometheus datasource for Grafana. Updates https://github.com/VictoriaMetrics/VictoriaMetrics/issues/1413	2021-07-10 12:52:25 +03:00
Aliaksandr Valialkin	0b98f6c7ff	app/vmselect: expose vmui at `/vmselect/<accountID>/vmui/` instead of `/vmselect/<accountID>/prometheus/vmui/` Updates https://github.com/VictoriaMetrics/VictoriaMetrics/issues/1413	2021-07-10 12:32:21 +03:00
Aliaksandr Valialkin	98e049ba6d	app/vmui: move source code from https://github.com/VictoriaMetrics/vmui to app/vmui Updates https://github.com/VictoriaMetrics/VictoriaMetrics/issues/1413	2021-07-09 17:13:51 +03:00
Aliaksandr Valialkin	2c5e1cd893	app/vmselect: move web ui from /ui to /select/<accountID>/prometheus/ui This way the UI is available for every tenant (aka accountID) and the UI can automatically determine the needed per-tenant datasource path from page referer. Updates https://github.com/VictoriaMetrics/VictoriaMetrics/issues/1413	2021-07-08 13:14:50 +03:00
Aliaksandr Valialkin	acb7a95c64	app/vmselect: follow-up after `aa11ef6d3b` Updates https://github.com/VictoriaMetrics/VictoriaMetrics/issues/1413 Updates https://github.com/VictoriaMetrics/VictoriaMetrics/issues/4	2021-07-07 17:45:09 +03:00
tony	aa11ef6d3b	add vmui for vmselect component (#1431 ) Co-authored-by: Aliaksandr Valialkin <valyala@gmail.com>	2021-07-07 17:04:23 +03:00
Aliaksandr Valialkin	9c19719ad6	app/{vminsert,vmselect}: export vminsert_request_duration_seconds and vmselect_request_duration_seconds histograms	2021-07-07 13:27:23 +03:00
Aliaksandr Valialkin	ceda2b1df4	lib/httpserver: print full requestURI in httpserver.Errorf This should simplify debugging.	2021-07-07 13:11:29 +03:00
Aliaksandr Valialkin	22c6e64bbc	lib/storage: consistency renaming: tagCache -> tagFiltersCache This improves code readability	2021-07-06 11:03:30 +03:00
Aliaksandr Valialkin	44855f0c9b	app/{vmselect,vmstorage}: clarify the description for `-dedup.minScrapeInterval` command-line flag Updates https://github.com/VictoriaMetrics/VictoriaMetrics/issues/1426	2021-07-02 15:06:41 +03:00
Aliaksandr Valialkin	4d8ab5d9fa	docs/vmagent.md: mention about docker_sd_config support	2021-06-25 20:53:09 +03:00
Aliaksandr Valialkin	856aecae05	app/vmselect/promql: return the last timestamp for the max / min value from `tmax_over_time()` and `tmin_over_time()` function as most users expect	2021-06-23 14:18:37 +03:00
Aliaksandr Valialkin	c18017a9c3	app/vminsert/netstorage: sort the `-storageNode` list passed to `vminsert` nodes This should reduce resource usage (CPU, RAM, disk IO) at vmstorage nodes if the addresses of vmstorage nodes are passed in random order to vminsert nodes.	2021-06-23 14:00:08 +03:00
Nikolay	e03a3d3a36	adds http_sd (#1399 ) * adds http_sd * adds X-Prometheus-Refresh-Interval-Seconds header * Update lib/promscrape/discovery/http/api.go Co-authored-by: Aliaksandr Valialkin <valyala@gmail.com>	2021-06-22 13:42:09 +03:00
Roman Khavronenko	79474baf99	vmctl: add more context to flags description in vm-native mode (#1395 )	2021-06-18 19:20:52 +03:00
Aliaksandr Valialkin	b92d110cad	app/vmselect: log slow requests to all the `/api/v1/*` handlers if their execution time exceeds `-search.logSlowQueryDuration`	2021-06-18 19:07:03 +03:00
Aliaksandr Valialkin	4acc4602b3	app/vmctl: limit JSON line size by 10K samples (#1394 ) This should reduce the maximum memory usage at VictoriaMetrics when importing time series with big number of samples.	2021-06-18 15:41:34 +03:00
Aliaksandr Valialkin	60bc35f550	docs/{vmgateway,vmbackupmanager}: explicitly mention that these components are a part of an enterprise package	2021-06-17 17:19:13 +03:00
Aliaksandr Valialkin	51fc469642	app/vmagent/remotewrite: `go fmt` after `0a796f7c3a`	2021-06-17 13:51:40 +03:00
Zongyang	cf506e300d	Change default value of '-remoteWrite.queues' to cgroup.AvailableCPUs * 2 (#1385 ) * Change default value of '-remoteWrite.queues' to cgroup.AvailableCPUS() * 2 to reduce scrape interval Default value of vmagent option '-remotewrite.queues' is 4 and default size of vmagent ScheudleUnmarshalWorkers is number of CPUs, when available CPUs is much greater than 4, e.g 32, worker are competing push queues which will increase scrape interval and may cause scrape timeout. * Update README and flag description Co-authored-by: xiaozy <xiaozy01@fenbi.com>	2021-06-16 12:37:55 +03:00
Roman Khavronenko	a15c947045	promql: fix `increase_pure` calculation for cases with stale series (#1381 ) Due to staleness handling, increase_pure were using incorrect previous value during calculation in cases where series disappears for period longer than staleness period and then returns back. The fix suppose to account for a real datapoint value before staleness takes place. The fix should remove unexpected spikes while using `increase_pure` for staled series.	2021-06-15 17:37:51 +03:00
Nikolay	e42da47608	adds digital ocean sd (#1376 ) * adds digital ocean sd config * adds digital ocean sd https://github.com/VictoriaMetrics/VictoriaMetrics/issues/1367 * typo fix	2021-06-14 13:19:29 +03:00
Roman Khavronenko	c5f493db8e	Vmalert docs (#1372 ) * vmalert: mention what happens if `for` is set to 0 or omitted * vmalert: add more context to docs	2021-06-14 11:43:01 +03:00
Aliaksandr Valialkin	0672cfffa2	app/vmauth: properly handle http.ErrAbortHandler panic This panic can be raised by the reverseProxy on aborted request to the backend. So handle it (e.g. suppress) at reverseProxy.ServeHTTP call. Do not suppress the panic at lib/httpserver generic HTTP handler, since it may result in an inconsistent state left after the panicking handler. Updates https://github.com/VictoriaMetrics/VictoriaMetrics/issues/1353	2021-06-11 12:54:37 +03:00
Roman Khavronenko	f3cb2158a3	vmalert: fix mistake with object reuse while parsing response (#1370 ) * vmalert: fix mistake with object reuse while parsing response During the refactoring, the wrong optimisations was applied in parse function which caused metric fields reset. The change removes optimisation. https://github.com/VictoriaMetrics/VictoriaMetrics/issues/1369 * vmalert: add test to cover multiple metrics in one response	2021-06-11 11:30:07 +03:00
John Belmonte	3e79f3994e	spelling fix: synonym (#1363 )	2021-06-11 10:58:48 +03:00
Aliaksandr Valialkin	e8e7f03394	app/vmselect/promql: typo fix in the comment	2021-06-09 18:34:57 +03:00
Aliaksandr Valialkin	247b2a5a08	app/vmauth: improve readability for a config with multiple `src_paths`	2021-06-09 15:38:09 +03:00
Aliaksandr Valialkin	520d62ade2	docs/vmagent.md: mention that vmagent supports scrape targets sharding	2021-06-09 12:30:54 +03:00
Aliaksandr Valialkin	f3749dedba	docs: document rules replay feature for vmalert Updates https://github.com/VictoriaMetrics/VictoriaMetrics/issues/836 This is a follow-up for `2a259ef5e7`	2021-06-09 12:30:54 +03:00
Roman Khavronenko	5aa7846900	vmalert: support rules backfilling (aka `replay`) (#1358 ) * vmalert: support rules backfilling (aka `replay`) vmalert can `replay` configured rules in the past and backfill results via remote write protocol. It supports MetricsQL/PromQL storage as data source, and can backfill data to remote write compatible storage. Supports recording and alerting rules `replay`. See more details in README. https://github.com/VictoriaMetrics/VictoriaMetrics/issues/836 * vmalert: review fixes * vmalert: readme fixes	2021-06-09 12:30:54 +03:00
Aliaksandr Valialkin	2c6b917749	app/vminsert/netstorage: update storageNode.lastRerouteTime before the rerouting This is needed for reliable detection of storage nodes with recent rerouting	2021-06-08 12:06:32 +03:00
Aliaksandr Valialkin	0d067eb112	app/vminsert/netstorage: tune re-routing algorithm Do not re-route data to unavailable storage node. Send it to the remaining storage nodes instead even if they cannot keep up with the load. This should spread the load more evenly among available storage nodes. Updates https://github.com/VictoriaMetrics/VictoriaMetrics/issues/791 Updates https://github.com/VictoriaMetrics/VictoriaMetrics/issues/1054 Updates https://github.com/VictoriaMetrics/VictoriaMetrics/issues/1165	2021-06-05 16:23:44 +03:00
Aliaksandr Valialkin	269e35d676	app/{vmagent,vminsert}: follow-up after `2fe045e2a4` Updates https://github.com/VictoriaMetrics/VictoriaMetrics/issues/1343	2021-06-04 20:33:22 +03:00
jelmd	d8b46908db	new feature: debug relabeling (#1344 ) * new feature: relabel logging Use scrape_configs[x].relabel_debug = true to log metric names inkl. labels before and after relabeling. After relabeling related metrics get dropped, i.e. not submitted to servers. * vminsert wants relabel logging, too.	2021-06-04 20:33:21 +03:00
Aliaksandr Valialkin	1c09e71f5b	app/vminsert: add `-disableRerouting` command-line flag for disabling re-routing if some vmstorage nodes have lower performance than the others Refactor the rerouting mechanism and make it more resilient to cases when some of vmstorage nodes are temporarily unavailable. Reduce the probability of rerouting storm. Updates https://github.com/VictoriaMetrics/VictoriaMetrics/issues/791 Updates https://github.com/VictoriaMetrics/VictoriaMetrics/issues/1054 Updates https://github.com/VictoriaMetrics/VictoriaMetrics/issues/1165	2021-06-04 04:33:52 +03:00
Aliaksandr Valialkin	8cdecfc52c	app/vmauth: allow balancing the load among multiple backend nodes by specifying multiple urls in `url_prefix` config	2021-05-29 01:04:22 +03:00
Aliaksandr Valialkin	97de72054e	docs: document `f0c21b6300`	2021-05-27 15:04:13 +03:00
Roman Khavronenko	e183a5c532	vmalert: automatically reload configuration on file change (#1326 ) New flag `-rule.configCheckInterval` defines how often `vmalert` will re-read config file. If it detects any changes, the config will be reloaded. This behaviour is turned off by default. https://github.com/VictoriaMetrics/VictoriaMetrics/issues/512	2021-05-26 12:24:27 +03:00
Aliaksandr Valialkin	a0b001bfec	app/vmselect/netstorage: remove duplicate limiter on concurrent queries It duplicates the `-search.maxConcurrentRequests` limiter.	2021-05-24 19:13:04 +03:00
Aliaksandr Valialkin	890e1bd826	app/vmagent/remotewrite: use WARN level instead of ERROR level for `couldnt send a block with size ... bytes to ...` log message This is really warning, since vmagent re-tries sending the data block until success.	2021-05-24 15:43:32 +03:00
Roman Khavronenko	beee24ecee	vmalert: support `extra_filter_labels` setting per-group (#1319 ) The new setting `extra_filter_labels` may be assigned to group. If it is, then all rules within a group will automatically filter for configured labels. The feature is well-described here https://docs.victoriametrics.com#prometheus-querying-api-enhancements New setting is compatible only with VM datasource.	2021-05-23 14:15:49 +03:00
Aliaksandr Valialkin	71ff7ee18d	lib/promauth: follow-up after `5b8176c68e`	2021-05-22 18:02:03 +03:00
Nikolay	2780d6dbcd	basic OAuth2 support for remoteWrite and scrape targets (#1316 ) * adds OAuth2 support for remoteWrite and scrapping * adds tests changes init	2021-05-22 18:02:01 +03:00
Nikolay	23a6c9c016	changes vmalert query function (#1307 ) * changes vmalert query function for prometheus rules compatibility its better to use labels as map. it simplifies template evaluation and allow to ignore can't evaluate field error because map will return default value. fixes https://github.com/VictoriaMetrics/operator/issues/243	2021-05-21 16:38:20 +03:00
Aliaksandr Valialkin	d77db9d813	all: do not skip SIGHUP signal during service initialization This can lead to stale or incomplete configs like in the https://github.com/VictoriaMetrics/VictoriaMetrics/issues/1240	2021-05-21 16:38:20 +03:00
Aliaksandr Valialkin	6139f6ed6d	app/vmauth: add ability to protect `/-/reload` endpoint with authKey	2021-05-20 18:48:34 +03:00
Aliaksandr Valialkin	69e365cd48	Makefile: update golangci-lint from v1.29.0 to v1.40.1	2021-05-20 18:30:24 +03:00
Aliaksandr Valialkin	da0b32c31a	app/vmagent/remotewrite: expose metrics with the current number of active series per day and per hour These numbers are exposed via the following metrics: - vmagent_hourly_series_limit_current_series - vmagent_daily_series_limit_current_series Expose also the limits via the following metrics: - vmagent_hourly_series_limit_max_series - vmagent_daily_series_limit_max_series	2021-05-20 15:31:57 +03:00
Aliaksandr Valialkin	165a9f9200	app/vmstorage: add ability to limit series cardinality via `-storage.maxHourlySeries` and `-storage.maxDailySeries` command-line flags	2021-05-20 15:31:57 +03:00
Aliaksandr Valialkin	7aad5c3f76	app/vmagent: add ability to limit series cardinality on a per-hour and per-day basis	2021-05-20 15:31:57 +03:00
Roman Khavronenko	12d0c6b6e0	vmctl: explicitly set `::tag` type for labels selector in `influx` mode (#1310 ) The `::tag` type is needed in cases when field and tag names are equal, which results into unexpected results in InfluxQL. Setting the type explicitly helps InfluxDB to understand which exact column we apply filter to. https://github.com/VictoriaMetrics/VictoriaMetrics/issues/1299	2021-05-20 12:07:15 +03:00
Aliaksandr Valialkin	180829b8c2	app/vmselect/promql: add `timezone_offset(tz)` function Updates https://github.com/VictoriaMetrics/VictoriaMetrics/issues/1306	2021-05-20 11:54:06 +03:00
Aliaksandr Valialkin	dcac849c1f	app/vmagent/remotewrite: sort labels before sending the series to per-remoteWrite.url queues	2021-05-20 11:54:06 +03:00
Neo He	c5ab00ebee	app/{vmbackup,vmrestore},docs/vmrestore.md: typo fix: vbackup -> vmbackup (#1305 )	2021-05-18 16:38:15 +03:00
Aliaksandr Valialkin	74ef40034c	lib/httpserver: typo fix in `-http.shutdownDelay` command-line flag description: servier -> server	2021-05-18 16:25:27 +03:00
Aliaksandr Valialkin	1668280e67	docs/vmalert.md: document multitenant support https://github.com/VictoriaMetrics/VictoriaMetrics/issues/740	2021-05-18 16:25:21 +03:00
Aliaksandr Valialkin	7fe362deb1	app/vmauth: reload `-auth.config` on the request to `/-/reload` Updates https://github.com/VictoriaMetrics/VictoriaMetrics/issues/1194	2021-05-18 02:24:37 +03:00
Aliaksandr Valialkin	25ca108642	docs/vmbackup.md: typo fix: snaphosts -> snapshots Thanks to @jelmd - see `1ab27582a3 (r50884395)`	2021-05-18 01:14:01 +03:00
Aliaksandr Valialkin	6ea191d196	docs: dealay -> delay	2021-05-18 01:07:32 +03:00
Roman Khavronenko	3428df6f15	vmalert: use stringified label keys for duplicates map in recroding rules (#1301 ) duplicates map helps to determine wheter extra labels has overriden labels which make time series unique. It was using a sorted hashed labels sequence as a key. But hashing algorithm could have collisions, so it is more convenient to not use hashing at all. Log message for recording rules duplicates was improved as well. https://github.com/VictoriaMetrics/VictoriaMetrics/issues/1293	2021-05-17 01:51:48 +03:00
Aliaksandr Valialkin	a6cb4f10a7	app/{vmalert,vmauth}: explicitly set MaxIdleConnsPerHost in net/http.Client.Transport By default MaxIdleConnsPerHost is set to 2. This limits the possibility to re-use http keep-alive connections. Updates https://github.com/VictoriaMetrics/VictoriaMetrics/issues/1300	2021-05-14 18:13:34 +03:00
Aliaksandr Valialkin	23afbd5094	app/vmagent/remotewrite: clarify the comment explaining why vmagent drops blocks if remote storage returns 400 or 409 status code	2021-05-13 16:17:09 +03:00
Aliaksandr Valialkin	2839055513	lib/storage: substitute GetTSDBStatusForDate with GetTSDBStatusWithFiltersForDate with nil tfss	2021-05-13 09:01:05 +03:00
Nikolay	be87be34a4	Adds tsdb match filters (#1282 ) * init work on filters * init propose for status filters * fixes tsdb status adds test * fix bug * removes checks from test	2021-05-12 17:16:58 +03:00
Aliaksandr Valialkin	56b08390f6	app/vmselect/promql: allow to use 2x more memory for query processing in cluster mode compared to single-node mode `vmselect` has no `vmstorage`-related caches. So it can use more memory for query processing compared to single-node VictoriaMetrics.	2021-05-12 14:43:49 +03:00
Aliaksandr Valialkin	cca9670573	docs/CHANGELOG.md: document `-datasource.roundDigits` added at `5c448126dc`	2021-05-10 11:18:58 +03:00
Roman Khavronenko	a7f00101f5	vmalert: add support for `round_digits` param in datasource package (#1278 ) Starting from v1.56.0 VM supports `round_digits` which allows to limit the number of digits after the decimal point in response value. The feature can be used to reduce entropy of produced by recording rules values and significantly improve the compression. See more details in link below. https://github.com/VictoriaMetrics/VictoriaMetrics/issues/525	2021-05-10 11:18:56 +03:00
Roman Khavronenko	35237fe1f5	vmalert: fix error when rule didn't start if restore failed (#1279 ) Previously, `startGroup` could exit on restore errors despite the `remoteRead.ignoreRestoreErrors` flag value. Now vmalert checks the flag value before deciding whether to return error or just log it.	2021-05-10 11:10:32 +03:00
Aliaksandr Valialkin	2dddd68feb	docs/vmagent.md: add `stream parsing mode` chapter	2021-05-08 23:14:47 +03:00
Aliaksandr Valialkin	9c505d27dd	lib/ingestserver: properly close incoming connections during graceful shutdown	2021-05-08 19:53:45 +03:00
Aliaksandr Valialkin	4a5f45c77e	app/vminsert: add support for data ingestion via other vminsert nodes	2021-05-08 19:53:45 +03:00
Aliaksandr Valialkin	07bc021f58	app/vmalert: add missing comment for ErrStateRestore	2021-05-08 19:53:45 +03:00
Aliaksandr Valialkin	e8478e1e97	app/vmbackup: make sure that `-snapshotName` isnt set if `-snapshot.createURL` is set	2021-05-07 08:44:44 +03:00
Roman Khavronenko	bb7e113dd4	vmalert: add flag to control behaviour on startup for state restore errors (#1265 ) Alerting rules now can return specific error type ErrStateRestore to indicate whether restore state procedure failed. Such errors were returned and logged before as well. But now user can specify whether to just log these errors (remoteRead.ignoreRestoreErrors=true) or to stop the process (remoteRead.ignoreRestoreErrors=false). The latter is important when VM isn't ready yet to serve queries from vmalert and it needs to wait. https://github.com/VictoriaMetrics/VictoriaMetrics/issues/1252	2021-05-05 12:24:32 +03:00
Aliaksandr Valialkin	0a2e746175	docs/vmalert.md: update docs after `afca7b430c`	2021-04-30 11:49:40 +03:00
Roman Khavronenko	7394967841	vmalert: fix the typo in ApplyParams func (#1259 )	2021-04-30 11:47:11 +03:00
Roman Khavronenko	6fbedd62b8	vmalert: use rule's `evaluationInterval` as `step` param by default (#1258 ) User still can override param by specifying `datasource.queryStep` flag.	2021-04-30 10:03:50 +03:00
Aliaksandr Valialkin	daf2778025	docs/CHANGELOG.md: document the change from `f3a048288e` Updates https://github.com/VictoriaMetrics/VictoriaMetrics/issues/1232	2021-04-30 09:56:47 +03:00
Roman Khavronenko	b55677e93d	Vmalert: adjust `time` param for datasource queries according to `evaluationInterval` (#1257 ) * Simplify arguments list for fn `queryDataSource` to improve readbility * vmalert: adjust `time` param according to rule evaluation interval With this change, vmalert will start to use rule's evaluation interval for truncating the `time` param. This is mostly needed to produce consistent time series with timestamps unaffected by vmalert start time. Now, timestamp becomes predictable. Additionally, adjustment is similar to what Grafana does for plotting range graphs. Hence, recording rule series and recording rule expression plotted in grafana suppose to become similar in most of cases.	2021-04-30 09:56:46 +03:00
Aliaksandr Valialkin	8be1cb297b	app/vmagent: list user-visible endpoints at `http://vmagent:8429/` While at it, use common WriteAPIHelp function for the listing in vmagent, vmalert and victoria-metrics	2021-04-30 09:38:23 +03:00
Nikolay	2eb8ef7b2b	changes vmalert Querier with per rule querier (#1249 ) * changes vmalert Querier with per rule querier it allows to changes some parametrs based on rule setting for instance - alert type, tenant for cluster version or event endpoint url.	2021-04-29 11:31:07 +03:00
Roman Khavronenko	0ceb4f7565	vmalert: keep the returned timestamp when persisting recording rule (#1245 ) Previously, vmalert used `lastExecTime` timestamp when writing recording rules to the remote storage. This may be incorrect, if vmalert uses `datasource.lookback` flag, which means rule's expression will be executed at some moment in the past. To avoid such situations, vmalert now will use returned timestamp instead of `lastExecTime`.	2021-04-27 00:16:45 +03:00
Aliaksandr Valialkin	e309b5a83b	app/vmagent/remotewrite: increase the maximum possible number of inmemory blocks for systems with high amounts of RAM This should reduce the probability of using much slower file-based persistent queue when vmagent processes metrics at high rate (millions of metrics per second). Updates https://github.com/VictoriaMetrics/VictoriaMetrics/issues/1235	2021-04-23 22:05:00 +03:00
Aliaksandr Valialkin	f92db26a93	app/vmagent/remotewrite: count maxLabelsPerBlock as 10x of maxRowsPerBlock This should increase block sizes and subsequently increase the maximum possible bandwidth per each connection to remote storage. This, in turn, should reduce the probability of storing the data in local buffers. Updates https://github.com/VictoriaMetrics/VictoriaMetrics/issues/1235	2021-04-23 22:05:00 +03:00
Aliaksandr Valialkin	aaee80d158	app/vmbackup: typo fix: snaphsot -> snapshot Follow-up for `9de0fa3649`	2021-04-22 11:18:13 +03:00
Aliaksandr Valialkin	e7c4fde756	app/vmauth: parse `url_prefix` only once during config load	2021-04-21 10:57:17 +03:00
Aliaksandr Valialkin	6dc5d3b357	all: rename https://victoriametrics.github.io to https://docs.victoriametrics.com	2021-04-20 20:20:01 +03:00
Aliaksandr Valialkin	64f1ddefe5	all: consistency renaming `Victoria Metrics` -> `VictoriaMetrics` VMInsert -> vminsert VMSelect -> vmselect VMStorage -> vmstorage	2021-04-20 11:45:02 +03:00
Aliaksandr Valialkin	8d869d112b	app/vmauth: follow-up for `6a81a89b3d`	2021-04-20 10:59:22 +03:00

1 2 3 4 5 ...

1253 commits