github-mirrors/VictoriaMetrics

mirror of https://github.com/VictoriaMetrics/VictoriaMetrics.git synced 2024-11-21 14:44:00 +00:00

Author	SHA1	Message	Date
Aliaksandr Valialkin	209c96fc42	docs/Cluster-VictoriaMetrics.md: document -disableReroutingOnUnavailable command-line flag This is a follow-up for `88f0d1572e` Updates https://github.com/VictoriaMetrics/VictoriaMetrics/pull/5713	2024-02-05 15:18:01 +02:00
Aliaksandr Valialkin	5f836c8729	docs/CHANGELOG.md: add a link to all VictoriaMetrics dashboards for Grafana	2024-02-05 11:42:05 +02:00
Aliaksandr Valialkin	1684766152	docs/CHANGELOG.md: add missing links to the corresponding dashboards	2024-02-05 10:55:30 +02:00
hagen1778	487a94565b	dashboards/all: add new panel `CPU spent on GC` It should help identifying cases when too much CPU is spent on garbage collection, and advice users on how this can be addressed. Signed-off-by: hagen1778 <roman@victoriametrics.com>	2024-02-02 16:21:21 +01:00
hagen1778	29a9b31584	dashboards: add `Targets scraped/s` A new stat panel shows the number of targets scraped by the vmagent per-second. Signed-off-by: hagen1778 <roman@victoriametrics.com>	2024-02-02 15:48:26 +01:00
Aliaksandr Valialkin	deed8ddfb8	docs/CHANGELOG.md: document v1.93.11 LTS release	2024-02-01 18:21:28 +02:00
Aliaksandr Valialkin	87bf1900e4	docs/CHANGELOG.md: document v1.87.14 LTS release	2024-02-01 17:08:56 +02:00
Aliaksandr Valialkin	31c53adbde	docs: mark v1.97.x as long-term support release	2024-02-01 15:16:39 +02:00
Aliaksandr Valialkin	bdfa4aee0d	docs/CHANGELOG.md: cut v1.97.1	2024-02-01 15:08:40 +02:00
Aliaksandr Valialkin	8aaa828ba3	lib/prompbmarshal: return back custom protobuf marshaler for lib/prompbmarshal.WriteRequest The easyproto-based marshaler is 2x slower than the previous custom marshaler, so let's stick with it. This improves the performance for sending data to remote storage at vmagent and reduces CPU usage to pre-v1.97.0 levels.	2024-02-01 06:33:06 +02:00
Aliaksandr Valialkin	b7fd7ee0b6	lib/promauth: follow-up for `fca3b14b7b` - Simplify the code for handling BasicAuthConfig at lib/promauth/config.go - Move the description of the change into correct place at docs/CHANGELOG.md - Put tests for username in front of tests for password at lib/promauth/config_test.go Updates https://github.com/VictoriaMetrics/VictoriaMetrics/pull/5720 Updates https://github.com/VictoriaMetrics/VictoriaMetrics/issues/5511	2024-01-31 19:45:16 +02:00
Nihal	fca3b14b7b	Support for username_file in scrape config (basic_auth) similar to Prometheus for having config compatibility (#5720 ) * adding support for username_file in basic_auth of scrape config Signed-off-by: Syed Nihal <syed.nihal@nokia.com> * adding support for username_file in basic_auth of scrape config. See https://github.com/VictoriaMetrics/VictoriaMetrics/issues/5511 Signed-off-by: Syed Nihal <syed.nihal@nokia.com> * adding support for username_file in basic_auth of scrape config Signed-off-by: Syed Nihal <syed.nihal@nokia.com> * adding support for username_file in basic_auth of scrape config Signed-off-by: Syed Nihal <syed.nihal@nokia.com> * adding support for username_file in basic_auth of scrape config Signed-off-by: Syed Nihal <syed.nihal@nokia.com> --------- Signed-off-by: Syed Nihal <syed.nihal@nokia.com>	2024-01-31 17:41:16 +00:00
Aliaksandr Valialkin	db4623efc2	app/vmselect/netstorage: properly handle the case when an empty brsPool points to the end of brs.brs This case is possible after a new brsPool is allocated. The fix is to verify whether len(brsPool) >= len(brs.brs) before trying to append a new item to brsPool and sharing its contents with brs.brs. Updates https://github.com/VictoriaMetrics/VictoriaMetrics/issues/5733	2024-01-31 10:27:50 +02:00
hagen1778	02492bc1a4	dashboards/single: fix typo in query for `version` annotation The typo falsely produced many version change events. Signed-off-by: hagen1778 <roman@victoriametrics.com>	2024-01-31 09:13:46 +01:00
Aliaksandr Valialkin	ec0ca8e7eb	app/vmselect/promql: really keep metric names when keep_metric_names modifier is applied to binary operator Updates https://github.com/VictoriaMetrics/VictoriaMetrics/issues/5556	2024-01-31 02:32:55 +02:00
Aliaksandr Valialkin	fcc8b14f86	deployment/docker: upgrade base Docker image from Alpine 3.19.0 to 3.19.1 See https://www.alpinelinux.org/posts/Alpine-3.19.1-released.html	2024-01-30 22:47:18 +02:00
Aliaksandr Valialkin	26488726a8	docs/CHANGELOG.md: cut v1.97.0	2024-01-30 22:45:04 +02:00
Roman Khavronenko	6939c53e48	app/vmselect: set proper timestamp for cached instant responses (#5723 ) * app/vmselect: set proper timestamp for cached instant responses The change updates `getSumInstantValues` to prefer timestamp from the most recent results. Before, timestamp from cached series was used. The old behavior had negative impact on recording rules as they were getting responses with shifted timestamps in past. Subsequent recording or alerting rules fetching results of these recording rules could get no result due to staleness interval. https://github.com/VictoriaMetrics/VictoriaMetrics/issues/5659 Signed-off-by: hagen1778 <roman@victoriametrics.com> * wip --------- Signed-off-by: hagen1778 <roman@victoriametrics.com> Co-authored-by: Aliaksandr Valialkin <valyala@victoriametrics.com>	2024-01-30 20:03:34 +00:00
Yury Molodov	81b5db04f6	vmui: add the ability to expand all tracing entries (#5677 ) (#5726 )	2024-01-30 19:10:10 +00:00
Aliaksandr Valialkin	f768d5d797	docs/CHANGELOG.md: document the enhancement, which reduces initial memory usage when `vmagent` scrapes targets with large responses Updates https://github.com/VictoriaMetrics/VictoriaMetrics/issues/5567	2024-01-30 20:51:13 +02:00
Aliaksandr Valialkin	17f8ed8948	docs/CHANGELOG.md: refer to the related pull request for the bugfix for https://github.com/VictoriaMetrics/VictoriaMetrics/issues/1945	2024-01-30 20:21:44 +02:00
Aliaksandr Valialkin	ea2752ce62	docs/CHANGELOG.md: document the bugfix addressed by the commit `bc7cf4950b` Updates https://github.com/VictoriaMetrics/VictoriaMetrics/issues/1945	2024-01-30 20:16:22 +02:00
Aliaksandr Valialkin	5d66ee88bd	lib/storage: do not check the limit for -search.maxUniqueTimeseries when performing /api/v1/labels and /api/v1/label/.../values requests This limit has little sense for these APIs, since: - Thses APIs frequently result in scanning of all the time series on the given time range. For example, if extra_filters={datacenter="some_dc"} . - Users expect these APIs shouldn't hit the -search.maxUniqueTimeseries limit, which is intended for limiting resource usage at /api/v1/query and /api/v1/query_range requests. Also limit the concurrency for /api/v1/labels, /api/v1/label/.../values and /api/v1/series requests in order to limit the maximum memory usage and CPU usage for these API. This limit shouldn't affect typical use cases for these APIs: - Grafana dashboard load when dashboard labels should be loaded - Auto-suggestion list load when editing the query in Grafana or vmui Updates https://github.com/VictoriaMetrics/VictoriaMetrics/issues/5055	2024-01-29 16:45:12 +01:00
Roman Khavronenko	aaa526e8ff	lib/streamaggr: skip unfinished aggregation state on shutdown by default (#5689 ) Sending unfinished aggregate states tend to produce unexpected anomalies with lower values than expected. The old behavior can be restored by specifying `flush_on_shutdown: true` setting in streaming aggregation config Signed-off-by: hagen1778 <roman@victoriametrics.com>	2024-01-26 22:45:23 +01:00
Roman Khavronenko	df59ac7f0e	app/vmalert: fix data race during hot-config reload (#5698 ) * app/vmalert: fix data race during hot-config reload During hot-reload, the logic evokes the group update and rules evaluation interruption simultaneously. Falsely assuming that interruption happens before the update. However, it could happen that group will be updated first and only after the rules evaluation will be cancelled. Which will result in permanent interruption for all rules within the group. The fix caches the cancel context function into local variable first. And only after performs the group update. With cached cancel function we can safely call it without worrying that we cancel the evaluation for already updated group. Signed-off-by: hagen1778 <roman@victoriametrics.com> * Revert "app/vmalert: fix data race during hot-config reload" This reverts commit `a4bb7e8932`. * app/vmalert: fix data race during hot-config reload During hot-reload, the logic evokes the group update and rules evaluation interruption simultaneously. Falsely assuming that interruption happens before the update. However, it could happen that group will be updated first and only after the rules evaluation will be cancelled. Which will result in permanent interruption for all rules within the group. The fix cancels the evaulation context before applying the update, making sure that the context will be cancelled for old group always. Signed-off-by: hagen1778 <roman@victoriametrics.com> * wip Signed-off-by: hagen1778 <roman@victoriametrics.com> --------- Signed-off-by: hagen1778 <roman@victoriametrics.com>	2024-01-26 22:42:21 +01:00
Yury Molodov	a7b11eff7c	vmui: fix `Enter` key in query field (#5667 ) (#5681 )	2024-01-26 22:38:32 +01:00
Aliaksandr Valialkin	bb7a419cc3	lib/{mergeset,storage}: make background merge more responsive and scalable - Maintain a separate worker pool per each part type (in-memory, file, big and small). Previously a shared pool was used for merging all the part types. A single merge worker could merge parts with mixed types at once. For example, it could merge simultaneously an in-memory part plus a big file part. Such a merge could take hours for big file part. During the duration of this merge the in-memory part was pinned in memory and couldn't be persisted to disk under the configured -inmemoryDataFlushInterval . Another common issue, which could happen when parts with mixed types are merged, is uncontrolled growth of in-memory parts or small parts when all the merge workers were busy with merging big files. Such growth could lead to significant performance degradataion for queries, since every query needs to check ever growing list of parts. This could also slow down the registration of new time series, since VictoriaMetrics searches for the internal series_id in the indexdb for every new time series. The third issue is graceful shutdown duration, which could be very long when a background merge is running on in-memory parts plus big file parts. This merge couldn't be interrupted, since it merges in-memory parts. A separate pool of merge workers per every part type elegantly resolves both issues: - In-memory parts are merged to file-based parts in a timely manner, since the maximum size of in-memory parts is limited. - Long-running merges for big parts do not block merges for in-memory parts and small parts. - Graceful shutdown duration is now limited by the time needed for flushing in-memory parts to files. Merging for file parts is instantly canceled on graceful shutdown now. - Deprecate -smallMergeConcurrency command-line flag, since the new background merge algorithm should automatically self-tune according to the number of available CPU cores. - Deprecate -finalMergeDelay command-line flag, since it wasn't working correctly. It is better to run forced merge when needed - https://docs.victoriametrics.com/#forced-merge - Tune the number of shards for pending rows and items before the data goes to in-memory parts and becomes visible for search. This improves the maximum data ingestion rate and the maximum rate for registration of new time series. This should reduce the duration of data ingestion slowdown in VictoriaMetrics cluster on e.g. re-routing events, when some of vmstorage nodes become temporarily unavailable. - Prevent from possible "sync: WaitGroup misuse" panic on graceful shutdown. This is a follow-up for `fa566c68a6` . Thanks @misutoth to for the inspiration at https://github.com/VictoriaMetrics/VictoriaMetrics/pull/5212 Updates https://github.com/VictoriaMetrics/VictoriaMetrics/issues/5190 Updates https://github.com/VictoriaMetrics/VictoriaMetrics/issues/3790 Updates https://github.com/VictoriaMetrics/VictoriaMetrics/issues/3551 Updates https://github.com/VictoriaMetrics/VictoriaMetrics/issues/3337 Updates https://github.com/VictoriaMetrics/VictoriaMetrics/issues/3425 Updates https://github.com/VictoriaMetrics/VictoriaMetrics/issues/3647 Updates https://github.com/VictoriaMetrics/VictoriaMetrics/issues/3641 Updates https://github.com/VictoriaMetrics/VictoriaMetrics/issues/648 Updates https://github.com/VictoriaMetrics/VictoriaMetrics/issues/291	2024-01-26 22:27:47 +01:00
Roman Khavronenko	b11f4ef5ea	app/vmalert: autogenerate `ALERTS_FOR_STATE` time series for alerting rules with `for: 0` (#5680 ) * app/vmalert: autogenerate `ALERTS_FOR_STATE` time series for alerting rules with `for: 0` Previously, `ALERTS_FOR_STATE` was generated only for alerts with `for > 0`. This behavior differs from Prometheus behavior - it generates ALERTS_FOR_STATE time series for alerting rules with `for: 0` as well. Such time series can be useful for tracking the moment when alerting rule became active. Updates https://github.com/VictoriaMetrics/VictoriaMetrics/issues/5648 https://github.com/VictoriaMetrics/VictoriaMetrics/issues/3056 Signed-off-by: hagen1778 <roman@victoriametrics.com> * app/vmalert: support ALERTS_FOR_STATE in `replay` mode Signed-off-by: hagen1778 <roman@victoriametrics.com> --------- Signed-off-by: hagen1778 <roman@victoriametrics.com>	2024-01-25 15:42:57 +01:00
Alexander Marshalov	806c07ddd5	vmsingle/vmselect returns http status 429 (TooManyRequests) instead of 503 (ServiceUnavailable) when max concurrent requests limit is reached. (#5682 )	2024-01-24 17:55:06 +01:00
Aliaksandr Valialkin	ef12598ad4	lib/promscrape/discovery/kubernetes: do not generate targets for already terminated pods and containers Already terminated pods and containers cannot be scraped and will never resurrect, so there is zero sense in creating scrape targets for them.	2024-01-24 14:57:53 +02:00
Aliaksandr Valialkin	4d961c70f7	app/{vmselect,vmstorage}: return compression of the data passed from vmstorage to vmselect This reverts `cd4f641d32` , since it has been appeared that the disabled compression for vmstorage->vmselect data increase network bandwidth usage by more than 10x on typical production workloads, while it decreases CPU usage at vmstorage by up to 10% and improves query latency by up to 10%. The 10x increase in network usage is too high price for 10% improvements on query latency and vmstorage CPU usage. This may result in network bandwidth bottlenecks, which can reduce the overall performance and stability of VictoriaMetrics cluster. That's why return back the vmstorage->vmselect data compression by default. The vmstorage->vmselect compression can be disabled by passing -rpc.disableCompression command-line flag to vmstorage. The vmselect->vmselect compression in multi-level cluster setup can be disabled by passing -clusternative.disableCompression command-line flag.	2024-01-24 13:39:28 +02:00
Aliaksandr Valialkin	f888a019fe	lib/streamaggr: expand `%{ENV}` placeholders in stream aggregation configs	2024-01-24 12:31:27 +02:00
Aliaksandr Valialkin	fa566c68a6	lib/mergeset: really limit the number of in-memory parts to 15 It has been appeared that the registration of new time series slows down linearly with the number of indexdb parts, since VictoriaMetrics needs to check every indexdb part when it searches for TSID by newly ingested metric name. The number of in-memory parts grows when new time series are registered at high rate. The number of in-memory parts grows faster on systems with big number of CPU cores, because the mergeset maintains per-CPU buffers with newly added entries for the indexdb, and every such entry is transformed eventually into a separate in-memory part. The solution has been suggested in https://github.com/VictoriaMetrics/VictoriaMetrics/pull/5212 by @misutoth - to limit the number of in-memory parts with buffered channel. This solution is implemented in this commit. Additionally, this commit merges per-CPU parts into a single part before adding it to the list of in-memory parts. This reduces CPU load when searching for TSID by newly ingested metric name. The https://github.com/VictoriaMetrics/VictoriaMetrics/pull/5212 recommends setting the limit on the number of in-memory parts to 100, but my internal testing shows that much lower limit 15 works with the same efficiency on a system with 16 CPU cores while reducing memory usage for `indexdb/dataBlocks` cache by up to 50%. Updates https://github.com/VictoriaMetrics/VictoriaMetrics/issues/5190	2024-01-24 03:38:12 +02:00
Roman Khavronenko	89e3c70ccd	lib/promscrape: respect `0` value for `series_limit` param (#5663 ) * lib/promscrape: respect `0` value for `series_limit` param Respect `0` value for `series_limit` param in `scrape_config` even if global limit was set via `-promscrape.seriesLimitPerTarget`. Previously, `0` value will be ignored in favor of `-promscrape.seriesLimitPerTarget`. This behavior aligns with possibility to override `series_limit` value via relabeling with `__series_limit__` label. Signed-off-by: hagen1778 <roman@victoriametrics.com> * Update docs/CHANGELOG.md --------- Signed-off-by: hagen1778 <roman@victoriametrics.com> Co-authored-by: Aliaksandr Valialkin <valyala@victoriametrics.com>	2024-01-23 13:09:14 +02:00
Aliaksandr Valialkin	114822d585	app/{vmstorage,vmselect}: disable vmstorage->vmselect RPC compression by default in order to improve query performance	2024-01-23 04:24:57 +02:00
Zakhar Bessarab	bf4742526d	lib/storage: print tenant ID in log when discarding or truncating labels (#5658 ) Previously, it was not possible to determine which tenant sends metrics with excessive amount of labels of label values. Signed-off-by: Zakhar Bessarab <z.bessarab@victoriametrics.com> Co-authored-by: Aliaksandr Valialkin <valyala@victoriametrics.com>	2024-01-23 04:24:56 +02:00
Yury Molodov	38231d5994	vmui: query report (#5497 ) * vmui: add query analyzer page * vmui: fix tabs for query analyzer * vmui: add help to export query * vmui: add time params to query analyzer * docs/vmui: add query analyzer * vmui: fix validation JSON form --------- Co-authored-by: Aliaksandr Valialkin <valyala@victoriametrics.com>	2024-01-23 04:23:26 +02:00
Yury Molodov	eb6def0695	vmui: add flag for default timezone setting (#5611 ) * vmui: add flag for default timezone setting #5375 * vmui: validate timezone before client return * Update app/vmselect/vmui.go --------- Co-authored-by: Aliaksandr Valialkin <valyala@victoriametrics.com>	2024-01-23 04:11:19 +02:00
Yury Molodov	633e6b48ad	vmui: fix cache autocomplete (#5591 ) * vmui: fix the logic of closing the popper #5470 * vmui: fix the logic of caching autocomplete results #5472 --------- Co-authored-by: Aliaksandr Valialkin <valyala@victoriametrics.com>	2024-01-23 04:06:14 +02:00
Dmytro Kozlov	38b2a5bc44	deployment/docker: add grafana datasource to the docker-compose files (#5363 ) https://github.com/VictoriaMetrics/VictoriaMetrics/issues/3920 https://github.com/VictoriaMetrics/grafana-datasource/issues/113	2024-01-22 15:45:31 +01:00
Aliaksandr Valialkin	d3ee3e0ef5	Revert "lib/promscrape: do not store last scrape response when stale markers … (#5577 )" This reverts commit `cfec258803`. Reason for revert: the original code already doesn't store the last scrape response when stale markers are disabled. The scrapeWork.areIdenticalSeries() function always returns true is stale markers are disabled. This prevents from storing the last response at scrapeWork.processScrapedData(). It looks like the reverted commit could also return back the issue https://github.com/VictoriaMetrics/VictoriaMetrics/issues/3660 Updates https://github.com/VictoriaMetrics/VictoriaMetrics/pull/5577	2024-01-22 00:43:48 +02:00
Aliaksandr Valialkin	1c7f990fad	app/vmselect: handle negative time range start in a generic manner inside NewSearchQuery() This is a follow-up for `cf03e11d89` Updates https://github.com/VictoriaMetrics/VictoriaMetrics/issues/5553 Updates https://github.com/VictoriaMetrics/VictoriaMetrics/pull/5630	2024-01-21 23:45:31 +02:00
Hui Wang	4e3242b02d	lib/promscrape/discovery/kubernetes: fix watcher start order for roles endpoints and endpointslice (#5557 ) * lib/promscrape/discovery/kubernetes: fix watcher start order for roles endpoints and endpointslice Previously the groupWatcher could be mistakenly stopped when requests for pod or services resources take too long. * remove mislead comment * docs/sd_configs.md: mention -promscrape.kubernetes.attachNodeMetadataAll flag in the description for attach_metadata section Updates https://github.com/VictoriaMetrics/VictoriaMetrics/issues/4640 * wip * lib/promscrape/kubernetes: prevent from stopping groupWatcher when there are in-flight apiWatcher.mustStart() calls groupWatcher is stopped if it has zero registered apiWatchers during 14 seconds. But such a groupWatcher can be still in use if apiWatcher for `role: endpoints` or `role: endpointslice` is being registered and the discovery of the associated `pod` and/or `service` objects takes longer than 14 seconds - see the beginning of groupWatcher.startWatchersForRole() function for details. Track the number of in-flight calls to apiWatcher.mustStart() and prevent from stopping the associated groupWatcher if the number of in-flight calls is non-zero. P.S. postponing the discovery of `pod` and/or `service` objects associated with `endpoints` or `endpointslice` roles isn't the best solution, since it slows down initial discovery of `endpoints` and `endpointslice` targets. * typo fix --------- Co-authored-by: Aliaksandr Valialkin <valyala@victoriametrics.com>	2024-01-21 23:13:15 +02:00
Aliaksandr Valialkin	1f105dde98	all: allow dynamically reading *AuthKey flag values from files and urls Examples: 1) -metricsAuthKey=file:///abs/path/to/file - reads flag value from the given absolute filepath 2) -metricsAuthKey=file://./relative/path/to/file - reads flag value from the given relative filepath 3) -metricsAuthKey=http://some-host/some/path?query_arg=abc - reads flag value from the given url The flag value is automatically updated when the file contents changes.	2024-01-21 22:03:38 +02:00
Nikolay	b3598ba2c1	app/vmauth: adds metric_labels and backend_errors counter (#5585 ) * app/vmauth: adds metric_labels and backend_errors counter it must improve observability for user requests with new metric - per user backend errors counter. it's needed to calculate requests fail rate to the configured backends. metric_labels configuration allows to perform additional aggregations on top of multiple users from configuration section. It could be multiple clients or clients with separate read/write tokens https://github.com/VictoriaMetrics/VictoriaMetrics/issues/5565 * wip --------- Co-authored-by: Aliaksandr Valialkin <valyala@victoriametrics.com>	2024-01-21 04:40:52 +02:00
Aliaksandr Valialkin	7fba73ce11	lib/promscrape/discovery/kubernetes: add -promscrape.kubernetes.attachNodeMetadataAll command-line flag This flag allows setting attach_metadata.node=true for all the kubernetes_sd_configs defined at -promscrape.config Updates https://github.com/VictoriaMetrics/VictoriaMetrics/issues/4640 Thanks to wasim-nihal for the initial implementation at https://github.com/VictoriaMetrics/VictoriaMetrics/pull/5593	2024-01-21 03:13:56 +02:00
Hui Wang	fad212c39c	app/vmselect/promql: properly handle possible negative results caused… (#5608 ) * app/vmselect/promql: properly handle possible negative results caused by float operations precision error in rollup functions like rate() or increase() * fix test	2024-01-21 02:53:29 +02:00
Nikolay	c9f39fd51f	app/vmselect/netstorage (#5649 ) * app/vmselect/netstorage correctly handle errGlobal set * wip Updates https://github.com/VictoriaMetrics/VictoriaMetrics/pull/5649 --------- Co-authored-by: Aliaksandr Valialkin <valyala@victoriametrics.com>	2024-01-21 02:47:29 +02:00
Nikolay	8ab0ce3ded	app/vmselect: abort streaming connections for vmselect (#5650 ) * app/vmselect: abort streaming connections for vmselect due to streaming nature of export APIs, curl and simmilr tools cannot detect errors that happened after http.Header with status 200 was written to it. This PR tracks if body write was already started and closes connection. It allows client to detect not expected chunk sequence and return error to the caller. Mostly it affects vmselect at cluster version https://github.com/VictoriaMetrics/VictoriaMetrics/issues/5645 * wip Updates https://github.com/VictoriaMetrics/VictoriaMetrics/issues/5645 Updates https://github.com/VictoriaMetrics/VictoriaMetrics/pull/5650 --------- Co-authored-by: Aliaksandr Valialkin <valyala@victoriametrics.com>	2024-01-21 02:12:51 +02:00
Aliaksandr Valialkin	74448a7e57	lib/promscrape/discovery/hetzner: follow-up after `03a97dc678` - docs/sd_configs.md: moved hetzner_sd_configs docs to the correct place according to alphabetical order of SD names, document missing __meta_hetzner_role label. - lib/promscrape/config.go: added missing MustStop() call for Hetzner SD, and moved the code to the correct place according to alphabetical order of SD names. - lib/promscrape/discovery/hetzner: properly handle pagination for hloud API responses, populate missing __meta_hetzner_role label like Prometheus does. - Properly populate __meta_hetzner_public_ipv6_network label like Prometheus does. - Remove unused SDConfig.Token. - Remove "omitempty" annotation from SDConfig.Role field, since this field is mandatory. Updates https://github.com/VictoriaMetrics/VictoriaMetrics/pull/5550 Updates https://github.com/VictoriaMetrics/VictoriaMetrics/issues/3154	2024-01-20 17:01:53 +02:00
Hui Wang	cfec258803	lib/promscrape: do not store last scrape response when stale markers … (#5577 ) * lib/promscrape: do not store last scrape response when stale markers are disabled * update changelog	2024-01-20 00:53:41 +08:00
Roman Khavronenko	7e374c227f	app/vmui: send `step` param for instant queries (#5639 ) The change reverts https://github.com/VictoriaMetrics/VictoriaMetrics/issues/3896 due to reasons explained in https://github.com/VictoriaMetrics/VictoriaMetrics/issues/3896#issuecomment-1896704401 Signed-off-by: hagen1778 <roman@victoriametrics.com>	2024-01-19 08:48:16 +01:00
Artem Navoiev	dab160cd74	docs: changelog fix the link to cluster Signed-off-by: Artem Navoiev <tenmozes@gmail.com>	2024-01-17 15:44:56 +01:00
Roman Khavronenko	cf03e11d89	app/vmselect: properly calculate `start` param for queries with too big look-behind window (#5630 ) Properly determine time range search for instant queries with too big look-behind window like `foo[100y]`. Previously, such queries could return empty responses even if `foo` is present in database. https://github.com/VictoriaMetrics/VictoriaMetrics/issues/5553 Signed-off-by: hagen1778 <roman@victoriametrics.com>	2024-01-17 13:48:06 +01:00
Aliaksandr Valialkin	cc6819869a	docs/CHANGELOG*: move changes for 2023 year to docs/CHANGELOG_2023.md	2024-01-17 13:10:32 +02:00
Aliaksandr Valialkin	1683df11f0	docs/CHANGELOG.md: document v1.93.10 LTS release See https://github.com/VictoriaMetrics/VictoriaMetrics/releases/tag/v1.93.10	2024-01-17 01:45:06 +02:00
Aliaksandr Valialkin	ecce2d6db1	docs/CHANGELOG.md: document v1.87.13 LTS release See https://github.com/VictoriaMetrics/VictoriaMetrics/releases/tag/v1.87.13	2024-01-17 01:04:05 +02:00
hagen1778	b0287867fe	deployment/dashboards: change title `VictoriaMetrics` to `VictoriaMetrics - single-node` The new title should provide better understanding of this dashboard purpose. Signed-off-by: hagen1778 <roman@victoriametrics.com>	2024-01-16 20:39:52 +01:00
Aliaksandr Valialkin	30d77393a5	docs/CHANGELOG.md: fix a link in the description of `70cd09e736` Updates https://github.com/VictoriaMetrics/VictoriaMetrics/pull/5581	2024-01-16 20:54:44 +02:00
Aliaksandr Valialkin	a74f6d63e0	deployment/docker: update Go builder from Go1.21.5 to Go1.21.6	2024-01-16 17:00:16 +02:00
Aliaksandr Valialkin	9d886a2eb0	lib/storage: follow-up for `4b8088e377` - Clarify the bugfix description at docs/CHANGELOG.md - Simplify the code by accessing prefetchedMetricIDs struct under the lock instead of using lockless access to immutable struct. This shouldn't worsen code scalability too much on busy systems with many CPU cores, since the code executed under the lock is quite small and fast. This allows removing cloning of prefetchedMetricIDs struct every time new metric names are pre-fetched. This should reduce load on Go GC, since the cloning of uin64set.Set struct allocates many new objects.	2024-01-16 15:29:57 +02:00
Hui Wang	3ac44baebe	exit vmagent if there is config syntax error in `scrape_config_files` when `-promscrape.config.strictParse=true` (#5560 )	2024-01-16 17:30:02 +08:00
hagen1778	d0e4190969	deployment/alerts: add `job` label to `DiskRunsOutOfSpace` alerting rule So it is easier to understand to which installation the triggered instance belongs. Signed-off-by: hagen1778 <roman@victoriametrics.com>	2024-01-16 09:49:39 +01:00
Aliaksandr Valialkin	388d020b7c	app/vmselect/promql: follow-up for `ce4f26db02` - Document the bugfix at docs/CHANGELOG.md - Filter out NaN values before sorting as suggested at https://github.com/VictoriaMetrics/VictoriaMetrics/pull/5509#discussion_r1447369218 - Revert unrelated changes in lib/filestream and lib/fs - Use simpler test at app/vmselect/promql/exec_test.go Updates https://github.com/VictoriaMetrics/VictoriaMetrics/pull/5509 Updates https://github.com/VictoriaMetrics/VictoriaMetrics/issues/5506	2024-01-16 02:55:08 +02:00
Aliaksandr Valialkin	190a6565ae	app/vmselect/promql: consistently sort results of `a or b` query Previously the order of results returned from `a or b` query could change with each request because the sorting for such query has been disabled in order to satisfy https://github.com/VictoriaMetrics/VictoriaMetrics/issues/4763 . This commit executes `a or b` query as `sortByMetricName(a) or sortByMetricName(b)`. This makes the order of returned time series consistent across requests, while maintaining the requirement from https://github.com/VictoriaMetrics/VictoriaMetrics/issues/4763 , e.g. `b` results are consistently put after `a` results. Updates https://github.com/VictoriaMetrics/VictoriaMetrics/issues/5393	2024-01-16 01:30:10 +02:00
Aliaksandr Valialkin	be509b3995	lib/pushmetrics: wait until the background goroutines, which push metrics, are stopped at pushmetrics.Stop() Previously the was a race condition when the background goroutine still could try collecting metrics from already stopped resources after returning from pushmetrics.Stop(). Now the pushmetrics.Stop() waits until the background goroutine is stopped before returning. This is a follow-up for https://github.com/VictoriaMetrics/VictoriaMetrics/pull/5549 and the commit `fe2d9f6646` . Updates https://github.com/VictoriaMetrics/VictoriaMetrics/issues/5548	2024-01-15 13:50:36 +02:00
Aleksandr Stepanov	03a97dc678	vmagent: added hetzner sd config (#5550 ) * added hetzner robot and hetzner cloud sd configs * remove gettoken fun and update docs * Updated CHANGELOG and vmagent docs * Updated CHANGELOG and vmagent docs --------- Co-authored-by: Nikolay <nik@victoriametrics.com>	2024-01-15 10:13:22 +01:00
Roman Khavronenko	4b8088e377	lib/storage: properly check for `storage/prefetchedMetricIDs` cache expiration deadline (#5607 ) Before, this cache was limited only by size. Cache invalidation by time happens with jitter to prevent thundering herd problem. Signed-off-by: hagen1778 <roman@victoriametrics.com>	2024-01-15 10:03:06 +01:00
rbizos	70cd09e736	Handling negative index in Graphite groupByNode/aliasByNode (#5581 ) Handeling the error case with -1 Signed-off-by: Raphael Bizos <r.bizos@criteo.com> Co-authored-by: Nikolay <nik@victoriametrics.com>	2024-01-15 09:57:15 +01:00
Aliaksandr Valialkin	a47127c1a6	app/vmalert/remotewrite: properly calculate vmalert_remotewrite_dropped_rows_total It was calculating the number of dropped time series instead of the number of dropped samples. While at it, drop vmalert_remotewrite_dropped_bytes_total metric, since it was inconsistently calculated - at one place it was calculating raw protobuf-encoded sample sizes, while at another place it was calculating the size of snappy-compressed prompbmarshal.WriteRequest protobuf message. Additionally, this metric has zero practical sense, so just drop it in order to reduce the level of confusion.	2024-01-14 22:55:11 +02:00
Aliaksandr Valialkin	0597718435	lib/protoparser/datadogv2: add support for reading protobuf-encoded requests at /api/v2/series endpoint Updates https://github.com/VictoriaMetrics/VictoriaMetrics/issues/4451 Updates https://github.com/VictoriaMetrics/VictoriaMetrics/pull/5094	2024-01-14 21:09:05 +02:00
Dmytro Kozlov	828aca82e9	app/vmctl: add insecure skip verify flags for source and destination addresses for native protocol (#5606 ) https://github.com/VictoriaMetrics/VictoriaMetrics/issues/5595	2024-01-11 14:04:32 +01:00
hagen1778	91ccea236f	app/all: follow-up after `84d710beab` https://github.com/VictoriaMetrics/VictoriaMetrics/issues/5548 Signed-off-by: hagen1778 <roman@victoriametrics.com>	2024-01-09 13:34:54 +01:00
Dmytro Kozlov	105c6b2eb7	app/vmui: fix broken link for the statistic inaccuracy explanation (#5568 )	2024-01-08 20:13:45 +01:00
hagen1778	463455665b	dashboards: update cluster dashboard * add panels for detailed visualization of traffic usage between vmstorage, vminsert, vmselect components and their clients. New panels are available in the rows dedicated to specific components. * update "Slow Queries" panel to show percentage of the slow queries to the total number of read queries served by vmselect. The percentage value should make it more clear for users whether there is a service degradation. Signed-off-by: hagen1778 <roman@victoriametrics.com>	2024-01-08 11:58:31 +01:00
Denys Holius	aecfabe318	CHANGELOG.md: fixed wrong links to vmalert-tool documentation page (#5570 )	2024-01-05 07:16:46 -08:00
Hui Wang	1f477aba41	vmalert: automatically add `exported_` prefix for original evaluation… (#5398 ) automatically add `exported_` prefix for original evaluation result label if it's conflicted with external or reserved one, previously it was overridden. https://github.com/VictoriaMetrics/VictoriaMetrics/issues/5161 Signed-off-by: hagen1778 <roman@victoriametrics.com> Co-authored-by: hagen1778 <roman@victoriametrics.com>	2023-12-22 16:07:47 +01:00
Aliaksandr Valialkin	9678235eea	docs/CHANGELOG.md: typo fix after `fb90a56de2`: supperted -> supported	2023-12-21 21:01:42 +02:00
Aliaksandr Valialkin	fb90a56de2	app/{vminsert,vmagent}: preliminary support for /api/v2/series ingestion from new versions of DataDog Agent This commit adds only JSON support - https://docs.datadoghq.com/api/latest/metrics/#submit-metrics , while recent versions of DataDog Agent send data to /api/v2/series in undocumented Protobuf format. The support for this format will be added later. Thanks to @AndrewChubatiuk for the initial implementation at https://github.com/VictoriaMetrics/VictoriaMetrics/pull/5094 Updates https://github.com/VictoriaMetrics/VictoriaMetrics/issues/4451	2023-12-21 20:50:55 +02:00
Aliaksandr Valialkin	160cc9debd	app/{vmagent,vmalert}: add the ability to set OAuth2 endpoint params via the corresponding *.oauth2.endpointParams command-line flags This is a follow-up for `5ebd5a0d7b` Updates https://github.com/VictoriaMetrics/VictoriaMetrics/pull/5427	2023-12-20 21:35:28 +02:00
Morgan	5ebd5a0d7b	Expose OAuth2 Endpoint Parameters to cli (#5427 ) The user may which to control the endpoint parameters for instance to set the audience when requesting an access token. Exposing the parameters as a map allows for additional use cases without requiring modification.	2023-12-20 20:16:43 +02:00
Aliaksandr Valialkin	7a31f8a6c9	app/vmselect/netstorage: make sure that at least a single result is collected from every storage group before deciding whether it is OK to skip results from the remaining storage nodes	2023-12-20 19:55:13 +02:00
Nikolay	7cfde237ec	lib/awsapi: properly assume role with webIdentity token (#5495 ) * lib/awsapi: properly assume role with webIdentity token introduce new irsaRoleArn param for config. It's only needed for authorization with webIdentity token. First credentials obtained with irsa role and the next sts assume call for an actual roleArn made with those credentials. Common use case for it - cross AWS accounts authorization https://github.com/VictoriaMetrics/VictoriaMetrics/issues/3822 * wip --------- Co-authored-by: Aliaksandr Valialkin <valyala@victoriametrics.com>	2023-12-20 19:05:39 +02:00
Aliaksandr Valialkin	326a77c697	all: add -metrics.exposeMetadata command-line flag, which can be used for adding TYPE and HELP metadata for metrics exposed at /metrics page This may be needed for systems, which require this metadata such as Google Cloud Managed Prometheus. See https://cloud.google.com/stackdriver/docs/managed-prometheus/troubleshooting#missing-metric-type	2023-12-19 03:20:40 +02:00
Aliaksandr Valialkin	4b529562ce	lib/pushmetrics: add -pushmetrics.header and -pushmetrics.disableCompression command-line flags	2023-12-17 19:56:46 +02:00
Aliaksandr Valialkin	873f0deaa6	docs/CHANGELOG.md: typo fix: use proper backtick closing quote instead of single quote	2023-12-17 19:28:15 +02:00
Aliaksandr Valialkin	0379a0eb82	lib/protoparser/opentelemetry: allow ingesting metrics without resource labels Some clients may ingest samples via OpenTelemetry protocol without Resource labels. Previously VictoriaMetrics was silently dropping such samples. The commit `317834f876` added vm_protoparser_rows_dropped_total{type="opentelemetry",reason="resource_not_set"} counter for tracking of such dropped samples. See https://github.com/VictoriaMetrics/VictoriaMetrics/pull/5459 It is better from usability PoV to accept such samples instead of dropping them and incrementing the corresponding counter.	2023-12-17 19:12:58 +02:00
Aliaksandr Valialkin	5ddccbc2b9	docs/CHANGELOG.md: move the description of the bugfix from `9253c24dd6` into correct place The description of the bugfix was incorrectly placed in already released v1.96.0, while it should be placed in tip. Updates https://github.com/VictoriaMetrics/VictoriaMetrics/pull/5450	2023-12-17 18:58:42 +02:00
Roman Khavronenko	779bbc2e91	vmctl: rename `vm-native-disable-retries` to `vm-native-disable-per-metric-migration` (#5476 ) The change supposed to better reflect the meaning of this flag. Signed-off-by: hagen1778 <roman@victoriametrics.com>	2023-12-15 12:36:28 +01:00
Roman Khavronenko	664fa5cb78	vmctl: retry requests that failed in the very end for `vm-native` (#5475 ) Before, retries happened only on writes into a network connection between source and destination. But errors returned by server after all the data was transmitted were logged, but not retried. Signed-off-by: hagen1778 <roman@victoriametrics.com>	2023-12-15 11:43:41 +01:00
Hui Wang	9253c24dd6	vmalert: validate schema for `-external.url` (#5450 ) Requests with wrong or no schema in `-external.url` could be rejected by alertmanager. So we validate schema on start up.	2023-12-15 11:13:56 +01:00
Aliaksandr Valialkin	0a6a2e455d	app/vmstorage: addd missing -inmemoryDataFlushInterval command-line flag Updates https://github.com/VictoriaMetrics/VictoriaMetrics/issues/3337	2023-12-14 20:52:39 +02:00
Aliaksandr Valialkin	df88baef07	docs/CHANGELOG.md: document the bugfix at `66c76a4d4d` Updates https://github.com/VictoriaMetrics/VictoriaMetrics/pull/5414	2023-12-14 12:48:37 +02:00
Aliaksandr Valialkin	68be182075	app/vmauth: add ability to route requests to different backends depending on the request host	2023-12-14 00:46:36 +02:00
Aliaksandr Valialkin	304fe05650	docs/CHANGELOG.md: cut v1.96.0 release	2023-12-13 00:42:56 +02:00
Yury Molodov	1a5cdb4790	vmui: autocomplete usability improvements (#5422 ) * vmui: add show quick tip for autocomplete * vmui: auto-completion usability improvements #5348 * vmui: add const for min symbols in autocomplete * Use proper queries to VictoriaMetrics * vmui: fix comments for autocomplete * app/vmselect: run `make vmui-update` --------- Co-authored-by: Aliaksandr Valialkin <valyala@victoriametrics.com>	2023-12-13 00:32:41 +02:00
Aliaksandr Valialkin	0f91f83639	app/vmselect: add support for vmstorage groups with independent -replicationFactor per group Updates https://github.com/VictoriaMetrics/VictoriaMetrics/issues/5197 See https://docs.victoriametrics.com/Cluster-VictoriaMetrics.html#vmstorage-groups-at-vmselect Thanks to @zekker6 for the initial pull request at https://github.com/VictoriaMetrics/VictoriaMetrics-enterprise/pull/718	2023-12-13 00:14:45 +02:00
hagen1778	242472086b	app/vmctl: follow-up after `6af732b6f7` Make docs more clear about new feature. Signed-off-by: hagen1778 <roman@victoriametrics.com>	2023-12-12 13:44:21 +01:00
Dmytro Kozlov	6af732b6f7	app/vmctl: enable range steps in reverse order (#5444 ) See https://github.com/VictoriaMetrics/VictoriaMetrics/issues/5376	2023-12-12 13:05:44 +01:00
hagen1778	1e02efd511	docs: fix formatting after a list in CHANGELOG.md Signed-off-by: hagen1778 <roman@victoriametrics.com>	2023-12-12 10:28:11 +01:00
hagen1778	39c405ed4d	app/vmctl: follow-up after `27668c9d01` * remove duplications in error messages * mention the change in CHANGELOG.md `27668c9d01` Signed-off-by: hagen1778 <roman@victoriametrics.com>	2023-12-11 15:30:47 +01:00
hagen1778	e13dc04fbf	docs: mention alerts change in CHANGELOG.md Signed-off-by: hagen1778 <roman@victoriametrics.com>	2023-12-11 15:23:09 +01:00
Aliaksandr Valialkin	635da5fab7	docs/CHANGELOG.md: document v1.93.9 LTS release	2023-12-11 10:39:28 +02:00
Aliaksandr Valialkin	ce8ae450fc	docs/CHANGELOG.md: document v1.87.12	2023-12-10 14:30:22 +02:00
Aliaksandr Valialkin	6d03779870	deployment/docker: update base Docker image from alpine:3.18.5 to alpine:3.19.0 See https://www.alpinelinux.org/posts/Alpine-3.19.0-released.html	2023-12-10 02:28:19 +02:00
Aliaksandr Valialkin	3d3b0e31e0	app/vmselect: add -search.maxResponseSeries command-line flag for limiting the number of time series a single response can return This limit can be used for preventing from high memory usage at Grafana when the response returns too many series. Updates https://github.com/VictoriaMetrics/VictoriaMetrics/issues/5372	2023-12-10 00:54:42 +02:00
Aliaksandr Valialkin	c7504daa7a	docs: follow-up after `49552eaa15` Link to the related issue - https://github.com/VictoriaMetrics/VictoriaMetrics/issues/4792 Fix heading for `Modifying HTTP headers` chapter at docs/vmagent.md	2023-12-08 23:56:56 +02:00
Aliaksandr Valialkin	042267541f	app/vmauth: add support for `hot standby` mode via `first_available` load balancing policy vmauth in `hot standby` mode sends requests to the first url_prefix while it is available. If the first url_prefix becomes unavailable, then vmauth falls back to the next url_prefix. This allows building highly available setup as described at https://docs.victoriametrics.com/vmauth.html#high-availability Updates https://github.com/VictoriaMetrics/VictoriaMetrics/issues/4893 Updates https://github.com/VictoriaMetrics/VictoriaMetrics/issues/4792	2023-12-08 23:31:07 +02:00
Roman Khavronenko	74b09ab4de	app/vmalert: sanitize label names before sending to Alertmanager (#5442 ) Before, vmalert would send notifications with labels containing characters not supported by Alertmanager validator, resulting into validation errors like `msg="Failed to validate alerts" err="invalid label set: invalid name "foo.bar"` Signed-off-by: hagen1778 <roman@victoriametrics.com>	2023-12-08 16:53:35 +03:00
Alexander Marshalov	02a0a7f428	added field `version` to the response for `/api/v1/status/buildinfo` API for using more efficient API in Grafana for receiving label values, added additional info about setup Grafana datasource (#5370 ) (#5437 )	2023-12-07 16:37:36 +02:00
Aliaksandr Valialkin	b39e9257eb	app/vmselect/prometheus: properly encode Prometheus label values at /federate endpoint Prometheus spec says that only \, \n and " must be escaped inside label values. See `995743836e/content/docs/instrumenting/exposition_formats.md (L90)` See https://github.com/VictoriaMetrics/VictoriaMetrics/issues/5431	2023-12-07 15:36:01 +02:00
Aliaksandr Valialkin	7cb8ed8271	lib/promscrape: show -promscrape.cluster.memberNum values for vmagent instances, which scrape the given dropped target at /service-discovery page The /service-discovery page contains the list of all the discovered targets after the commit `487f6380d0` on all the vmagent instances in cluster mode ( https://docs.victoriametrics.com/vmagent.html#scraping-big-number-of-targets ). This commit improves debuggability of targets in cluster mode by providing a list of -promscrape.cluster.memberNum values per each target at /service-discovery page, which has been dropped becasue of sharding, e.g. if this target is scraped by other vmagent instances in the cluster. Updates https://github.com/VictoriaMetrics/VictoriaMetrics/issues/5389 Updates https://github.com/VictoriaMetrics/VictoriaMetrics/issues/4018	2023-12-07 00:05:32 +02:00
Aliaksandr Valialkin	efbe25a678	deployment/docker: update Go builder from Go1.21.4 to Go1.21.5 See https://github.com/golang/go/issues?q=milestone%3AGo1.21.5+label%3ACherryPickApproved	2023-12-06 22:33:40 +02:00
Dmytro Kozlov	935bec447b	app/vmalert: replace error metrics for gauges with counter metrics (#5217 ) See https://github.com/VictoriaMetrics/VictoriaMetrics/issues/5160 Signed-off-by: hagen1778 <roman@victoriametrics.com> Co-authored-by: hagen1778 <roman@victoriametrics.com>	2023-12-06 19:39:35 +01:00
Aliaksandr Valialkin	65bc460323	lib/promscrape: follow-up for `97373b7786` Substitute O(N^2) algorithm for exposing the `vm_promscrape_scrape_pool_targets` metric with O(N) algorithm, where N is the number of scrape jobs. The previous algorithm could slow down /metrics exposition significantly when -promscrape.config contains thousands of scrape jobs. Updates https://github.com/VictoriaMetrics/VictoriaMetrics/issues/5311 Updates https://github.com/VictoriaMetrics/VictoriaMetrics/pull/5335	2023-12-06 17:35:50 +02:00
Aliaksandr Valialkin	e4f5039509	app/vmselect: properly adjust the lower bound for the time range where raw samples must be selected for default_rollup() function Previously the lower bound could be too small, which could result in missing values at the beginning of the graph for default_rollup() function. This function is automatically applied to all the series selectors if they aren't explicitly wrapped into a rollup function - see https://docs.victoriametrics.com/MetricsQL.html#implicit-query-conversions While at it, properly take into account `-search.minStalenessInterval` command-line flag when adjusting the lower bound for the selected time range. Updates https://github.com/VictoriaMetrics/VictoriaMetrics/issues/5388	2023-12-06 14:20:14 +02:00
Hui Wang	97373b7786	vmagent: add `vm_promscrape_scrape_pool_targets` for scrape jobs like… (#5335 ) * vmagent: export `vm_promscrape_scrape_pool_targets` metric to track the number of targets that each scrape_job discovers * add extra panel for new metric	2023-12-06 15:44:39 +08:00
Aliaksandr Valialkin	bc550e22d7	Revert "lib/protoparser/datadog: follow-up after 543f218fe96574b9b2189c8350bb09afa349e3bb" This reverts commit `98d0f81f21`. Reson for revert: see https://github.com/VictoriaMetrics/VictoriaMetrics/pull/5094#issuecomment-1839789080	2023-12-05 02:19:29 +02:00
Aliaksandr Valialkin	fdbbbf33ca	app/vmagent: add `-enableMultitenantHandlers` command-line flag This flag allows converting tenant id to (vm_account_id, vm_project_id) labels. this flag deprecates `-remoteWrite.multitenantURL` command-line flag, because `-enableMultitenantHandlers` is easier to use and combine with multitenant url at vminsert - https://docs.victoriametrics.com/Cluster-VictoriaMetrics.html#multitenancy-via-labels See https://docs.victoriametrics.com/vmagent.html#multitenancy Updates https://github.com/VictoriaMetrics/VictoriaMetrics/pull/1505	2023-12-05 01:28:37 +02:00
Dmytro Kozlov	a28cc6ebec	app/vmalert: expose `/vmalert/api/v1/rule` and `/api/v1/rule` API which returns rule status in JSON format (#5397 ) * app/vmalert: expose `/vmalert/api/v1/rule` and `/api/v1/rule` API which returns rule status in JSON format * app/vmalert: hide updates if query param not set * app/vmalert: fix panic (recursion call) * app/vmalert: add needed group name and file name * app/vmalert: fix comment, update behavior * app/vmalert: fix description * app/vmalert: simplify API for /api/v1/rule Signed-off-by: hagen1778 <roman@victoriametrics.com> * app/vmalert: simplify API for /api/v1/rule Signed-off-by: hagen1778 <roman@victoriametrics.com> * app/vmalert: simplify API for /api/v1/rule Signed-off-by: hagen1778 <roman@victoriametrics.com> * app/vmalert: simplify API for /api/v1/rule Signed-off-by: hagen1778 <roman@victoriametrics.com> * app/vmalert: simplify API for /api/v1/rule Signed-off-by: hagen1778 <roman@victoriametrics.com> --------- Signed-off-by: hagen1778 <roman@victoriametrics.com> Co-authored-by: Aliaksandr Valialkin <valyala@victoriametrics.com> Co-authored-by: hagen1778 <roman@victoriametrics.com>	2023-12-04 18:40:33 +03:00
Aliaksandr Valialkin	17900e39d7	app/vminsert/newrelic: simplify the code a bit after `1fb8dc0092` Updates https://github.com/VictoriaMetrics/VictoriaMetrics/issues/5416 Updates https://github.com/VictoriaMetrics/VictoriaMetrics/pull/5421	2023-12-04 16:52:34 +02:00
Dmytro Kozlov	d1aa15688a	app/vminsert: fix newrelic ingestion in cluster version (#5421 ) Properly pass tenant ID to ingested data from newrelic. Before tenant ID was mistakenly skipped.	2023-12-04 16:52:29 +02:00
Zakhar Bessarab	3532f52f4b	lib/backup/s3remote: remove prev object versions for recursive delete (#719 ) * lib/backup/s3remote: remove prev object versions for recursive delete - fix error caused by sending empty objects list to be deleted. This was possible in case old versions of objects where deleted, but root-level entries where still available. This caused paginator to return an empty page which wasn't skipped. - delete previous versions of objects recursively for S3 remote Signed-off-by: Zakhar Bessarab <z.bessarab@victoriametrics.com> * docs/changelog: add vmbackupmanager fix entry Signed-off-by: Zakhar Bessarab <z.bessarab@victoriametrics.com> * lib/backup/s3remote: unify path construction for S3 objects Signed-off-by: Zakhar Bessarab <z.bessarab@victoriametrics.com> --------- Signed-off-by: Zakhar Bessarab <z.bessarab@victoriametrics.com>	2023-12-04 10:32:05 +02:00
Aliaksandr Valialkin	c7a2e4e90a	deployment/docker: update backe Docker image from alpine 3.18.4 to 3.18.5 See https://www.alpinelinux.org/posts/Alpine-3.15.11-3.16.8-3.17.6-3.18.5-released.html	2023-12-03 18:53:51 +02:00
Aliaksandr Valialkin	f62e03b3d2	app/vmselect: do not limit concurrency for static and fast queries Previously concurrency for static and fast queries was limited with the -search.maxConcurrentRequests command-line flag. This could complicate identifying heavy queries via `vmui` at `Top queries` and `Active queries` pages, since `vmui` and these pages couldn't be opened on overloaded vmselect. Thanks to @f41gh7 for the idea.	2023-12-01 17:25:01 +02:00
Aliaksandr Valialkin	487f6380d0	lib/promscrape: show dropped targets because of sharding at /service-discovery page Previously the /service-discovery page didn't show targets dropped because of sharding ( https://docs.victoriametrics.com/vmagent.html#scraping-big-number-of-targets ). Show also the reason why every target is dropped at /service-discovery page. This should improve debuging why particular targets are dropped. While at it, do not remove dropped targets from the list at /service-discovery page until the total number of targets exceeds the limit passed to -promscrape.maxDroppedTargets . Previously the list was cleaned up every 10 minutes from the entries, which weren't updated for the last minute. This could complicate debugging of dropped targets. Updates https://github.com/VictoriaMetrics/VictoriaMetrics/issues/5389	2023-12-01 16:48:48 +02:00
Hui Wang	1911320c86	vmalert-tool: fix alert_rule_test case when eval_time is not multiple of evaluation_interval (#5387 ) Co-authored-by: hagen1778 <roman@victoriametrics.com>	2023-12-01 12:17:24 +01:00
Aliaksandr Valialkin	8eddccfbb4	all: expose additional metrics for simplifying debugging of VictoriaMetrics components Updates https://github.com/VictoriaMetrics/metrics/issues/54	2023-11-30 02:06:54 +02:00
Aliaksandr Valialkin	ac65c6b178	lib/promrelabel: add `keep_if_contains` and `drop_if_contains` relabeling actions	2023-11-29 12:22:43 +02:00
Nikolay	41f7940f97	lib/streamaggr: properly reference slice with labels (#5406 ) * lib/streamaggr: properly reference slice with labels by limiting slice capacity. It must fix issues with slice modification, in case of append new slice will be allocated, instead of modifying refrenced slice https://github.com/VictoriaMetrics/VictoriaMetrics/issues/5402 * Reduce memory allocations when output_relabel_configs adds new labels to output samples --------- Co-authored-by: Aliaksandr Valialkin <valyala@victoriametrics.com>	2023-11-29 10:03:04 +02:00
hagen1778	98d0f81f21	lib/protoparser/datadog: follow-up after `543f218fe9` * prevent /api/v1 from panic on parsing rows * add tests for Extract function for v1 and v2 api's * separate request types in different pools to prevent different objects mixing * add changelog line `543f218fe9` Signed-off-by: hagen1778 <roman@victoriametrics.com>	2023-11-28 15:04:15 +01:00
hagen1778	5424632ba3	docs: mention contributor of PR 5368 Signed-off-by: hagen1778 <roman@victoriametrics.com>	2023-11-28 11:55:05 +01:00
luckyxiaoqiang	d7897e0d70	app/vmselect/promql: add day_of_year() function (#5368 ) Co-authored-by: dingxiaoqiang <dingxiaoqiang@bytedance.com> Co-authored-by: Roman Khavronenko <roman@victoriametrics.com>	2023-11-28 11:54:00 +01:00
Aliaksandr Valialkin	5034aa0773	app/vmagent: follow-up for `090cb2c9de` - Add Try* prefix to functions, which return bool result in order to improve readability and reduce the probability of missing check for the result returned from these functions. - Call the adjustSampleValues() only once on input samples. Previously it was called on every attempt to flush data to peristent queue. - Properly restore the initial state of WriteRequest passed to tryPushWriteRequest() before returning from this function after unsuccessful push to persistent queue. Previously a part of WriteRequest samples may be lost in such case. - Add -remoteWrite.dropSamplesOnOverload command-line flag, which can be used for dropping incoming samples instead of returning 429 Too Many Requests error to the client when -remoteWrite.disableOnDiskQueue is set and the remote storage cannot keep up with the data ingestion rate. - Add vmagent_remotewrite_samples_dropped_total metric, which counts the number of dropped samples. - Add vmagent_remotewrite_push_failures_total metric, which counts the number of unsuccessful attempts to push data to persistent queue when -remoteWrite.disableOnDiskQueue is set. - Remove vmagent_remotewrite_aggregation_metrics_dropped_total and vm_promscrape_push_samples_dropped_total metrics, because they are replaced with vmagent_remotewrite_samples_dropped_total metric. - Update 'Disabling on-disk persistence' docs at docs/vmagent.md - Update stale comments in the code Updates https://github.com/VictoriaMetrics/VictoriaMetrics/pull/5088 Updates https://github.com/VictoriaMetrics/VictoriaMetrics/issues/2110	2023-11-25 12:09:44 +02:00
Nikolay	090cb2c9de	app/vmagent: allow to disabled on-disk persistence (#5088 ) * app/vmagent: allow to disabled on-disk queue Previously, it wasn't possible to build data processing pipeline with a chain of vmagents. In case when remoteWrite for the last vmagent in the chain wasn't accessible, it persisted data only when it has enough disk capacity. If disk queue is full, it started to silently drop ingested metrics. New flags allows to disable on-disk persistent and immediatly return an error if remoteWrite is not accessible anymore. It blocks any writes and notify client, that data ingestion isn't possible. Main use case for this feature - use external queue such as kafka for data persistence. https://github.com/VictoriaMetrics/VictoriaMetrics/issues/2110 * adds test, updates readme * apply review suggestions * update docs for vmagent * makes linter happy --------- Co-authored-by: Aliaksandr Valialkin <valyala@victoriametrics.com>	2023-11-24 13:42:11 +01:00
Aliaksandr Valialkin	2cd9cda12c	docs: make more visible that the maximum JSON line length, which is accepted by /api/v1/import, is limited by -import.maxLineLen command-line flag value This is a follow-up for `0cf55ded34` Updates https://github.com/VictoriaMetrics/VictoriaMetrics/pull/5364	2023-11-24 13:12:51 +02:00
Roman Khavronenko	0cf55ded34	lib/protoparser: decrease `import.maxLineLen` from 100MB to 10MB (#5364 ) Tests showed that importing a single line with 70MB size takes 5.3GiB RSS memory for VictoriaMetrics single-node. In the scenario when user exports and imports data from one VM to another, it could possibly lead to OOM exception for destination VM. Importing a single line with 16MB size taks 1.3GiB RSS memory. Hence, the limit for `import.maxLineLen` was decreased from 100MB to 10MB to improve reliability of VictoriaMetrics during imports. Signed-off-by: hagen1778 <roman@victoriametrics.com> Co-authored-by: Aliaksandr Valialkin <valyala@victoriametrics.com>	2023-11-24 12:53:04 +02:00
Aliaksandr Valialkin	06d2d933fb	docs/CHANGELOG.md: document Google PubSub support at vmagent (see `752f89f13f` )	2023-11-23 21:13:46 +02:00
Aliaksandr Valialkin	1831c731a3	app/vmagent/remotewrite: do not drop persistent queues when -remoteWrite.multitenantURL is set It is unsafe to drop persistent queues when -remoteWrite.multitenantURL command-line flag is set, since these queues are created on demand when a new sample for the given tenant is pushed to the remote storage. This addresses https://github.com/VictoriaMetrics/VictoriaMetrics/issues/5357 The issue has been appeared in the commit `f3a51e8b1d` when implementing https://github.com/VictoriaMetrics/VictoriaMetrics/issues/4014	2023-11-23 20:40:39 +02:00
Hui Wang	ae3107153c	lib/protoparser/promremotewrite: fall back to zstd decoding if Snappy-decoding fails (#5344 ) This case is possible after the following steps: 1. vmagent successfully performed handshake with the -remoteWrite.url and the remote storage supports zstd-compressed data. 2. remote storage became unavailable or slow to ingest data, vmagent compressed the collected data into blocks with zstd and puts these blocks to persistent queue on disk. 3. vmagent restarts and the remote storage is unavailable during the handshake, then vmagent falls back to Snappy compression. 4. vmagent starts sending zstd-compressed data from persistent queue to the remote storage, while falsely advertizing it sends Snappy-compressed data. 5. The remote storage receives zstd-compressed data and fails unpacking it with Snappy. The solution is the same as `12cd32fd75`, just fall back to zstd decompression if Snappy decompression fails.	2023-11-17 15:51:09 +01:00
Aliaksandr Valialkin	3545633934	docs/CHANGELOG.md: cut v1.95.1	2023-11-16 20:31:59 +01:00
Aliaksandr Valialkin	2ea03cf80d	lib/handshake: add SetReadDeadline and SetWriteDeadline implementations additionally to SetDeadline This is a follow-up for `27a5461785` Updates https://github.com/VictoriaMetrics/VictoriaMetrics/pull/5327	2023-11-16 16:48:05 +01:00
Roman Khavronenko	1fbd0dd9d8	lib/handshake: check for deadline in Read and Write methods (#5327 ) The buffered connection could have exceeded the underlying connection deadline during reading or writing to an internal buffer. With this change, buffered connection struct additionally checks for a deadline in Read/Write methods. Signed-off-by: hagen1778 <roman@victoriametrics.com>	2023-11-16 16:47:46 +01:00
Aliaksandr Valialkin	61035419d5	docs/CHANGELOG.md: remove duplicate word `query` after `2cbdb1db22`	2023-11-16 16:24:03 +01:00
Aliaksandr Valialkin	2cbdb1db22	app/vmselect/promql: properly handle duplicate series when merging cached results with the results obtained from the database evalRollupFuncNoCache() may return time series with identical labels (aka duplicate series) when performing queries satisfying all the following conditions: - It must select time series with multiple metric names. For example, {__name__=~"foo\|bar"} - The series selector must be wrapped into rollup function, which drops metric names. For example, rate({__name__=~"foo\|bar"}) - The rollup function must be wrapped into aggregate function, which has no streaming optimization. For example, quantile(0.9, rate({__name__=~"foo\|bar"}) In this case VictoriaMetrics shouldn't return `cannot merge series: duplicate series found` error. Instead, it should fall back to query execution with disabled cache. Also properly store the merged results. Previously they were incorrectly stored because of a typo introduced in the commit `41a0fdaf39` Updates https://github.com/VictoriaMetrics/VictoriaMetrics/issues/5332 Updates https://github.com/VictoriaMetrics/VictoriaMetrics/pull/5337	2023-11-16 16:01:40 +01:00
hagen1778	d389a4fcf3	dashboards: use `version` instead of `short_version` in annotations `version` label won't show the difference if various flavors of the same version were deployed. But `short_version` will. For example, on the sandbox env we test VM builds before new version release. Without this change, the version update won't be visible on dashboard. Signed-off-by: hagen1778 <roman@victoriametrics.com>	2023-11-16 09:26:47 +01:00
Aliaksandr Valialkin	91f5c24f82	docs/CHANGELOG.md: cut v1.95.0 release	2023-11-15 17:45:52 +01:00
Aliaksandr Valialkin	741013a33f	docs/CHANGELOG.md: document v1.93.8 LTS release	2023-11-15 17:12:44 +01:00
Aliaksandr Valialkin	5bfa2a3e97	docs/CHANGELOG.md: document v1.87.11 LTS release	2023-11-15 15:53:05 +01:00
Aliaksandr Valialkin	6a533023b1	docs/CHANGELOG.md: consistently prepend command-line flags with a single dash	2023-11-14 21:44:19 +01:00
hagen1778	feff13851c	docs: clarify vmalert flag changes Signed-off-by: hagen1778 <roman@victoriametrics.com>	2023-11-14 21:18:58 +01:00
Nikolay	3121d76bee	lib/querytracer: makes package concurrent safe to use (#5322 ) * lib/querytracer: makes package concurrent safe to use it must fix various issues with concurrent code usage. Especially, when it's not reasonable to wait for all goroutines to be finished * wip --------- Co-authored-by: Aliaksandr Valialkin <valyala@victoriametrics.com>	2023-11-14 20:59:08 +01:00
hagen1778	d3ae2b2f62	dashboards: update description for RSS and anonymous memory panels to be consistent for single-node, cluster and vmagent dashboards. Signed-off-by: hagen1778 <roman@victoriametrics.com>	2023-11-14 09:50:06 +01:00
hagen1778	d6ae082598	deployment/dashboards: respect `job` and `instance` filters for `alerts` annotation in cluster and single-node dashboards Signed-off-by: hagen1778 <roman@victoriametrics.com>	2023-11-14 09:38:15 +01:00
Aliaksandr Valialkin	43e3302803	docs/CHANGELOG.md: document `0e056ddb2d` Updates https://github.com/VictoriaMetrics/VictoriaMetrics/issues/5203	2023-11-14 01:24:05 +01:00
Zakhar Bessarab	37997abd14	vmcluster: re-routing enhancement (#5293 ) * app/vmstorage: close vminsert connections gradually before stopping storage Implements graceful shutdown approach suggested here - https://github.com/VictoriaMetrics/VictoriaMetrics/issues/4922#issuecomment-1768146878 Test results for this can be found here - https://github.com/VictoriaMetrics/VictoriaMetrics/issues/4922#issuecomment-1790640274 Signed-off-by: Zakhar Bessarab <z.bessarab@victoriametrics.com> * app/vmstorage: update graceful shutdown logic - close connections from vminsert in determenistic order - update flag description - lower default timeout to 25 seconds. 25 seconds value was chosen because the lowest default value used in default configuration deployments is 30s(default value in Kubernetes and ansible-playbooks). Signed-off-by: Zakhar Bessarab <z.bessarab@victoriametrics.com> * docs/cluster: add information about re-routing enhancement during restart Signed-off-by: Zakhar Bessarab <z.bessarab@victoriametrics.com> * docs/changelog: add entry for new command-line flag Signed-off-by: Zakhar Bessarab <z.bessarab@victoriametrics.com> * {app/vmstorage,lib/ingestserver}: address review feedback Signed-off-by: Zakhar Bessarab <z.bessarab@victoriametrics.com> * docs/cluster: add note to update workload scheduler timeout Signed-off-by: Zakhar Bessarab <z.bessarab@victoriametrics.com> * wip --------- Signed-off-by: Zakhar Bessarab <z.bessarab@victoriametrics.com> Co-authored-by: Aliaksandr Valialkin <valyala@victoriametrics.com>	2023-11-14 01:03:44 +01:00
Aliaksandr Valialkin	8eed04b2c6	app/vmauth: add ability to drop the specified number of `/`-delimited prefix parts from request path This can be done via `drop_src_path_prefix_parts` option at `url_map` and `user` levels. See https://docs.victoriametrics.com/vmauth.html#dropping-request-path-prefix	2023-11-13 22:32:22 +01:00
Aliaksandr Valialkin	0feaeca3c1	lib/protoparser/promremotewrite: fall back to Snappy decoding if zstd decoding fails This case is possible after the following steps: 1. vmagent tries to perform handshake with the -remoteWrite.url in order to determine whether the remote storage supports zstd-compressed data. 2. The remote storage is unavailable during the handshake. In this case vmagent falls back to Snappy compression for the data sent to the remote storage. 3. vmagent compresses the collected data into blocks with Snappy and puts these blocks to persistent queue on disk. 4. The remote storage becomes available. 5. vmagent restarts, performs the handshake with the remote storage and detects that it supports zstd-compressed data. 6. vmagent starts sending Snappy-compressed data from persistent queue to the remote storage, while falsely advertizing it sends zstd-compressed data. 7. The remote storage receives Snappy-compressed data and fails unpacking it with zstd. The solution is to just fall back to Snappy decompression if zstd decompression fails. Updates https://github.com/VictoriaMetrics/VictoriaMetrics/issues/5301	2023-11-13 21:19:08 +01:00
Aliaksandr Valialkin	8af56ea2ed	lib/htmlcomponents: use relative links for the top page and for favicon.ico This allows hiding VictoriaMetrics components behind proxies with arbitrary path prefixes. For example, vmagent HTTP handlers can be served via /vmagent/ path prefix: - http://proxy/vmagent/targets - http://proxy/vmagent/service-discovery The path prefix can be arbitrary. For example, below are vmagent urls for /tenantID/vmagent/ path prefix: - http://proxy/tenantID/vmagent/targets - http://proxy/tenantID/vmagent/service-discovery While at it, consistently serve favicon.ico from any path directory. Updates https://github.com/VictoriaMetrics/VictoriaMetrics/issues/5306 Updates https://github.com/VictoriaMetrics/VictoriaMetrics/pull/5307	2023-11-13 20:29:05 +01:00
Aliaksandr Valialkin	3e93fa61ad	lib/regexutil: properly handle alternate regexps surrounded by .+ or .* Previously the following regexps were improperly handled: .+foo\|bar.+ .foo\|bar. This could lead to unexpected regexp match results. See https://github.com/VictoriaMetrics/VictoriaMetrics/issues/5297 Thanks to @Haleygo for the initial attempt to fix the issue at https://github.com/VictoriaMetrics/VictoriaMetrics/pull/5308	2023-11-13 18:23:38 +01:00
Aliaksandr Valialkin	ba058a4514	docs/CHANGELOG.md: remove trailing whitespace after `bffd30b57a`	2023-11-13 09:24:29 +01:00
Aliaksandr Valialkin	eded218e8c	app/vmauth: properly pass `Host` header to backends Previously the `Host` header was remained unchanged when passing it in requests to backends. This may improperly work if the backend uses host-based routing. While at it, allows http/2.0 requests to backends. While VictoriaMetrics components do not accept http/2.0 requests, other backends can require such requests. Updates https://github.com/VictoriaMetrics/VictoriaMetrics/issues/5240	2023-11-13 09:05:39 +01:00
Aliaksandr Valialkin	61594d2bd8	app/vmauth: follow-up for `323f3720ed` - Re-use identically configured http.Transport across multiple users. This fixes handling of the limit on the number of connection, which can be established per each backend via -maxIdleConnsPerBackend command-line flag. This limit stopped working after `323f3720ed` - Add docs about backend TLS setup at https://docs.victoriametrics.com/vmauth.html#backend-tls-setup - Add ability to disable backend TLS verification for all the users via -backend.tlsInsecureSkipVerify command-line flag. This flag may be useful when -auth.config contains big number of users, and every user must disable backend TLS verification. - Add ability to specify TLS Root CA via tls_ca_file option at per-user basis and via -backend.tlsCAFile command-line flag across all the users. Updates https://github.com/VictoriaMetrics/VictoriaMetrics/issues/5240	2023-11-13 08:33:10 +01:00
Aliaksandr Valialkin	bfec8a3751	app/vmauth: improve docs a bit after `323f3720ed` Updates https://github.com/VictoriaMetrics/VictoriaMetrics/issues/5240	2023-11-11 12:49:28 +01:00
Aliaksandr Valialkin	230230cf0b	lib/logger: add `-loggerMaxArgLen` command-line flag for fine-tuning the maximum length of logged args	2023-11-11 12:30:08 +01:00
Aliaksandr Valialkin	80213f07fa	app/vmselect/promql: optimize instant queries with min_over_time() and max_over_time() rollup functions This is a follow-up for `41a0fdaf39`	2023-11-11 12:10:03 +01:00
Aliaksandr Valialkin	2db1a664e1	deployment: update Go builder from Go1.21.3 to Go1.21.4 See https://github.com/golang/go/issues?q=milestone%3AGo1.21.4+label%3ACherryPickApproved	2023-11-10 22:28:44 +01:00
Aliaksandr Valialkin	010dc15d16	lib/blockcache: do not cache entries, which were attempted to be accessed 1 or 2 times Previously entries which were accessed only 1 time weren't cached. It has been appeared that some rarely executed heavy queries may read indexdb block twice in a row instead of once. There is no need in caching such a block then. This change should eliminate cache size spikes for indexdb/dataBlocks when such heavy queries are executed. Expose -blockcache.missesBeforeCaching command-line flag, which can be used for fine-tuning the number of cache misses needed before storing the block in the caching.	2023-11-10 22:28:03 +01:00
Zakhar Bessarab	73a1862182	docs/changelog: document vmbackupmanager bugfix (#5303 ) Signed-off-by: Zakhar Bessarab <z.bessarab@victoriametrics.com>	2023-11-08 18:51:14 +01:00
Roman Khavronenko	bffd30b57a	app/vmalert: update remote-write process (#5284 ) * app/vmalert: update remote-write process * automatically retry remote-write requests on closed connections. The change should reduce the amount of logs produced in environments with short-living connections or environments without support of keep-alive on network balancers. * increment `vmalert_remotewrite_errors_total` metric if all retries to send remote-write request failed. Before, this metric was incremented only if remote-write client's buffer is overloaded. * increment `vmalert_remotewrite_dropped_rows_total` amd `vmalert_remotewrite_dropped_bytes_total` metrics if remote-write client's buffer is overloaded. Before, these metrics were incremented only after unsuccessful HTTP calls. Signed-off-by: hagen1778 <roman@victoriametrics.com> * Update docs/CHANGELOG.md --------- Signed-off-by: hagen1778 <roman@victoriametrics.com> Co-authored-by: Hui Wang <haley@victoriametrics.com>	2023-11-08 14:53:07 +08:00
Yury Molodov	f90d2ec843	vmui: display query error on Explore metrics page (#5272 ) https://github.com/VictoriaMetrics/VictoriaMetrics/issues/5202	2023-11-03 16:23:19 +01:00
Zakhar Bessarab	323f3720ed	app/vmauth: add option to skip TLS verification (#5256 ) Add `tls_insecure_skip_verify` option on per-user basis which allows to disable TLS verification for all requests to backend on behalf of this user. See: https://github.com/VictoriaMetrics/VictoriaMetrics/issues/5240 Signed-off-by: Zakhar Bessarab <z.bessarab@victoriametrics.com>	2023-11-03 12:04:17 +01:00
Aliaksandr Valialkin	65db6609eb	docs/CHANGELOG.md: update the description of the optimization for SLO/SLI-like queries according to latest changes See commits `4497a08e3d` and `92826b0b4a`	2023-11-02 20:05:05 +01:00
Roman Khavronenko	b5254199c6	app/vmalert: add label `file` pointing to the group's filename to metrics (#5281 ) The filename should help identifying alerting rules belonging to specific groups with identical names but different filenames. https://github.com/VictoriaMetrics/VictoriaMetrics/issues/5267 Signed-off-by: hagen1778 <roman@victoriametrics.com>	2023-11-02 16:01:31 +01:00
Hui Wang	90d45574bf	vmalert: reduce restore query request for each alerting rule (#5265 ) reduce the number of queries for restoring alerts state on start-up. The change should speed up the restore process and reduce pressure on `remoteRead.url`.	2023-11-02 15:22:13 +01:00
Aliaksandr Valialkin	dd33fc0c76	docs/CHANGELOG.md: typo fix: tis -> this	2023-11-02 08:33:40 +01:00
Aliaksandr Valialkin	87a86ec9db	docs/CHANGELOG.md: document v1.93.7 LTS release	2023-11-02 08:21:00 +01:00
Aliaksandr Valialkin	ed70a40669	app/vmagent/remotewrite: add -remoteWrite.shardByURL.labels command-line flag This command-line flag can be used for specifying a list of labels used for sharding among -remoteWrite.url entries when -remoteWrite.shardByURL command-line flag is set. Updates https://github.com/VictoriaMetrics/VictoriaMetrics/issues/4942	2023-11-01 23:08:54 +01:00
Alexander Marshalov	828ddd4e4f	vmauth: add browser authorization request for http requests without… (#5234 ) * vmauth: add browser authorization request for http requests without credentials to a route that is not in the `unauthorized_user` section (when `unauthorized_user` is specified). * add link to issue in CHANGELOG * Extend vmauth docs * wip --------- Co-authored-by: Aliaksandr Valialkin <valyala@victoriametrics.com>	2023-11-01 20:59:46 +01:00
Aliaksandr Valialkin	da887b49e7	app/vmui: show query execution duration in the header of query input field This should simplify the process of query optimization	2023-11-01 16:43:51 +01:00
Hui Wang	e482eeff58	vmalert: support specifying full http url in notifier static_configs target (#5261 ) * vmalert: support specifying full http or https urls in notifier static_configs target address * show right label results in ui	2023-11-01 19:53:50 +08:00
Aliaksandr Valialkin	c4c6ee9485	app/vmui: fix non-working `Disable cache` checkbox at `JSON` and `Table` views	2023-10-31 22:58:06 +01:00
Aliaksandr Valialkin	ea81f6fc36	app/vmselect/promql: add outliers_iqr(q) and outlier_iqr_over_time(m[d]) functions These functions allow detecting anomalies in series and samples using Interquartile range method. See Outliers section at https://en.wikipedia.org/wiki/Interquartile_range for more details.	2023-10-31 22:10:31 +01:00
Aliaksandr Valialkin	41a0fdaf39	app/vmselect/promql: optimize repeated SLI-like instant queries with lookbehind windows >= 1d Repeated instant queries with long lookbehind windows, which contain one of the following rollup functions, are optimized via partial result caching: - sum_over_time() - count_over_time() - avg_over_time() - increase() - rate() The basic idea of optimization is to calculate rf(m[d] @ t) as rf(m[offset] @ t) + rf(m[d] @ (t-offset)) - rf(m[offset] @ (t-d)) where rf(m[d] @ (t-offset)) is cached query result, which was calculated previously The offset may be in the range of up to 1 hour.	2023-10-31 19:25:23 +01:00
Aliaksandr Valialkin	714af89b13	lib/httpserver: follow-up for `0638bbe69c` - Replace spaces with underscores in the `reason` label value for the vm_http_request_errors_total metric in order be consistent with Prometheus-like naming - Clarify the description for the change at docs/CHANGELOG.md Updates https://github.com/victoriaMetrics/victoriaMetrics/issues/4590 Updates https://github.com/VictoriaMetrics/VictoriaMetrics/pull/5166	2023-10-31 18:52:39 +01:00
Aliaksandr Valialkin	4ac95b6f49	docs/CHANGELOG.md: move the description for -http.header.* command-line flags from SECURITY to FEATURE The SECURITY label should be applied only to changes, which fix security issues. The change at `ad839aa492` adds new command-line flags, which can be used for improving security in some cases. They do not fix any security issues. Updates https://github.com/VictoriaMetrics/VictoriaMetrics/pull/5111	2023-10-31 16:23:08 +01:00
hagen1778	f6208965ce	dashboards/cluster: fix description about `max` threshold for `Concurrent selects` panel. Before, it was mistakenly implying that `max` is equal to the double of available CPUs. Addresses https://github.com/VictoriaMetrics/VictoriaMetrics/pull/5214 Signed-off-by: hagen1778 <roman@victoriametrics.com>	2023-10-31 16:05:33 +01:00
Roman Khavronenko	a950873fff	app/vmselect: expose `vm_memory_intensive_queries_total` counter metric (#5208 ) The new metric gets increased each time `-search.logQueryMemoryUsage` memory limit is exceeded by a query. This metric should help to identify expensive and heavy queries without inspecting the logs. Signed-off-by: hagen1778 <roman@victoriametrics.com>	2023-10-31 13:31:09 +01:00
hagen1778	a8051d48c4	docs: follow-up for `0638bbe69c` `0638bbe69c` Signed-off-by: hagen1778 <roman@victoriametrics.com>	2023-10-31 12:54:30 +01:00
hagen1778	aaf9e3d526	dashboards/vmalert: add new panel `Missed evaluations` The new panel supposed to indicate alerting groups that miss their evaluations. Signed-off-by: hagen1778 <roman@victoriametrics.com>	2023-10-31 10:35:19 +01:00
hagen1778	9866974a53	deployment/alerts: add `TooManyMissedIterations` alerting rule The new rule for vmalert supposed to detect groups that miss their evaulations due to slow queries. Signed-off-by: hagen1778 <roman@victoriametrics.com>	2023-10-31 10:35:18 +01:00
hagen1778	8874b525b7	dashboards: fix `Errors rate to Alertmanager` filter The panel `Errors rate to Alertmanager` had `group` label filter applied to the expression, while the metric `vmalert_alerts_send_errors_total` doesn't have that label. This resulted into always empty results. Signed-off-by: hagen1778 <roman@victoriametrics.com>	2023-10-31 10:16:45 +01:00
Hui Wang	abcb21aa5e	vmalert: fix alert firing state in replay mode (#5192 ) fix possible missing firing states for alerting rules in replay mode Before if one firing stage is bigger than single query request range, like rule with a big `for`, alerting rule won't able to be detected as firing. Co-authored-by: hagen1778 <roman@victoriametrics.com>	2023-10-30 13:54:18 +01:00
Dima Lazerka	ad839aa492	lib/httpserver: add flags to specify HSTS / Frame-Options / CSP headers for httpserver (#5111 ) support `Strict-Transport-Security`, `Content-Security-Policy` and `X-Frame-Options` HTTP headers in all VictoriaMetrics components. The values for headers can be specified by users via the following flags: `-http.header.hsts`, `-http.header.csp` and `-http.header.frameOptions`. Co-authored-by: hagen1778 <roman@victoriametrics.com>	2023-10-30 11:33:38 +01:00
Roman Khavronenko	29cebd82fb	lib/storage: log warning about RO mode only on state change (#5191 ) Before, vmstorage would log the same message each second producing excessive amount of logs. See https://github.com/VictoriaMetrics/VictoriaMetrics/issues/5159 Signed-off-by: hagen1778 <roman@victoriametrics.com>	2023-10-30 10:52:57 +01:00
Aliaksandr Valialkin	632d788b63	lib/promscrape/discovery/kubernetes: stop all the url watchers, which belong to a particular groupWatcher, at once Previously url watchers for pod, service and node objects could be mistakenly closed when service discovery was set up only for endpoints and endpointslice roles, since watchers for these roles may start start pod, service and node url watchers with nil apiWatcher passed to groupWatcher.startWatchersForRole(). Now all the url watchers, which belong to a particular groupWatcher, are stopped at once when this groupWatcher has no apiWatcher subscribers. Updates https://github.com/VictoriaMetrics/VictoriaMetrics/issues/5216 The issue has been introduced in v1.93.5 when addressing https://github.com/VictoriaMetrics/VictoriaMetrics/issues/4850	2023-10-27 13:51:35 +02:00
Hui Wang	7c90ce39cb	do not print redundant error logs when failed to scrape consul or no… (#5239 ) * do not print redundant error logs when failed to scrape consul or nomad target prometheus performs the same because it uses consul lib which just drops the error(`1806bcb38c/api/api.go (L1134)`)	2023-10-27 13:31:55 +08:00
Aliaksandr Valialkin	d5a599badc	lib/promauth: follow-up for `e16d3f5639` - Make sure that invalid/missing TLS CA file or TLS client certificate files at vmagent startup don't prevent from processing the corresponding scrape targets after the file becomes correct, without the need to restart vmagent. Previously scrape targets with invalid TLS CA file or TLS client certificate files were permanently dropped after the first attempt to initialize them, and they didn't appear until the next vmagent reload or the next change in other places of the loaded scrape configs. - Make sure that TLS CA is properly re-loaded from file after it changes without the need to restart vmagent. Previously the old TLS CA was used until vmagent restart. - Properly handle errors during http request creation for the second attempt to send data to remote system at vmagent and vmalert. Previously failed request creation could result in nil pointer dereferencing, since the returned request is nil on error. - Add more context to the logged error during AWS sigv4 request signing before sending the data to -remoteWrite.url at vmagent. Previously it could miss details on the source of the request. - Do not create a new HTTP client per second when generating OAuth2 token needed to put in Authorization header of every http request issued by vmagent during service discovery or target scraping. Re-use the HTTP client instead until the corresponding scrape config changes. - Cache error at lib/promauth.Config.GetAuthHeader() in the same way as the auth header is cached, e.g. the error is cached for a second now. This should reduce load on CPU and OAuth2 server when auth header cannot be obtained because of temporary error. - Share tls.Config.GetClientCertificate function among multiple scrape targets with the same tls_config. Cache the loaded certificate and the error for one second. This should significantly reduce CPU load when scraping big number of targets with the same tls_config. - Allow loading TLS certificates from HTTP and HTTPs urls by specifying these urls at `tls_config->cert_file` and `tls_config->key_file`. - Improve test coverage at lib/promauth - Skip unreachable or invalid files specified at `scrape_config_files` during vmagent startup, since these files may become valid later. Previously vmagent was exitting in this case. Updates https://github.com/VictoriaMetrics/VictoriaMetrics/issues/4959	2023-10-25 23:19:37 +02:00
Aliaksandr Valialkin	eed5206376	lib/promauth: properly parse string contents for ca, cert and key fields at tls_config Previously yaml parser wasn't accepting string values for these fields, because it was mistakenly expecting a list of uint8 values instead.	2023-10-25 23:12:21 +02:00
hagen1778	a216fe6728	app/vmalert: follow-up after `c9375cac5e` `c9375cac5e` Descriptions were updated in attempt to make it more clear for readers, re-phrasing and linking missing docs. `eval_delay` was added to tests to verify it can be unmarshalled. `eval_delay` is now applied before timestamp alignment to make it more predictable. Before, if delay < interval the timestamp won't be aligned. `eval_delay` and `eval_offset` was added to API output. `PreviouslySentSeriesToRW` converted to private `previouslySentSeriesToRW`. Signed-off-by: hagen1778 <roman@victoriametrics.com>	2023-10-25 13:07:13 +02:00
Hui Wang	c9375cac5e	vmalert: add `-rule.evalDelay` flag and `eval_delay` as group attribute (#5185 ) Also mark `-datasource.lookback` as will be deprecated, see https://github.com/VictoriaMetrics/VictoriaMetrics/issues/5155.	2023-10-25 11:54:18 +02:00
hagen1778	003ef3a518	deployment/alerts: make `TooHighMemoryUsage` more tolerable to spikes Using `min_over_time` should reduce the amount of false positives when component is running in near-the-threshold state. Now it should trigger only if all collected samples were above the threshold on 10m interval. Signed-off-by: hagen1778 <roman@victoriametrics.com>	2023-10-24 09:39:46 +02:00
Alexander Marshalov	33484d3365	lib/streamaggr: respect `streamAgg.dropInput` with empty stream aggr config (#5213 ) https://github.com/VictoriaMetrics/VictoriaMetrics/issues/5207	2023-10-20 15:55:58 +02:00
Roman Khavronenko	b8b6e120ff	app/vmselect: limit the number of parallel workers by 32 (#5195 ) * app/vmselect: limit the number of parallel workers by 32 The change should improve performance and memory usage during query processing on machines with big number of CPU cores. The number of parallel workers for query processing is controlled via `-search.maxWorkersPerQuery` command-line flag. By default, the number of workers is limited by the number of available CPU cores, but not more than 32. The limit can be increased via `-search.maxWorkersPerQuery`. Signed-off-by: hagen1778 <roman@victoriametrics.com> * wip - The `-search.maxWorkersPerQuery` command-line flag doesn't limit resource usage, so move it from the `resource usage limits` to `troubleshooting` chapter at docs/Single-server-VictoriaMetrics.md - Make more clear the description for the `-search.maxWorkersPerQuery` command-line flag - Add the description of `-search.maxWorkersPerQuery` to docs/Cluster-VictoriaMetrics.md - Limit the maximum value, which can be passed to `-search.maxWorkersPerQuery`, to GOMAXPROCS, because bigger values may worsen query performance and increase CPU usage - Improve the the description of the change at docs/CHANGELOG.md. Mark it as FEATURE instead of BUGFIX, since it is closer to a feature than to a bugfix. Updates https://github.com/VictoriaMetrics/VictoriaMetrics/issues/5087 --------- Signed-off-by: hagen1778 <roman@victoriametrics.com> Co-authored-by: Aliaksandr Valialkin <valyala@victoriametrics.com>	2023-10-18 19:51:37 +02:00
hagen1778	fd2d07ba33	lib/storage: follow-up after `188cfe3a85` `188cfe3a85` See https://github.com/VictoriaMetrics/VictoriaMetrics/issues/5159 Signed-off-by: hagen1778 <roman@victoriametrics.com>	2023-10-17 15:45:14 +02:00
Hui Wang	e16d3f5639	fix inconsistent behaviors with prometheus when scraping (#5153 ) * fix inconsistent behaviors with prometheus when scraping 1. address https://github.com/VictoriaMetrics/VictoriaMetrics/issues/4959. skip job with wrong syntax in `scrape_configs` with error logs instead of exiting; 2. show error messages on vmagent /targets ui if there are wrong auth configs in `scrape_configs`, previously will print error logs and do scrape without auth header; 3. don't send requests if there are wrong auth configs in: 1. vmagent remoteWrite; 2. vmalert datasource/remoteRead/remoteWrite/notifier. * add changelogs * address review comments * fix ut	2023-10-17 17:58:19 +08:00
hagen1778	c2d252c045	dashboards/vmalert: respect job and instance filters in `No data errors` Signed-off-by: hagen1778 <roman@victoriametrics.com>	2023-10-17 09:40:39 +02:00
hagen1778	edba9f6266	dashboards/vmalert: use `desc` sorting for tooltips on panels Signed-off-by: hagen1778 <roman@victoriametrics.com>	2023-10-17 09:31:09 +02:00
Aliaksandr Valialkin	14f3d844fe	docs/CHANGELOG.md: document v1.93.6 LTS release See https://github.com/VictoriaMetrics/VictoriaMetrics/releases/tag/v1.93.6	2023-10-17 00:53:18 +02:00
Aliaksandr Valialkin	daaf2b0e61	docs/CHANGELOG.md: document v1.87.10 release See https://github.com/VictoriaMetrics/VictoriaMetrics/releases/tag/v1.87.10	2023-10-16 23:25:38 +02:00
Aliaksandr Valialkin	da77f4deeb	app/vmselect/promql: add labels_equal(q, "label1", "label2", ...) function This function returns q series, which have identical values for the listed labels "label1", "label2", ... See https://github.com/VictoriaMetrics/VictoriaMetrics/issues/5148	2023-10-16 21:50:11 +02:00
Aliaksandr Valialkin	6c3dd16a16	app/vmagent/remotewrite: move sas var initialization closer to the place where it is used This makes the code sligthtly easier to understand. This is a follow-up for `1d3d989be5` Updates https://github.com/VictoriaMetrics/VictoriaMetrics/issues/5170	2023-10-16 20:52:56 +02:00
Aliaksandr Valialkin	bdb743c88d	app/vmselect/promql: add drop_empty_series() function for dropping empty series before performing additional calculations This can be useful in the following queries: drop_empty_series(temperature <= 30) default 40 This query drops temperature series with all the values bigger than 30 on the selected time range, while replacing gaps in the remaining series with 40. The query without drop_empty_series: (temperature <= 30) default 40 would leave all the temperature series with all the values bigger than 30 on the selected time range, and replace all their values with 40. This is not what could be epxected in some cases like here - https://github.com/VictoriaMetrics/VictoriaMetrics/issues/5071	2023-10-16 20:44:56 +02:00
hagen1778	1d3d989be5	app/vmagent/remotewrite: follow-up after `4f102ff945` `4f102ff945` Signed-off-by: hagen1778 <roman@victoriametrics.com>	2023-10-16 16:00:24 +02:00
Alexander Marshalov	b248413a07	fixed error when creating a full backup using the `-origin` flag (#5180 ) * fixed error when creating a full backup using the `-origin` flag (#5144) * Update docs/CHANGELOG.md --------- Co-authored-by: Aliaksandr Valialkin <valyala@victoriametrics.com>	2023-10-16 12:02:51 +02:00
Roman Khavronenko	3594214a16	lib/vmselect: bump maxSearchQuerySize to 5MB (#5158 ) See https://github.com/VictoriaMetrics/VictoriaMetrics/issues/5154#issuecomment-1757216612 https://github.com/VictoriaMetrics/VictoriaMetrics/issues/5154 Signed-off-by: hagen1778 <roman@victoriametrics.com>	2023-10-15 19:24:38 +02:00
Artem Navoiev	f5c46b8176	docs fix bad links Signed-off-by: Artem Navoiev <tenmozes@gmail.com>	2023-10-14 14:56:06 +02:00
Haleygo	dc28196237	vmalert-tool: implement unittest (#4789 ) 1. split package rule under /app/vmalert, expose needed objects 2. add vmalert-tool with unittest subcmd https://github.com/VictoriaMetrics/VictoriaMetrics/issues/2945	2023-10-13 13:54:33 +02:00
Aliaksandr Valialkin	930a36df40	app/vmui: small UX enhancements - Reduce vertical space usage, so more information is available on the screen without the need to scroll. - Show information for lines with higher values at the top of the legend under the graph. This should simplify graph analysis when it contains many lines.	2023-10-12 19:54:19 +02:00
Aliaksandr Valialkin	d984598e30	deployment/docker: update Go builder from Go1.21.1 to Go1.21.3 See https://github.com/golang/go/issues?q=milestone%3AGo1.21.2+label%3ACherryPickApproved and https://github.com/golang/go/issues?q=milestone%3AGo1.21.3+label%3ACherryPickApproved	2023-10-12 09:41:41 +02:00
Aliaksandr Valialkin	31f7ef0811	app/{vmselect,vlselect}: enable caching of static contents from /vmui/static/ folder at client side This should improve repated VMUI page load times on slow networks See https://developer.chrome.com/docs/lighthouse/performance/uses-long-cache-ttl/	2023-10-12 09:33:40 +02:00
hagen1778	d43566605b	dasbhoards: fix vminsert/vmstorage/vmselect metrics filtering Fix vminsert/vmstorage/vmselect metrics filtering when dashboard is used to display data from many sub-clusters with unique job names. Before, only one specific job could have been accounted for component-specific panels, instead of all available jobs for the component. Signed-off-by: hagen1778 <roman@victoriametrics.com>	2023-10-11 12:09:04 +02:00
Zakhar Bessarab	2fc7e9f47e	lib/backup: add `-deleteAllObjectVersions` command-line flag (#5147 ) New flag enforces removal of all versions of the object in remote object storage. See: - https://github.com/VictoriaMetrics/VictoriaMetrics/issues/5121 - https://docs.victoriametrics.com/vmbackup.html#permanent-deletion-of-objects-in-s3-compatible-storages	2023-10-10 14:13:23 +02:00
Yury Molodov	6dc5306c9b	vmui: transfer Top Queries time interval #5097 (#5145 ) https://github.com/VictoriaMetrics/VictoriaMetrics/issues/5097	2023-10-10 13:58:39 +02:00
Nikolay	1f91f22b5f	app/vmselect: reduce lock contention for heavy aggregation requests (#5119 ) reduce lock contention for heavy aggregation requests previously lock contetion may happen on machine with big number of CPU due to enabled string interning. sync.Map was a choke point for all aggregation requests. Now instead of interning, new string is created. It may increase CPU and memory usage for some cases. https://github.com/VictoriaMetrics/VictoriaMetrics/issues/5087	2023-10-10 13:45:20 +02:00
Haleygo	2aa0f5fc41	vmalert: add `evalAlignment` for rule group and fix evalutaion timstamp (#5066 ) * vmalert: add `query_time_alignment` for rule group 1. add `eval_alignment` attribute for group which by default is true. So group rule query stamp will be aligned with interval and propagated to ALERT metrics and the messages for alertmanager; 2. deprecate `datasource.queryTimeAlignment` flag. https://github.com/VictoriaMetrics/VictoriaMetrics/issues/5049	2023-10-10 12:41:19 +02:00
Dmytro Kozlov	244c887825	app/vmalert: hide sensetive info in the vmalert (#5059 ) Strip sensitive information such as auth headers or passwords from datasource, remote-read, remote-write or notifier URLs in log messages or UI. This behavior is by default and is controlled via `-datasource.showURL`, `-remoteRead.showURL`, `remoteWrite.showURL` or `-notifier.showURL` cmd-line flags. https://github.com/VictoriaMetrics/VictoriaMetrics/issues/5044	2023-10-10 11:40:27 +02:00
Yury Molodov	c5044cdba9	vmui: enhancement of autocomplete feature (#5051 ) https://github.com/VictoriaMetrics/VictoriaMetrics/issues/4993 https://github.com/VictoriaMetrics/VictoriaMetrics/issues/3006	2023-10-10 10:38:08 +02:00
Dmytro Kozlov	f60c08a7bd	app/(vminsert\|vmagent): add support for new relic infrastructure agent (#4712 ) Signed-off-by: hagen1778 <roman@victoriametrics.com> Co-authored-by: hagen1778 <roman@victoriametrics.com>	2023-10-05 14:39:51 +02:00
Aliaksandr Valialkin	75dd7b30ba	lib/filestream: add `-filestream.disableFadvise` syscall for unconditional disabling of `fadvise` syscall This may be needed in rare cases when performing backups on systems with big number of CPU cores and big value passed to -concurrency command-line flag. See https://github.com/VictoriaMetrics/VictoriaMetrics/pull/5120	2023-10-04 16:19:46 +02:00
hagen1778	de651165bd	alerting: account for `vmauth` component for alerts `ServiceDown` and `TooManyRestarts` Signed-off-by: hagen1778 <roman@victoriametrics.com>	2023-10-03 16:45:33 +02:00
Aliaksandr Valialkin	f13a96f42c	docs/CHANGELOG.md: cut v1.94.0	2023-10-02 22:33:35 +02:00
Yury Molodov	f39045eca6	vmui: add storage for query history (#5022 ) * vmui: add storage for query history * docs/vmui: add storage for query history	2023-10-02 21:41:03 +02:00
Roman Khavronenko	a4bd73ec7e	lib/promscrape: make concurrency control optional (#5073 ) * lib/promscrape: make concurrency control optional Before, `-maxConcurrentInserts` was limiting all calls to `promscrape.Parse` function: during ingestion and scraping. This behavior is incorrect. Cmd-line flag `-maxConcurrentInserts` should have effect onl on ingestion. Since both pipelines use the same `promscrape.Parse` function, we extend it to make concurrency limiter optional. So caller can decide whether concurrency should be limited or not. This commit makes `c53b5788b4` obsolete. Signed-off-by: hagen1778 <roman@victoriametrics.com> * Revert "dashboards: move `Concurrent inserts` panel to Troubleshooting section" This reverts commit `c53b5788b4`. --------- Signed-off-by: hagen1778 <roman@victoriametrics.com>	2023-10-02 21:32:11 +02:00
Dmytro Kozlov	34961dd4b8	app/vmagent: fix check of the DataDog agent path requests when requests have trailing slashes (#5106 ) * app/vmagent: fix check of the DataDog agent path requests when requests have trailing slashes * app/vmagent: fix CHANGELOG.md description * wip * wip --------- Co-authored-by: Aliaksandr Valialkin <valyala@victoriametrics.com>	2023-10-02 21:18:03 +02:00
Aliaksandr Valialkin	859977d591	Revert "lib/promscrape: add metric `vm_promscrape_scrapes_skipped_total` (#5074 )" This reverts commit `74301cdbf5`. Reason for revert: vmagent already provides better approach for detecting slow scrape targets via the following query: scrape_duration_seconds / scrape_timeout_seconds > 1 This query depends on automatically generated per-target metrics. See https://docs.victoriametrics.com/vmagent.html#automatically-generated-metrics for more details. Updates https://github.com/VictoriaMetrics/VictoriaMetrics/pull/5074	2023-10-02 20:59:56 +02:00
Aliaksandr Valialkin	71668637ce	app/vmselect/promql: follow-up for `896c85a4a4` - Clarify the description of the change at docs/CHANGELOG.md - Make sure that bitmap_*(X, NaN) returns NaN Updates https://github.com/VictoriaMetrics/VictoriaMetrics/issues/4996 Updates https://github.com/VictoriaMetrics/VictoriaMetrics/pull/5021	2023-10-02 20:08:26 +02:00
Roman Khavronenko	74301cdbf5	lib/promscrape: add metric `vm_promscrape_scrapes_skipped_total` (#5074 ) * lib/promscrape: add metric `vm_promscrape_scrapes_skipped_total` add metric `vm_promscrape_scrapes_skipped_total`to show whether vmagent skips the scrapes. This could happen if vmagent is overloaded or target is responding too slow for configured `scrape_interval`. The follow-up commit should add a corresponding alerting rule and panel to vmagent dashboard. Signed-off-by: hagen1778 <roman@victoriametrics.com> * deployment/docker: add `TooManyScrapeSkips` alerting rule for vmagent Signed-off-by: hagen1778 <roman@victoriametrics.com> * dashboards: add panels `Scrape duration 0.99 quantile` and `Skipped scrapes` to vmagent dashboard Signed-off-by: hagen1778 <roman@victoriametrics.com> --------- Signed-off-by: hagen1778 <roman@victoriametrics.com>	2023-10-02 17:12:12 +02:00
Aliaksandr Valialkin	5e49b72126	docs/CHANGELOG.md: follow-up for `f0e33700fc` Mention that the statistic inaccuracy is related to cardinality explorer	2023-10-01 21:33:31 +02:00
Aliaksandr Valialkin	859859aa1c	app/vmagent: follow-up for `cfef814750` - Properly handle /insert/multitenant/api/put url for opentsdb handler at vmagent - Document that the bug has been introduced in v1.93.2 at docs/CHANGELOG.md - Add a link to multitenant url docs in bugfix description Updates https://github.com/VictoriaMetrics/VictoriaMetrics/issues/5061 Updates https://github.com/VictoriaMetrics/VictoriaMetrics/issues/4910	2023-10-01 21:09:32 +02:00
Dmytro Kozlov	896c85a4a4	app/vmselect: fix bitmap_*() functions behavior (#5021 ) Related issue: https://github.com/VictoriaMetrics/VictoriaMetrics/issues/4996 Signed-off-by: dmitryk-dk d.kozlov@victoriametrics.com Signed-off-by: dmitryk-dk d.kozlov@victoriametrics.com Co-authored-by: Nikolay <nik@victoriametrics.com>	2023-09-29 12:03:01 +02:00
Dmytro Kozlov	f0e33700fc	vmui: update information about tsdb usage in cluster version (#5004 ) * vmui: update information about tsdb usage in cluster version * vmui: cleanup * vmui: add CHANGELOG.md * vmui: cleanup * vmui: update logic, move information to the visible place * app/vmui: remove values fetch, update documentation for cardinality explorer * app/vmui: update CHANGELOG.md	2023-09-29 11:47:45 +02:00
hagen1778	c53b5788b4	dashboards: move `Concurrent inserts` panel to Troubleshooting section Moved because this panel is related to both: scraped and ingested data. Before, it could have give a misleading impression that it is related to ingested metrics only. Signed-off-by: hagen1778 <roman@victoriametrics.com>	2023-09-26 14:26:40 +02:00
Alexander Marshalov	34a9d1f818	fixed ingestion via multitenant url for opentsdbhttp (#5061 ) (#5064 )	2023-09-26 11:18:34 +02:00
Roman Khavronenko	4d1b572f46	Docker add vmauth (#5057 ) * docker-compose: add vmauth to cluster env vmauth acts as a balancer and used as an example of how to interconnect VM components via vmauth. Signed-off-by: hagen1778 <roman@victoriametrics.com> * docker-compose: add vmauth to cluster env vmauth acts as a balancer and used as an example of how to interconnect VM components via vmauth. Signed-off-by: hagen1778 <roman@victoriametrics.com> --------- Signed-off-by: hagen1778 <roman@victoriametrics.com> Co-authored-by: Nikolay <nik@victoriametrics.com>	2023-09-26 10:50:10 +02:00
Aliaksandr Valialkin	717c53af27	lib/storage: stop exposing vm_merge_need_free_disk_space metric This metric confuses users and has no any useful information. See https://github.com/VictoriaMetrics/VictoriaMetrics/issues/686#issuecomment-1733844128	2023-09-25 16:52:39 +02:00
Aliaksandr Valialkin	3b9605dba5	app/vmselect/promql: do not sort `q1 or q2` results This makes sure that `q2` series are returned after `q1` series in the same way as Prometheus does See https://github.com/VictoriaMetrics/VictoriaMetrics/issues/4763	2023-09-25 16:14:16 +02:00
Aliaksandr Valialkin	a740159541	app/vmselect/promql: completely substitute median_over_time() WITH template with regular median_over_time() rollup function This is a follow-up for `34d7a670d0` Updates https://github.com/VictoriaMetrics/VictoriaMetrics/issues/5034	2023-09-25 15:28:12 +02:00
Zakhar Bessarab	34d7a670d0	app/vmselect/promql: add implementation of median_over_time for rollup functions list (#5042 ) `median_over_time` is handled by predefined WITH template in MetricsQL library which translates it to `quantile_over_time(0.5)` This makes it impossble to use `median_over_time` as a usual rollup function for `aggr_over_time`. See: https://github.com/VictoriaMetrics/VictoriaMetrics/issues/5034 Signed-off-by: Zakhar Bessarab <z.bessarab@victoriametrics.com>	2023-09-25 14:01:00 +02:00
Roman Khavronenko	ec50375991	docs/changelog: add link to sandbox (#5050 ) Signed-off-by: hagen1778 <roman@victoriametrics.com>	2023-09-25 14:00:41 +02:00
Zakhar Bessarab	8d99c12a7d	lib/promscrape/discovery/kubernetes: supress context.Cancelled error in logs (#5048 ) lib/promscrape/discovery/kubernetes: supress context.Cancelled error in logs It is possible that context.Cancelled will appear after k8s watcher was closed due to reload(see https://github.com/VictoriaMetrics/VictoriaMetrics/issues/4850). Logging an error misinforms user and looks like vmagent discovery will stop working even though this does not affect discovery. Signed-off-by: Zakhar Bessarab <z.bessarab@victoriametrics.com>	2023-09-22 13:01:33 +02:00
Zakhar Bessarab	760cdcec68	lib/backup: fix issue with inconsistent copying of appliedRetention.txt (#5027 ) * lib/backup: fix issue with inconsistent copying of appliedRetention.txt appliedRetention.txt can be modified in place, so it should be always copied just the same as parts.json Updates: https://github.com/victoriaMetrics/victoriaMetrics/issues/5005 Signed-off-by: Zakhar Bessarab <z.bessarab@victoriametrics.com> * docs: add changelog entry for appliedRetention.txt copying fix Signed-off-by: Zakhar Bessarab <z.bessarab@victoriametrics.com> --------- Signed-off-by: Zakhar Bessarab <z.bessarab@victoriametrics.com>	2023-09-21 11:25:19 +02:00
Roman Khavronenko	462c918251	app/vmauth: update config reload routine (#5019 ) * expose metrics `vmauth_config_last_reload_` for tracking the state of config reloads, similarly to vmagent/vmalert components. do not print logs like `SIGHUP received...` once per configured `-configCheckInterval` cmd-line flag. This log will be printed only if config reload was invoked manually. * prevent configuration reloading if there were no changes in config. This improves memory usage when `-configCheckInterval` cmd-line flag is configured and config has extensive list of regexp expressions requiring additional memory on parsing. Signed-off-by: hagen1778 <roman@victoriametrics.com>	2023-09-20 15:04:52 +02:00
Aliaksandr Valialkin	28aed4d098	docs/CHANGELOG.md: publish changes for v1.93.5	2023-09-19 10:50:25 +02:00
Aliaksandr Valialkin	582f1f8fda	docs/CHANGELOG.md: clarify the description of bugfixes at `f7dda12b4d` and `b6ad581b45` This is a follow-up for `8b01bc4a5c` Updates https://github.com/VictoriaMetrics/VictoriaMetrics/pull/4999 Updates https://github.com/VictoriaMetrics/VictoriaMetrics/pull/5009	2023-09-19 00:22:32 +02:00
Aliaksandr Valialkin	76af32d869	lib/promscrape/discovery/kubernetes: follow-up after `eeb862f3ff` - Move the bugfix description to the correct place in docs/CHANGELOG.md - Prevent from logging of 'context canceled' errors after the url watcher is stopped, since these errors are expected and may confuse users. - Remove unused urlWatcher.refCount field. - Remove unused urlWatcher.close() method. Updates https://github.com/VictoriaMetrics/VictoriaMetrics/issues/4850	2023-09-18 17:06:39 +02:00
Aliaksandr Valialkin	4d01bc6d52	lib/backup: properly copy parts.json files inside indexdb directory additional to data directory This is a follow-up for `264ffe3fa1` Updates https://github.com/VictoriaMetrics/VictoriaMetrics/issues/5005 Updates https://github.com/VictoriaMetrics/VictoriaMetrics/pull/5006	2023-09-18 16:16:50 +02:00
Nikolay	8b01bc4a5c	docs: reflect recent changes at change logs (#5015 )	2023-09-18 08:22:10 +02:00
Zakhar Bessarab	eeb862f3ff	lib/promscrape/discovery/kubernetes: fix leaking api watcher (#4861 ) * lib/promscrape/discovery/kubernetes: fix leaking api watcher goroutine which was polling k8s API had no execution control. This leaded to leaking goroutines during config reload. See: https://github.com/VictoriaMetrics/VictoriaMetrics/issues/4850 Signed-off-by: Zakhar Bessarab <z.bessarab@victoriametrics.com> * lib/promscrape/discovery/kubernetes: use reference counting for urlWatcher cleanup Signed-off-by: Zakhar Bessarab <z.bessarab@victoriametrics.com> * lib/promscrape/discovery/kubernetes: remove waitgroup sync for goroutines polling API server This is unnecessary since context will is cancelled and new requests will not be sent. Also, using waitgroup will increase time required to perform reload which might result in missed scrapes. Signed-off-by: Zakhar Bessarab <z.bessarab@victoriametrics.com> * lib/promscrape/discovery/kubernetes: clarify comment Signed-off-by: Zakhar Bessarab <z.bessarab@victoriametrics.com> * Apply suggestions from code review * lib/promscrape/discovery/kubernetes: address review feedback Signed-off-by: Zakhar Bessarab <z.bessarab@victoriametrics.com> --------- Signed-off-by: Zakhar Bessarab <z.bessarab@victoriametrics.com> Co-authored-by: Nikolay <nik@victoriametrics.com>	2023-09-15 19:40:13 +02:00
Zakhar Bessarab	264ffe3fa1	lib/backup: force copying of parts.json (#5006 ) * lib/backup: force copying of parts.json Copying of parts.json is required because `part.key()` comparison can create same key value for files with different contents. This will result in inconsistent backup being created or restored. See: https://github.com/VictoriaMetrics/VictoriaMetrics/issues/5005 Signed-off-by: Zakhar Bessarab <z.bessarab@victoriametrics.com> * lib/backup: ensure parts.json is only copied once Signed-off-by: Zakhar Bessarab <z.bessarab@victoriametrics.com> --------- Signed-off-by: Zakhar Bessarab <z.bessarab@victoriametrics.com> Co-authored-by: Nikolay <nik@victoriametrics.com>	2023-09-15 19:04:38 +02:00
Zakhar Bessarab	2a362e7397	docs: add changelog entry for downsampling.period and dedup.minScrapeInterval verification (#5000 ) * docs: add changelog entry for downsampling.period and dedup.minScrapeInterval verification - added changelog entry - documented requirements for dedup.minScrapeInterval and downsampling.period being multiples of each other Signed-off-by: Zakhar Bessarab <z.bessarab@victoriametrics.com> * docs: `make docs-sync` Signed-off-by: Zakhar Bessarab <z.bessarab@victoriametrics.com> --------- Signed-off-by: Zakhar Bessarab <z.bessarab@victoriametrics.com>	2023-09-15 15:14:16 +02:00
Dmytro Kozlov	d5f9619984	vmagent: add validation of MetricsQL functions (#4991 ) Co-authored-by: Aliaksandr Valialkin <valyala@victoriametrics.com>	2023-09-15 13:15:23 +02:00
Aliaksandr Valialkin	151f363552	docs/CHANGELOG.md: document v1.87.9	2023-09-10 21:41:23 +02:00
Aliaksandr Valialkin	bb8eda0b0f	docs/CHANGELOG.md: document v1.93.4	2023-09-10 19:47:38 +02:00
Aliaksandr Valialkin	0bbc6a5b43	app/vmagent/remotewrite: fix data race when extra labels are added to samples before sending them to multiple remote storage systems See https://github.com/VictoriaMetrics/VictoriaMetrics/issues/4972	2023-09-08 23:24:00 +02:00
Aliaksandr Valialkin	a315694dd9	app/vmauth: add ability to specify response status codes for retrying requests during load-balancing Response status codes for retrying can be specified via retry_status_codes list See https://github.com/VictoriaMetrics/VictoriaMetrics/issues/4893	2023-09-08 23:23:15 +02:00
Roman Khavronenko	6351d07da8	vmalert: correctly add duplicated params to the query (#4955 ) Fix the bug when Group's `params` fields with multiple values were overriding each other instead of adding up. The bug was introduced in this commit `eccecdf177` starting from v1.91.1 https://github.com/VictoriaMetrics/VictoriaMetrics/releases/tag/v1.91.1 https://github.com/VictoriaMetrics/VictoriaMetrics/issues/4908 Signed-off-by: hagen1778 <roman@victoriametrics.com>	2023-09-08 09:32:48 +02:00
Aliaksandr Valialkin	b80d338287	app/vmauth: retry requests at other backends on 5xx response status codes This should allow implementing high availability scheme described at https://github.com/VictoriaMetrics/VictoriaMetrics/issues/4792#issuecomment-1674338561 See also https://github.com/VictoriaMetrics/VictoriaMetrics/issues/4893	2023-09-08 00:46:37 +02:00
Aliaksandr Valialkin	dd10f94951	app/vmselect: return 503 status code when partial responses are denied and some of vmstorage nodes are temporarily unavailable This should help detecting this case and automatic retrying the query at healthy cluster replica in another availability zone. This commit is needed as a preparation for automatic query retry at another backend at vmauth on 5xx errors as described at https://github.com/VictoriaMetrics/VictoriaMetrics/issues/4792#issuecomment-1674338561	2023-09-07 16:11:39 +02:00
Aliaksandr Valialkin	9de440c803	lib/logger: increase the maximum log arg size from 200 to 500 The 200 chars limit has been appeared too small for typical log messages emitted by VictoriaMetrics components This is a follow-up for `87fea7d8ac`	2023-09-07 16:11:08 +02:00
Aliaksandr Valialkin	87fea7d8ac	lib/logger: limit the maximum arg length, which can be emitted to log lines This should prevent from emitting too long lines when too long args are passed to logger.* functions. For example, too long MetricsQL queries or too long data samples.	2023-09-07 15:22:46 +02:00
Aliaksandr Valialkin	9bccc5aab2	docs/CHANGELOG.md: return back accidentally deleted line at `45c0e4bb31`	2023-09-07 12:03:04 +02:00
Aliaksandr Valialkin	2dc33e0ddc	all: update Go builder from Go1.21.0 to Go1.21.1 See https://github.com/golang/go/issues?q=milestone%3AGo1.21.1+label%3ACherryPickApproved	2023-09-07 11:36:16 +02:00
Aliaksandr Valialkin	5f85dd7f80	docs/CHANGELOG.md: clarify the scope of recent bugfixes	2023-09-07 11:25:11 +02:00
Aliaksandr Valialkin	448baf12a3	deployment/docker: properly build armv5 production builds for GOARCH=arm Pass GOARM=5 when building GOARCH=arm production builds, since the default value for this env var has been changed to GOARM=6 since Go1.21.0. See https://github.com/VictoriaMetrics/VictoriaMetrics/issues/4965 and https://github.com/golang/go/issues/62475	2023-09-07 11:18:53 +02:00
Haleygo	45c0e4bb31	vmalert: add `eval_offset` for group (#4693 ) Adds `eval_offset` attribute for Groups. If specified, Group will be evaluated at the exact time offset on the range of [0...evaluationInterval]. The setting might be useful for cron-like rules which must be evaluated at specific moments of time. https://github.com/VictoriaMetrics/VictoriaMetrics/issues/3409 Signed-off-by: Haley Wang <pipilong.25@gmail.com> Co-authored-by: hagen1778 <roman@victoriametrics.com>	2023-09-06 16:29:59 +02:00
Aliaksandr Valialkin	138e02da37	docs/CHANGELOG.md: document the bugfix at `7db72dd7e6` Updates https://github.com/VictoriaMetrics/VictoriaMetrics/issues/4947	2023-09-06 12:17:21 +02:00
Yury Molodov	7b92f1d038	vmui: fix render heatmap (#4957 )	2023-09-06 10:26:45 +02:00
hagen1778	f9e47a9abe	docs: fix broken link in vmctl references Signed-off-by: hagen1778 <roman@victoriametrics.com>	2023-09-04 12:45:46 +02:00
Yury Molodov	d19072a2d9	feat: add the option to see the latest queries (#4718 ) (#4759 ) Co-authored-by: Aliaksandr Valialkin <valyala@victoriametrics.com>	2023-09-04 11:29:11 +02:00
Aliaksandr Valialkin	7a716095dc	docs/CHANGELOG.md: document 1.93.3 release	2023-09-02 10:21:20 +02:00
Aliaksandr Valialkin	82ccae1c02	docs/CHANGELOG.md: document v1.87.8	2023-09-02 01:54:07 +02:00
Nikolay	b9a5ea03fa	lib/vmselectapi: do not send empty label names for labelNames request (#4936 ) * lib/vmselectapi: do not send empty label names for labelNames request it breaks cluster communication, since vmselect incorrectly reads request buffer, leaving unread data on it https://github.com/VictoriaMetrics/VictoriaMetrics/issues/4932 * typo fix * wip --------- Co-authored-by: Aliaksandr Valialkin <valyala@victoriametrics.com>	2023-09-01 23:26:43 +02:00
Aliaksandr Valialkin	8632683990	docs/CHANGELOG.md: document bugfix at `7c19d01e9a` Updates https://github.com/VictoriaMetrics/VictoriaMetrics/issues/4870	2023-09-01 18:00:12 +02:00
Aliaksandr Valialkin	8847fbd34f	docs/CHANGELOG.md: document v1.93.2	2023-09-01 17:33:01 +02:00
Yury Molodov	c112dd7367	vmui: support for Prometheus data on the cardinality page (#4713 ) * feat: add cardinality support for prometheus (#4320) * docs/CHANGELOG.md: add cardinality support for prometheus --------- Co-authored-by: Aliaksandr Valialkin <valyala@victoriametrics.com>	2023-09-01 10:51:44 +02:00
Aliaksandr Valialkin	4bcc086965	app/vmauth: add tests for ResponseHeaders This is a follow-up for `b18eed3427` Updates https://github.com/VictoriaMetrics/VictoriaMetrics/issues/4825	2023-09-01 09:21:12 +02:00
Alexander Marshalov	b18eed3427	vmauth: added ability to set and remove response headers (#4825 ) (#4914 ) * added ability to set and clear response headers (#4825) Signed-off-by: Alexander Marshalov <_@marshalov.org> * added ability to set and clear response headers (#4825) Signed-off-by: Alexander Marshalov <_@marshalov.org> * fix review comment Signed-off-by: Alexander Marshalov <_@marshalov.org> --------- Signed-off-by: Alexander Marshalov <_@marshalov.org>	2023-08-31 14:26:51 +02:00
Nikolay	dc4b974a48	app/vminsert: fixes readonly check (#4892 ) * app/vminsert: fixes readonly check previously vminsert doesn't check readOnly state for vmstorage, since check was never performed for nil buffer In this case every 30 second storage node loss readonly state and received some data. It caused re-routing and possible slow down for ingestion https://github.com/VictoriaMetrics/VictoriaMetrics/issues/4870 * wip --------- Co-authored-by: Aliaksandr Valialkin <valyala@victoriametrics.com>	2023-08-30 16:25:20 +02:00
Nikolay	00685b627f	lib/promscrape/k8s_sd: set resourceVersion to 0 by default for watch … (#4901 ) * lib/promscrape/k8s_sd: set resourceVersion to 0 by default for watch requests it must reduce load for kubernetes ETCD servers. Since requests without resourceVersion performs force cache sync at kubernetes API server with ETCD more info at https://kubernetes.io/docs/reference/using-api/api-concepts/\#semantics-for-watch https://github.com/VictoriaMetrics/VictoriaMetrics/issues/4855 * wip --------- Co-authored-by: Aliaksandr Valialkin <valyala@victoriametrics.com>	2023-08-30 16:03:41 +02:00
Aliaksandr Valialkin	1c0e065216	app/vmselect/promql: add support for `_` delimiters in numeric values For example, 1_234_567_890 is equivalent to 1234567890, while 1.234_567_890 is equivalent to 1.234567890	2023-08-30 14:33:41 +02:00
Zakhar Bessarab	137fa19d9c	app/vmselect: fix panic when using `/select/multitenant` endpoint (#4912 ) app/vmselect: fix panic when using `/select/multitenant` endpoint Such requests must be rejected as not found since vmselect does not support multitenant endpoint. See: https://github.com/VictoriaMetrics/VictoriaMetrics/issues/4910 Signed-off-by: Zakhar Bessarab <z.bessarab@victoriametrics.com>	2023-08-30 14:16:09 +02:00
Nikolay	c37d7dd567	deployment/docker: disable provenance in buildx (#4911 ) * deployment/docker: disable provenance in buildx it must fix an issue with multi-platform manifest generation at buildx >= 0.10 backward compatibility was broken and generated image cannot be used with docker systems that doesn't support oci. disabling attestat temporary fixes it. https://github.com/VictoriaMetrics/VictoriaMetrics/issues/4907 https://docs.docker.com/build/attestations/slsa-provenance/ * Update docs/CHANGELOG.md --------- Co-authored-by: Aliaksandr Valialkin <valyala@victoriametrics.com>	2023-08-29 16:29:14 +02:00
Aliaksandr Valialkin	d087334049	app/{vminsert,vmselect}: follow-up after `2b7b3293c1` - Document the change at docs/CHANGELOG.md - Set the default value for -vmstorageUserTimeout to 3 seconds. This is much better than the 0 value, which means that TCP connection to unreachable vmstorage could block for up to 16 minutes. - Document -vmstorageUserTimeout at docs/Cluster-VictoriaMetrics.md	2023-08-29 12:18:53 +02:00
Roman Khavronenko	e8db78eaa4	dashboards: provide copies of Grafana dashboards alternated with Vict… (#4905 ) dashboards: provide copies of Grafana dashboards alternated with VictoriaMetrics datasource Signed-off-by: hagen1778 <roman@victoriametrics.com>	2023-08-29 11:06:55 +02:00
Aliaksandr Valialkin	379b92cc10	docs/CHANGELOG.md: add links to stream parsing mode in descriptions for `6e8611f301` and `6788704152`	2023-08-29 10:47:32 +02:00
Aliaksandr Valialkin	154c691f47	docs/CHANGELOG.md: remove unneeded `utm_source` and `utm_medium` query args in the link to Google Lighthouse Remove the line about consistent rounding of values in vmui, since it looks like it has been broken and needs to be returned back. See https://github.com/VictoriaMetrics/VictoriaMetrics/pull/4872#issuecomment-1696981947 for details. This is a follow-up for `e865989fa9`	2023-08-29 10:27:28 +02:00
Aliaksandr Valialkin	0e31415b34	docs/CHANGELOG.md: remove another blank line in order to reduce the difference with lts-1.93 branch	2023-08-29 09:48:27 +02:00
Aliaksandr Valialkin	58a6bb7bd1	docs/CHANGELOG.md: remove superflouos blank lines	2023-08-28 10:00:27 +02:00
Aliaksandr Valialkin	5b96a96535	docs/CHANGELOG.md: move the bugfix line into correct place after `ddf87b32ed`	2023-08-28 09:59:41 +02:00
Aliaksandr Valialkin	1b6d37b8e2	docs/CHANGELOG.md: explicitly mention that the bug in 1.93.0 may lead to data loss Updates https://github.com/VictoriaMetrics/VictoriaMetrics/issues/4873	2023-08-28 09:52:29 +02:00
Aliaksandr Valialkin	d8a4f01fe9	docs/CHANGELOG.md: return back the line accidentally deleted at `6abd575cbe` The line has been originally added in `481a2c70fd`	2023-08-28 09:46:42 +02:00
Aliaksandr Valialkin	e4fff13697	docs/CHANGELOG.md: clarify the description of `b7d07e5acf` Updates https://github.com/VictoriaMetrics/VictoriaMetrics/issues/4817	2023-08-28 09:12:29 +02:00
Zakhar Bessarab	6e8611f301	lib/promscrape/client: sync timeout for HostClient and http.Client (#4889 ) Initially, stream parse mode was reading data from response and parsing it on flight. This was causing longer delay to read the whole response and required increasing timeout value to allow data processing while reading. So that `908e35affd` increased timeout value to fix this. But after `74c00a8762` response in stream parse mode is saved into memory and then parsed eliminating necessity of having timeout value higher that for usual scrape. Updates: https://github.com/VictoriaMetrics/VictoriaMetrics/issues/4847 Signed-off-by: Zakhar Bessarab <z.bessarab@victoriametrics.com>	2023-08-25 15:47:11 +02:00
hagen1778	e865989fa9	docs: follow-up after `72167a697e` `72167a697e` Signed-off-by: hagen1778 <roman@victoriametrics.com>	2023-08-25 15:43:23 +02:00
Aliaksandr Valialkin	f1c2508243	lib/promscrape: add -promscrape.cluster.memberLabel command-line flag This flag allows specifying an additional label to add to all the scraped metrics. The flag must contain label name to add. The label value will be equal to -promscrape.cluster.memberNum. This functionality can help when there is a need to differentiate metrics scraped by distinct vmagent instances in the cluster according to https://docs.victoriametrics.com/vmagent.html#scraping-big-number-of-targets Updates https://github.com/VictoriaMetrics/VictoriaMetrics/issues/4247 See https://github.com/VictoriaMetrics/VictoriaMetrics/issues/4247#issuecomment-1692279393	2023-08-24 22:03:54 +02:00
hagen1778	4ebe8bb1d5	app/vmagent: follow-up after `6788704152` https://github.com/VictoriaMetrics/VictoriaMetrics/issues/4884 Signed-off-by: hagen1778 <roman@victoriametrics.com>	2023-08-24 11:36:42 +02:00
Roman Khavronenko	992a1c0a3a	vmagent: retry failed write request on the closed connection (#4857 ) * vmagent: retry failed write request on the closed connection Retry failed write request on the closed connection immediately, without waiting for backoff. This should improve data delivery speed and reduce amount of error logs emitted by vmagent when using idle connections. https://github.com/VictoriaMetrics/VictoriaMetrics/issues/4139 Signed-off-by: hagen1778 <roman@victoriametrics.com> * vmagent: retry failed write request on the closed connection Re-instantinate request before retry as body could have been already spoiled. Signed-off-by: hagen1778 <roman@victoriametrics.com> --------- Signed-off-by: hagen1778 <roman@victoriametrics.com> Co-authored-by: Nikolay <nik@victoriametrics.com>	2023-08-24 00:08:04 +02:00
Roman Khavronenko	ddf87b32ed	vmalert: correctly re-instantinate HTTP req on retries (#4864 ) * vmalert: correctly re-instantinate HTTP req on retries Previosly, request retry to datasource re-used existing HTTP request. But if request object was already partially processed (body was read), then retry will be unsuccessful. The change re-instantinates HTTP request object before retry. Signed-off-by: hagen1778 <roman@victoriametrics.com> * vmalert: review fix Signed-off-by: hagen1778 <roman@victoriametrics.com> --------- Signed-off-by: hagen1778 <roman@victoriametrics.com>	2023-08-24 00:04:05 +02:00
hagen1778	59dee2e714	docs: mention 1.93.0 contains a bug Signed-off-by: hagen1778 <roman@victoriametrics.com>	2023-08-23 15:55:31 +02:00
Nikolay	6abd575cbe	docs: release docs 1.93.1 (#4875 ) * docs: mention v1.93.1 release * deployment/docker: bumps image for v1.93.1 release	2023-08-23 15:52:58 +02:00
hagen1778	946e370b26	docs: mention breaking change to indexdb intorduced in 1.92.0 Signed-off-by: hagen1778 <roman@victoriametrics.com>	2023-08-23 14:27:55 +02:00
Nikolay	c5aac34b68	lib/storage: properly caclucate nextRotationTimestamp (#4874 ) cause of typo unix millis was used instead of unix for current timestamp calculation https://github.com/VictoriaMetrics/VictoriaMetrics/issues/4873	2023-08-23 13:22:53 +02:00
Yury Molodov	ca44b8da1f	vmui: change warning display for text fields (#4848 ) (#4863 ) https://github.com/VictoriaMetrics/VictoriaMetrics/issues/4848	2023-08-21 15:42:55 +02:00
hagen1778	ea2fbcf0e6	vmselect: follow-up after `7349f18c55` `7349f18c55` Signed-off-by: hagen1778 <roman@victoriametrics.com>	2023-08-21 15:34:21 +02:00
Tamara Vashchuk	7349f18c55	vmui: Add button to prettify query (#4694 ) * Add button to prettify query Just capitalizes query text for now * Add /prettify-query API handler * Replace UI pretiffier using prettifier API * Add showing server errors Had to pass setQueryErrors from useFetchQuery.ts * Use serverUrl from global AppState * Change icon to AutoAwsome icon + added style change color when button is active * Add sync/await to prettifyQuery function * Doc public function for lint * Minor async fix * Removed extra blank lines * Extract usePrettifyQuery hook * Made more generic style for :active button * Refactor usePrettifyQuery However, prettify errors don't clean up query errors, but should * Add prettyQuery functionality to CHANGELOG.md * Reuse queryErrors * Unhide errors on start --------- Co-authored-by: Tamara <toma.vashchuk@gmail.com>	2023-08-18 20:12:48 +03:00
Dmytro Kozlov	b7d07e5acf	lib/protoparser: handle unexpected EOF error when parsing lines in prometheus exposition format (#4851 ) Previously only io.EOF was handled, and io.ErrUnexpectedEOF was ignored, but it may happen if the client interrupts the connection. https://github.com/VictoriaMetrics/VictoriaMetrics/issues/4817	2023-08-18 08:55:42 +02:00
Aliaksandr Valialkin	54f522ac25	docs/stream-aggregation.md: clarify the usage of `-remoteWrite.label` after the fix at `a27c2f3773` Updates https://github.com/VictoriaMetrics/VictoriaMetrics/issues/4247	2023-08-17 15:18:37 +02:00
Aliaksandr Valialkin	cd9f86afe1	lib/envflag: do not allow unsupported form for boolean command-line flags in the form `-boolFlag value` Updates https://github.com/VictoriaMetrics/VictoriaMetrics/issues/4845	2023-08-17 13:26:53 +02:00
Dmytro Kozlov	39623ae428	app/vmctl: fix migration process if tenant have no data (#4799 ) app/vmctl: don't interrupt migration process if tenant has no data Signed-off-by: hagen1778 <roman@victoriametrics.com> Co-authored-by: Alexander Marshalov <_@marshalov.org>	2023-08-16 14:54:51 +02:00
Roman Khavronenko	6da32a27ac	vmbackup: correctly check if specified `-dst` belongs to specified `-storageDataPath` (#4841 ) See this issue https://github.com/VictoriaMetrics/VictoriaMetrics/issues/4837 Signed-off-by: hagen1778 <roman@victoriametrics.com>	2023-08-16 14:45:35 +02:00
Alexander Marshalov	a27c2f3773	fixed applying `remoteWrite.label` for pushed metrics (#4247 ) (#4824 ) vmagent: properly add extra labels before sending data to remote storage labels from `remoteWrite.label` are now added to sent metrics just before they are pushed to `remoteWrite.url` after all relabelings, including stream aggregation relabelings (#4247) https://github.com/VictoriaMetrics/VictoriaMetrics/issues/4247 Signed-off-by: Alexander Marshalov <_@marshalov.org> Co-authored-by: Roman Khavronenko <roman@victoriametrics.com>	2023-08-15 13:47:48 +02:00
hagen1778	481a2c70fd	dashboard: fix display of ingested rows rate Fix display of ingested rows rate for `Samples ingested/s` and `Samples rate` panels for vmagent's dasbhoard. Previously, not all ingested protocols were accounted in these panels. An extra panel `Rows rate` was added to `Ingestion` section to display the split for rows ingested rate by protocol. Signed-off-by: hagen1778 <roman@victoriametrics.com>	2023-08-15 08:45:10 +02:00
Aliaksandr Valialkin	fdae53a75b	lib/promrelabel: properly replace `:` char with `_` in metric names when -usePromCompatibleNaming command-line flag is set This addresses https://github.com/VictoriaMetrics/VictoriaMetrics/issues/3113#issuecomment-1275077071 comment from @johnseekins	2023-08-14 16:14:42 +02:00
Aliaksandr Valialkin	63e3571e8c	lib/promrelabel: stop emitting DEBUG log lines when parsing `if` expressions These lines were accidentally left in the commit `62651570bb` Updates https://github.com/VictoriaMetrics/VictoriaMetrics/issues/4635	2023-08-14 15:24:31 +02:00
Aliaksandr Valialkin	a0f695f5de	app/vmbackup: add ability to make server-side copying of existing backups	2023-08-13 17:24:24 -07:00
Aliaksandr Valialkin	88b620b8c8	docs/CHANGELOG.md: document that v1.93.x is a new line of LTS releases	2023-08-12 15:31:57 -07:00
Aliaksandr Valialkin	11329c3d16	docs/CHANGELOG.md: document changes in the v1.87.7 LTS release	2023-08-12 14:49:12 -07:00
Aliaksandr Valialkin	fae59146ad	docs/CHANGELOG.md: document LTS release v1.79.14 See https://github.com/VictoriaMetrics/VictoriaMetrics/releases/tag/v1.79.14	2023-08-12 12:28:10 -07:00
Aliaksandr Valialkin	59f7d810c9	docs/CHANGELOG.md: cut v1.93.0	2023-08-12 06:01:10 -07:00
Aliaksandr Valialkin	e1235267a0	deployment/docker/Makefile: upgrade base Docker image from alpine:3.18.2 to alpine:3.18.3 See https://alpinelinux.org/posts/Alpine-3.15.10-3.16.7-3.17.5-3.18.3-released.html	2023-08-12 05:59:48 -07:00
Aliaksandr Valialkin	05f109ad58	docs/CHANGELOG.md: split changelog into per-year pages in order to keep the size of CHANGELOG pages under control Make sure that links to particular releases - https://docs.victoriametrics.com/CHANGELOG.html#vXXYY - continue working.	2023-08-12 05:48:43 -07:00
Nikolay	d144e39592	lib/protoparser/openetelemetry: fixes panic (#4821 ) Opentelemetry format allows histograms with non-counter buckets. In this case it makes no sense to add buckets into database and save only counter with _count suffix. It could be used as gauge. https://github.com/VictoriaMetrics/VictoriaMetrics/issues/4814 Co-authored-by: Aliaksandr Valialkin <valyala@victoriametrics.com>	2023-08-12 05:09:18 -07:00
Nikolay	8faa17493b	opentelemetry: return human readable error for json encoding. (#4822 ) Opentelemetry parser supports only protobuf atm. Co-authored-by: Aliaksandr Valialkin <valyala@victoriametrics.com>	2023-08-12 05:05:16 -07:00
Nikolay	f111ddb862	lib/promscrape: adds validation for proxy_url scheme (#4823 ) * lib/promscrape: adds validation for proxy_url scheme adds tests https://github.com/VictoriaMetrics/VictoriaMetrics/issues/4811 * Update lib/proxy/proxy.go * Update lib/proxy/proxy.go --------- Co-authored-by: Aliaksandr Valialkin <valyala@victoriametrics.com>	2023-08-12 05:03:08 -07:00
Aliaksandr Valialkin	072d891ed9	app/vmselect: prevent from panic when lookbehind window inside rollup function is parsed into negative value Updates https://github.com/VictoriaMetrics/VictoriaMetrics/issues/4795	2023-08-12 04:47:53 -07:00
Zakhar Bessarab	1bd7637fe1	lib/promrelabel: fix relabeling if clause (#4816 ) * lib/promrelabel: fix relabeling if clause being applied to labels outside of current context Relabeling is applied to each metric row separately, but in order to lower amount of memory allocations it is reusing labels. Functions which are working on current metric row labels are supposed to use only current metric labels by using provided offset, but if clause matcher was using the whole labels set instead of local metrics. This leaded to invalid relabeling results such as one described here: https://github.com/VictoriaMetrics/VictoriaMetrics/issues/4806 Signed-off-by: Zakhar Bessarab <z.bessarab@victoriametrics.com> * docs/CHANGELOG.md: document the bugfix Updates https://github.com/VictoriaMetrics/VictoriaMetrics/issues/1998 Updates https://github.com/VictoriaMetrics/VictoriaMetrics/issues/4806 --------- Signed-off-by: Zakhar Bessarab <z.bessarab@victoriametrics.com> Co-authored-by: Aliaksandr Valialkin <valyala@victoriametrics.com>	2023-08-11 06:37:48 -07:00
Aliaksandr Valialkin	e0017b4d47	all: update Go builder from Go1.20.7 to Go1.21.0 See https://tip.golang.org/doc/go1.21 and https://go.dev/blog/go1.21	2023-08-11 06:25:54 -07:00
Aliaksandr Valialkin	4c4bcdf0b1	docs/CHANGELOG.md: add a link to stream aggregation for the description of the bugfix at `a4a1884237` This makes the description more clear. Updates https://github.com/VictoriaMetrics/VictoriaMetrics/issues/4804	2023-08-11 05:38:30 -07:00
Aliaksandr Valialkin	271743f892	docs/CHANGELOG.md: add missing context to the description of the fix at `be5c4818f5`	2023-08-11 05:26:16 -07:00
Aliaksandr Valialkin	be5c4818f5	lib/httpserver: properly quote the returned address from GetQuotedRemoteAddr() for requests with X-Forwarded-For header Make sure that the quoted address can be used as JSON string. Updates https://github.com/VictoriaMetrics/VictoriaMetrics/pull/4676#issuecomment-1663203424 This is a follow up for `252643d100` and `ac0b7e0421` Updates https://github.com/VictoriaMetrics/VictoriaMetrics/pull/4676	2023-08-11 05:19:50 -07:00
Aliaksandr Valialkin	ac0b7e0421	Revert "vmui: change the response for active queries (#4782 )" This reverts commit `252643d100`. Reason for revert: the commit incorrectly fixes the the issue. The `remoteAddr` must be properly quoted inside lib/httpserver.GetQuotedRemoteAddr(). It isn't quoted properly if the request contains X-Forwarded-For header. The proper fix will be included in the follow-up commit. Updates https://github.com/VictoriaMetrics/VictoriaMetrics/pull/4676	2023-08-11 05:06:40 -07:00
Aliaksandr Valialkin	b50ed5ddd1	app/vmctl: follow-up after `5aed369132` - Fix default value for --remote-read-disable-path-append - Clarify description for the change at docs/CHANGELOG.md Updates https://github.com/VictoriaMetrics/VictoriaMetrics/issues/4655 TODO: address the comment at https://github.com/VictoriaMetrics/VictoriaMetrics/pull/4744	2023-08-11 03:46:10 -07:00
Aliaksandr Valialkin	efb81185a7	docs/CHANGELOG.md: remove superflouos information from the line, which describes the upgrade from Go1.20.6 to Go1.20.7	2023-08-11 03:10:10 -07:00
Aliaksandr Valialkin	e49e4f372b	docs/CHANGELOG.md: clarify the change at `e3ef3df938` Updates https://github.com/VictoriaMetrics/VictoriaMetrics/issues/4697	2023-08-11 03:06:41 -07:00
hagen1778	c36259fca5	docs: mention `honor_timestamps` change in changelog https://github.com/VictoriaMetrics/VictoriaMetrics/issues/4697 Signed-off-by: hagen1778 <roman@victoriametrics.com>	2023-08-10 14:46:39 +02:00
Zakhar Bessarab	a4a1884237	{vmagent/remotewrite,vminsert/common}: fix dropInput and keepInput flags inconsistency (#4809 ) {vmagent/remotewrite,vminsert/common}: fix dropInput and keepInput flags inconsistency Sync behavior for dropInput and keepInput flags between single-node and vmagent. Fix vmagent not respecting dropInput flag and reverse logic for keepInput.	2023-08-10 14:27:21 +02:00
Yury Molodov	252643d100	vmui: change the response for active queries (#4782 ) * fix: change the response to a valid json (#4676) * vmui/docs: fix response of active queries https://github.com/VictoriaMetrics/VictoriaMetrics/pull/4676	2023-08-10 12:27:28 +02:00
Yury Molodov	cc7bfaca6c	vmui: allow displaying the full error message on click (#4760 ) https://github.com/VictoriaMetrics/VictoriaMetrics/issues/4719	2023-08-10 11:34:25 +02:00
Haleygo	bd8ecfb551	docs: add changelog for `4c815ed59b` (#4805 )	2023-08-10 08:26:55 +02:00
Roman Khavronenko	1d4a0796f4	vmalert: cleanup config reload metrics handling (#4790 ) * rename `configErr` to `lastConfigErr` to reduce confusion * add tests to verify metrics and msg are set properly * fix mistake when config success metric wasn't restored after an error Signed-off-by: hagen1778 <roman@victoriametrics.com>	2023-08-07 21:58:40 +02:00
hagen1778	d890038a94	dashboards: correctly calculate `Bytes per point` value Correctly calculate `Bytes per point` value for single-server and cluster VM dashboards. Before, the calculation mistakenly accounted for the number of entries in indexdb in denominator, which could have shown lower values than expected. Signed-off-by: hagen1778 <roman@victoriametrics.com>	2023-08-03 16:22:50 +02:00
Roman Khavronenko	4c854c3ae2	security: bump go version from 1.20.6 to 1.20.7 (#4773 ) The update includes a security fix to the crypto/tls package, as well as bug fixes to the assembler and the compiler. See the list of issues addressed in Go1.20.7 here: https://github.com/golang/go/issues?q=milestone%3AGo1.20.7+label%3ACherryPickApproved Signed-off-by: hagen1778 <roman@victoriametrics.com>	2023-08-03 11:48:37 +02:00
hagen1778	c47138e1b0	dashboards: add panels for absoulte value of mem and cpu usage by vmalert See https://github.com/VictoriaMetrics/VictoriaMetrics/issues/4627 Signed-off-by: hagen1778 <roman@victoriametrics.com>	2023-08-03 11:14:14 +02:00
hagen1778	2e4d0d0e41	alerts: move `ConcurrentFlushesHitTheLimit` alert to health alerts The `ConcurrentFlushesHitTheLimit` could be related to components like vminsert, vmstorage, vm-single-node and vmagent. Moving this alert to the `health` section of alerts will be benefitial for all components and will remove the duplicates from single/cluster alerts. Signed-off-by: hagen1778 <roman@victoriametrics.com>	2023-08-03 10:46:26 +02:00
hagen1778	1043fc1fd9	alerts: add docs section for the full list of alerting rules The change also includes update of all references in other docs to the alerting rules. Signed-off-by: hagen1778 <roman@victoriametrics.com>	2023-08-03 10:46:25 +02:00
hagen1778	e311a7bf80	dashboards: add `Concurrent inserts` panel to vmagent's dasbhoard The new panel supposed to show whether the number of concurrent inserts processed by vmagent isn't reaching the limit. The panel contains recommendation what to do if limit is reached. Signed-off-by: hagen1778 <roman@victoriametrics.com>	2023-08-03 10:46:25 +02:00
hagen1778	061f68fe5e	docs: follow-up after `df37a47d4b` https://github.com/VictoriaMetrics/VictoriaMetrics/issues/4415 Signed-off-by: hagen1778 <roman@victoriametrics.com>	2023-08-02 14:35:37 +02:00
Yury Molodov	8f4961fbbd	vmui: display partial response warning (#4742 ) https://github.com/VictoriaMetrics/VictoriaMetrics/issues/4721	2023-08-02 14:21:52 +02:00
Dmytro Kozlov	5aed369132	app/vmctl: add flag where use can define path to the source remote read protocol (#4744 ) https://github.com/VictoriaMetrics/VictoriaMetrics/issues/4655	2023-08-01 16:43:00 +02:00
Roman Khavronenko	833ab331b1	vmctl: allow disabling binary export protocol (#4716 ) Binary export API protocol can be disabled via `-vm-native-disable-binary-protocol` cmd-line flag when migrating data from VictoriaMetrics. Disabling binary protocol can be useful for deduplication of the exported data before ingestion. For this, deduplication need to be configured at `-vm-native-src-addr` side and `-vm-native-disable-binary-protocol` should be set on vmctl side. Signed-off-by: hagen1778 <roman@victoriametrics.com>	2023-08-01 09:45:50 +02:00
Zakhar Bessarab	6289a21d24	docs: add changelog entry for #4704 (#4753 ) Signed-off-by: Zakhar Bessarab <z.bessarab@victoriametrics.com>	2023-08-01 11:33:46 +04:00
Dmytro Kozlov	d322ee4b35	app/vmctl: add support the `week` step for time-based chunks (#4743 ) https://github.com/VictoriaMetrics/VictoriaMetrics/issues/4738	2023-07-31 16:55:59 +02:00
Roman Khavronenko	216d4091f7	vmalert: remove deprecated in v1.79.0 web links with `*/status` suffix (#4747 ) Links of form `/api/v1/<groupID>/<alertID>/status` were deprecated in favour of `/api/v1/alerts?group_id=<>&alert_id=<>` links in v1.79.0. See more details here https://github.com/VictoriaMetrics/VictoriaMetrics/issues/2825 This change removes code responsible for deprecated functionality. Signed-off-by: hagen1778 <roman@victoriametrics.com>	2023-07-31 16:51:41 +02:00
Roman Khavronenko	9ede3e996b	vmalert: remove deprecated in v1.61.0 `-rule.configCheckInterval` (#4745 ) Use `-configCheckInterval` command-line flag instead. Signed-off-by: hagen1778 <roman@victoriametrics.com>	2023-07-31 16:39:57 +02:00
hagen1778	1a43ee11d1	docs: mention `3f6efab6ae` in changelog Signed-off-by: hagen1778 <roman@victoriametrics.com>	2023-07-31 15:27:26 +02:00
hagen1778	4283eb4626	docs: remove anchors from the 1.92 release Adding anchors to the 1.92 changelog breaks consistency of navigation section at https://docs.victoriametrics.com/CHANGELOG.html All other releases do not have subsections, so should 1.92. Signed-off-by: hagen1778 <roman@victoriametrics.com>	2023-07-31 09:21:05 +02:00
Aliaksandr Valialkin	8e38efaa7b	docs/CHANGELOG.md: move bugfix description to `tip` chapter, since it isnt released yet Updates https://github.com/VictoriaMetrics/VictoriaMetrics/issues/4697	2023-07-28 23:01:37 -07:00
Aliaksandr Valialkin	d18ff993e6	lib/promscrape: add a comment why `honor_timestamps` is set to false by default This should prevent from returning it back to true in the future Updates https://github.com/VictoriaMetrics/VictoriaMetrics/issues/4697	2023-07-28 21:36:32 -07:00
Aliaksandr Valialkin	e3ef3df938	lib/promscrape: use local scrape timestamp for scraped metrics unless `honor_timestamps: true` is set explicitly This fixes the case with gaps for metrics collected from cadvisor, which exports invalid timestamps, which break staleness detection at VictoriaMetrics side. See https://github.com/VictoriaMetrics/VictoriaMetrics/issues/4697 , https://github.com/VictoriaMetrics/VictoriaMetrics/issues/4697#issuecomment-1654614799 and https://github.com/VictoriaMetrics/VictoriaMetrics/issues/4697#issuecomment-1656540535 Updates https://github.com/VictoriaMetrics/VictoriaMetrics/issues/1773	2023-07-28 21:11:26 -07:00
Zakhar Bessarab	8f257889cc	docs/CHANGELOG.md: cut v1.92.1 (#4735 ) Signed-off-by: Zakhar Bessarab <z.bessarab@victoriametrics.com>	2023-07-28 11:07:15 +02:00
Roman Khavronenko	9f1b9b86cc	vmalert: revert unittest feature (#4734 ) * Revert "vmalert: unittest support stale datapoint (#4696)" This reverts commit `0b44df7ec8`. * Revert "docs: specify min version and limitations for vmalert's unit tests" This reverts commit `a24541bd` Signed-off-by: hagen1778 <roman@victoriametrics.com> * Revert "vmalert: init unit test (#4596)" This reverts commit `da60a68d` Signed-off-by: hagen1778 <roman@victoriametrics.com> * docs: mention unittest revert in changelog Signed-off-by: hagen1778 <roman@victoriametrics.com> --------- Signed-off-by: hagen1778 <roman@victoriametrics.com>	2023-07-28 10:42:02 +02:00
Aliaksandr Valialkin	f3b5c9c9fb	docs/CHANGELOG.md: delimit changes from update notes	2023-07-27 17:06:15 -07:00
Aliaksandr Valialkin	5d01d545ce	docs/CHANGELOG.md: cut v1.92.0	2023-07-27 14:55:46 -07:00
Aliaksandr Valialkin	3d73640815	lib/promscrape/discovery: close unused HTTP connections to service discovery servers This should prevent from connection leaks See https://github.com/VictoriaMetrics/VictoriaMetrics/issues/4724	2023-07-27 14:48:56 -07:00
Nikolay	46ecbbea26	lib/protoparser: adds opentelemetry parser (#2570 ) * lib/protoparser: adds opentelemetry parser app/{vmagent,vminsert}: adds opentelemetry ingestion path Adds ability to ingest data with opentelemetry protocol protobuf and json encoding is supported data converted into prometheus protobuf timeseries each data type has own converter and it may produce multiple timeseries from single datapoint (for summary and histogram). only cumulative aggregationFamily is supported for sum(prometheus counter) and histogram. Apply suggestions from code review Co-authored-by: Roman Khavronenko <roman@victoriametrics.com> updates deps fixes tests wip wip wip wip lib/protoparser/opentelemetry: moves to vtprotobuf generator go mod vendor lib/protoparse/opentelemetry: reduce memory allocations * wip - Remove support for JSON parsing, since it is too fragile and is rarely used in practice. The most clients send OpenTelemetry metrics in protobuf. The JSON parser can be added in the future if needed. - Remove unused code from lib/protoparser/opentelemetry/pb and lib/protoparser/opentelemetry/proto - Do not re-use protobuf message between ParseStream() calls, since there is high chance of high fragmentation of the re-used message because of too complex nested structure of the message. * wip * wip * wip --------- Co-authored-by: Aliaksandr Valialkin <valyala@victoriametrics.com>	2023-07-27 13:26:45 -07:00
Aliaksandr Valialkin	584400c2f0	docs/CHANGELOG.md: add a link to Pushgateway protocol in the bugfix description for `74237ce5c0` Updates https://github.com/VictoriaMetrics/VictoriaMetrics/issues/4692	2023-07-27 13:12:12 -07:00
Alexander Marshalov	7e5555f9c7	fixed label values decoding for pushgateway compatibility (#4727 ) Fixed decoding of label values with slash for pushgateway and prometheus golang client compatibility + added some tests. (#4962)	2023-07-27 17:09:28 +02:00
Haleygo	ae0e4a8c90	vmalert: add `keep_firing_for` field for alerting rule (#4669 ) vmalert: support `keep_firing_for` field for alerting rule https://github.com/VictoriaMetrics/VictoriaMetrics/issues/4529 --------- Signed-off-by: hagen1778 <roman@victoriametrics.com> Co-authored-by: hagen1778 <roman@victoriametrics.com>	2023-07-27 15:13:13 +02:00
Aliaksandr Valialkin	6b6b61137f	app/vmagent: add ability to shard outgoing data among multiple remote storage systems Add -remoteWrite.shardByURL command-line flag, which instructs vmagent to spread evenly outgoing time series data among the configured remote storage systems specified via -remoteWrite.url . Samples for the same time series go to the same -remoteWrite.url . This allows building horizontally scalable stream aggregation when samples for counter and histogram series must be aggregated by the same second-level vmagent instance. Updates https://github.com/VictoriaMetrics/VictoriaMetrics/issues/4637	2023-07-24 18:15:26 -07:00
Aliaksandr Valialkin	62651570bb	lib/promrelabel: add support for a list of series selectors at IfExpression This makes possible specifying a list of series selectors at the following places: - Inside `if` option at relabeling rules - Inside `match` option at stream aggregation rules Updates https://github.com/VictoriaMetrics/VictoriaMetrics/issues/4635	2023-07-24 17:08:52 -07:00
Aliaksandr Valialkin	52c13e9515	lib/streamaggr: follow-up for `736197179e` - Use a byte slice instead of a map for tracking indexes for matching series. This improves performance, since access by slice index is faster than access by map key. - Re-use the byte slice for tracking indexes for matching series. This removes unnecessary memory allocations and improves stream aggregation performance a bit. - Add an ability to return to the previous behvaiour by specifying -remoteWrite.streamAggr.dropInput command-line flag. In this case all the input samples are dropped when stream aggregation is enabled. - Backport the new stream aggregation behaviour from vmagent to single-node VictoriaMetrics when -streamAggr.config option is set. - Improve docs regarding this change at docs/CHANGELOG.md - Document the new behavior at docs/stream-aggregation.md Updates https://github.com/VictoriaMetrics/VictoriaMetrics/issues/4243 Updates https://github.com/VictoriaMetrics/VictoriaMetrics/pull/4575	2023-07-24 17:05:26 -07:00
Zakhar Bessarab	736197179e	{lib/streamaggr,vmagent/remotewrite}: breaking change for keepInput flag (#4575 ) * {lib/streamaggr,vmagent/remotewrite}: breaking change for keepInput flag Changes default behaviour of keepInput flag to write series which did not match any aggregators to the remote write. See: https://github.com/VictoriaMetrics/VictoriaMetrics/issues/4243 Signed-off-by: Zakhar Bessarab <z.bessarab@victoriametrics.com> * Update app/vmagent/remotewrite/remotewrite.go Co-authored-by: Roman Khavronenko <roman@victoriametrics.com> --------- Signed-off-by: Zakhar Bessarab <z.bessarab@victoriametrics.com> Co-authored-by: Roman Khavronenko <roman@victoriametrics.com> Co-authored-by: Aliaksandr Valialkin <valyala@victoriametrics.com>	2023-07-24 16:33:30 -07:00
Nikolay	544fba6826	lib/storage: pre-create timeseries before indexDB rotation (#4652 ) * lib/storage: pre-create timeseries before indexDB rotation during an hour before indexDB rotation start creating records at the next indexDB it must improve performance during switch for the next indexDB and remove ingestion issues. Since there is no need for creation new index records for timeseries already ingested into current indexDB https://github.com/VictoriaMetrics/VictoriaMetrics/issues/4563 * lib/storage: further work on indexdb rotation optimization - Document the change at docs/CHAGNELOG.md - Move back various caches from indexDB to Storage. This makes the change less intrusive. The dateMetricIDCache now takes into account indexDB generation, so it stores (date, metricID) entries for both the current and the next indexDB. - Consolidate the code responsible for idbNext pre-filling into prefillNextIndexDB() function. This improves code readability and maintainability a bit. - Rewrite and simplify the code responsible for calculating the next retention timestamp. Add various tests for corner cases of this code. - Remove indexdb pre-filling from RegisterMetricNames() function, since this function is rarely called. It is OK to add indexdb entries on demand in this function. This simplifies the code. Updates https://github.com/VictoriaMetrics/VictoriaMetrics/issues/1401 * docs/CHANGELOG.md: refer to https://github.com/VictoriaMetrics/VictoriaMetrics/issues/4563 --------- Co-authored-by: Aliaksandr Valialkin <valyala@victoriametrics.com>	2023-07-22 15:20:21 -07:00
Zakhar Bessarab	866b150f0f	app/vmalert/datasource/graphite: allow overriding "from" parameter for datasource queries (#4687 ) * app/vmalert/datasource/graphite: allow overriding "from" parameter for datasource queries Fixes construction of URL parameters for graphite render to allow overriding "from" parameter. See: https://github.com/VictoriaMetrics/VictoriaMetrics/issues/4685 Signed-off-by: Zakhar Bessarab <z.bessarab@victoriametrics.com> * app/vmalert/datasource/graphite: update flow for building URL parameters Makes flow of building URL parameters same as Prometheus datasource has: 1) Setting all default values 2) Merging those values with provided `extraParams` Signed-off-by: Zakhar Bessarab <z.bessarab@victoriametrics.com> * Update docs/CHANGELOG.md Co-authored-by: Roman Khavronenko <roman@victoriametrics.com> --------- Signed-off-by: Zakhar Bessarab <z.bessarab@victoriametrics.com> Co-authored-by: Roman Khavronenko <roman@victoriametrics.com>	2023-07-21 14:28:10 +04:00
Aliaksandr Valialkin	49bd2905fa	lib/promscrape: follow-up after `6aa50ca954` - Improve docs - Hide `debug relabeling` column when -promscrape.dropOriginalLabels command-line flag is set - Inline the code from the added template functions, since the code is harder to follow with the template functions, especially when these functions have misleading names. Also, these functions are used only in one place, e.g. they do not reduce the amounts of code. - Hide `click to show original labels` title at `labels` column when original labels aren't available. - Show the reason on whey original labels aren't available at /service-discovery page. Updates https://github.com/VictoriaMetrics/VictoriaMetrics/issues/4597	2023-07-20 19:14:33 -07:00
Aliaksandr Valialkin	b8ba2d5f1a	app/{vmselect,vlselect}: run `make vmui-update vmui-logs-update` after recent changes to VMUI Updates https://github.com/VictoriaMetrics/VictoriaMetrics/issues/4604 Updates https://github.com/VictoriaMetrics/VictoriaMetrics/pull/4676 Updates https://github.com/VictoriaMetrics/VictoriaMetrics/pull/4294	2023-07-20 17:26:03 -07:00
Alexander Marshalov	70773f53d7	allow configuring staleness interval in stream aggregation (#4667 ) (#4670 ) --------- Signed-off-by: Alexander Marshalov <_@marshalov.org> Co-authored-by: Roman Khavronenko <roman@victoriametrics.com>	2023-07-20 16:07:33 +02:00
Haleygo	da60a68d09	vmalert: init unit test (#4596 ) vmalert: support unit tests See https://github.com/VictoriaMetrics/VictoriaMetrics/issues/2945 --------- Signed-off-by: hagen1778 <roman@victoriametrics.com> Co-authored-by: hagen1778 <roman@victoriametrics.com>	2023-07-20 15:07:10 +02:00
Dmytro Kozlov	6aa50ca954	app/vmagent: fix creating target id if `--promscrape.dropOriginalLabels` flag was used (#4616 ) * app/vmagent: fix creating target id if `--promscrape.dropOriginalLabels` flag was used * app/vmagent: hide links if OriginalLabels was dropped * app/vmagent: update CHANGELOG.md and added information to the docs * app/vmagent: fix comments	2023-07-20 10:13:39 +02:00
Yury Molodov	6a96fd8ed5	vmui: add Active Queries page (#4653 ) * feat: add page to display a list of active queries (#4598) * app/vmagent: code formatting * fix: remove console --------- Co-authored-by: dmitryk-dk <kozlovdmitriyy@gmail.com>	2023-07-19 15:47:21 -07:00
Aliaksandr Valialkin	262932f517	vendor: update github.com/VictoriaMetrics/metricsql from v0.60.0 to v0.61.1 This adds support for passing durations via WITH template vars: - `WITH (w = 5m) m[w]` is transformed to `m[5m]` - `WITH (f(w, step, off) = m[w:step] offset off) f(5m, 10s, 1h)` is transformed to `m[5m:10s] offset 1h` Updates https://github.com/VictoriaMetrics/VictoriaMetrics/issues/4025 Updates https://github.com/VictoriaMetrics/metricsql/issues/12 See also the initial implementation by @lujiajing1126 at https://github.com/VictoriaMetrics/metricsql/pull/13	2023-07-19 14:59:46 -07:00
hagen1778	4dcf7563ff	docs: typo Signed-off-by: hagen1778 <roman@victoriametrics.com>	2023-07-19 16:56:24 +02:00
Roman Khavronenko	25317b4e70	vmalert: follow-up after `d4ac4b7813` (#4659 ) Signed-off-by: hagen1778 <roman@victoriametrics.com>	2023-07-18 15:53:37 +02:00
hagen1778	99f4f6a653	docs: mention change from `6f3fee197e` Signed-off-by: hagen1778 <roman@victoriametrics.com>	2023-07-18 11:48:06 +02:00
Aliaksandr Valialkin	9c3717412a	docs/VictoriaLogs: add CHANGELOG.md	2023-07-17 23:14:05 -07:00
Aliaksandr Valialkin	8815080030	app/vmselect/promql: add the ability to copy all the labels from `one` side of group_left()/group_right() operation This is performed by specifying `` inside group_left()/group_right(). Also allow specifying prefix for the copied labels via `group_left(...) prefix "..."` and `group_right(...) prefix "..."` syntax. For example, the following query adds all the namespace-related labels to pod info, and prefixes all the copied label names with "ns_" prefix: kube_pod_info on(namespace) group_left(*) prefix "ns_" kube_namespace_labels This resolves the following StackOverflow questions: - https://stackoverflow.com/questions/76661818/how-to-add-namespace-labels-to-pod-labels-in-prometheus - https://stackoverflow.com/questions/76653997/how-can-i-make-a-new-copy-of-kube-namespace-labels-metric-with-a-different-name	2023-07-17 19:07:39 -07:00
Aliaksandr Valialkin	be31bdc88c	app/vmselect/promql: recommend to use `(a op b) keep_metric_names` instead of `a op b keep_metric_names` The `a op b keep_metric_names` is ambigouos to `a op (b keep_metric_names)` when `b` is a transform or rollup function. For example, `a + rate(b) keep_metric_names`. So it is better to use more clear syntax: `(a op b) keep_metric_names` Updates https://github.com/VictoriaMetrics/VictoriaMetrics/issues/3710	2023-07-16 23:46:34 -07:00
Zakhar Bessarab	e2367b6d1c	metricsql: add support of using keep_metric_names for binary operations (#4109 ) * metricsql: add support of using keep_metric_names for binary operations This should help to avoid confusion with queries like one in the issue #3710. Signed-off-by: Zakhar Bessarab <z.bessarab@victoriametrics.com> * wip --------- Signed-off-by: Zakhar Bessarab <z.bessarab@victoriametrics.com> Co-authored-by: Aliaksandr Valialkin <valyala@victoriametrics.com>	2023-07-16 03:00:39 -07:00
Aliaksandr Valialkin	4cb024d8a3	all: add support for `or` filters in series selectors This commit adds ability to select series matching distinct filters via a single series selector. For example, the following selector selects series with either {env="prod",job="a"} or {env="dev",job="b"} labels: {env="prod",job="a" or env="dev",job="b"} The `or` filter is supported in all the VictoriaMetrics tools now. Updates https://github.com/VictoriaMetrics/VictoriaMetrics/issues/3997 Uses https://github.com/VictoriaMetrics/metricsql/pull/14	2023-07-16 00:06:33 -07:00

... 6 7 8 9 10 ...

2203 commits