github-mirrors/VictoriaMetrics

mirror of https://github.com/VictoriaMetrics/VictoriaMetrics.git synced 2024-11-21 14:44:00 +00:00

Author	SHA1	Message	Date
hagen1778	febba3971b	make go vet happy Address `non-constant format string in call` check: https://github.com/golang/go/issues/60529 Signed-off-by: hagen1778 <roman@victoriametrics.com>	2024-08-19 21:15:33 +02:00
Roman Khavronenko	e58dde6925	lib/httputils: parse URL before creating HTTP transport (#6820 ) https://github.com/VictoriaMetrics/VictoriaMetrics/issues/6740 --------- Signed-off-by: hagen1778 <roman@victoriametrics.com>	2024-08-16 11:32:04 +02:00
Zakhar Bessarab	5390ee2413	app/vmseleсt/promql: fix calculation of histogram buckets This issue was introduced in `6a4bd5049b` See: https://github.com/VictoriaMetrics/VictoriaMetrics/issues/6714 Signed-off-by: Zakhar Bessarab <z.bessarab@victoriametrics.com>	2024-08-15 10:11:41 +02:00
Nikolay	9f42fccfc2	app/vminsert: returns back memory optimisation (#6794 ) Production workload shows that it's useful optimisation. Channel based objects pool allows to handle irregural data ingestion requests and make memory allocations more smooth. It's improves sync.Pool efficiency, since objects from sync.Pool removed after 2 GC cycles. With GOGC=30 value, GC runs significantly more often. https://github.com/VictoriaMetrics/VictoriaMetrics/issues/6733 ### Checklist The following checks are mandatory: - [ ] My change adheres [VictoriaMetrics contributing guidelines](https://docs.victoriametrics.com/contributing/). --------- Signed-off-by: f41gh7 <nik@victoriametrics.com> Signed-off-by: hagen1778 <roman@victoriametrics.com> Co-authored-by: hagen1778 <roman@victoriametrics.com> (cherry picked from commit `f255800da3`) Signed-off-by: hagen1778 <roman@victoriametrics.com> # Conflicts: # app/vminsert/common/insert_ctx_pool.go	2024-08-13 10:56:33 -04:00
ccliu	d134a310f3	vmagent: resolve the issue where usePromCompatibleNaming is not working (#6776 ) Describe Your Changes When I use usePromCompatibleNaming with vmagent to process data that needs to be formatted from different sources such as InfluxDB, I find that it doesn’t work However, it works in vminsert. I found that vminsert uses the HasRelabeling method to determine whether to relabel. ```go func HasRelabeling() bool { pcs := pcsGlobal.Load() return pcs.Len() > 0 \|\| usePromCompatibleNaming } ``` in vmagent, the decision to relabel is determined only by pcsGlobal.Len() > 0. However, in the applyRelabeling method, the usePromCompatibleNaming logic is also used to determine whether to relabel in the error handling. ```go func (rctx relabelCtx) applyRelabeling(tss []prompbmarshal.TimeSeries, pcs promrelabel.ParsedConfigs) []prompbmarshal.TimeSeries { if pcs.Len() == 0 && !usePromCompatibleNaming { // Nothing to change. return tss } ``` So I think that the logic for determining whether to relabel in vmagent is not as expected. Checklist The following checks are mandatory: [✅]My change adheres [VictoriaMetrics contributing guidelines](https://docs.victoriametrics.com/contributing/). --------- Co-authored-by: Roman Khavronenko <hagen1778@gmail.com>	2024-08-13 10:32:05 -04:00
jackyin	5f5bc46b3e	vlogs: add select/deselect all button to table settings in UI (#6680 ) fix #6668, just add select all and "unselect all" func. https://github.com/user-attachments/assets/0c31385b-def0-4618-aa9c-5ba4bb6f56c3 --------- Co-authored-by: Yury Molodov <yurymolodov@gmail.com> Co-authored-by: hagen1778 <roman@victoriametrics.com>	2024-08-13 10:20:07 -04:00
Hui Wang	62d19369a3	stream aggregation: do not allow to enable `-stream.keepInput` and `k… (#6723 ) …eep_metric_names` options in stream aggregation config together With aggregated data and raw data under the same metric, results would be confusing. --------- Signed-off-by: hagen1778 <roman@victoriametrics.com> Co-authored-by: hagen1778 <roman@victoriametrics.com>	2024-08-13 08:54:35 -04:00
Hui Wang	8f5c26d788	app/vmagent/remotewrite: make `-remoteWrite.streamAggr.ignoreFirstIntervals` of array type (#6744 ) Make `-remoteWrite.streamAggr.ignoreFirstIntervals` of array type so it could accept multiple values which can be applied to the corresponding`-remoteWrite.url`. --------- Signed-off-by: hagen1778 <roman@victoriametrics.com> Co-authored-by: hagen1778 <roman@victoriametrics.com>	2024-08-07 09:53:50 +02:00
Hui Wang	4863605469	app/vmagent/remotewrite: fix `-streamAggr.dropInputLabels` behavior (#6743 ) Fix `-streamAggr.dropInputLabels` behavior when global deduplication is enabled without `-streamAggr.config`. Previously, `-remoteWrite.streamAggr.dropInputLabels` is misapplied. --------- Signed-off-by: hagen1778 <roman@victoriametrics.com> Co-authored-by: hagen1778 <roman@victoriametrics.com>	2024-08-07 09:48:15 +02:00
hagen1778	9726e6c1a2	app/vmalert: rm unnecessary err check The error check was needed before `a84491324d` It was kept by mistake and makes no sense to have rn. Signed-off-by: hagen1778 <roman@victoriametrics.com>	2024-08-07 09:09:24 +02:00
Yury Molodov	04c2232e45	vmui/logs: add display top streams in the hits graph (#6647 ) ### Describe Your Changes - Adds support for displaying the top 5 log streams in the hits graph, grouping the remaining streams into an "other" label. #6545 - Adds options to customize the graph display with bar, line, stepped line, and points views. ### Checklist The following checks are mandatory: - [x] My change adheres [VictoriaMetrics contributing guidelines](https://docs.victoriametrics.com/contributing/).	2024-08-06 16:28:44 +02:00
Zakhar Bessarab	58b6c54da2	app/vlinsert/elasticsearch: add fake response for logstash requests (#6742 ) ### Describe Your Changes This is needed in order to support standard Elasticsearch output in Logstash pipelines. See: https://github.com/VictoriaMetrics/VictoriaMetrics/pull/6660 ### Checklist The following checks are mandatory: - [x] My change adheres [VictoriaMetrics contributing guidelines](https://docs.victoriametrics.com/contributing/). --------- Signed-off-by: Zakhar Bessarab <z.bessarab@victoriametrics.com>	2024-08-06 15:43:33 +02:00
Hui Wang	c1b54779a2	vmalert: respect HTTP headers defined in notifier configuration file (#6762 ) Co-authored-by: Roman Khavronenko <roman@victoriametrics.com>	2024-08-06 15:37:25 +02:00
hagen1778	f283126084	fix typos in comments Signed-off-by: hagen1778 <roman@victoriametrics.com>	2024-08-06 14:54:49 +02:00
Zakhar Bessarab	9877a5e7d5	app/{vminsert,vmagent}: add healthcheck for influx ingestion endpoints (#6749 ) ### Describe Your Changes This is useful for clients which validate InfluxDB is available before data ingestion can be started. See: https://github.com/VictoriaMetrics/VictoriaMetrics/issues/6653 ### Checklist The following checks are mandatory: - [x] My change adheres [VictoriaMetrics contributing guidelines](https://docs.victoriametrics.com/contributing/). --------- Signed-off-by: Zakhar Bessarab <z.bessarab@victoriametrics.com> Signed-off-by: hagen1778 <roman@victoriametrics.com> Co-authored-by: hagen1778 <roman@victoriametrics.com>	2024-08-05 09:34:54 +02:00
Dmytro Kozlov	6f401daacb	vmctl: add `--backoff-retries`, `--backoff-factor`, `--backoff-min-duration` global command-line flags (#6639 ) ### Describe Your Changes Added `--vm-backoff-retries`, `--vm-backoff-factor`, `--vm-backoff-min-duration` and `--vm-native-backoff-retries`, `--vm-native-backoff-factor`, `--vm-native-backoff-min-duration` command-line flags to the `vmctl` app. Those changes will help to configure the retry backoff policy for different situations. Related issue: https://github.com/VictoriaMetrics/VictoriaMetrics/issues/6622 ### Checklist The following checks are mandatory: - [X] My change adheres [VictoriaMetrics contributing guidelines](https://docs.victoriametrics.com/contributing/). --------- Signed-off-by: hagen1778 <roman@victoriametrics.com> Co-authored-by: hagen1778 <roman@victoriametrics.com>	2024-08-03 19:12:48 +02:00
Yury Molodov	e06a19d85f	vmui/logs: improve UI functionality (#6688 ) * add a toggle button to the "Group" tab that allows users to expand or collapse all groups at once * introduce the ability to select a key for grouping logs within the "Group" tab * display the number of entries within each log group. * move the Markdown toggle to the general settings panel in the upper left corner.	2024-08-02 15:48:36 +02:00
Yury Molodov	a05317f61f	vmui/logs: add fields for tenant configuration (#6661 ) Added fields for configuring AccountID and ProjectID #6631	2024-08-02 09:57:39 +02:00
f41gh7	996b623585	make vmui-update	2024-08-01 14:45:09 +02:00
Yury Molodov	53919327b2	vmui: fix auto-completion triggers (#6566 ) ### Describe Your Changes - Fixes auto-complete triggers according to [these comments](https://github.com/VictoriaMetrics/VictoriaMetrics/issues/5866#issuecomment-2065273421). - Fixes loading and displaying suggestions when there is no metric in the expression. Related issue: #6153 - Adds quotes when inserting label values. Related issue: #6260 - [x] My change adheres [VictoriaMetrics contributing guidelines](https://docs.victoriametrics.com/contributing/).	2024-07-31 15:00:14 +02:00
Aliaksandr Valialkin	9dde5b8ee3	app/{vmselect,vlselect}: run `make vmui-update vmui-logs-update` after `efd70b2c52`	2024-07-27 13:50:31 +02:00
Aliaksandr Valialkin	83f2ce4910	app/vmauth: verify how backend response headers are propagated to vmauth client	2024-07-27 13:44:49 +02:00
Hui Wang	b515a7b69b	security: upgrade base docker image (Alpine) from 3.20.1 to 3.20.2 (#6684 ) See https://www.alpinelinux.org/posts/Alpine-3.20.1-released.html >including security fix for: OpenSSL CVE-2024-5535	2024-07-23 13:20:06 +02:00
Zakhar Bessarab	d88d0f382b	app/vmauth: change response code when all backend are not available (#6676 ) ### Describe Your Changes Change response code to 502 to align it with behaviour of other existing reverse proxies. Currently, the following reverse proxies will return 502 in case an upstream is not available: nginx, traefik, caddy, apache. Signed-off-by: Zakhar Bessarab <z.bessarab@victoriametrics.com>	2024-07-22 17:31:18 +02:00
Aliaksandr Valialkin	dad3eefd74	app/vmauth: test how User-Agent header is set in requests to backend	2024-07-20 11:43:24 +02:00
Aliaksandr Valialkin	e87b4d3768	app/vmauth: verify the correctness of X-Forwarded-For header processing at TestRequestHandler()	2024-07-20 11:28:14 +02:00
Aliaksandr Valialkin	cb76ff5c56	app/vmauth: add missing tests for requestHandler()	2024-07-20 11:22:36 +02:00
Aliaksandr Valialkin	78b1571eb8	app/vmauth: add more tests for requestHandler()	2024-07-20 10:19:45 +02:00
Aliaksandr Valialkin	0a8c9c5ee7	docs/vmauth.md: document the case with default url_prefix additionally to url_map	2024-07-20 09:46:01 +02:00
Aliaksandr Valialkin	9e0c37be2d	app/vmauth: properly proxy requests to backend paths ending with / Previously the traling / was incorrectly removed when proxying requests from http://vmauth/ While at it, add more tests for requestHandler()	2024-07-19 17:29:04 +02:00
Aliaksandr Valialkin	add2db12b2	app/vmauth: properly proxy HTTP requests without body The Request.Body for requests without body can be nil. This could break readTrackingBody.Read() logic, which could incorrectly return "cannot read data after closing the reader" error in this case. Fix this by initializing the readTrackingBody.r with zeroReader. While at it, properly set Host header if it is specified in 'headers' section. It must be set net/http.Request.Host instead of net/http.Request.Header.Set(), since the net/http.Client overwrites the Host header with the value from req.Host before sending the request. While at it, add tests for requestHandler(). Additional tests for various requestHandler() cases will be added in future commits. Updates https://github.com/VictoriaMetrics/VictoriaMetrics/issues/6445 Updates https://github.com/VictoriaMetrics/VictoriaMetrics/issues/5707 Updates https://github.com/VictoriaMetrics/VictoriaMetrics/issues/5240 Updates https://github.com/VictoriaMetrics/VictoriaMetrics/pull/6525	2024-07-19 16:24:12 +02:00
Yury Molodov	efd70b2c52	vmui/logs: switched requests to sequential execution (#6624 ) ### Describe Your Changes This PR changes `/select/logsql/query` and `/select/logsql/hits` to execute sequentially Fixed https://github.com/VictoriaMetrics/VictoriaMetrics/issues/6558#issuecomment-2219298984 ### Checklist The following checks are mandatory: - [x] My change adheres [VictoriaMetrics contributing guidelines](https://docs.victoriametrics.com/contributing/).	2024-07-18 11:55:42 +02:00
Aliaksandr Valialkin	c8bc2f0ee5	app/vmselect/vmui: run `make vmui-update` after `959a4383c5`	2024-07-17 23:09:18 +02:00
Aliaksandr Valialkin	eaed0465d2	all: substitute double "the the" with "the" This is a follow-up for `8786a08d27` Updates https://github.com/VictoriaMetrics/VictoriaMetrics/pull/6600	2024-07-17 14:28:12 +02:00
Aliaksandr Valialkin	9c4b0334f2	all: consistently use stringsutil.JSONString() for formatting JSON strings with fmt.* functions instead of using "%q" formatter The %q formatter may result in incorrectly formatted JSON string if the original string contains special chars such as \x1b . They must be encoded as \u001b , otherwise the resulting JSON string cannot be parsed by JSON parsers. This is a follow-up for `c0caa69939` See https://github.com/VictoriaMetrics/victorialogs-datasource/issues/24	2024-07-17 13:52:13 +02:00
rtm0	bdc0e688e8	Fix inconsistent error handling in Storage.AddRows() (#6583 ) ### Describe Your Changes `Storage.AddRows()` returns an error only in one case: when `Storage.updatePerDateData()` fails to unmarshal a `metricNameRaw`. But the same error is treated as a warning when it happens inside `Storage.add()` or returned by `Storage.prefillNextIndexDB()`. This commit fixes this inconsistency by treating the error returned by `Storage.updatePerDateData()` as a warning as well. As a result `Storage.add()` does not need a return value anymore and so doesn't `Storage.AddRows()`. Additionally, this commit adds a unit test that checks all cases that result in a row not being added to the storage. --------- Signed-off-by: Artem Fetishev <wwctrsrx@gmail.com> Co-authored-by: Nikolay <nik@victoriametrics.com>	2024-07-17 12:07:14 +02:00
Aliaksandr Valialkin	7ed719b46a	app/vmauth: properly handle the case when zero backend hosts are resolved at SRV DNS When zero backend hosts are resolved, then vmauth must return 'no backend hosts' error instead of crashing with panic This is a follow-up for `590aeccd7d` and `3a45bbb4e0` Updates https://github.com/VictoriaMetrics/VictoriaMetrics/pull/6401	2024-07-17 11:31:05 +02:00
Aliaksandr Valialkin	7ee5797493	app/vmauth: pool readTrackingBody structs in order to reduce pressure on Go GC - use pool for readTrackingBody structs in order to reduce pressure on Go GC - allow re-reading partially read request body - add missing tests for various cases of readTrackingBody usage This is a follow-up for `ad6af95183` and `4d66e042e3`. Updates https://github.com/VictoriaMetrics/VictoriaMetrics/issues/6445 Updates https://github.com/VictoriaMetrics/VictoriaMetrics/pull/6446 Updates https://github.com/VictoriaMetrics/VictoriaMetrics/pull/6533	2024-07-17 11:06:18 +02:00
Aliaksandr Valialkin	277aad18d8	app/vmauth: use more clear names for the field and function added at `e666d64f1d` - Rename overrideHostHeader() function to hasEmptyHostHeader() - Rename overrideHostHeader field at UserInfo to useBackendHostHeader This should simplify the future maintenance of the code Updates https://github.com/VictoriaMetrics/VictoriaMetrics/pull/6525	2024-07-16 19:08:38 +02:00
Aliaksandr Valialkin	ad6af95183	Revert "app/vmauth: reader pool to reduce gc & mem alloc (#6533 )" This reverts commit `4d66e042e3`. Reasons for revert: - The commit makes unrelated invalid changes to docs/CHANGELOG.md - The changes at app/vmauth/main.go are too complex. It is better splitting them into two parts: - pooling readTrackingBody struct for reducing pressure on GC - avoiding to use readTrackingBody when -maxRequestBodySizeToRetry command-line flag is set to 0 Let's make this in the follow-up commits! Updates https://github.com/VictoriaMetrics/VictoriaMetrics/issues/6445 Updates https://github.com/VictoriaMetrics/VictoriaMetrics/pull/6533	2024-07-16 18:59:16 +02:00
Aliaksandr Valialkin	590aeccd7d	app/vmauth: follow-up for `3a45bbb4e0` - Move the test for SRV discovery into a separate function. This allows verifying round-robin discovery across SRV records. - Restore the original netutil.Resolver after the test finishes, so it doesn't interfere with other tests. - Move the description of the bugfix into the correct place at docs/CHANGELOG.md - it should be placed under v1.102.0-rc2 instead of v1.102.0-rc1. - Remove unneeded code in URLPrefix.sanitizeAndInitialize(), since it is expected this function is called only once for finishing URLPrefix initializiation. In this case URLPrefix.nextDiscoveryDeadline and URLPrefix.n are equal to 0 according to https://pkg.go.dev/sync/atomic#Uint64 - Properly fix the bug at URLPrefix.discoverBackendAddrsIfNeeded() - it is expected that hostToAddrs map uses the original hostname keys, including 'srv+' prefix, so it shouldn't be removed when looping over up.busOriginal. Instead, the 'srv+' prefix must be removed from the hostname only locally before passing the hostname to netutil.Resolver.LookupSRV. Updates https://github.com/VictoriaMetrics/VictoriaMetrics/pull/6401	2024-07-16 10:40:51 +02:00
Aliaksandr Valialkin	88e02b6352	app/vmauth: clarify the description for -idleConnTimeout command-line flag This is a follow-up for `d44058bcd6` Updates https://github.com/VictoriaMetrics/VictoriaMetrics/pull/6388	2024-07-16 09:39:15 +02:00
Aliaksandr Valialkin	233e5f0a9e	lib/httpserver: skip basic auth check for additional request paths, which should call httpserver.CheckAuthFlag() This is a follow-up for `61dce6f2a1` Updates https://github.com/VictoriaMetrics/VictoriaMetrics/pull/6338 Updates https://github.com/VictoriaMetrics/VictoriaMetrics/issues/6329	2024-07-16 01:00:45 +02:00
Aliaksandr Valialkin	e3d5714f6f	app/vminsert: increase default value for -maxLabelValueLen command-line flag from 1KiB to 4KiB It has been appeared that the standard Kubernetes monitoring can generate labels with sizes up to 4KiB This is a follow-up for `a5d1013042` Updates https://github.com/VictoriaMetrics/VictoriaMetrics/issues/6176	2024-07-15 23:32:36 +02:00
Aliaksandr Valialkin	a468a6e985	lib/{httputils,netutil}: move httputils.GetStatDialFunc to netutil.NewStatDialFunc - Rename GetStatDialFunc to NewStatDialFunc, since it returns new function with every call - NewStatDialFunc isn't related to http in any way, so it must be moved from lib/httputils to lib/netutil - Simplify the implementation of NewStatDialFunc by removing sync.Map from there. - Use netutil.NewStatDialFunc at app/vmauth and lib/promscrape/discoveryutils - Use gauge instead of counter type for *_conns metric This is a follow-up for `d7b5062917` Updates https://github.com/VictoriaMetrics/VictoriaMetrics/pull/6299	2024-07-15 23:02:34 +02:00
Aliaksandr Valialkin	db557b86ee	app/vmagent/remotewrite: follow-up for `f153f54d11` - Move the remaining code responsible for stream aggregation initialization from remotewrite.go to streamaggr.go . This improves code maintainability a bit. - Properly shut down streamaggr.Aggregators initialized inside remotewrite.CheckStreamAggrConfigs(). This prevents from potential resource leaks. - Use separate functions for initializing and reloading of global stream aggregation and per-remoteWrite.url stream aggregation. This makes the code easier to read and maintain. This also fixes INFO and ERROR logs emitted by these functions. - Add an ability to specify `name` option in every stream aggregation config. This option is used as `name` label in metrics exposed by stream aggregation at /metrics page. This simplifies investigation of the exposed metrics. - Add `path` label additionally to `name`, `url` and `position` labels at metrics exposed by streaming aggregation. This label should simplify investigation of the exposed metrics. - Remove `match` and `group` labels from metrics exposed by streaming aggregation, since they have little practical applicability: it is hard to use these labels in query filters and aggregation functions. - Rename the metric `vm_streamaggr_flushed_samples_total` to less misleading `vm_streamaggr_output_samples_total` . This metric shows the number of samples generated by the corresponding streaming aggregation rule. This metric has been added in the commit `861852f262` . See https://github.com/VictoriaMetrics/VictoriaMetrics/pull/6462 - Remove the metric `vm_streamaggr_stale_samples_total`, since it is unclear how it can be used in practice. This metric has been added in the commit `861852f262` . See https://github.com/VictoriaMetrics/VictoriaMetrics/pull/6462 - Remove Alias and aggrID fields from streamaggr.Options struct, since these fields aren't related to optional params, which could modify the behaviour of the constructed streaming aggregator. Convert the Alias field to regular argument passed to LoadFromFile() function, since this argument is mandatory. - Pass Options arg to LoadFromFile() function by reference, since this structure is quite big. This also allows passing nil instead of Options when default options are enough. - Add `name`, `path`, `url` and `position` labels to `vm_streamaggr_dedup_state_size_bytes` and `vm_streamaggr_dedup_state_items_count` metrics, so they have consistent set of labels comparing to the rest of streaming aggregation metrics. - Convert aggregator.aggrStates field type from `map[string]aggrState` to `[]aggrOutput`, where `aggrOutput` contains the corresponding `aggrState` plus all the related metrics (currently only `vm_streamaggr_output_samples_total` metric is exposed with the corresponding `output` label per each configured output function). This simplifies and speeds up the code responsible for updating per-output metrics. This is a follow-up for the commit `2eb1bc4f81` . See https://github.com/VictoriaMetrics/VictoriaMetrics/pull/6604 - Added missing urls to docs ( https://docs.victoriametrics.com/stream-aggregation/ ) in error messages. These urls help users figuring out why VictoriaMetrics or vmagent generates the corresponding error messages. The urls were removed for unknown reason in the commit `2eb1bc4f81` . - Fix incorrect update for `vm_streamaggr_output_samples_total` metric in flushCtx.appendSeriesWithExtraLabel() function. While at it, reduce memory usage by limiting the maximum number of samples per flush to 10K. Updates https://github.com/VictoriaMetrics/VictoriaMetrics/issues/5467 Updates https://github.com/VictoriaMetrics/VictoriaMetrics/pull/6268	2024-07-15 20:24:01 +02:00
Aliaksandr Valialkin	202e5704e6	vendor: update github.com/VictoriaMetrics/metrics from v1.34.1 to v1.35.0 Fix potential memory leaks across VictoriaMetrics codebase after metrics.UnregisterSet(s) call because of missing s.UnregisterAllMetrics() call. This is a follow-up for `6a6e34ab8e` . It is OK if some vmauth metrics aren't visible for a few microseconds when the previous metrics are unregistered and new metrics weren't registered yet. Updates https://github.com/VictoriaMetrics/VictoriaMetrics/issues/6247 Updates https://github.com/VictoriaMetrics/VictoriaMetrics/issues/4690 Updates https://github.com/VictoriaMetrics/VictoriaMetrics/pull/6252 Updates https://github.com/VictoriaMetrics/VictoriaMetrics/pull/5805	2024-07-15 10:43:37 +02:00
Aliaksandr Valialkin	f3ccbe181d	app/vmagent/remotewrite: do not spend CPU time on an attempt to send data to blocked queue if some queues are unblocked Previously remotewrite.TryPush() was trying to send data to remote storages with blocked persistent queues, if some persistent queues to other remote storage systems were unblocked. This resulted in excess CPU usage on relabeling and stream aggregation for the remote storage with blocked queues. The solution is to check whether some peristent storages have blocked queues and skip them before applying per- -remoteWrite.url relabeling and streaming aggregation. While at it, properly update per- -remoteWrite.url vmagent_remotewrite_samples_dropped_total and vmagent_remotewrite_push_failures_total counters when global streaming aggregation cannot send data to remote storage systems because of blocked queues. See https://github.com/VictoriaMetrics/VictoriaMetrics/issues/5467 and https://github.com/VictoriaMetrics/VictoriaMetrics/pull/6268 . This is a follow-up for `87fd400dfc` and `f153f54d11` Updates https://github.com/VictoriaMetrics/VictoriaMetrics/pull/6248 Updates https://github.com/VictoriaMetrics/VictoriaMetrics/pull/6065	2024-07-15 09:38:17 +02:00
Aliaksandr Valialkin	cfc72cb129	docs/CHANGELOG.md: use new link to VictoriaMetrics cluster docs instead of old link The old link was changed globally to the new link in the commit `f4b1cbfef0` . Unfortunately, old links are still posted in new commits :( This is a follow-up for `680b8c25c8` . While at it, remove duplicate 'len(*remoteWriteURLs) > 0' check in the remotewrite.Init() functions, since this check is already made at the beginning of the function. Updates https://github.com/VictoriaMetrics/VictoriaMetrics/pull/6253	2024-07-13 03:02:40 +02:00
Aliaksandr Valialkin	0145b65f25	app/vmagent/remotewrite: follow-up for `87fd400dfc` - Drop samples and return true from remotewrite.TryPush() at fast path when all the remote storage systems are configured with the disabled on-disk queue, every in-memory queue is full and -remoteWrite.dropSamplesOnOverload is set to true. This case is quite common, so it should be optimized. Previously additional CPU time was spent on per-remoteWriteCtx relabeling and other processing in this case. - Properly count the number of dropped samples inside remoteWriteCtx.pushInternalTrackDropped(). Previously dropped samples were counted only if -remoteWrite.dropSamplesOnOverload flag is set. In reality, the samples are dropped when they couldn't be sent to the queue because in-memory queue is full and on-disk queue is disabled. The remoteWriteCtx.pushInternalTrackDropped() function is called by streaming aggregation for pushing the aggregated data to the remote storage. Streaming aggregation cannot wait until the remote storage processes pending data, so it drops aggregated samples in this case. - Clarify the description for -remoteWrite.disableOnDiskQueue command-line flag at -help output, so it is clear that this flag can be set individually per each -remoteWrite.url. - Make the -remoteWrite.dropSamplesOnOverload flag global. If some of the remote storage systems are configured with the disabled on-disk queue, then there is no sense in keeping samples on some of these systems, while dropping samples on the remaining systems, since this will result in global stall on the remote storage system with the disabled on-disk queue and with the -remoteWrite.dropSamplesOnOverload=false flag. vmagent will always return false from remotewrite.TryPush() in this case. This will result in infinite duplicate samples written to the remaining remote storage systems. That's why the -remoteWrite.dropSamplesOnOverload is forcibly set to true if more than one -remoteWrite.disableOnDiskQueue flag is set. This allows proceeding with newly scraped / pushed samples by sending them to the remaining remote storage systems, while dropping them on overloaded systems with the -remoteWrite.disableOnDiskQueue flag set. - Verify that the remoteWriteCtx.TryPush() returns true in the TestRemoteWriteContext_TryPush_ImmutableTimeseries test. - Mention in vmagent docs that the -remoteWrite.disableOnDiskQueue command-line flag can be set individually per each -remoteWrite.url. See https://docs.victoriametrics.com/vmagent/#disabling-on-disk-persistence Updates https://github.com/VictoriaMetrics/VictoriaMetrics/pull/6248 Updates https://github.com/VictoriaMetrics/VictoriaMetrics/pull/6065	2024-07-13 02:25:19 +02:00

1 2 3 4 5 ...

3278 commits