github-mirrors/VictoriaMetrics

mirror of https://github.com/VictoriaMetrics/VictoriaMetrics.git synced 2025-04-10 16:00:50 +00:00

Author	SHA1	Message	Date
Emre Yazıcı	a9736a5bfb	app/vmalert: show partial responses in debug logs (#8522 ) ### Describe Your Changes Log when the data response from vmselect is partial during rule(recording, alertingrule) evaluations. vmselect returns `isPartial: true` in case data is not fully fetched from scattered vmstorages. At the time of rule evals, it may be drifting apart from real values due to missing points. This is an important event that should be logged to inform users to see how often that happens as it may lead to false positive alerts. ### Checklist The following checks are mandatory: - [x] My change adheres [VictoriaMetrics contributing guidelines](https://docs.victoriametrics.com/contributing/). --------- Signed-off-by: emreya <emre.yazici@adyen.com> Signed-off-by: emreya <e.yazici1990@gmail.com> Signed-off-by: Emre Yazici <e.yazici1990@gmail.com> (cherry picked from commit `56f60e8be9`)	2025-04-03 09:47:34 +01:00
Zakhar Bessarab	f863b331c1	app/vmalert/rule: follow-up for `d8fe739aba` Remove tenancy-related part of the commit as it is not relevant to OS vmalert version. Signed-off-by: Zakhar Bessarab <z.bessarab@victoriametrics.com>	2025-03-28 18:29:41 +04:00
Hui Wang	d8fe739aba	vmalert: properly attach tenant labels `vm_account_id` and `vm_projec… (#866 ) * vmalert: properly attach tenant labels `vm_account_id` and `vm_project_id` to alerting rules when enabling `-clusterMode` Previously, these labels were lost in alert messages to Alertmanager. Bug was introduced in [v1.112.0](https://github.com/VictoriaMetrics/VictoriaMetrics/releases/tag/v1.112.0).	2025-03-28 12:26:55 +01:00
Aliaksandr Valialkin	35b31f904d	lib/httputil: automatically initialize data transfer metrics for the created HTTP transports via NewTransport()	2025-03-27 15:03:52 +01:00
Aliaksandr Valialkin	2f86ef95a1	lib/{httputil,promauth}: move functions, which create TLS config and TLS-based HTTP transport, from lib/httputil to lib/promauth - Move lib/httputil.Transport to lib/promauth.NewTLSTransport. Remove the first arg to this function (URL), since it has zero relation to the created transport. - Move lib/httputil.TLSConfig to lib/promauth.NewTLSConfig. Re-use the existing functionality from lib/promauth.Config for creating TLS config. This enables the following features: - Ability to load key, cert and CA files from http urls. - Ability to change the key, cert and CA files without the need to restart the service. It automatically re-loads the new files after they change.	2025-03-26 20:14:17 +01:00
Aliaksandr Valialkin	e5f4826964	lib/httputil: add NewTransport() function for creating pre-initialized net/http.Transport	2025-03-26 18:57:17 +01:00
Aliaksandr Valialkin	5d7f68726d	app/vmalert: rename app/vmalert/utils to app/vmalert/vmalertutil for the sake of consistency of *util package naming	2025-03-26 18:07:45 +01:00
Aliaksandr Valialkin	0e0432db6c	lib: rename lib/promutils to lib/promutil for the sake of consistency for *util package naming	2025-03-26 17:32:06 +01:00
Aliaksandr Valialkin	e9c4769baf	lib: rename lib/httputils to lib/httputil for the sake of consistency for *util package naming	2025-03-26 16:44:21 +01:00
Hui Wang	bd11e00a59	app/vmalert: properly register group and rules metrics Commit `9ca74d1fff` introduced an issue with metrics registration. Due to metrics.Summary type always registered at the global state of metrics package, vmalert had increased memory and CPU usage after multiple configuration reloads. This commit addresses this issue and properly registers metrics.Summary metric. Now metrics for group and rules must be explicitly registered before group.Start with group.Init method. It simplifies metrics usage an ensures that all needed metrics were registered and group is ready to start. Related issue: https://github.com/VictoriaMetrics/VictoriaMetrics/issues/8532	2025-03-19 13:57:34 +01:00
Hui Wang	b97cacad45	app/vmalert: fix possible data race on group checksum 1. fix possible data race on group checksum when reload is called concurrently. Before, it didn't affect much but might update the group one more time. 2. remove the unnecessary g.mu.RLock() and compute group.id at newGroup creation. Changes to group.ID() indicate that type and interval have changed, and the group is new. Related PR: https://github.com/VictoriaMetrics/VictoriaMetrics/pull/8540	2025-03-19 12:58:51 +01:00
Hui Wang	dec237b7d6	app/vmalert: fix memory leak with -notifier.blackhole Previous commit `9ca74d1fff` added a regression for notifier's metrics exposed by vmalert. vmalert returned new notifier instances for the blackhole notifier type. And it registered new metrics each get notifiers function was called. It registered duplicate metrics and lead to OOM crash. This commit properly init blachole notifier instances and add metrics for it only once, during application start. Related issue: https://github.com/VictoriaMetrics/VictoriaMetrics/issues/8532	2025-03-19 10:43:57 +01:00
Guillem Jover	76d205feae	spelling and grammar fixes via codespell (#8497 ) ### Describe Your Changes Fix many spelling errors and some grammar, including misspellings in filenames. The change also fixes a typo in metric `vm_mmaped_files` to `vm_mmapped_files`. While this is a breaking change, this metric isn't used in alerts or dashboards. So it seems to have low impact on users. The change also deprecates `cspell` as it is much heavier and less usable. --------- Co-authored-by: Andrii Chubatiuk <achubatiuk@victoriametrics.com> Co-authored-by: Andrii Chubatiuk <andrew.chubatiuk@gmail.com>	2025-03-17 16:32:10 +01:00
Emre Yazıcı	cfd2c6e5e7	app/vmalert: add vmalert_alerts_send_duration_seconds metric (#8468 ) ### Describe Your Changes Add `vmalert_alerts_send_latency_seconds` metric for alertmanager.notifier. To measure the time for alertmanager calls to send alerts per notifier. This is needed to see the latency for each notifier from vmalert calls. ### Checklist The following checks are mandatory: - [x] My change adheres [VictoriaMetrics contributing guidelines](https://docs.victoriametrics.com/contributing/). --------- Signed-off-by: emreya <e.yazici1990@gmail.com> Co-authored-by: Hui Wang <haley@victoriametrics.com> Co-authored-by: Roman Khavronenko <hagen1778@gmail.com>	2025-03-12 14:30:54 +01:00
Hui Wang	e8e2ef54a0	vmalert: allow chaining groups with `eval_offset` (#8402 ) address https://github.com/VictoriaMetrics/VictoriaMetrics/issues/860, see https://github.com/VictoriaMetrics/VictoriaMetrics/blob/change-evaloffset-behavior/docs/vmalert.md#chaining-groups Also related to https://github.com/VictoriaMetrics/VictoriaMetrics/issues/8154	2025-03-07 09:45:16 +01:00
Hui Wang	52988ebdc8	vmalert: add time buckets stats pipe check for vlogs expression (#8400 ) VictoriaLogs inserts `_time` field as a label in result when query with [time buckets stats pipe](https://docs.victoriametrics.com/victorialogs/logsql/#stats-by-time-buckets), making the result meaningless and may lead to cardinality issues. >curl --location --request POST 'https://play-vmlogs.victoriametrics.com/select/logsql/stats_query?query=_time%3A1m%20%7C%20stats%20by%20(_time%3A10s)%20count%20()%20as%20total' >{"status":"success","data":{"resultType":"vector","result":[{"metric":{"__name__":"total","_time":"2025-01-24T12:31:30Z"},"value":[1737721904.4476516,"12"]},{"metric":{"__name__":"total","_time":"2025-01-24T12:31:10Z"},"value":[1737721904.4476516,"10"]},{"metric":{"__name__":"total","_time":"2025-01-24T12:31:00Z"},"value":[1737721904.4476516,"10"]},{"metric":{"__name__":"total","_time":"2025-01-24T12:31:20Z"},"value":[1737721904.4476516,"12"]},{"metric":{"__name__":"total","_time":"2025-01-24T12:30:50Z"},"value":[1737721904.4476516,"10"]},{"metric":{"__name__":"total","_time":"2025-01-24T12:30:40Z"},"value":[1737721904.4476516,"9"]}]}}% --------- Signed-off-by: hagen1778 <roman@victoriametrics.com> Co-authored-by: hagen1778 <roman@victoriametrics.com>	2025-03-03 14:09:34 +01:00
Roman Khavronenko	63f6ac3ff8	lib/promutils: move time-related funcs from `promutils` to `timeutil` (#8403 ) Since funcs `ParseDuration` and `ParseTimeMsec` are used in vlogs, vmalert, victoriametrics and other components, importing promutils only for this reason makes them to export irrelevant `vm_rows_invalid_total{type="prometheus"}` metric. This change removes `vm_rows_invalid_total{type="prometheus"}` metric from /metrics page for these components. ### Describe Your Changes Please provide a brief description of the changes you made. Be as specific as possible to help others understand the purpose and impact of your modifications. ### Checklist The following checks are mandatory: - [ ] My change adheres [VictoriaMetrics contributing guidelines](https://docs.victoriametrics.com/contributing/). --------- Signed-off-by: hagen1778 <roman@victoriametrics.com>	2025-03-03 10:25:42 +01:00
Zakhar Bessarab	9ca74d1fff	app/vmalert: properly unregister exposed metrics for alerting rules Previously if rule group parameters were changed, alerting rules related metrics could be deleted due to bug at `utils/metrics` package. This commit introduces `metrics.Set` per rule group. It holds group and alerting rules metrics. It properly unregister alerting rules metrics and addresses issue. In addition: - expose group metrics only once group is started - this helps to avoid exposing metrics for groups which are created during YAML unmarshaling and only used to update existing group. - properly close rules which are discarded after updating existing rules so that metrics are also correctly closed. - detect file renames and properly recreate groups "moved" between files. Related issue: https://github.com/VictoriaMetrics/VictoriaMetrics/issues/8229	2025-02-21 10:36:29 +01:00
Gunju Kim	10d5e979a6	app/vmalert: add command line flag `-notifier.sendTimeout` Currently, vmalert uses a fixed 10-second client timeout for notifiers, which can prevent large sets of alerts from being sent successfully. This introduces `-notifier.sendTimeout` flag to vmalert to control the client timeout duration for the notifiers. Related issue: https://github.com/VictoriaMetrics/VictoriaMetrics/issues/8287	2025-02-19 18:04:31 +01:00
Roman Khavronenko	768525928d	bump golangci-lint to v1.64.4 See https://github.com/golangci/golangci-lint/releases/tag/v1.64.4 * address linting errors Signed-off-by: hagen1778 <roman@victoriametrics.com>	2025-02-13 11:12:06 +01:00
Evgeny	4e9fb93acc	fix race where the same list is used from 2 goroutines ### Describe Your Changes There is an issue described in #8040 this should fix it - The alerts slice is shared across multiple goroutines (since send() is called concurrently). - `alerts[:0]` creates a new slice header, but it still references the same underlying array. - Appending (append(alertsToSend, a)) modifies the underlying array, which may also be used by another goroutine. Solution: Use a separate slice copy for each goroutine. ### Checklist The following checks are mandatory: - [x] My change adheres [VictoriaMetrics contributing guidelines](https://docs.victoriametrics.com/contributing/). --------- Co-authored-by: Evgeny Kuzin <evgeny@hudson-trading.com> Co-authored-by: Hui Wang <haley@victoriametrics.com>	2025-02-10 21:40:00 +04:00
hagen1778	5561970db0	app/vmalert: mention that `remoteWrite.concurrency` depends on CPU Mnetion explicitly that `remoteWrite.concurrency` deopends on number of available CPU cores. Updated docs to rm auto-printed default value. Updates https://github.com/VictoriaMetrics/VictoriaMetrics/pull/8151 Signed-off-by: hagen1778 <roman@victoriametrics.com>	2025-02-01 22:10:47 +01:00
Roman Khavronenko	ee3c0c6a87	make: bump golangci-lint to v1.63.4 ( New version has additional checks and reduced resource consumption, so it doesn't timeout for our internal repos. To make linter happy, I addressed "redefinition of the built-in function" lint error. ---- Signed-off-by: hagen1778 <roman@victoriametrics.com>	2025-01-13 07:18:04 +01:00
Zakhar Bessarab	51b21dfd57	app/vmalert/notifier: fix rendering of Alertmanager notification body commitL `c7fc0d0d2f` enabled skipping alerts in case there is no labels present for an alert. This made clause which was adding a comma for the JSON list incorrect as it is not possible to determine if the next alert will be skipped or not. This fix renders all alert labels in advance allowing properly format JSON payload for Alertmanager notification. Related issue: https://github.com/VictoriaMetrics/VictoriaMetrics/issues/7985 Signed-off-by: Zakhar Bessarab <z.bessarab@victoriametrics.com>	2025-01-08 19:02:20 +01:00
Hui Wang	afb07034ed	app/vmalert: fix the auto-generated metrics `ALERTS` and `ALERTS_FOR_STATE` Previously, since labels slice is reused for both `ALERTS` and `ALERTS_FOR_STATE`, metrics might have incorrect labels and affect the restore process. Tested the fix under `TestAlertingRule_Exec: "for-pending=>empty"`. The bug is introduced in `282f13cf11`. Affected versions: v1.106.1, v1.107...v1.108.x related issue: https://github.com/VictoriaMetrics/VictoriaMetrics/issues/7796	2025-01-02 12:51:05 +01:00
Hui Wang	b0ed5b6174	app/vmalert: fixes reload of external templates Previously after configuration reload call `externalURL` templaing function defined at external templates could be lost. Since it was added only at initial `Load` call and never copied during template reload process. External templates for vmalert could be defined via `-rule.templates` flag. This commit properly reload external templates. It's no longer copies mutated templates and instead fully reloads it each time if there is any changes.	2024-12-13 10:29:19 +01:00
Hui Wang	e439e40e79	app/vmalert: fix possible template overwritten between rule annotations Previous commit `b09272ccac` added regression, which could lead to the template global state overwrites. The issue related to the mechanism how `vmalert` inherits templates. It has global templates, that could be changed via `rule.templates` flag. And local templates defined per labels/annotations for rules and groups. During labels/annotations templating state could be changed via `define` syntax. This commit restores previous behavior with `Clone` call for templates before templating labels/annotations. Affected releases: - 1.106.1 - v1.102.7 - v1.97.12 Related issue: https://github.com/VictoriaMetrics/VictoriaMetrics/issues/6894	2024-12-10 14:59:40 +01:00
Hui Wang	6ff1de89a9	vmalert: fix alert states restoration (#7624 ) Previously, when the alert got resolved shortly before the vmalert process shuts down, this could result in false alerts. This change switches vmalert to use MetricsQL function during alerts state restore, which makes it incompatible for state restoration with PromQL. --------- Co-authored-by: Roman Khavronenko <roman@victoriametrics.com>	2024-11-22 09:11:31 +01:00
Hui Wang	1bd927e3fe	vmalert: remove deprecated cmd-line flags `-datasource.lookback`, `da… (#6779 ) …tasource.queryTimeAlignment` and `remoteRead.ignoreRestoreErrors` Those flags were all deprecated before [v1.101.0](https://github.com/VictoriaMetrics/VictoriaMetrics/releases/tag/v1.101.0).	2024-11-21 13:58:09 +01:00
Hui Wang	71f521fc0c	vmalert: revert the default value of `-remoteWrite.maxQueueSize` from… (#7570 ) … `1_000_000` to `100_000` It was bumped in [v1.104.0](https://github.com/VictoriaMetrics/VictoriaMetrics/releases/tag/v1.104.0), which increases memory usage and is not needed for most setups. See [this issue](https://github.com/VictoriaMetrics/VictoriaMetrics/issues/7471).	2024-11-20 16:20:51 +01:00
Hui Wang	18afeff742	app/vmalert: fix flaky ut `TestRecordingRule_Exec` The order of stale metrics can't be controlled in recording rule, only use two time series then.	2024-11-14 15:30:39 +01:00
Hui Wang	b09272ccac	app/vmalert: improve performances when rules produce large volumes of results 1. Avoid storing the last evaluation results outside of rules, check for stale time series as soon as possible; 2. remove duplicated template `Clone()`. This pull request is primarily reducing memory usage when rules produce large volumes of results, as seen in https://github.com/VictoriaMetrics/VictoriaMetrics/issues/6894. The CPU time spent on garbage collection remains high and may be addressed in a separate PR.	2024-11-14 12:23:39 +01:00
Hui Wang	304996bc08	docs/vmalert: clarify some vmalert flags Some flags are shared between datasourceURL and remoteReadURL, some flags are not valid for victoriaLogs as the datasource.	2024-11-14 11:21:35 +01:00
Aliaksandr Valialkin	e5537bc64d	lib/logstorage: properly take into account the `end` query arg when calculating time range for _time:duration filters	2024-11-08 16:43:54 +01:00
Hui Wang	68bad22fd2	vmalert: integrate with victorialogs (#7255 ) address https://github.com/VictoriaMetrics/VictoriaMetrics/issues/6706. See https://github.com/VictoriaMetrics/VictoriaMetrics/blob/vmalert-support-vlog-ds/docs/VictoriaLogs/vmalert.md. Related fix https://github.com/VictoriaMetrics/VictoriaMetrics/pull/7254. Note: in this pull request, vmalert doesn't support [backfilling](https://github.com/VictoriaMetrics/VictoriaMetrics/blob/vmalert-support-vlog-ds/docs/VictoriaLogs/vmalert.md#rules-backfilling) for rules with a customized time filter. It might be added in the future, see [this issue](https://github.com/VictoriaMetrics/VictoriaMetrics/issues/7289) for details. Feature can be tested with image `victoriametrics/vmalert:heads-vmalert-support-vlog-ds-0-g420629c-scratch`. --------- Signed-off-by: hagen1778 <roman@victoriametrics.com> Co-authored-by: hagen1778 <roman@victoriametrics.com>	2024-10-29 16:30:39 +01:00
Roman Khavronenko	0204ce942d	app/vmalert: update `-remoteWrite.concurrency` and `-remoteWrite.flushInterval` (#7272 ) Auto-adjust `-remoteWrite.concurrency` cmd-line flags with the number of available CPU cores in the same way as vmagent does. With this change the default behavior of vmalert in high-loaded installation should become more resilient. This change also reduces `-remoteWrite.flushInterval` from `5s` to `2s` to provide better data freshness. --------- Signed-off-by: hagen1778 <roman@victoriametrics.com> Co-authored-by: Nikolay <nik@victoriametrics.com>	2024-10-22 14:43:55 +02:00
Antoine Deschênes	d656934d22	vmalert: properly set `group_name` and `file` fields for recording rules (#7298 ) This commit properly adds `group_name` and `file` fields for recording rules web api response at `/api/v1/rules`. Previously these fields were blank. Related issue https://github.com/victoriaMetrics/victoriaMetrics/issues/7297 Signed-off-by: Antoine Deschênes <antoine.deschenes@linux.com>	2024-10-22 14:13:56 +02:00
Hui Wang	c4fe23794a	vmalert: fix blocking hot-reload process if the old rule group hasn't started yet (#7258 ) Group [sleeps](`daa7183749/app/vmalert/rule/group.go (L320)`) random duration before start the evaluation, and during the sleep, `g.updateCh <- new` will be blocked since there is no `<-g.updateCh` waiting. --------- Signed-off-by: hagen1778 <roman@victoriametrics.com> Co-authored-by: hagen1778 <roman@victoriametrics.com>	2024-10-18 11:18:24 +02:00
kirti purohit	008b649658	vmalert: parse multi doc yaml (#6995 ) ### Describe Your Changes This PR adds the feature to parse a multi yaml doc following the `\n---\n` The issue is [6753](https://github.com/VictoriaMetrics/VictoriaMetrics/issues/6753) ### Checklist The following checks are mandatory: - [x] My change adheres [VictoriaMetrics contributing guidelines](https://docs.victoriametrics.com/contributing/). --------- Signed-off-by: kirti purohit <kirti.purohit@hpe.com> Co-authored-by: kirti purohit <kirti.purohit@hpe.com> Co-authored-by: Jiekun <jiekun@victoriametrics.com> Co-authored-by: hagen1778 <roman@victoriametrics.com>	2024-10-08 14:28:32 +02:00
Artem Fetishev	e2c73dc89f	app/(vmagent,vmalert)/remotewrite/client: Fix flag docs (#7198 ) ### Describe Your Changes The flags docs mention the flag that does not exist (and never existed). Perhaps that was a typo. `s/retryMaxInterval/retryMaxTime/g` ### Checklist The following checks are mandatory: - [x] My change adheres [VictoriaMetrics contributing guidelines](https://docs.victoriametrics.com/contributing/). Signed-off-by: Artem Fetishev <rtm@victoriametrics.com>	2024-10-08 13:14:38 +02:00
Roman Khavronenko	59bc63ebc4	app/vmalert: mention labels conflict resolution strategy (#7085 ) The change should help users to understand what happens on labels conflict. ### Describe Your Changes Please provide a brief description of the changes you made. Be as specific as possible to help others understand the purpose and impact of your modifications. ### Checklist The following checks are mandatory: - [ ] My change adheres [VictoriaMetrics contributing guidelines](https://docs.victoriametrics.com/contributing/). Signed-off-by: hagen1778 <roman@victoriametrics.com>	2024-09-27 14:41:33 +02:00
Roman Khavronenko	6b1b47df54	app/vmalert: bump default values for sending data to `remoteWrite.url` (#7084 ) * `remoteWrite.maxQueueSize` from `100_000` to `1_000_000`, this should improve resiliency of recording rules that produce many series; * `remoteWrite.maxBatchSize` from `1_000` to `10_000`, this should be more efficient to send from netwroking perspective; * `remoteWrite.concurrency` from `1` to `4`, this should imrpove speed of sending the generated series. The new settings should improve remote write performance of vmalert with default settings. ### Describe Your Changes Please provide a brief description of the changes you made. Be as specific as possible to help others understand the purpose and impact of your modifications. ### Checklist The following checks are mandatory: - [ ] My change adheres [VictoriaMetrics contributing guidelines](https://docs.victoriametrics.com/contributing/). --------- Signed-off-by: hagen1778 <roman@victoriametrics.com> Co-authored-by: Hui Wang <haley@victoriametrics.com>	2024-09-25 15:01:39 +02:00
Hui Wang	d6d02d7aeb	vmalert: fix variable `$activeAt` value when templating rule annotation in replay mode Co-authored-by: Roman Khavronenko <roman@victoriametrics.com>	2024-09-20 11:07:40 +02:00
Dima Lazerka	8207879fa3	docs: fixes misspelled typos Also tried to make it catch "Authorisation" in the future, fixed a lot of other misspells along the way, but didn't make it catch "Authorisation" anyway. - Fix misspelled "Authorization" header name - Fix misspelled "organization" - Fix more misspells	2024-09-13 12:14:24 +02:00
Hui Wang	ae4d376e41	vmalert: do not send message to alertmanager when alert has no label … (#6823 ) …pair `alert_relabel_configs` in [notifier config](https://docs.victoriametrics.com/vmalert/#notifier-configuration-file) can drop alert labels when used to filter different tenant alert message to different notifier. alertmanager would report error like `msg="Failed to validate alerts" err="at least one label pair required"` in this case, but the rest of the alerts inside one request would still be valid in alertmanager, so it's not severe.	2024-09-09 13:34:48 +02:00
f41gh7	95acca6b52	app/*/multiarch: return back empty value for TARGETARCH follow-up after `91456ab5bb` docker buildx uses special variables, such as TARGETARCH and it shouldn't be overwritten. See this article for details https://www.docker.com/blog/faster-multi-platform-builds-dockerfile-cross-compilation-guide/ Signed-off-by: f41gh7 <nik@victoriametrics.com>	2024-09-06 18:12:17 +02:00
Aliaksandr Valialkin	91456ab5bb	all: suppress InvalidDefaultArgInFrom warning emitted by `docker build` when building Docker packages via `make package-` command Recent versions of `docker build` started generating the InvalidDefaultArgInFrom warning if Dockerfile contains an ARG without default value. While this warning doesn't affect building Docker packages via `make package-` commands, it is better suppressing the warning, so it doesn't clutter `make package-*` output with the noise, which can hide real issues in the future.	2024-09-03 14:00:28 +02:00
dufucun	95bafc8caf	tests: fix slice init length (#6897 ) ### Describe Your Changes fix slice init length ### Checklist The following checks are mandatory: - [ ] My change adheres [VictoriaMetrics contributing guidelines](https://docs.victoriametrics.com/contributing/). Signed-off-by: dufucun <dufuchun@sohu.com>	2024-08-30 10:55:25 +02:00
Roman Khavronenko	70a94ea492	app/vmalert: update parsing for instant responses (#6859 ) This change is made in attempt to reduce memory usage by vmalert when parsing big instant responses from VM/Prometheus. In `a5c427bac4` vmalert switched from std json lib to fastjson lib in order to reduce amount of allocations, as according to highloaded profiles of vmalert the CPU is mostly spent on GC. But switching to fastjson resulted into excessive memory usage for cases when vmalert has to parse long json lines, which usually happens when instant response contains many `metric` objects. In this change we do a mixed parsing: 1. Slice of `metric` objects is parsed with std lib to keep mem low 2. Each `metric` object is parsed with fastjson to reduce allocs The benchmark results are the following: ``` pkg: github.com/VictoriaMetrics/VictoriaMetrics/app/vmalert/datasource BenchmarkParsePrometheusResponse/Instant_std+fastjson-10 1760 668959 ns/op 280147 B/op 5781 allocs/op MBs allocated at heap: 493.078392 mallocs: 18655472 BenchmarkParsePrometheusResponse/Instant_fastjson-10 6109 198258 ns/op 172839 B/op 5548 allocs/op MBs allocated at heap: 1056.384464 mallocs: 34457184 BenchmarkParsePrometheusResponse/Instant_std-10 1287 950987 ns/op 451677 B/op 9619 allocs/op MBs allocated at heap: 580.802976 mallocs: 13351636 ``` The benchmark function code with mem measurement is available here https://gist.github.com/hagen1778/b9c3ca7f8ca7d6b21aec9777112c5810 The benchmark contains 3 results: 1. Instant_std+fastjson is the implementation in this change 2. Instant_fastjson-10 is the implementation from `a5c427bac4` 3. BenchmarkParsePrometheusResponse/Instant_std-10 is implementation before `a5c427bac4` According to these results, this new implementation is slower than previous, but faster than before switching to fastjson. It also has lower number of allocations and roughly the same memory allocation on heap with GC turned off. --------- Other changes: 1. rm BenchmarkMetrics as it doesn't measure anything 2. simplify BenchmarkParsePrometheusResponse into BenchmarkPromInstantUnmarshal ### Describe Your Changes Please provide a brief description of the changes you made. Be as specific as possible to help others understand the purpose and impact of your modifications. ### Checklist The following checks are mandatory: - [ ] My change adheres [VictoriaMetrics contributing guidelines](https://docs.victoriametrics.com/contributing/). Signed-off-by: hagen1778 <roman@victoriametrics.com>	2024-08-22 17:36:11 +02:00
Hui Wang	0f1ec33892	vmalert: add command line flag `-notifier.headers` (#6751 ) to allow configuring additional headers in each request to the corresponding notifier. Other flags like `-datasource.headers`, `-remoteWrite.headers` already use `^^` as delimiter, it's consistent to use it in `-notifier.headers` as well. related https://github.com/VictoriaMetrics/VictoriaMetrics/issues/3260 vmalert can integrate with alertmanager that supports multi-tenant by adding tenantID header`X-Scope-OrgID` in requests. In multitenancy, vmalert can also filter alerts which send to different notifier addresses(or with different header settings) using `alert_relabel_configs`. Updates https://github.com/VictoriaMetrics/VictoriaMetrics/issues/3260 --------- Signed-off-by: hagen1778 <roman@victoriametrics.com> Co-authored-by: hagen1778 <roman@victoriametrics.com>	2024-08-19 21:40:57 +02:00

1 2 3 4 5 ...

649 commits