github-mirrors/VictoriaMetrics

mirror of https://github.com/VictoriaMetrics/VictoriaMetrics.git synced 2024-12-01 14:47:38 +00:00

Author	SHA1	Message	Date
Roman Khavronenko	c6e0780f4b	app/vmalert: update parsing for instant responses (#6859 ) This change is made in attempt to reduce memory usage by vmalert when parsing big instant responses from VM/Prometheus. In `a5c427bac4` vmalert switched from std json lib to fastjson lib in order to reduce amount of allocations, as according to highloaded profiles of vmalert the CPU is mostly spent on GC. But switching to fastjson resulted into excessive memory usage for cases when vmalert has to parse long json lines, which usually happens when instant response contains many `metric` objects. In this change we do a mixed parsing: 1. Slice of `metric` objects is parsed with std lib to keep mem low 2. Each `metric` object is parsed with fastjson to reduce allocs The benchmark results are the following: ``` pkg: github.com/VictoriaMetrics/VictoriaMetrics/app/vmalert/datasource BenchmarkParsePrometheusResponse/Instant_std+fastjson-10 1760 668959 ns/op 280147 B/op 5781 allocs/op MBs allocated at heap: 493.078392 mallocs: 18655472 BenchmarkParsePrometheusResponse/Instant_fastjson-10 6109 198258 ns/op 172839 B/op 5548 allocs/op MBs allocated at heap: 1056.384464 mallocs: 34457184 BenchmarkParsePrometheusResponse/Instant_std-10 1287 950987 ns/op 451677 B/op 9619 allocs/op MBs allocated at heap: 580.802976 mallocs: 13351636 ``` The benchmark function code with mem measurement is available here https://gist.github.com/hagen1778/b9c3ca7f8ca7d6b21aec9777112c5810 The benchmark contains 3 results: 1. Instant_std+fastjson is the implementation in this change 2. Instant_fastjson-10 is the implementation from `a5c427bac4` 3. BenchmarkParsePrometheusResponse/Instant_std-10 is implementation before `a5c427bac4` According to these results, this new implementation is slower than previous, but faster than before switching to fastjson. It also has lower number of allocations and roughly the same memory allocation on heap with GC turned off. --------- Other changes: 1. rm BenchmarkMetrics as it doesn't measure anything 2. simplify BenchmarkParsePrometheusResponse into BenchmarkPromInstantUnmarshal ### Describe Your Changes Please provide a brief description of the changes you made. Be as specific as possible to help others understand the purpose and impact of your modifications. ### Checklist The following checks are mandatory: - [ ] My change adheres [VictoriaMetrics contributing guidelines](https://docs.victoriametrics.com/contributing/). Signed-off-by: hagen1778 <roman@victoriametrics.com>	2024-08-22 23:56:11 +02:00
Aliaksandr Valialkin	d6415b2572	all: consistently use 'any' instead of 'interface{}' 'any' type is supported starting from Go1.18. Let's consistently use it instead of 'interface{}' type across the code base, since `any` is easier to read than 'interface{}'.	2024-07-10 00:23:26 +02:00
Roman Khavronenko	a5c427bac4	app/vmalert/datasource: reduce number of allocations when parsing instant responses (#6272 ) Allocations are reduced by implementing custom json parser via fastjson lib. The change also re-uses `promInstant` object in attempt to reduce number of allocations when parsing big responses, as usually happens with heavy recording rules. ``` name old allocs/op new allocs/op delta ParsePrometheusResponse/Instant-10 9.65k ± 0% 5.60k ± 0% ~ (p=1.000 n=1+1) ``` Signed-off-by: hagen1778 <roman@victoriametrics.com> --------- Signed-off-by: hagen1778 <roman@victoriametrics.com> (cherry picked from commit `4f0525852f`)	2024-05-16 09:35:58 +02:00
Hui Wang	349564fd82	vmalert: deprecate cmd-line flag `-datasource.lookback` (#5877 ) * vmalert: deprecate cmd-line flag `-datasource.lookback` * fix lint * review fixes Signed-off-by: hagen1778 <roman@victoriametrics.com> --------- Signed-off-by: hagen1778 <roman@victoriametrics.com> Co-authored-by: hagen1778 <roman@victoriametrics.com> (cherry picked from commit `e80b44f19d`)	2024-03-12 16:17:40 +01:00
Aliaksandr Valialkin	36a1fdca6c	all: consistently use %w instead of %s in when error is passed to fmt.Errorf() This allows consistently using errors.Is() for verifying whether the given error wraps some other known error.	2023-10-26 09:44:40 +02:00
Haleygo	b52f1d1f0a	vmalert: add `evalAlignment` for rule group and fix evalutaion timstamp (#5066 ) * vmalert: add `query_time_alignment` for rule group 1. add `eval_alignment` attribute for group which by default is true. So group rule query stamp will be aligned with interval and propagated to ALERT metrics and the messages for alertmanager; 2. deprecate `datasource.queryTimeAlignment` flag. https://github.com/VictoriaMetrics/VictoriaMetrics/issues/5049 (cherry picked from commit `2aa0f5fc41`)	2023-10-10 12:45:37 +02:00
Haleygo	0212219f6c	vmalert: add `eval_offset` for group (#4693 ) Adds `eval_offset` attribute for Groups. If specified, Group will be evaluated at the exact time offset on the range of [0...evaluationInterval]. The setting might be useful for cron-like rules which must be evaluated at specific moments of time. https://github.com/VictoriaMetrics/VictoriaMetrics/issues/3409 Signed-off-by: Haley Wang <pipilong.25@gmail.com> Co-authored-by: hagen1778 <roman@victoriametrics.com> (cherry picked from commit `45c0e4bb31`)	2023-09-07 10:59:14 +02:00
Haleygo	3c2308fd52	vmalert:fix query request using rfc3339 format (#4577 ) vmalert: consistently use time.RFC3339 format for time in queries Co-authored-by: hagen1778 <roman@victoriametrics.com>	2023-07-09 11:03:10 -07:00
Roman Khavronenko	109e55f865	vmalert: allow disabling of `step` param attached to instant queries (#4574 ) vmalert: allow disabling of `step` param attached to instant queries This might be useful for using vmalert with datasources that to not support this param, unlike VictoriaMetrics. See https://github.com/VictoriaMetrics/VictoriaMetrics/issues/4573 Signed-off-by: hagen1778 <roman@victoriametrics.com>	2023-07-06 23:13:56 -07:00
Roman Khavronenko	4edb97f4da	app/vmalert: detect alerting rules which don't match any series at all (#4198 ) app/vmalert: detect alerting rules which don't match any series at all vmalert starts to understand /query responses which contain object: ``` "stats":{"seriesFetched": "42"} ``` If object is present, vmalert parses it and populates a new field `SeriesFetched`. This field is then used to populate the new metric `vmalert_alerting_rules_last_evaluation_series_fetched` and to display warnings in the vmalert's UI. If response doesn't contain the new object (Prometheus or VictoriaMetrics earlier than v1.90), then `SeriesFetched=nil`. In this case, UI will contain no additional warnings. And `vmalert_alerting_rules_last_evaluation_series_fetched` will be set to `-1`. Negative value of the metric will help to compile correct alerting rule in follow-up. Thanks for the initial implementation to @Haleygo See https://github.com/VictoriaMetrics/VictoriaMetrics/pull/4056 See https://github.com/VictoriaMetrics/VictoriaMetrics/issues/4039 Signed-off-by: hagen1778 <roman@victoriametrics.com>	2023-05-09 21:48:59 -07:00
Roman Khavronenko	a922308438	vmalert: reduce allocations for Prometheus resp parse (#3435 ) Method `metrics()` now pre-allocates slices for labels and results from query responses. This reduces the number of allocations on the hot path for instant requests. Signed-off-by: hagen1778 <roman@victoriametrics.com>	2022-12-05 00:18:11 -08:00
Roman Khavronenko	31ca22109e	vmalert: fix replay step param (#3428 ) The recent change in modifying default value of `datasource.queryStep` flag resulted in situation where replay mode was always running queries with step=`datasource.queryStep`. When it should always use rule's evaluation interval. The fix is related not to replay mode only, but for all Range requests. Now step param is set individually for each mode. Signed-off-by: hagen1778 <roman@victoriametrics.com> Signed-off-by: hagen1778 <roman@victoriametrics.com>	2022-12-02 19:09:30 -08:00
Roman Khavronenko	d814c83b21	vmalert: remove a line added for debug (#2611 ) Signed-off-by: hagen1778 <roman@victoriametrics.com>	2022-05-20 14:08:57 +03:00
Roman Khavronenko	2aeb00f98f	vmalert: support `scalar` type in response (#2610 ) See https://github.com/VictoriaMetrics/VictoriaMetrics/issues/2607 Signed-off-by: hagen1778 <roman@victoriametrics.com>	2022-05-20 14:08:19 +03:00
Aliaksandr Valialkin	83ff4c411d	app/vmalert: apply `-remoteRead.disablePathAppend` to `-datasource.url` in the same way as for the `-remoteRead.url` This is a follow-up for `0e2486df56` The related pull requests: - https://github.com/VictoriaMetrics/VictoriaMetrics/pull/1536 - https://github.com/VictoriaMetrics/VictoriaMetrics/pull/1712	2022-05-13 16:59:16 +03:00
Roman Khavronenko	35bf5bf688	Vmalert compliance improvements (#2320 ) * vmalert: add support for `sortByLabel` template function * vmalert: update API according to Prometheus conformance program The changes to the API, field names and URL path has been made according to the Prometheus specification for `alert_generator` https://github.com/prometheus/compliance/blob/main/alert_generator/specification.md * vmalert: fix the timestamp of the evaluated rules The timestamp used for alert's `EndsAt` was calculated before sending the notification. While the correct way is to use the timestamp taken right before rules evaluation. * vmalert: add `-datasource.queryTimeAlignment` flag The flag is supposed to provide ability to disable `time` param alignment when executing rules. By default, this flag is enabled, so it remains backward compatible. The flag was introduced to achieve better compatibility with Prometheus behaviour according to https://github.com/prometheus/compliance/blob/main/alert_generator/specification.md Signed-off-by: hagen1778 <roman@victoriametrics.com>	2022-03-16 13:22:26 +02:00
Roman Khavronenko	1f0301c809	vmalert: always convert `step` value to seconds for better compatibility (#1955 ) When using `vmalert` with older Prometheus versions, the passed `step=2m` may be parsed by Prometheus with an err: "cannot parse \"2m0s\" to a valid duration". In order to improve compatibility vmalert will always convert step duration to seconds. https://github.com/VictoriaMetrics/VictoriaMetrics/issues/1943 Signed-off-by: hagen1778 <roman@victoriametrics.com>	2021-12-17 20:17:09 +02:00
Roman Khavronenko	582c063698	vmalert: introduce additional HTTP URL params per-group configuration (#1892 ) * vmalert: introduce additional HTTP URL params per-group configuration The new group field `params` allows to configure custom HTTP URL params per each group. These params will be applied to every request before executing rule's expression. Hot config reload is also supported. Field `extra_filter_labels` was deprecated in favour of `params` field. vmalert will print deprecation log message if config file contains the deprecated field. `params` fields are supported by both Prometheus and Graphite datasource types. Signed-off-by: hagen1778 <roman@victoriametrics.com> * vmalert: provide more examples for `params` field Signed-off-by: hagen1778 <roman@victoriametrics.com> * vmalert: set higher priority for `params` setting If there would be a conflict between URL params set in `datasource.url` flag and params in group definition the latter will have higher priority. Signed-off-by: hagen1778 <roman@victoriametrics.com>	2021-12-02 14:51:54 +02:00
Alexander Rickardsson	0e1dbcd039	vmalert: add disablePathAppend to remote read (#1712 ) * vmalert: add disablePathAppend to remoteRead * docs: add docs for remoteRead.disablePathAppend	2021-10-18 14:59:17 +03:00
Alexander Rickardsson	63571e1334	vmalert: Redact passwords from error messages (#1713 )	2021-10-18 14:59:17 +03:00
Roman Khavronenko	1cf4f5a715	Vmalert extra params (#1587 ) * vmalert: allow extra GET params in datasource package ExtraParams will be added as GET params to every HTTP request made by datasource. The `roundDigits` param, for example, was substituted by corresponding extra param. * vmalert: add nocache=1 param for replay process The `nocache=1` param is VictoriaMetrics specific parameter which prevents it from caching and boundaries aligning for queries. We set it to avoid cache pollution in `replay` mode and also to avoid unnecessary time range boundaries alignment. * vmalert: mention nocache=1 in replay description * vmalert: fix bug with unused param	2021-09-01 12:20:01 +03:00
Roman Khavronenko	f3cb2158a3	vmalert: fix mistake with object reuse while parsing response (#1370 ) * vmalert: fix mistake with object reuse while parsing response During the refactoring, the wrong optimisations was applied in parse function which caused metric fields reset. The change removes optimisation. https://github.com/VictoriaMetrics/VictoriaMetrics/issues/1369 * vmalert: add test to cover multiple metrics in one response	2021-06-11 11:30:07 +03:00
Roman Khavronenko	5aa7846900	vmalert: support rules backfilling (aka `replay`) (#1358 ) * vmalert: support rules backfilling (aka `replay`) vmalert can `replay` configured rules in the past and backfill results via remote write protocol. It supports MetricsQL/PromQL storage as data source, and can backfill data to remote write compatible storage. Supports recording and alerting rules `replay`. See more details in README. https://github.com/VictoriaMetrics/VictoriaMetrics/issues/836 * vmalert: review fixes * vmalert: readme fixes	2021-06-09 12:30:54 +03:00

23 commits