github-mirrors/VictoriaMetrics

mirror of https://github.com/VictoriaMetrics/VictoriaMetrics.git synced 2024-12-01 14:47:38 +00:00

Author	SHA1	Message	Date
Hui Wang	c4fe23794a	vmalert: fix blocking hot-reload process if the old rule group hasn't started yet (#7258 ) Group [sleeps](`daa7183749/app/vmalert/rule/group.go (L320)`) random duration before start the evaluation, and during the sleep, `g.updateCh <- new` will be blocked since there is no `<-g.updateCh` waiting. --------- Signed-off-by: hagen1778 <roman@victoriametrics.com> Co-authored-by: hagen1778 <roman@victoriametrics.com>	2024-10-18 11:18:24 +02:00
Nikolay	635bdd130b	lib/storage: properly unmarshal SearchQuery (#7277 ) After adding multitenant query feature at v1.104.0, searchQuery wasn't properly unmarshalled at bottom vmselect in multi-level cluster setup. It resulted into empty query responses. This commit adds fallback to Unmarshal method of SearchQuery to fill TenantTokens. It allows to properly execute search requests at vmselect side. Related issue: https://github.com/VictoriaMetrics/VictoriaMetrics/issues/7270 --------- Signed-off-by: f41gh7 <nik@victoriametrics.com> Co-authored-by: Roman Khavronenko <roman@victoriametrics.com>	2024-10-17 10:52:35 -03:00
Hui Wang	ab0d31a7b0	vmagent: fix type of command-line flag `-streamAggr.dedupInterval` (#7081 ) Previously unit `m` is not correctly supported. --------- Signed-off-by: hagen1778 <roman@victoriametrics.com> Co-authored-by: hagen1778 <roman@victoriametrics.com>	2024-10-17 13:27:59 +02:00
Zakhar Bessarab	65e9d19f3c	lib/flagutil/dict: properly update default value in case there is no key value set (#7211 ) ### Describe Your Changes If a dict flag has only one value without a prefix it is supposed to replace default value. Previously, when flag was set to `-flag=2` and the default value in `NewDictInt` was set to 1 the resulting value for any `flag.Get()` call would be 1 which is not expected. This commit updates default value for the flag in case there is only one entry for flag and the entry is a number without a key. This affects cluster version and specifically `replicationFactor` flag usage with vmstorage [node groups](https://docs.victoriametrics.com/cluster-victoriametrics/#vmstorage-groups-at-vmselect). Previously, the following configuration would effectively be ignored: ``` /path/to/vmselect \ -replicationFactor=2 \ -storageNode=g1/host1,g1/host2,g1/host3 \ -storageNode=g2/host4,g2/host5,g2/host6 \ -storageNode=g3/host7,g3/host8,g3/host9 ``` Changes from this PR will force default value for `replicationFactor` flag to be set to `2` which is expected as the result of this configuration. --------- Signed-off-by: Zakhar Bessarab <z.bessarab@victoriametrics.com>	2024-10-17 12:05:47 +02:00
Hui Wang	4984e71da6	vmalert-tool: add more syntax checks for `input_series` and `exp_samples` (#7263 ) address https://github.com/VictoriaMetrics/VictoriaMetrics/issues/7224, allow using ``` exp_samples: - labels: '{}' ``` for prometheus compatibility. --------- Signed-off-by: hagen1778 <roman@victoriametrics.com> Co-authored-by: hagen1778 <roman@victoriametrics.com>	2024-10-17 11:00:34 +02:00
Hui Wang	c90adf566e	vmalert-tool: reduce victoriametrics health check interval (#7256 ) address https://github.com/VictoriaMetrics/VictoriaMetrics/issues/6970. This reduces the hard limit on duration for completing the test when users run vmalert-tool on slow hosts. --------- Signed-off-by: hagen1778 <roman@victoriametrics.com> Co-authored-by: hagen1778 <roman@victoriametrics.com>	2024-10-17 10:51:12 +02:00
rusttech	87910e4fa8	app/vmctl: fixes opentsdb source metric tags Previously it was incorrectly used append for pre-allocated slice of labels. This commit fixes slice append by allocating zero length slice with needed capacity. --------- Co-authored-by: Nikolay <nik@victoriametrics.com>	2024-10-16 10:35:17 +02:00
hagen1778	e347d90531	docs: update anchor level to fix menu rendering in changelog Signed-off-by: hagen1778 <roman@victoriametrics.com>	2024-10-15 20:40:58 +02:00
Yury Molodov	86029de0d4	vmui: fix alert display with long messages (#7228 ) ### Describe Your Changes Fix `Alert` component to prevent it from overflowing the screen when displaying long messages. Related issue: #7207 ### Checklist The following checks are mandatory: - [x] My change adheres [VictoriaMetrics contributing guidelines](https://docs.victoriametrics.com/contributing/). --------- Signed-off-by: Zakhar Bessarab <z.bessarab@victoriametrics.com> Signed-off-by: hagen1778 <roman@victoriametrics.com> Co-authored-by: hagen1778 <roman@victoriametrics.com>	2024-10-15 16:35:57 +02:00
Yury Molodov	6c9772b101	vmui: add the ability to cancel running queries (#7204 ) ### Describe Your Changes - Added functionality to cancel running queries on the Explore Logs and Query pages. - The loader was changed from a spinner to a top bar within the block. This still indicates loading, but solves the issue of the spinner "flickering," especially during graph dragging. Related issue: #7097 https://github.com/user-attachments/assets/98e59aeb-905b-4b9d-bbb2-688223b22a82 ### Checklist The following checks are mandatory: - [ ] My change adheres [VictoriaMetrics contributing guidelines](https://docs.victoriametrics.com/contributing/).	2024-10-15 14:48:40 +02:00
Zakhar Bessarab	a8d8987825	lib/jwt: accept scope encoded as a slice (#790 ) Some IDPs encode scope as a slice of strings. Handle this gracefully by encoding a slice back to string. Signed-off-by: Zakhar Bessarab <z.bessarab@victoriametrics.com> - [x] My change adheres [VictoriaMetrics contributing guidelines](https://docs.victoriametrics.com/contributing/). (cherry picked from commit `f61d8c3ebb`) --------- Signed-off-by: Zakhar Bessarab <z.bessarab@victoriametrics.com> Co-authored-by: hagen1778 <roman@victoriametrics.com> Signed-off-by: hagen1778 <roman@victoriametrics.com>	2024-10-15 14:40:46 +02:00
Andrii Chubatiuk	daa7183749	lib/protoparser/influx: enable batch processing by default (#7165 ) ### Describe Your Changes Fixes https://github.com/VictoriaMetrics/VictoriaMetrics/issues/7090 ### Checklist The following checks are mandatory: - [ ] My change adheres [VictoriaMetrics contributing guidelines](https://docs.victoriametrics.com/contributing/). --------- Signed-off-by: hagen1778 <roman@victoriametrics.com> Co-authored-by: hagen1778 <roman@victoriametrics.com>	2024-10-15 11:48:40 +02:00
hagen1778	22d3f67908	docs: fix typos in change line Signed-off-by: hagen1778 <roman@victoriametrics.com>	2024-10-11 16:43:50 +02:00
Andrii Chubatiuk	9eb0c1fd86	lib/protoparser/opentelemetry: added exponential histograms support (#6354 ) ### Describe Your Changes added opentelemetry exponential histograms support. Such histograms are automatically converted into VictoriaMetrics histogram with `vmrange` buckets. ### Checklist The following checks are mandatory: - [ ] My change adheres [VictoriaMetrics contributing guidelines](https://docs.victoriametrics.com/contributing/). --------- Signed-off-by: hagen1778 <roman@victoriametrics.com> Co-authored-by: hagen1778 <roman@victoriametrics.com>	2024-10-11 13:44:52 +02:00
kirti purohit	008b649658	vmalert: parse multi doc yaml (#6995 ) ### Describe Your Changes This PR adds the feature to parse a multi yaml doc following the `\n---\n` The issue is [6753](https://github.com/VictoriaMetrics/VictoriaMetrics/issues/6753) ### Checklist The following checks are mandatory: - [x] My change adheres [VictoriaMetrics contributing guidelines](https://docs.victoriametrics.com/contributing/). --------- Signed-off-by: kirti purohit <kirti.purohit@hpe.com> Co-authored-by: kirti purohit <kirti.purohit@hpe.com> Co-authored-by: Jiekun <jiekun@victoriametrics.com> Co-authored-by: hagen1778 <roman@victoriametrics.com>	2024-10-08 14:28:32 +02:00
Zakhar Bessarab	eefae85450	vmagent: add support of HTTP2 client for Kubernetes SD (#7114 ) ### Describe Your Changes Currently, vmagent always uses a separate `http.Client` for every group watcher in Kubernetes SD. With a high number of group watchers this leads to large amount of opened connections. This PR adds 2 changes to address this: - re-use of existing `http.Client` - in case `http.Client` is connecting to the same API server and uses the same parameters it will be re-used between group watchers - HTTP2 support - this allows to reuse connections more efficiently due to ability of using streaming via existing connections. See this issue for the details and test results - https://github.com/VictoriaMetrics/VictoriaMetrics/issues/5971 ### Checklist The following checks are mandatory: - [ ] My change adheres [VictoriaMetrics contributing guidelines](https://docs.victoriametrics.com/contributing/). --------- Signed-off-by: Zakhar Bessarab <z.bessarab@victoriametrics.com> Co-authored-by: Roman Khavronenko <roman@victoriametrics.com>	2024-10-08 10:36:31 +02:00
Zakhar Bessarab	9b6efb5e81	make: add darwin builds for cluster (#7195 ) ### Describe Your Changes Add darwin `amd64` and `arm64` builds for cluster binaries build. ### Checklist The following checks are mandatory: - [x] My change adheres [VictoriaMetrics contributing guidelines](https://docs.victoriametrics.com/contributing/). --------- Signed-off-by: hagen1778 <roman@victoriametrics.com> Co-authored-by: hagen1778 <roman@victoriametrics.com> (cherry picked from commit `b9115d6882`) Signed-off-by: hagen1778 <roman@victoriametrics.com>	2024-10-08 10:19:58 +02:00
hagen1778	3f2bfd2ff6	docs: move `Retry-After` to the 1.104.0 notes It was mistakenly place to 1.103.0 Signed-off-by: hagen1778 <roman@victoriametrics.com>	2024-10-03 15:23:03 +02:00
hagen1778	4086cef01c	docs: rm incorrectly placed bugfix change from v1.103 The change was present in v1.103 by mistake. In fact, it was released in v1.104 See `c193e6d43e` Signed-off-by: hagen1778 <roman@victoriametrics.com>	2024-10-03 09:55:07 +02:00
hagen1778	feba481ac2	docs: re-qualify `-search.maxDeleteSeries` change into feature Signed-off-by: hagen1778 <roman@victoriametrics.com>	2024-10-02 21:18:54 +02:00
hagen1778	ce81a86fc2	docs: re-order changes by priority Signed-off-by: hagen1778 <roman@victoriametrics.com>	2024-10-02 21:15:59 +02:00
hagen1778	41850995d3	docs: rm `vm_rows_ignored_total{reason="nan_value"}` It was reverted in `0d4f4b8f7d` Signed-off-by: hagen1778 <roman@victoriametrics.com>	2024-10-02 21:13:53 +02:00
hagen1778	8592fc3162	docs: add link to docs for multitenant reads Signed-off-by: hagen1778 <roman@victoriametrics.com>	2024-10-02 16:27:20 +02:00
hagen1778	36acde1d11	docs: add missing release notes Signed-off-by: hagen1778 <roman@victoriametrics.com>	2024-10-02 14:20:55 +02:00
f41gh7	776c501cb2	CHANGELOG.md: cut v1.104.0 release	2024-10-01 16:55:04 +02:00
f41gh7	076a1f84e1	vmselect: add support of multi-tenant queries Added ability to query data across multiple tenants. See: VictoriaMetrics/VictoriaMetrics#1434 Currently, the following endpoints work with multi-tenancy: - /prometheus/api/v1/query - /prometheus/api/v1/query_range - /prometheus/api/v1/series - /prometheus/api/v1/labels - /prometheus/api/v1/label/<label_name>/values - /prometheus/api/v1/status/active_queries - /prometheus/api/v1/status/top_queries - /prometheus/api/v1/status/tsdb - /prometheus/api/v1/export - /prometheus/api/v1/export/csv - /vmui A note regarding VMUI: endpoints such as `active_queries` and `top_queries` have been updated to indicate whether query was a single-tenant or multi-tenant, but UI needs to be updated to display this info. cc: @Loori-R --------- Signed-off-by: Zakhar Bessarab <z.bessarab@victoriametrics.com> Signed-off-by: f41gh7 <nik@victoriametrics.com> Co-authored-by: f41gh7 <nik@victoriametrics.com>	2024-10-01 16:49:46 +02:00
Nikolay	fbaa026ae6	dashboards: updates operator dashboard (#7139 ) * Replaces deprecated graphs with Timeseries panels * Adds new latency dashboards for rest client and golang scheduler * Adds new overview panels * Adds VM Datasource version of dashboard --------- Signed-off-by: f41gh7 <nik@victoriametrics.com> Co-authored-by: Roman Khavronenko <roman@victoriametrics.com>	2024-09-30 15:35:39 +02:00
Zhu Jiekun	7bb8853a5c	feature: [vmagent] Add service discovery support for OVH Cloud VPS and dedicated server (#6160 ) ### Describe Your Changes related issue: https://github.com/VictoriaMetrics/VictoriaMetrics/issues/6071 #### Added - Added service discovery support for OVH Cloud: - VPS. - Dedicated server. #### Docs - `CHANGELOG.md`, `sd_configs.md`, `vmagent.md` are updated. #### Note - Useful links: - OVH Cloud VPS API: https://eu.api.ovh.com/console/#/vps~GET - OVH Cloud Dedicated server API: https://eu.api.ovh.com/console/#/dedicated/server~GET - OVH Cloud SDK: https://github.com/ovh/go-ovh - Prometheus SD: https://prometheus.io/docs/prometheus/latest/configuration/configuration/#ovhcloud_sd_config Tested on OVH Cloud VPS and dedicated server. <img width="1722" alt="image" src="https://github.com/VictoriaMetrics/VictoriaMetrics/assets/30280396/d3f0adc8-b0ef-423e-9379-8a9b9b0792ee"> <img width="1724" alt="image" src="https://github.com/VictoriaMetrics/VictoriaMetrics/assets/30280396/18b5b730-3512-4fc0-8b2c-f2450ac550fd"> --- Signed-off-by: Jiekun <jiekun@victoriametrics.com> Co-authored-by: hagen1778 <roman@victoriametrics.com>	2024-09-30 14:42:46 +02:00
Hui Wang	664f337c70	stream aggregation: fix possible duplicated aggregation results (#7118 ) When ingesting samples with the same labels(duplicated samples or samples with the same labels after `by` or `without` options). They could register different entries for the same labelset in LabelsCompressor. For example, both index 99 and 100 can be assigned to label `foo=1` in two concurrent pushes. Then due to differing label indexes in encoded keys, the samples will appear as distinct in aggrState, resulting in duplicated results after decompressing the label indexes. `fbde238cdc/lib/streamaggr/streamaggr.go (L933)` In this pull request, since we need to store `idxToLabel` first to ensure the idx can be searched after `lc.labelToIdxStore`, the `lc.idxToLabel` still could contain a duplicated entries [100]="foo=1". But given the low likelihood of this issue and the size of idxToLabel, it should be fine.	2024-09-30 14:24:59 +02:00
f41gh7	758f42fc12	docs: add Update Note for upcoming release changes Signed-off-by: f41gh7 <nik@victoriametrics.com>	2024-09-30 12:37:30 +02:00
Artem Fetishev	ed5da38ede	Introduce a flag for limiting the number of time series to delete (#7091 ) ### Describe Your Changes Introduce the `-search.maxDeleteSeries` flag that limits the number of time series that can be deleted with a single `/api/v1/admin/tsdb/delete_series` call. Currently, any number can be deleted and if the number is big (millions) then the operation may result in unaccounted CPU and memory usage spikes which in some cases may result in OOM kill (see #7027). The flag limits the number to 30k by default and the users may override it if needed at the vmstorage start time. --------- Signed-off-by: Artem Fetishev <rtm@victoriametrics.com> Co-authored-by: Nikolay <nik@victoriametrics.com>	2024-09-30 10:02:21 +02:00
Nikolay	3bbb2aed72	fscore: rollback trailing space trim (#7106 ) Previous commit `201fd6de1e` removed trailing space trim from data read from file. But common practice is to remove such trailing space. And it leaded to the authorization errors for the major group of users. In first place, this change must help to mitigate an issue with kubernetes. When authorization information was read from Secret content. Changes to the operator was made to mitigate such problem at commit `1cf64358c8` We could introduce later optional flag for VictoriaMetrics to disable trim space behavior. Related issues: https://github.com/VictoriaMetrics/VictoriaMetrics/issues/6986 https://github.com/VictoriaMetrics/VictoriaMetrics/issues/7089 https://github.com/VictoriaMetrics/VictoriaMetrics/issues/6947 --------- Signed-off-by: f41gh7 <nik@victoriametrics.com> Co-authored-by: Zhu Jiekun <jiekun@victoriametrics.com>	2024-09-29 10:59:25 +02:00
Artem Navoiev	14a0396f53	docs: changelog fix typo in url Signed-off-by: Artem Navoiev <tenmozes@gmail.com>	2024-09-28 23:30:19 +02:00
Artem Navoiev	96efe99eef	docs: mention new create backup api in docs and changelog (#7104 ) ### Describe Your Changes Please provide a brief description of the changes you made. Be as specific as possible to help others understand the purpose and impact of your modifications. ### Checklist The following checks are mandatory: - [x] My change adheres [VictoriaMetrics contributing guidelines](https://docs.victoriametrics.com/contributing/). Signed-off-by: Artem Navoiev <tenmozes@gmail.com>	2024-09-28 14:28:58 -07:00
Yury Molodov	25a9802ca4	vmui: add link to vmalert (#7088 ) ### Describe Your Changes Add link to VMalert when proxy is enabled. The link is displayed when the `-vmalert.proxyURL` flag is present. #5924 ![image](https://github.com/user-attachments/assets/c45ca884-8912-4bd9-a867-df5919f278a1) ### Checklist The following checks are mandatory: - [ ] My change adheres [VictoriaMetrics contributing guidelines](https://docs.victoriametrics.com/contributing/). --------- Co-authored-by: Aliaksandr Valialkin <valyala@victoriametrics.com>	2024-09-27 13:22:22 +02:00
Hui Wang	fbde238cdc	stream aggregation: support configuring multiple labels per `remoteWrite… (#7073 ) ….url` using `-remoteWrite.streamAggr.dropInputLabels` Before, labels were set to all the `remoteWrite.url`. address https://github.com/VictoriaMetrics/VictoriaMetrics/issues/6780 --------- Co-authored-by: Roman Khavronenko <roman@victoriametrics.com>	2024-09-27 12:21:09 +02:00
Yury Molodov	c896bf340d	vmui: add functionality to preserve selected columns (#7037 ) ### Describe Your Changes 1) Changed table settings from a popup to a modal window to simplify future functionality additions. 2) Added functionality to save selected columns when data is modified or the page is reloaded. See #7016. <details> <summary>Example screenshots</summary> <img alt="demo-1" width="600" src="https://github.com/user-attachments/assets/a5d9a910-363c-4931-8b12-18ea8b3d97d8"/> </details> ### Checklist The following checks are mandatory: - [x] My change adheres [VictoriaMetrics contributing guidelines](https://docs.victoriametrics.com/contributing/). --------- Co-authored-by: Roman Khavronenko <roman@victoriametrics.com>	2024-09-27 11:52:01 +02:00
Roman Khavronenko	6b1b47df54	app/vmalert: bump default values for sending data to `remoteWrite.url` (#7084 ) * `remoteWrite.maxQueueSize` from `100_000` to `1_000_000`, this should improve resiliency of recording rules that produce many series; * `remoteWrite.maxBatchSize` from `1_000` to `10_000`, this should be more efficient to send from netwroking perspective; * `remoteWrite.concurrency` from `1` to `4`, this should imrpove speed of sending the generated series. The new settings should improve remote write performance of vmalert with default settings. ### Describe Your Changes Please provide a brief description of the changes you made. Be as specific as possible to help others understand the purpose and impact of your modifications. ### Checklist The following checks are mandatory: - [ ] My change adheres [VictoriaMetrics contributing guidelines](https://docs.victoriametrics.com/contributing/). --------- Signed-off-by: hagen1778 <roman@victoriametrics.com> Co-authored-by: Hui Wang <haley@victoriametrics.com>	2024-09-25 15:01:39 +02:00
Zhu Jiekun	5319acb8ed	vmagent: remote write respect Retry-After in header (#6124 ) ### Describe Your Changes related issue: https://github.com/VictoriaMetrics/VictoriaMetrics/issues/6097 #### Changed - Remote write retry policy in `vmagent` is changed into: 1. Respect `Retry-After` duration if exists. 2. Otherwise, calculate next retry duration by backoff policy (x2) and max retry duration limit. #### Docs - `CHANGELOG.md`. --- ### Checklist The following checks are mandatory: - [x] My change adheres [VictoriaMetrics contributing guidelines](https://docs.victoriametrics.com/contributing/). --------- Co-authored-by: Zakhar Bessarab <me@zekker-dev.tk> Co-authored-by: hagen1778 <roman@victoriametrics.com>	2024-09-24 12:44:03 +02:00
Dmytro Kozlov	cbeb7d50e8	lib/promscrape: show only unhealthy targets if `show_only_unhealthy` filter is enabled (#6960 ) ### Describe Your Changes It is better to show only unhealthy targets instead of all of them when `show_only_unhealthy` filter is enabled. Related issue: https://github.com/VictoriaMetrics/VictoriaMetrics/issues/3536 ### Checklist The following checks are mandatory: - [x] My change adheres [VictoriaMetrics contributing guidelines](https://docs.victoriametrics.com/contributing/). --------- Co-authored-by: Roman Khavronenko <roman@victoriametrics.com>	2024-09-24 12:18:24 +02:00
Roman Khavronenko	4d0b41e63b	deployment: add panel and alerts for displying go scheduler latency (#7078 ) The panel and alerting rule should help to understand whether VM component doesn't have enough CPU resources or gets throttled. The alert is applicable for all VM components. The panel was added to vmalert, vmagent, vmsingle, vm clusert and victorialogs dashes. ------------------- This alerting rule should have help us identify resource shortage for sandbox vmagent - see [this link](https://play.victoriametrics.com/select/accounting/1/6a716b0f-38bc-4856-90ce-448fd713e3fe/prometheus/graph/#/?g0.range_input=23d13h25m25s424ms&g0.end_input=2024-09-23T14%3A11%3A00&g0.relative_time=none&g0.tab=0&g0.expr=histogram_quantile%280.99%2C+sum%28rate%28go_sched_latencies_seconds_bucket%7Bjob%3D%22vmagent-monitoring-vmagent%22%7D%5B5m%5D%29%29+by+%28le%2C+job%2C+instance%29%29+%3E+0.1) for example. We weren't aware of resource shortage, because VM metrics assumed this vmagent had 1vCPU while in fact its limit was 0.2vCPU. Signed-off-by: hagen1778 <roman@victoriametrics.com>	2024-09-23 16:54:42 +02:00
Aliaksandr Valialkin	3964889705	app/vmselect/promql: consistently replace `NaN` data points with non-`NaN` values for `range_first` and `range_last` functions It is expected that range_first and range_last functions return non-nan const value across all the points if the original series contains at least a single non-NaN value. Previously this rule was violated for NaN data points in the original series. This could confuse users. While at it, add tests for series with NaN values across all the range_* and running_* functions, in order to maintain consistent handling of NaN values across these functions.	2024-09-23 14:59:29 +02:00
Aliaksandr Valialkin	57183c9b61	docs/changelog/CHANGELOG.md: moved the description of the fix for proper usage of -streamAggr.dedupInterval and -remoteWrite.streamAggr.dedupInterval from FEATURE to BUGFIX section The previous behaviour was incorrect, since it is unexpected that the -streamAggr.dedupInterval and -remoteWrite.streamAggr.dedupInterval is applied to processed samples only if -streamAggr.config isn't set. This is a follow-up for `d523015f27` Updates https://github.com/VictoriaMetrics/VictoriaMetrics/pull/6711	2024-09-23 08:56:33 +02:00
Aliaksandr Valialkin	0ada781cf2	docs/changelog/CHANGELOG.md: document bugfix for https://github.com/VictoriaMetrics/VictoriaMetrics/issues/7009 Updates https://github.com/VictoriaMetrics/VictoriaMetrics/pull/7064 Updates https://github.com/VictoriaMetrics/VictoriaMetrics/issues/7009 This is a follow-up for `55febc0920`	2024-09-22 21:57:48 +02:00
Hui Wang	d6d02d7aeb	vmalert: fix variable `$activeAt` value when templating rule annotation in replay mode Co-authored-by: Roman Khavronenko <roman@victoriametrics.com>	2024-09-20 11:07:40 +02:00
hagen1778	6167bccc5a	docs: fix more typos in the changelog Signed-off-by: hagen1778 <roman@victoriametrics.com>	2024-09-20 10:54:43 +02:00
hagen1778	59281d5358	docs: rm update node about loggerMaxArgLen as it doesn't have incompatibility effect Signed-off-by: hagen1778 <roman@victoriametrics.com>	2024-09-20 10:42:07 +02:00
hagen1778	6726a5aaed	docs: fix typo in link in change line about NaN Signed-off-by: hagen1778 <roman@victoriametrics.com>	2024-09-20 10:39:13 +02:00
Thomas Danielsson	258201af04	docs: fix typo in the changelog Signed-off-by: hagen1778 <roman@victoriametrics.com>	2024-09-20 08:46:29 +02:00
f41gh7	61721303fd	docs/changelog: mention vmagent kafka consumer bugfix Changes were made to the enteprise repository	2024-09-19 15:35:48 +02:00
Yury Molodov	b0bdb92729	vmui: change the `query_range` request method from `GET` to `POST` (#7039 ) ### Describe Your Changes change the `/query_range` and `/query` requests method from `GET` to `POST`. See #6288. ### Checklist The following checks are mandatory: - [x] My change adheres [VictoriaMetrics contributing guidelines](https://docs.victoriametrics.com/contributing/). --------- Signed-off-by: hagen1778 <roman@victoriametrics.com> Co-authored-by: hagen1778 <roman@victoriametrics.com>	2024-09-19 14:30:54 +02:00
Roman Khavronenko	e115b85770	lib/logger: increase default value of `-loggerMaxArgLen` cmd-line fla… (#7008 ) …g from 1e3 to 5e3 This should improve visibility on errors produced by very long queries. The change is classified as BUG in order to port it to LTS releases. ### Describe Your Changes Please provide a brief description of the changes you made. Be as specific as possible to help others understand the purpose and impact of your modifications. ### Checklist The following checks are mandatory: - [ ] My change adheres [VictoriaMetrics contributing guidelines](https://docs.victoriametrics.com/contributing/). --------- Signed-off-by: hagen1778 <roman@victoriametrics.com> Co-authored-by: Mathias Palmersheim <mathias@victoriametrics.com>	2024-09-19 14:29:18 +02:00
Aliaksandr Valialkin	b82e2cabc5	app/vmselect/promql: properly calculate `c1 and c2` and `c1 or c2` by upgrading github.com/VictoriaMetrics/metricsql to v0.79.0 The fix is in the https://github.com/VictoriaMetrics/metricsql/pull/34 Updates https://github.com/VictoriaMetrics/VictoriaMetrics/issues/6637	2024-09-18 17:38:19 +02:00
Nikolay	d8f8822fa5	lib/storage: consistently check for missing metricID index records (#6967 ) * Previously, only metricID->metricName missing index records were tracked with deadline But it was possible a case for missing metricID->TSID index records. IndexDB metrics fix exposed misleading metric for such missing records. * This commit adds check for metricID->TSID missing index records. And delete missing metricID entry if it hit 60 second deadline. Related issue https://github.com/VictoriaMetrics/VictoriaMetrics/issues/6931 Signed-off-by: f41gh7 <nik@victoriametrics.com>	2024-09-16 10:05:08 +02:00
Nikolay	264c2ec6bd	lib/fs: properly call windows APIs (#6998 ) Previously we manually imported system windows DDLs and made direct syscall. But golang exposes syscall wrappers with sys/windows package. It seems, that direct syscall was broken at 1.23 golang release. It was `GetDiskFreeSpace` syscall in our case. This commit replaces all manual syscalls with wrappers Related issue: https://github.com/VictoriaMetrics/VictoriaMetrics/issues/6973 Related golang issue: https://github.com/golang/go/issues/69029 Signed-off-by: f41gh7 <nik@victoriametrics.com>	2024-09-13 12:22:25 +02:00
Dima Lazerka	8207879fa3	docs: fixes misspelled typos Also tried to make it catch "Authorisation" in the future, fixed a lot of other misspells along the way, but didn't make it catch "Authorisation" anyway. - Fix misspelled "Authorization" header name - Fix misspelled "organization" - Fix more misspells	2024-09-13 12:14:24 +02:00
Hui Wang	ae4d376e41	vmalert: do not send message to alertmanager when alert has no label … (#6823 ) …pair `alert_relabel_configs` in [notifier config](https://docs.victoriametrics.com/vmalert/#notifier-configuration-file) can drop alert labels when used to filter different tenant alert message to different notifier. alertmanager would report error like `msg="Failed to validate alerts" err="at least one label pair required"` in this case, but the rest of the alerts inside one request would still be valid in alertmanager, so it's not severe.	2024-09-09 13:34:48 +02:00
Aliaksandr Valialkin	4fbdde5852	deployment/docker: update base Alpine docker image from 3.20.2 to 3.20.3 See https://alpinelinux.org/posts/Alpine-3.17.10-3.18.9-3.19.4-3.20.3-released.html	2024-09-08 19:26:48 +02:00
Aliaksandr Valialkin	5261a84119	deployment: update Go builder from Go1.23.0 to Go1.23.1 See https://github.com/golang/go/issues?q=milestone%3AGo1.23.1+label%3ACherryPickApproved	2024-09-06 22:51:15 +02:00
f41gh7	feafb30266	docs/changelog: mention storage changes After `a5424e95b3` Signed-off-by: f41gh7 <nik@victoriametrics.com>	2024-09-06 18:05:11 +02:00
Zakhar Bessarab	9f7ee4c0bb	Vmgateway no prefix string (#784 ) * app/vmgateway: allow skipping Bearer prefix, parsing access as string - allow disabling of "Bearer" prefix check - This is needed in order to support OIDC systems where identity token is provided separately from access token and it does not contain "Bearer" prefix(such as Azure Entra ID, ex AD).a - support parsing "vm_access" claim as a string - This is helpful for systems where claims can only be mapped to string. Signed-off-by: Zakhar Bessarab <z.bessarab@victoriametrics.com> * docs/changelog: mention vmgateway updates Signed-off-by: Zakhar Bessarab <z.bessarab@victoriametrics.com> --------- Signed-off-by: Zakhar Bessarab <z.bessarab@victoriametrics.com>	2024-09-06 16:20:20 +02:00
f41gh7	d7be0e7c9a	docs/changelog: mention storage NaN changes follow-up after `39294b4919` Signed-off-by: f41gh7 <nik@victoriametrics.com>	2024-09-05 16:56:32 +02:00
Zhu Jiekun	c193e6d43e	lib/discovery/azure: fix host check in next link in Azure SD (#6915 ) Previous bugfix at `49f63b2` only partially fixed pagination host validation error. Before this fix it was: ``` unexpected nextLink host \"management.azure.com\", expecting \"https://management.azure.com\" ``` Now we only check the `Host` without schema. However, when Azure respond `nextLink` in `Host:Port` format, the `nextLink` check will fail: ``` unexpected nextLink host \"management.azure.com:443\", expecting \"management.azure.com\" ``` This pull request further relaxes the checks by only checking the `Hostname`. --- related issue: https://github.com/VictoriaMetrics/VictoriaMetrics/issues/6912	2024-09-05 16:48:09 +02:00
Hui Wang	b48f5f3e59	lib/storage: fix metric `vm_object_references{type="indexdb"}` (#6937 ) follow up `4ecc370acb` ### Describe Your Changes Please provide a brief description of the changes you made. Be as specific as possible to help others understand the purpose and impact of your modifications. ### Checklist The following checks are mandatory: - [ ] My change adheres [VictoriaMetrics contributing guidelines](https://docs.victoriametrics.com/contributing/).	2024-09-05 16:42:49 +02:00
f41gh7	be66aa5f4e	docs/changelog: mention enterprise changes Signed-off-by: f41gh7 <nik@victoriametrics.com>	2024-09-04 15:39:34 +02:00
f41gh7	b8bbea8896	docs/changelog: moves victorialogs changes to proper file Signed-off-by: f41gh7 <nik@victoriametrics.com>	2024-09-04 15:36:33 +02:00
Andrii Chubatiuk	01430a155c	vlinsert: added opentelemetry logs support Commit adds the following changes: * Adds support of OpenTelemetry logs for Victoria Logs with protobuf encoded messages * json encoding is not supported for the following reasons: - It brings a lot of fragile code, which works inefficiently. - json encoding is impossible to use with language SDK. * splits metrics and logs structures at lib/protoparser/opentelemetry/pb package. * adds docs with examples for opentelemetry logs. --- Related issue: https://github.com/VictoriaMetrics/VictoriaMetrics/issues/4839 Co-authored-by: AndrewChubatiuk <andrew.chubatiuk@gmail.com> Co-authored-by: f41gh7 <nik@victoriametrics.com>	2024-09-03 20:12:05 +02:00
hagen1778	4dcb6a3719	dashboards/vmagent: fix legend captions for stream aggregation related panels. Before they were displaying wrong label names. Signed-off-by: hagen1778 <roman@victoriametrics.com>	2024-09-03 14:23:35 +02:00
Hui Wang	d523015f27	stream aggregation: perform deduplication for all received data when … (#6711 ) …specifying `-streamAggr.dedupInterval` or `-remoteWrite.streamAggr.dedupInterval` command-line flag [The documentation](https://docs.victoriametrics.com/stream-aggregation/) contains conflicting descriptions regarding deduplication for non-matched series when `-remoteWrite.streamAggr.config` and / or `-streamAggr.config` are set: 1. Statement below says all the received data is deduplicated: >[vmagent](https://docs.victoriametrics.com/vmagent/) supports relabeling, deduplication and stream aggregation for all the received data, scraped or pushed. Then, the collected data will be forwarded to specified -remoteWrite.url destinations. The data processing order is the following: >1. all the received data is relabeled according to the specified [-remoteWrite.relabelConfig](https://docs.victoriametrics.com/vmagent/#relabeling) (if it is set) >2. all the received data is deduplicated according to specified [-streamAggr.dedupInterval](https://docs.victoriametrics.com/stream-aggregation/#deduplication) (if it is set to duration bigger than 0) 2. Another statement says the deduplication is performed individually for the matching samples >The de-deduplication is performed after applying [relabeling](https://docs.victoriametrics.com/vmagent/#relabeling) and before performing the aggregation. If the -remoteWrite.streamAggr.config and / or -streamAggr.config is set, then the de-duplication is performed individually per each [stream aggregation config](https://docs.victoriametrics.com/stream-aggregation/#stream-aggregation-config) for the matching samples after applying [input_relabel_configs](https://docs.victoriametrics.com/stream-aggregation/#relabeling). Considering the following deduplication use cases: 1. To apply deduplication(globally or for specific remoteWrite destination) for all the received data, scraped or pushed --- using `-streamAggr.dedupInterval` or `-remoteWrite.streamAggr.dedupInterval`. 2. To deduplicate and aggregate metrics that match the rule `match` filters --- using `-remoteWrite.streamAggr.config` and specifiying `dedup_interval` option in [stream aggregation config](https://docs.victoriametrics.com/stream-aggregation/#stream-aggregation-config). 3. To deduplicate all the received data while having `streamAggr.config` for some metrics --- no way for a single vmagent now, need to set up two level vmagents This PR implements case3. --------- Co-authored-by: Roman Khavronenko <roman@victoriametrics.com>	2024-09-03 10:47:05 +02:00
hagen1778	d5755e55ef	docs/CHANGELOG.md: update changelog with LTS release notes Signed-off-by: hagen1778 <roman@victoriametrics.com>	2024-08-29 13:27:27 +02:00
hagen1778	5aeb759df9	docs/CHANGELOG.md: cut v1.103.0 Signed-off-by: hagen1778 <roman@victoriametrics.com>	2024-08-28 13:48:31 +02:00
hagen1778	e71cfdcfa5	docs: pre-release doc update * typo fix * mention version starting from features are available Signed-off-by: hagen1778 <roman@victoriametrics.com>	2024-08-28 13:47:29 +02:00
f41gh7	40d55199fd	docs/changelog: mention bugfix Signed-off-by: f41gh7 <nik@victoriametrics.com>	2024-08-28 11:51:02 +02:00
Nikolay	4ecc370acb	lib/storage: properly add previous indexDB metrics (#6890 ) Previously, some extIndexDB metrics were not registered. It resulted into missing metrics, if metric value was added to the extIndexDB. It's a usual case for search requests at both indexes. Current commit updates all metrics from extIndexDB according to the current IndexDB. It must fix such cases Related issue: https://github.com/VictoriaMetrics/VictoriaMetrics/issues/6868 ### Describe Your Changes Please provide a brief description of the changes you made. Be as specific as possible to help others understand the purpose and impact of your modifications. ### Checklist The following checks are mandatory: - [ ] My change adheres [VictoriaMetrics contributing guidelines](https://docs.victoriametrics.com/contributing/).	2024-08-28 11:14:28 +02:00
rtm0	9fcfba3927	lib/storage: properly handle maxMetrics limit at metricID search `TL;DR` This PR improves the metric IDs search in IndexDB: - Avoid seaching for metric IDs twice when `maxMetrics` limit is exceeded - Use correct error type for indicating that the `maxMetrics` limit is exceded - Simplify the logic of deciding between per-day and global index search A unit test has been added to ensure that this refactoring does not break anything. --- Function calls before the fix: ``` idb.searchMetricIDs \|__ is.searchMetricIDs \|__ is.searchMetricIDsInternal \|__ is.updateMetricIDsForTagFilters \|__ is.tryUpdatingMetricIDsForDateRange \| \| \|__ is.getMetricIDsForDateAndFilters ``` - `searchMetricIDsInternal` searches metric IDs for each filter set. It maintains a metric ID set variable which is updated every time the `updateMetricIDsForTagFilters` function is called. After each successful call, the function checks the length of the updated metric ID set and if it is greater than `maxMetrics`, the function returns `too many timeseries` error. - `updateMetricIDsForTagFilters` uses either per-day or global index to search metric IDs for the given filter set. The decision of which index to use is made is made within the `tryUpdatingMetricIDsForDateRange` function and if it returns `fallback to global search` error then the function uses global index by calling `getMetricIDsForDateAndFilters` with zero date. - `tryUpdatingMetricIDsForDateRange` first checks if the given time range is larger than 40 days and if so returns `fallback to global search` error. Otherwise it proceeds to searching for metric IDs within that time range by calling `getMetricIDsForDateAndFilters` for each date. - `getMetricIDsForDateAndFilters` searches for metric IDs for the given date and returns `fallback to global search` error if the number of found metric IDs is greater than `maxMetrics`. Problems with this solution: 1. The `fallback to global search` error returned by `getMetricIDsForDateAndFilters` in case when maxMetrics is exceeded is misleading. 2. If `tryUpdatingMetricIDsForDateRange` proceeds to date range search and returns `fallback to global search` error (because `getMetricIDsForDateAndFilters` returns it) then this will trigger global search in `updateMetricIDsForTagFilters`. However the global search uses the same maxMetrics value which means this search is destined to fail too. I.e. the same search is performed twice and fails twice. 3. `too many timeseries` error is already handled in `searchMetricIDsInternal` and therefore handing this error in `updateMetricIDsForTagFilters` is redundant 4. updateMetricIDsForTagFilters is a better place to make a decision on whether to use per-day or global index. Solution: 1. Use a dedicated error for `too many timeseries` case 2. Handle `too many timeseries` error in `searchMetricIDsInternal` only 3. Move the per-day or global search decision from `tryUpdatingMetricIDsForDateRange` to `updateMetricIDsForTagFilters` and remove `fallback to global search` error. --------- Signed-off-by: Artem Fetishev <wwctrsrx@gmail.com> Co-authored-by: Nikolay <nik@victoriametrics.com>	2024-08-27 21:39:03 +02:00
rtm0	eef6943084	lib/storage: properly register index records with RegisterMetricNames Once the timeseries is in tsidCache, new entries won't be created in per-day index because the RegisterMetricNames() code does consider different dates for the same timeseries. So this case has been added. The same bug exists for AddRows() but it is not manifested because the index entries are finally created in updatePerDateData(). RegisterMetricNames also updated to increase the newTimeseriesCreated counter because it actually creates new time series in index. A unit tests has been added that check all possible data patterns (different metric names and dates) and code branches in both RegisterMetricNames and AddRows. The total number of new unit tests is around 100 which increaded the running time of storage tests by 50%. --------- Signed-off-by: Artem Fetishev <wwctrsrx@gmail.com> Co-authored-by: Roman Khavronenko <hagen1778@gmail.com>	2024-08-27 21:33:53 +02:00
hagen1778	6f17ee0d0f	docs: typo fix Signed-off-by: hagen1778 <roman@victoriametrics.com>	2024-08-27 15:49:29 +02:00
Zhu Jiekun	e97e966f82	lib/promrelabel: follow-up for `8958cecad6` In the previous commit `8958cecad6` the default ports (80/443) were removed for both the `scrapeURL` and `instance` label values for those targets without a port in `__address__`. Different values in the `instance` label generate new time series. This commit reverts the changes made to the `instance` label. Now, for those targets: - `scrapeURL` will remain unchanged. - The `instance` label value will include the default port. https://github.com/VictoriaMetrics/VictoriaMetrics/issues/6792	2024-08-27 13:04:26 +02:00
YuDong Tang	295f2aa8ca	app/vmselect:add command-line flag -search.inmemoryBufSizeBytes (#6869 ) add command-line flag `-search.inmemoryBufSizeBytes` for configuring size of in-memory buffers used by vmselect during processing of vmstorage responses. A new summary metric `vm_tmp_blocks_inmemory_file_size_bytes` is exposed to show the size of the buffer during requests processing. The new setting can be used by experienced users to adjust memory usage by vmselect when processing many small read requests. Instead of allocating 4MB buffers each time, vmselect can be instructed to lower the buffer size via `-search.inmemoryBufSizeBytes`. To make the decision whether this flag needs to be adjusted users can consult with `vm_tmp_blocks_inmemory_file_size_bytes` which shows the actual size of buffers used during query processing. ---------- The detailed information of this PR can be found in https://github.com/VictoriaMetrics/VictoriaMetrics/pull/6851 ### Checklist The following checks are mandatory: - [ ] My change adheres [VictoriaMetrics contributing guidelines](https://docs.victoriametrics.com/contributing/). --------- Co-authored-by: hagen1778 <roman@victoriametrics.com> (cherry picked from commit `cab3ef8294`)	2024-08-26 14:48:53 +02:00
Yury Akudovich	d0f5a9d77a	app/vmagent: add `remoteWrite.retryMinInterval` and `remoteWrite.retryMaxTime` flags (#6289 ) ## Describe Your Changes Add RemoteWrite Retry Controls This PR introduces two new flags to the remote write functionality: - remoteWrite.retryMinInterval - remoteWrite.retryMaxTime These flags provide finer control over the retry behavior for remoteWrite operations, allowing users to customize the minimum interval between retries and the maximum duration for retry attempts. Fixes #5486. ## Checklist - [x] The following checks are mandatory: My change adheres [VictoriaMetrics contributing guidelines](https://docs.victoriametrics.com/contributing/). --------- Signed-off-by: Yury Akudovich <ya@matterlabs.dev> Co-authored-by: hagen1778 <roman@victoriametrics.com>	2024-08-23 14:05:51 +02:00
Roman Khavronenko	70a94ea492	app/vmalert: update parsing for instant responses (#6859 ) This change is made in attempt to reduce memory usage by vmalert when parsing big instant responses from VM/Prometheus. In `a5c427bac4` vmalert switched from std json lib to fastjson lib in order to reduce amount of allocations, as according to highloaded profiles of vmalert the CPU is mostly spent on GC. But switching to fastjson resulted into excessive memory usage for cases when vmalert has to parse long json lines, which usually happens when instant response contains many `metric` objects. In this change we do a mixed parsing: 1. Slice of `metric` objects is parsed with std lib to keep mem low 2. Each `metric` object is parsed with fastjson to reduce allocs The benchmark results are the following: ``` pkg: github.com/VictoriaMetrics/VictoriaMetrics/app/vmalert/datasource BenchmarkParsePrometheusResponse/Instant_std+fastjson-10 1760 668959 ns/op 280147 B/op 5781 allocs/op MBs allocated at heap: 493.078392 mallocs: 18655472 BenchmarkParsePrometheusResponse/Instant_fastjson-10 6109 198258 ns/op 172839 B/op 5548 allocs/op MBs allocated at heap: 1056.384464 mallocs: 34457184 BenchmarkParsePrometheusResponse/Instant_std-10 1287 950987 ns/op 451677 B/op 9619 allocs/op MBs allocated at heap: 580.802976 mallocs: 13351636 ``` The benchmark function code with mem measurement is available here https://gist.github.com/hagen1778/b9c3ca7f8ca7d6b21aec9777112c5810 The benchmark contains 3 results: 1. Instant_std+fastjson is the implementation in this change 2. Instant_fastjson-10 is the implementation from `a5c427bac4` 3. BenchmarkParsePrometheusResponse/Instant_std-10 is implementation before `a5c427bac4` According to these results, this new implementation is slower than previous, but faster than before switching to fastjson. It also has lower number of allocations and roughly the same memory allocation on heap with GC turned off. --------- Other changes: 1. rm BenchmarkMetrics as it doesn't measure anything 2. simplify BenchmarkParsePrometheusResponse into BenchmarkPromInstantUnmarshal ### Describe Your Changes Please provide a brief description of the changes you made. Be as specific as possible to help others understand the purpose and impact of your modifications. ### Checklist The following checks are mandatory: - [ ] My change adheres [VictoriaMetrics contributing guidelines](https://docs.victoriametrics.com/contributing/). Signed-off-by: hagen1778 <roman@victoriametrics.com>	2024-08-22 17:36:11 +02:00
Yury Molodov	e35237920a	vmui: add column search in table settings (#6804 ) ### Describe Your Changes Add search functionality to the column display settings in the table #6668 ![image](https://github.com/user-attachments/assets/e9bd52c3-6428-4d4f-8b7f-d83dd80b6912) ### Checklist The following checks are mandatory: - [ ] My change adheres [VictoriaMetrics contributing guidelines](https://docs.victoriametrics.com/contributing/). --------- Signed-off-by: hagen1778 <roman@victoriametrics.com> Co-authored-by: hagen1778 <roman@victoriametrics.com>	2024-08-22 16:57:26 +02:00
Roman Khavronenko	9dc8d1debd	docs: move changelog to dir (#6853 ) Moving changelog-related docs to a separate dir should make it easier to navigate in `docs/` folder. ----------- The change shouldn't have any visual changes or changes to the links. Signed-off-by: hagen1778 <roman@victoriametrics.com>	2024-08-21 17:26:54 +02:00

1 2 3

133 commits