github-mirrors/VictoriaMetrics

mirror of https://github.com/VictoriaMetrics/VictoriaMetrics.git synced 2024-11-21 14:44:00 +00:00

Author	SHA1	Message	Date
Nikolay	fbaa026ae6	dashboards: updates operator dashboard (#7139 ) * Replaces deprecated graphs with Timeseries panels * Adds new latency dashboards for rest client and golang scheduler * Adds new overview panels * Adds VM Datasource version of dashboard --------- Signed-off-by: f41gh7 <nik@victoriametrics.com> Co-authored-by: Roman Khavronenko <roman@victoriametrics.com>	2024-09-30 15:35:39 +02:00
Zhu Jiekun	7bb8853a5c	feature: [vmagent] Add service discovery support for OVH Cloud VPS and dedicated server (#6160 ) ### Describe Your Changes related issue: https://github.com/VictoriaMetrics/VictoriaMetrics/issues/6071 #### Added - Added service discovery support for OVH Cloud: - VPS. - Dedicated server. #### Docs - `CHANGELOG.md`, `sd_configs.md`, `vmagent.md` are updated. #### Note - Useful links: - OVH Cloud VPS API: https://eu.api.ovh.com/console/#/vps~GET - OVH Cloud Dedicated server API: https://eu.api.ovh.com/console/#/dedicated/server~GET - OVH Cloud SDK: https://github.com/ovh/go-ovh - Prometheus SD: https://prometheus.io/docs/prometheus/latest/configuration/configuration/#ovhcloud_sd_config Tested on OVH Cloud VPS and dedicated server. <img width="1722" alt="image" src="https://github.com/VictoriaMetrics/VictoriaMetrics/assets/30280396/d3f0adc8-b0ef-423e-9379-8a9b9b0792ee"> <img width="1724" alt="image" src="https://github.com/VictoriaMetrics/VictoriaMetrics/assets/30280396/18b5b730-3512-4fc0-8b2c-f2450ac550fd"> --- Signed-off-by: Jiekun <jiekun@victoriametrics.com> Co-authored-by: hagen1778 <roman@victoriametrics.com>	2024-09-30 14:42:46 +02:00
Hui Wang	664f337c70	stream aggregation: fix possible duplicated aggregation results (#7118 ) When ingesting samples with the same labels(duplicated samples or samples with the same labels after `by` or `without` options). They could register different entries for the same labelset in LabelsCompressor. For example, both index 99 and 100 can be assigned to label `foo=1` in two concurrent pushes. Then due to differing label indexes in encoded keys, the samples will appear as distinct in aggrState, resulting in duplicated results after decompressing the label indexes. `fbde238cdc/lib/streamaggr/streamaggr.go (L933)` In this pull request, since we need to store `idxToLabel` first to ensure the idx can be searched after `lc.labelToIdxStore`, the `lc.idxToLabel` still could contain a duplicated entries [100]="foo=1". But given the low likelihood of this issue and the size of idxToLabel, it should be fine.	2024-09-30 14:24:59 +02:00
f41gh7	758f42fc12	docs: add Update Note for upcoming release changes Signed-off-by: f41gh7 <nik@victoriametrics.com>	2024-09-30 12:37:30 +02:00
Artem Fetishev	ed5da38ede	Introduce a flag for limiting the number of time series to delete (#7091 ) ### Describe Your Changes Introduce the `-search.maxDeleteSeries` flag that limits the number of time series that can be deleted with a single `/api/v1/admin/tsdb/delete_series` call. Currently, any number can be deleted and if the number is big (millions) then the operation may result in unaccounted CPU and memory usage spikes which in some cases may result in OOM kill (see #7027). The flag limits the number to 30k by default and the users may override it if needed at the vmstorage start time. --------- Signed-off-by: Artem Fetishev <rtm@victoriametrics.com> Co-authored-by: Nikolay <nik@victoriametrics.com>	2024-09-30 10:02:21 +02:00
Nikolay	3bbb2aed72	fscore: rollback trailing space trim (#7106 ) Previous commit `201fd6de1e` removed trailing space trim from data read from file. But common practice is to remove such trailing space. And it leaded to the authorization errors for the major group of users. In first place, this change must help to mitigate an issue with kubernetes. When authorization information was read from Secret content. Changes to the operator was made to mitigate such problem at commit `1cf64358c8` We could introduce later optional flag for VictoriaMetrics to disable trim space behavior. Related issues: https://github.com/VictoriaMetrics/VictoriaMetrics/issues/6986 https://github.com/VictoriaMetrics/VictoriaMetrics/issues/7089 https://github.com/VictoriaMetrics/VictoriaMetrics/issues/6947 --------- Signed-off-by: f41gh7 <nik@victoriametrics.com> Co-authored-by: Zhu Jiekun <jiekun@victoriametrics.com>	2024-09-29 10:59:25 +02:00
Artem Navoiev	14a0396f53	docs: changelog fix typo in url Signed-off-by: Artem Navoiev <tenmozes@gmail.com>	2024-09-28 23:30:19 +02:00
Artem Navoiev	96efe99eef	docs: mention new create backup api in docs and changelog (#7104 ) ### Describe Your Changes Please provide a brief description of the changes you made. Be as specific as possible to help others understand the purpose and impact of your modifications. ### Checklist The following checks are mandatory: - [x] My change adheres [VictoriaMetrics contributing guidelines](https://docs.victoriametrics.com/contributing/). Signed-off-by: Artem Navoiev <tenmozes@gmail.com>	2024-09-28 14:28:58 -07:00
Yury Molodov	25a9802ca4	vmui: add link to vmalert (#7088 ) ### Describe Your Changes Add link to VMalert when proxy is enabled. The link is displayed when the `-vmalert.proxyURL` flag is present. #5924 ![image](https://github.com/user-attachments/assets/c45ca884-8912-4bd9-a867-df5919f278a1) ### Checklist The following checks are mandatory: - [ ] My change adheres [VictoriaMetrics contributing guidelines](https://docs.victoriametrics.com/contributing/). --------- Co-authored-by: Aliaksandr Valialkin <valyala@victoriametrics.com>	2024-09-27 13:22:22 +02:00
Hui Wang	fbde238cdc	stream aggregation: support configuring multiple labels per `remoteWrite… (#7073 ) ….url` using `-remoteWrite.streamAggr.dropInputLabels` Before, labels were set to all the `remoteWrite.url`. address https://github.com/VictoriaMetrics/VictoriaMetrics/issues/6780 --------- Co-authored-by: Roman Khavronenko <roman@victoriametrics.com>	2024-09-27 12:21:09 +02:00
Yury Molodov	c896bf340d	vmui: add functionality to preserve selected columns (#7037 ) ### Describe Your Changes 1) Changed table settings from a popup to a modal window to simplify future functionality additions. 2) Added functionality to save selected columns when data is modified or the page is reloaded. See #7016. <details> <summary>Example screenshots</summary> <img alt="demo-1" width="600" src="https://github.com/user-attachments/assets/a5d9a910-363c-4931-8b12-18ea8b3d97d8"/> </details> ### Checklist The following checks are mandatory: - [x] My change adheres [VictoriaMetrics contributing guidelines](https://docs.victoriametrics.com/contributing/). --------- Co-authored-by: Roman Khavronenko <roman@victoriametrics.com>	2024-09-27 11:52:01 +02:00
Roman Khavronenko	6b1b47df54	app/vmalert: bump default values for sending data to `remoteWrite.url` (#7084 ) * `remoteWrite.maxQueueSize` from `100_000` to `1_000_000`, this should improve resiliency of recording rules that produce many series; * `remoteWrite.maxBatchSize` from `1_000` to `10_000`, this should be more efficient to send from netwroking perspective; * `remoteWrite.concurrency` from `1` to `4`, this should imrpove speed of sending the generated series. The new settings should improve remote write performance of vmalert with default settings. ### Describe Your Changes Please provide a brief description of the changes you made. Be as specific as possible to help others understand the purpose and impact of your modifications. ### Checklist The following checks are mandatory: - [ ] My change adheres [VictoriaMetrics contributing guidelines](https://docs.victoriametrics.com/contributing/). --------- Signed-off-by: hagen1778 <roman@victoriametrics.com> Co-authored-by: Hui Wang <haley@victoriametrics.com>	2024-09-25 15:01:39 +02:00
Zhu Jiekun	5319acb8ed	vmagent: remote write respect Retry-After in header (#6124 ) ### Describe Your Changes related issue: https://github.com/VictoriaMetrics/VictoriaMetrics/issues/6097 #### Changed - Remote write retry policy in `vmagent` is changed into: 1. Respect `Retry-After` duration if exists. 2. Otherwise, calculate next retry duration by backoff policy (x2) and max retry duration limit. #### Docs - `CHANGELOG.md`. --- ### Checklist The following checks are mandatory: - [x] My change adheres [VictoriaMetrics contributing guidelines](https://docs.victoriametrics.com/contributing/). --------- Co-authored-by: Zakhar Bessarab <me@zekker-dev.tk> Co-authored-by: hagen1778 <roman@victoriametrics.com>	2024-09-24 12:44:03 +02:00
Dmytro Kozlov	cbeb7d50e8	lib/promscrape: show only unhealthy targets if `show_only_unhealthy` filter is enabled (#6960 ) ### Describe Your Changes It is better to show only unhealthy targets instead of all of them when `show_only_unhealthy` filter is enabled. Related issue: https://github.com/VictoriaMetrics/VictoriaMetrics/issues/3536 ### Checklist The following checks are mandatory: - [x] My change adheres [VictoriaMetrics contributing guidelines](https://docs.victoriametrics.com/contributing/). --------- Co-authored-by: Roman Khavronenko <roman@victoriametrics.com>	2024-09-24 12:18:24 +02:00
Roman Khavronenko	4d0b41e63b	deployment: add panel and alerts for displying go scheduler latency (#7078 ) The panel and alerting rule should help to understand whether VM component doesn't have enough CPU resources or gets throttled. The alert is applicable for all VM components. The panel was added to vmalert, vmagent, vmsingle, vm clusert and victorialogs dashes. ------------------- This alerting rule should have help us identify resource shortage for sandbox vmagent - see [this link](https://play.victoriametrics.com/select/accounting/1/6a716b0f-38bc-4856-90ce-448fd713e3fe/prometheus/graph/#/?g0.range_input=23d13h25m25s424ms&g0.end_input=2024-09-23T14%3A11%3A00&g0.relative_time=none&g0.tab=0&g0.expr=histogram_quantile%280.99%2C+sum%28rate%28go_sched_latencies_seconds_bucket%7Bjob%3D%22vmagent-monitoring-vmagent%22%7D%5B5m%5D%29%29+by+%28le%2C+job%2C+instance%29%29+%3E+0.1) for example. We weren't aware of resource shortage, because VM metrics assumed this vmagent had 1vCPU while in fact its limit was 0.2vCPU. Signed-off-by: hagen1778 <roman@victoriametrics.com>	2024-09-23 16:54:42 +02:00
Aliaksandr Valialkin	3964889705	app/vmselect/promql: consistently replace `NaN` data points with non-`NaN` values for `range_first` and `range_last` functions It is expected that range_first and range_last functions return non-nan const value across all the points if the original series contains at least a single non-NaN value. Previously this rule was violated for NaN data points in the original series. This could confuse users. While at it, add tests for series with NaN values across all the range_* and running_* functions, in order to maintain consistent handling of NaN values across these functions.	2024-09-23 14:59:29 +02:00
Aliaksandr Valialkin	57183c9b61	docs/changelog/CHANGELOG.md: moved the description of the fix for proper usage of -streamAggr.dedupInterval and -remoteWrite.streamAggr.dedupInterval from FEATURE to BUGFIX section The previous behaviour was incorrect, since it is unexpected that the -streamAggr.dedupInterval and -remoteWrite.streamAggr.dedupInterval is applied to processed samples only if -streamAggr.config isn't set. This is a follow-up for `d523015f27` Updates https://github.com/VictoriaMetrics/VictoriaMetrics/pull/6711	2024-09-23 08:56:33 +02:00
Aliaksandr Valialkin	0ada781cf2	docs/changelog/CHANGELOG.md: document bugfix for https://github.com/VictoriaMetrics/VictoriaMetrics/issues/7009 Updates https://github.com/VictoriaMetrics/VictoriaMetrics/pull/7064 Updates https://github.com/VictoriaMetrics/VictoriaMetrics/issues/7009 This is a follow-up for `55febc0920`	2024-09-22 21:57:48 +02:00
Hui Wang	d6d02d7aeb	vmalert: fix variable `$activeAt` value when templating rule annotation in replay mode Co-authored-by: Roman Khavronenko <roman@victoriametrics.com>	2024-09-20 11:07:40 +02:00
hagen1778	6167bccc5a	docs: fix more typos in the changelog Signed-off-by: hagen1778 <roman@victoriametrics.com>	2024-09-20 10:54:43 +02:00
hagen1778	59281d5358	docs: rm update node about loggerMaxArgLen as it doesn't have incompatibility effect Signed-off-by: hagen1778 <roman@victoriametrics.com>	2024-09-20 10:42:07 +02:00
hagen1778	6726a5aaed	docs: fix typo in link in change line about NaN Signed-off-by: hagen1778 <roman@victoriametrics.com>	2024-09-20 10:39:13 +02:00
Thomas Danielsson	258201af04	docs: fix typo in the changelog Signed-off-by: hagen1778 <roman@victoriametrics.com>	2024-09-20 08:46:29 +02:00
f41gh7	61721303fd	docs/changelog: mention vmagent kafka consumer bugfix Changes were made to the enteprise repository	2024-09-19 15:35:48 +02:00
Yury Molodov	b0bdb92729	vmui: change the `query_range` request method from `GET` to `POST` (#7039 ) ### Describe Your Changes change the `/query_range` and `/query` requests method from `GET` to `POST`. See #6288. ### Checklist The following checks are mandatory: - [x] My change adheres [VictoriaMetrics contributing guidelines](https://docs.victoriametrics.com/contributing/). --------- Signed-off-by: hagen1778 <roman@victoriametrics.com> Co-authored-by: hagen1778 <roman@victoriametrics.com>	2024-09-19 14:30:54 +02:00
Roman Khavronenko	e115b85770	lib/logger: increase default value of `-loggerMaxArgLen` cmd-line fla… (#7008 ) …g from 1e3 to 5e3 This should improve visibility on errors produced by very long queries. The change is classified as BUG in order to port it to LTS releases. ### Describe Your Changes Please provide a brief description of the changes you made. Be as specific as possible to help others understand the purpose and impact of your modifications. ### Checklist The following checks are mandatory: - [ ] My change adheres [VictoriaMetrics contributing guidelines](https://docs.victoriametrics.com/contributing/). --------- Signed-off-by: hagen1778 <roman@victoriametrics.com> Co-authored-by: Mathias Palmersheim <mathias@victoriametrics.com>	2024-09-19 14:29:18 +02:00
Aliaksandr Valialkin	b82e2cabc5	app/vmselect/promql: properly calculate `c1 and c2` and `c1 or c2` by upgrading github.com/VictoriaMetrics/metricsql to v0.79.0 The fix is in the https://github.com/VictoriaMetrics/metricsql/pull/34 Updates https://github.com/VictoriaMetrics/VictoriaMetrics/issues/6637	2024-09-18 17:38:19 +02:00
Nikolay	d8f8822fa5	lib/storage: consistently check for missing metricID index records (#6967 ) * Previously, only metricID->metricName missing index records were tracked with deadline But it was possible a case for missing metricID->TSID index records. IndexDB metrics fix exposed misleading metric for such missing records. * This commit adds check for metricID->TSID missing index records. And delete missing metricID entry if it hit 60 second deadline. Related issue https://github.com/VictoriaMetrics/VictoriaMetrics/issues/6931 Signed-off-by: f41gh7 <nik@victoriametrics.com>	2024-09-16 10:05:08 +02:00
Nikolay	264c2ec6bd	lib/fs: properly call windows APIs (#6998 ) Previously we manually imported system windows DDLs and made direct syscall. But golang exposes syscall wrappers with sys/windows package. It seems, that direct syscall was broken at 1.23 golang release. It was `GetDiskFreeSpace` syscall in our case. This commit replaces all manual syscalls with wrappers Related issue: https://github.com/VictoriaMetrics/VictoriaMetrics/issues/6973 Related golang issue: https://github.com/golang/go/issues/69029 Signed-off-by: f41gh7 <nik@victoriametrics.com>	2024-09-13 12:22:25 +02:00
Dima Lazerka	8207879fa3	docs: fixes misspelled typos Also tried to make it catch "Authorisation" in the future, fixed a lot of other misspells along the way, but didn't make it catch "Authorisation" anyway. - Fix misspelled "Authorization" header name - Fix misspelled "organization" - Fix more misspells	2024-09-13 12:14:24 +02:00
Hui Wang	ae4d376e41	vmalert: do not send message to alertmanager when alert has no label … (#6823 ) …pair `alert_relabel_configs` in [notifier config](https://docs.victoriametrics.com/vmalert/#notifier-configuration-file) can drop alert labels when used to filter different tenant alert message to different notifier. alertmanager would report error like `msg="Failed to validate alerts" err="at least one label pair required"` in this case, but the rest of the alerts inside one request would still be valid in alertmanager, so it's not severe.	2024-09-09 13:34:48 +02:00
Aliaksandr Valialkin	4fbdde5852	deployment/docker: update base Alpine docker image from 3.20.2 to 3.20.3 See https://alpinelinux.org/posts/Alpine-3.17.10-3.18.9-3.19.4-3.20.3-released.html	2024-09-08 19:26:48 +02:00
Aliaksandr Valialkin	5261a84119	deployment: update Go builder from Go1.23.0 to Go1.23.1 See https://github.com/golang/go/issues?q=milestone%3AGo1.23.1+label%3ACherryPickApproved	2024-09-06 22:51:15 +02:00
f41gh7	feafb30266	docs/changelog: mention storage changes After `a5424e95b3` Signed-off-by: f41gh7 <nik@victoriametrics.com>	2024-09-06 18:05:11 +02:00
Zakhar Bessarab	9f7ee4c0bb	Vmgateway no prefix string (#784 ) * app/vmgateway: allow skipping Bearer prefix, parsing access as string - allow disabling of "Bearer" prefix check - This is needed in order to support OIDC systems where identity token is provided separately from access token and it does not contain "Bearer" prefix(such as Azure Entra ID, ex AD).a - support parsing "vm_access" claim as a string - This is helpful for systems where claims can only be mapped to string. Signed-off-by: Zakhar Bessarab <z.bessarab@victoriametrics.com> * docs/changelog: mention vmgateway updates Signed-off-by: Zakhar Bessarab <z.bessarab@victoriametrics.com> --------- Signed-off-by: Zakhar Bessarab <z.bessarab@victoriametrics.com>	2024-09-06 16:20:20 +02:00
f41gh7	d7be0e7c9a	docs/changelog: mention storage NaN changes follow-up after `39294b4919` Signed-off-by: f41gh7 <nik@victoriametrics.com>	2024-09-05 16:56:32 +02:00
Zhu Jiekun	c193e6d43e	lib/discovery/azure: fix host check in next link in Azure SD (#6915 ) Previous bugfix at `49f63b2` only partially fixed pagination host validation error. Before this fix it was: ``` unexpected nextLink host \"management.azure.com\", expecting \"https://management.azure.com\" ``` Now we only check the `Host` without schema. However, when Azure respond `nextLink` in `Host:Port` format, the `nextLink` check will fail: ``` unexpected nextLink host \"management.azure.com:443\", expecting \"management.azure.com\" ``` This pull request further relaxes the checks by only checking the `Hostname`. --- related issue: https://github.com/VictoriaMetrics/VictoriaMetrics/issues/6912	2024-09-05 16:48:09 +02:00
Hui Wang	b48f5f3e59	lib/storage: fix metric `vm_object_references{type="indexdb"}` (#6937 ) follow up `4ecc370acb` ### Describe Your Changes Please provide a brief description of the changes you made. Be as specific as possible to help others understand the purpose and impact of your modifications. ### Checklist The following checks are mandatory: - [ ] My change adheres [VictoriaMetrics contributing guidelines](https://docs.victoriametrics.com/contributing/).	2024-09-05 16:42:49 +02:00
f41gh7	be66aa5f4e	docs/changelog: mention enterprise changes Signed-off-by: f41gh7 <nik@victoriametrics.com>	2024-09-04 15:39:34 +02:00
f41gh7	b8bbea8896	docs/changelog: moves victorialogs changes to proper file Signed-off-by: f41gh7 <nik@victoriametrics.com>	2024-09-04 15:36:33 +02:00
Andrii Chubatiuk	01430a155c	vlinsert: added opentelemetry logs support Commit adds the following changes: * Adds support of OpenTelemetry logs for Victoria Logs with protobuf encoded messages * json encoding is not supported for the following reasons: - It brings a lot of fragile code, which works inefficiently. - json encoding is impossible to use with language SDK. * splits metrics and logs structures at lib/protoparser/opentelemetry/pb package. * adds docs with examples for opentelemetry logs. --- Related issue: https://github.com/VictoriaMetrics/VictoriaMetrics/issues/4839 Co-authored-by: AndrewChubatiuk <andrew.chubatiuk@gmail.com> Co-authored-by: f41gh7 <nik@victoriametrics.com>	2024-09-03 20:12:05 +02:00
hagen1778	4dcb6a3719	dashboards/vmagent: fix legend captions for stream aggregation related panels. Before they were displaying wrong label names. Signed-off-by: hagen1778 <roman@victoriametrics.com>	2024-09-03 14:23:35 +02:00
Hui Wang	d523015f27	stream aggregation: perform deduplication for all received data when … (#6711 ) …specifying `-streamAggr.dedupInterval` or `-remoteWrite.streamAggr.dedupInterval` command-line flag [The documentation](https://docs.victoriametrics.com/stream-aggregation/) contains conflicting descriptions regarding deduplication for non-matched series when `-remoteWrite.streamAggr.config` and / or `-streamAggr.config` are set: 1. Statement below says all the received data is deduplicated: >[vmagent](https://docs.victoriametrics.com/vmagent/) supports relabeling, deduplication and stream aggregation for all the received data, scraped or pushed. Then, the collected data will be forwarded to specified -remoteWrite.url destinations. The data processing order is the following: >1. all the received data is relabeled according to the specified [-remoteWrite.relabelConfig](https://docs.victoriametrics.com/vmagent/#relabeling) (if it is set) >2. all the received data is deduplicated according to specified [-streamAggr.dedupInterval](https://docs.victoriametrics.com/stream-aggregation/#deduplication) (if it is set to duration bigger than 0) 2. Another statement says the deduplication is performed individually for the matching samples >The de-deduplication is performed after applying [relabeling](https://docs.victoriametrics.com/vmagent/#relabeling) and before performing the aggregation. If the -remoteWrite.streamAggr.config and / or -streamAggr.config is set, then the de-duplication is performed individually per each [stream aggregation config](https://docs.victoriametrics.com/stream-aggregation/#stream-aggregation-config) for the matching samples after applying [input_relabel_configs](https://docs.victoriametrics.com/stream-aggregation/#relabeling). Considering the following deduplication use cases: 1. To apply deduplication(globally or for specific remoteWrite destination) for all the received data, scraped or pushed --- using `-streamAggr.dedupInterval` or `-remoteWrite.streamAggr.dedupInterval`. 2. To deduplicate and aggregate metrics that match the rule `match` filters --- using `-remoteWrite.streamAggr.config` and specifiying `dedup_interval` option in [stream aggregation config](https://docs.victoriametrics.com/stream-aggregation/#stream-aggregation-config). 3. To deduplicate all the received data while having `streamAggr.config` for some metrics --- no way for a single vmagent now, need to set up two level vmagents This PR implements case3. --------- Co-authored-by: Roman Khavronenko <roman@victoriametrics.com>	2024-09-03 10:47:05 +02:00
hagen1778	d5755e55ef	docs/CHANGELOG.md: update changelog with LTS release notes Signed-off-by: hagen1778 <roman@victoriametrics.com>	2024-08-29 13:27:27 +02:00
hagen1778	5aeb759df9	docs/CHANGELOG.md: cut v1.103.0 Signed-off-by: hagen1778 <roman@victoriametrics.com>	2024-08-28 13:48:31 +02:00
hagen1778	e71cfdcfa5	docs: pre-release doc update * typo fix * mention version starting from features are available Signed-off-by: hagen1778 <roman@victoriametrics.com>	2024-08-28 13:47:29 +02:00
f41gh7	40d55199fd	docs/changelog: mention bugfix Signed-off-by: f41gh7 <nik@victoriametrics.com>	2024-08-28 11:51:02 +02:00
Nikolay	4ecc370acb	lib/storage: properly add previous indexDB metrics (#6890 ) Previously, some extIndexDB metrics were not registered. It resulted into missing metrics, if metric value was added to the extIndexDB. It's a usual case for search requests at both indexes. Current commit updates all metrics from extIndexDB according to the current IndexDB. It must fix such cases Related issue: https://github.com/VictoriaMetrics/VictoriaMetrics/issues/6868 ### Describe Your Changes Please provide a brief description of the changes you made. Be as specific as possible to help others understand the purpose and impact of your modifications. ### Checklist The following checks are mandatory: - [ ] My change adheres [VictoriaMetrics contributing guidelines](https://docs.victoriametrics.com/contributing/).	2024-08-28 11:14:28 +02:00
rtm0	9fcfba3927	lib/storage: properly handle maxMetrics limit at metricID search `TL;DR` This PR improves the metric IDs search in IndexDB: - Avoid seaching for metric IDs twice when `maxMetrics` limit is exceeded - Use correct error type for indicating that the `maxMetrics` limit is exceded - Simplify the logic of deciding between per-day and global index search A unit test has been added to ensure that this refactoring does not break anything. --- Function calls before the fix: ``` idb.searchMetricIDs \|__ is.searchMetricIDs \|__ is.searchMetricIDsInternal \|__ is.updateMetricIDsForTagFilters \|__ is.tryUpdatingMetricIDsForDateRange \| \| \|__ is.getMetricIDsForDateAndFilters ``` - `searchMetricIDsInternal` searches metric IDs for each filter set. It maintains a metric ID set variable which is updated every time the `updateMetricIDsForTagFilters` function is called. After each successful call, the function checks the length of the updated metric ID set and if it is greater than `maxMetrics`, the function returns `too many timeseries` error. - `updateMetricIDsForTagFilters` uses either per-day or global index to search metric IDs for the given filter set. The decision of which index to use is made is made within the `tryUpdatingMetricIDsForDateRange` function and if it returns `fallback to global search` error then the function uses global index by calling `getMetricIDsForDateAndFilters` with zero date. - `tryUpdatingMetricIDsForDateRange` first checks if the given time range is larger than 40 days and if so returns `fallback to global search` error. Otherwise it proceeds to searching for metric IDs within that time range by calling `getMetricIDsForDateAndFilters` for each date. - `getMetricIDsForDateAndFilters` searches for metric IDs for the given date and returns `fallback to global search` error if the number of found metric IDs is greater than `maxMetrics`. Problems with this solution: 1. The `fallback to global search` error returned by `getMetricIDsForDateAndFilters` in case when maxMetrics is exceeded is misleading. 2. If `tryUpdatingMetricIDsForDateRange` proceeds to date range search and returns `fallback to global search` error (because `getMetricIDsForDateAndFilters` returns it) then this will trigger global search in `updateMetricIDsForTagFilters`. However the global search uses the same maxMetrics value which means this search is destined to fail too. I.e. the same search is performed twice and fails twice. 3. `too many timeseries` error is already handled in `searchMetricIDsInternal` and therefore handing this error in `updateMetricIDsForTagFilters` is redundant 4. updateMetricIDsForTagFilters is a better place to make a decision on whether to use per-day or global index. Solution: 1. Use a dedicated error for `too many timeseries` case 2. Handle `too many timeseries` error in `searchMetricIDsInternal` only 3. Move the per-day or global search decision from `tryUpdatingMetricIDsForDateRange` to `updateMetricIDsForTagFilters` and remove `fallback to global search` error. --------- Signed-off-by: Artem Fetishev <wwctrsrx@gmail.com> Co-authored-by: Nikolay <nik@victoriametrics.com>	2024-08-27 21:39:03 +02:00
rtm0	eef6943084	lib/storage: properly register index records with RegisterMetricNames Once the timeseries is in tsidCache, new entries won't be created in per-day index because the RegisterMetricNames() code does consider different dates for the same timeseries. So this case has been added. The same bug exists for AddRows() but it is not manifested because the index entries are finally created in updatePerDateData(). RegisterMetricNames also updated to increase the newTimeseriesCreated counter because it actually creates new time series in index. A unit tests has been added that check all possible data patterns (different metric names and dates) and code branches in both RegisterMetricNames and AddRows. The total number of new unit tests is around 100 which increaded the running time of storage tests by 50%. --------- Signed-off-by: Artem Fetishev <wwctrsrx@gmail.com> Co-authored-by: Roman Khavronenko <hagen1778@gmail.com>	2024-08-27 21:33:53 +02:00

1 2

57 commits