github-mirrors/VictoriaMetrics

mirror of https://github.com/VictoriaMetrics/VictoriaMetrics.git synced 2024-12-11 14:53:49 +00:00

Author	SHA1	Message	Date
Aliaksandr Valialkin	64e615e6cc	lib/storage: reduce the contention on dateMetricIDCache mutex when new time series are registered at high rate The dateMetricIDCache puts recently registered (date, metricID) entries into mutable cache protected by the mutex. The dateMetricIDCache.Has() checks for the entry in the mutable cache when it isn't found in the immutable cache. Access to the mutable cache is protected by the mutex. This means this access is slow on systems with many CPU cores. The mutabe cache was merged into immutable cache every 10 seconds in order to avoid slow access to mutable cache. This means that ingestion of new time series to VictoriaMetrics could result in significant slowdown for up to 10 seconds because of bottleneck at the mutex. Fix this by merging the mutable cache into immutable cache after len(cacheItems) / 2 cache hits under the mutex, e.g. when the entry is found in the mutable cache. This should automatically adjust intervals between merges depending on the addition rate for new time series (aka churn rate): - The interval will be much smaller than 10 seconds under high churn rate. This should reduce the mutex contention for mutable cache. - The interval will be bigger than 10 seconds under low churn rate. This should reduce the uneeded work on merging of mutable cache into immutable cache.	2024-01-22 18:14:30 +02:00
dependabot[bot]	9c153d0710	build(deps): bump github/codeql-action from 2 to 3 (#5462 ) Bumps [github/codeql-action](https://github.com/github/codeql-action) from 2 to 3. - [Release notes](https://github.com/github/codeql-action/releases) - [Changelog](https://github.com/github/codeql-action/blob/main/CHANGELOG.md) - [Commits](https://github.com/github/codeql-action/compare/v2...v3) --- updated-dependencies: - dependency-name: github/codeql-action dependency-type: direct:production update-type: version-update:semver-major ... Signed-off-by: dependabot[bot] <support@github.com> Co-authored-by: dependabot[bot] <49699333+dependabot[bot]@users.noreply.github.com>	2024-01-22 01:49:40 +02:00
Aliaksandr Valialkin	c6f6f094c5	Revert "lib/promscrape: do not store last scrape response when stale markers … (#5577 )" This reverts commit `cfec258803`. Reason for revert: the original code already doesn't store the last scrape response when stale markers are disabled. The scrapeWork.areIdenticalSeries() function always returns true is stale markers are disabled. This prevents from storing the last response at scrapeWork.processScrapedData(). It looks like the reverted commit could also return back the issue https://github.com/VictoriaMetrics/VictoriaMetrics/issues/3660 Updates https://github.com/VictoriaMetrics/VictoriaMetrics/pull/5577	2024-01-22 01:46:12 +02:00
Aliaksandr Valialkin	3230525c36	docs: use persistent links to Grafana dashboards These links do not depend on the dashboard name, so they do not break after the renaming of the dashboard. This is a follow-up for `ff33e60a3d`	2024-01-22 01:45:42 +02:00
Aliaksandr Valialkin	d4a1a28543	app/vmselect: handle negative time range start in a generic manner inside NewSearchQuery() This is a follow-up for `cf03e11d89` Updates https://github.com/VictoriaMetrics/VictoriaMetrics/issues/5553 Updates https://github.com/VictoriaMetrics/VictoriaMetrics/pull/5630	2024-01-22 01:39:27 +02:00
Artem Navoiev	fbbd5ab1e7	docs vmanomaly fix anchor Signed-off-by: Artem Navoiev <tenmozes@gmail.com>	2024-01-22 01:36:22 +02:00
Hui Wang	49fa92c1d0	lib/promscrape/discovery/kubernetes: fix watcher start order for roles endpoints and endpointslice (#5557 ) * lib/promscrape/discovery/kubernetes: fix watcher start order for roles endpoints and endpointslice Previously the groupWatcher could be mistakenly stopped when requests for pod or services resources take too long. * remove mislead comment * docs/sd_configs.md: mention -promscrape.kubernetes.attachNodeMetadataAll flag in the description for attach_metadata section Updates https://github.com/VictoriaMetrics/VictoriaMetrics/issues/4640 * wip * lib/promscrape/kubernetes: prevent from stopping groupWatcher when there are in-flight apiWatcher.mustStart() calls groupWatcher is stopped if it has zero registered apiWatchers during 14 seconds. But such a groupWatcher can be still in use if apiWatcher for `role: endpoints` or `role: endpointslice` is being registered and the discovery of the associated `pod` and/or `service` objects takes longer than 14 seconds - see the beginning of groupWatcher.startWatchersForRole() function for details. Track the number of in-flight calls to apiWatcher.mustStart() and prevent from stopping the associated groupWatcher if the number of in-flight calls is non-zero. P.S. postponing the discovery of `pod` and/or `service` objects associated with `endpoints` or `endpointslice` roles isn't the best solution, since it slows down initial discovery of `endpoints` and `endpointslice` targets. * typo fix --------- Co-authored-by: Aliaksandr Valialkin <valyala@victoriametrics.com>	2024-01-22 01:33:17 +02:00
Aliaksandr Valialkin	885ee160c2	all: allow dynamically reading *AuthKey flag values from files and urls Examples: 1) -metricsAuthKey=file:///abs/path/to/file - reads flag value from the given absolute filepath 2) -metricsAuthKey=file://./relative/path/to/file - reads flag value from the given relative filepath 3) -metricsAuthKey=http://some-host/some/path?query_arg=abc - reads flag value from the given url The flag value is automatically updated when the file contents changes.	2024-01-22 01:23:23 +02:00
Fred Navruzov	b76c489e57	- fix 404 links after renaming (#5653 ) - improve wording on diagram - swap enterprise/about chapters for page clarity	2024-01-22 01:14:54 +02:00
Fred Navruzov	11c5b5d3ab	docs: vmanomaly - add component interaction diagram (#5652 ) * add interaction diagram for vmanomaly components * small docs fixes * resolve suggestions	2024-01-22 01:14:16 +02:00
Aliaksandr Valialkin	5f5fcab217	all: call atomic.Load* in front of atomic.CompareAndSwap* at places where the atomic.CompareAndSwap* returns false most of the time This allows avoiding slow inter-CPU synchornization induced by atomic.CompareAndSwap*	2024-01-22 01:13:41 +02:00
Aliaksandr Valialkin	6d91d10cbd	docs/sd_configs.md: mention -promscrape.kubernetes.attachNodeMetadataAll flag in the description for attach_metadata section Updates https://github.com/VictoriaMetrics/VictoriaMetrics/issues/4640	2024-01-22 01:13:19 +02:00
Aliaksandr Valialkin	be5faef552	lib/promscrape: code cleanup: send stale markers immediately after generating automatic metrics This cleanup has been extracted from https://github.com/VictoriaMetrics/VictoriaMetrics/pull/5557/files#diff-6b205cf6637d7b65a5c45d9417d08822d4efad94227268cb196f61aa2a0fc0f7	2024-01-22 01:12:56 +02:00
Aliaksandr Valialkin	e15f07d989	all: consistently clear prompbmarshal.Label by assigning an empty struct instead of zeroing Name and Value individually	2024-01-22 01:11:59 +02:00
Aliaksandr Valialkin	2f94bef59c	lib/storage/partition.go: remove misleading comment, which falsely states that inmemoryParts isn't visible to search Thanks to @satjd for raising attention to this comment at https://github.com/VictoriaMetrics/VictoriaMetrics/issues/5410	2024-01-22 01:11:36 +02:00
Nikolay	73c51072e6	app/vmauth: adds metric_labels and backend_errors counter (#5585 ) * app/vmauth: adds metric_labels and backend_errors counter it must improve observability for user requests with new metric - per user backend errors counter. it's needed to calculate requests fail rate to the configured backends. metric_labels configuration allows to perform additional aggregations on top of multiple users from configuration section. It could be multiple clients or clients with separate read/write tokens https://github.com/VictoriaMetrics/VictoriaMetrics/issues/5565 * wip --------- Co-authored-by: Aliaksandr Valialkin <valyala@victoriametrics.com>	2024-01-22 01:09:51 +02:00
Yury Molodov	0582ec5c8c	vmui: add autofocus to input for desktop version #5479 (#5592 )	2024-01-22 01:09:27 +02:00
Aliaksandr Valialkin	2c7c812a9d	lib/promscrape/discovery/kubernetes: add -promscrape.kubernetes.attachNodeMetadataAll command-line flag This flag allows setting attach_metadata.node=true for all the kubernetes_sd_configs defined at -promscrape.config Updates https://github.com/VictoriaMetrics/VictoriaMetrics/issues/4640 Thanks to wasim-nihal for the initial implementation at https://github.com/VictoriaMetrics/VictoriaMetrics/pull/5593	2024-01-22 01:08:52 +02:00
Hui Wang	e086ef16da	app/vmselect/promql: properly handle possible negative results caused… (#5608 ) * app/vmselect/promql: properly handle possible negative results caused by float operations precision error in rollup functions like rate() or increase() * fix test	2024-01-22 01:04:50 +02:00
Nikolay	d79a7c36b0	app/vmselect/netstorage (#5649 ) * app/vmselect/netstorage correctly handle errGlobal set * wip Updates https://github.com/VictoriaMetrics/VictoriaMetrics/pull/5649 --------- Co-authored-by: Aliaksandr Valialkin <valyala@victoriametrics.com>	2024-01-22 01:01:38 +02:00
Nikolay	e196c61e36	app/vmselect: abort streaming connections for vmselect (#5650 ) * app/vmselect: abort streaming connections for vmselect due to streaming nature of export APIs, curl and simmilr tools cannot detect errors that happened after http.Header with status 200 was written to it. This PR tracks if body write was already started and closes connection. It allows client to detect not expected chunk sequence and return error to the caller. Mostly it affects vmselect at cluster version https://github.com/VictoriaMetrics/VictoriaMetrics/issues/5645 * wip Updates https://github.com/VictoriaMetrics/VictoriaMetrics/issues/5645 Updates https://github.com/VictoriaMetrics/VictoriaMetrics/pull/5650 --------- Co-authored-by: Aliaksandr Valialkin <valyala@victoriametrics.com>	2024-01-22 00:54:32 +02:00
Aliaksandr Valialkin	c05982bfa7	lib/promscrape/discovery/hetzner: follow-up after `03a97dc678` - docs/sd_configs.md: moved hetzner_sd_configs docs to the correct place according to alphabetical order of SD names, document missing __meta_hetzner_role label. - lib/promscrape/config.go: added missing MustStop() call for Hetzner SD, and moved the code to the correct place according to alphabetical order of SD names. - lib/promscrape/discovery/hetzner: properly handle pagination for hloud API responses, populate missing __meta_hetzner_role label like Prometheus does. - Properly populate __meta_hetzner_public_ipv6_network label like Prometheus does. - Remove unused SDConfig.Token. - Remove "omitempty" annotation from SDConfig.Role field, since this field is mandatory. Updates https://github.com/VictoriaMetrics/VictoriaMetrics/pull/5550 Updates https://github.com/VictoriaMetrics/VictoriaMetrics/issues/3154	2024-01-22 00:53:23 +02:00
Artem Navoiev	1e14c3177b	vmanomly use proper title for overiview doc Signed-off-by: Artem Navoiev <tenmozes@gmail.com>	2024-01-22 00:52:56 +02:00
Hui Wang	66eb013b54	lib/promscrape: do not store last scrape response when stale markers … (#5577 ) * lib/promscrape: do not store last scrape response when stale markers are disabled * update changelog	2024-01-22 00:52:25 +02:00
Artem Navoiev	75920ab3b1	docs add docs titles Signed-off-by: Artem Navoiev <tenmozes@gmail.com>	2024-01-22 00:28:54 +02:00
Artem Navoiev	c8fc813e00	vmanonamly docs fix sorting for jekill as far it doesnt support of sorting the folders Signed-off-by: Artem Navoiev <tenmozes@gmail.com>	2024-01-22 00:28:17 +02:00
Artem Navoiev	f3b4ca6a77	vmanomaly remove unused pages from menu Signed-off-by: Artem Navoiev <tenmozes@gmail.com>	2024-01-22 00:27:49 +02:00
Artem Navoiev	7eff777467	vmanomly specify the right menu parent for overview page Signed-off-by: Artem Navoiev <tenmozes@gmail.com>	2024-01-22 00:27:26 +02:00
Artem Navoiev	0ec4134ab6	docs vmanmoly fix sorting Signed-off-by: Artem Navoiev <tenmozes@gmail.com>	2024-01-22 00:26:59 +02:00
Artem Navoiev	b5363ffab2	Move vmanomaly page to its own section (#5646 ) * docs: move vmanomaly overview page to its section Signed-off-by: Artem Navoiev <tenmozes@gmail.com> * add alias for backward compatibility Signed-off-by: Artem Navoiev <tenmozes@gmail.com> * fix links Signed-off-by: Artem Navoiev <tenmozes@gmail.com> * change title Signed-off-by: Artem Navoiev <tenmozes@gmail.com> --------- Signed-off-by: Artem Navoiev <tenmozes@gmail.com>	2024-01-22 00:26:37 +02:00
Roman Khavronenko	fe4934f0ec	app/vmui: send `step` param for instant queries (#5639 ) The change reverts https://github.com/VictoriaMetrics/VictoriaMetrics/issues/3896 due to reasons explained in https://github.com/VictoriaMetrics/VictoriaMetrics/issues/3896#issuecomment-1896704401 Signed-off-by: hagen1778 <roman@victoriametrics.com>	2024-01-22 00:25:31 +02:00
Fred Navruzov	3b8f4714c3	docs: vmanomaly slight improvements (#5637 ) * - better messaging - update links to dockerhub in guides - added anomaly_score to FAQ - improve model section (sort + use cases) - slight refactor of a guide * rename guide & change refs * change wording in installation options * - update remaining text reference - add cross-link to component sections in guide * add docs/.jekyll-metadata to .gitignore	2024-01-22 00:24:39 +02:00
hagen1778	a99d26633b	docs: remove slug from Grafana dashboard URLs Each Grafana dashboard has unique ID which can be used to fetch the dashboard from grafana.com: https://grafana.com/grafana/dashboards/11176 The same dashboard can be accessed via URL with slug: https://grafana.com/grafana/dashboards/11176-victoriametrics-cluster/ But using slug implies that any change to dashboard name will break the link. So it is better to just use ID, so the dashboard URL will never break. This is follow-up for `ff33e60a3d` Signed-off-by: hagen1778 <roman@victoriametrics.com>	2024-01-22 00:23:30 +02:00
Artem Navoiev	70743c7014	fix link for grafana dashbaord for single node after its renaming Signed-off-by: Artem Navoiev <tenmozes@gmail.com>	2024-01-22 00:21:57 +02:00
Artem Navoiev	79b308fa54	docs: vmanomaly update vmanomaly + vmalert guide (#5636 ) * docs: vmanomaly update vmanomaly + vmalert guide Signed-off-by: Artem Navoiev <tenmozes@gmail.com> * docs: vmanomaly update vmanomaly + vmalert guide. Update docker compose and monitoring section Signed-off-by: Artem Navoiev <tenmozes@gmail.com> * typos and fixes Signed-off-by: Artem Navoiev <tenmozes@gmail.com> --------- Signed-off-by: Artem Navoiev <tenmozes@gmail.com>	2024-01-22 00:19:15 +02:00
Artem Navoiev	8154d16420	docs: changelog fix the link to cluster Signed-off-by: Artem Navoiev <tenmozes@gmail.com>	2024-01-21 23:54:08 +02:00
Artem Navoiev	b1fe59df8b	vmanomaly docs fix broken relative links Signed-off-by: Artem Navoiev <tenmozes@gmail.com>	2024-01-21 23:53:39 +02:00
Artem Navoiev	547b62a807	docs/anomaly-detection/components/models.md add sort:1 Signed-off-by: Artem Navoiev <tenmozes@gmail.com>	2024-01-21 23:53:10 +02:00
Artem Navoiev	7c62010d04	vmanonaly docs add .html for the section document models Signed-off-by: Artem Navoiev <tenmozes@gmail.com>	2024-01-21 23:52:41 +02:00
Artem Navoiev	b66b059231	vmanomaly docs simplify the strcuture (#5634 ) * vmanomaly docs simplify the strcuture Signed-off-by: Artem Navoiev <tenmozes@gmail.com> * fix links Signed-off-by: Artem Navoiev <tenmozes@gmail.com> --------- Signed-off-by: Artem Navoiev <tenmozes@gmail.com>	2024-01-21 23:52:07 +02:00
hagen1778	07a55a484a	docs/setup-size: rm tolerable churn rate % It is likely this value was borrowed from panel `Slow inserts` panel from Grafana dasbhoard for VM single/cluster installations. This is a mistake. There is no such thing as "tolerable churn rate", as tolerancy depends on the amount of allocated resources. Although, it is unclear what is meant by 5%. If it refers to 5% of new time series per second, then such churn rate is extremely high. It would mean that the avg life of a time series is 20s. Signed-off-by: hagen1778 <roman@victoriametrics.com>	2024-01-21 23:48:24 +02:00
Roman Khavronenko	148e14b3f2	app/vmselect: properly calculate `start` param for queries with too big look-behind window (#5630 ) Properly determine time range search for instant queries with too big look-behind window like `foo[100y]`. Previously, such queries could return empty responses even if `foo` is present in database. https://github.com/VictoriaMetrics/VictoriaMetrics/issues/5553 Signed-off-by: hagen1778 <roman@victoriametrics.com>	2024-01-21 23:47:09 +02:00
Aliaksandr Valialkin	41d6c8a7dd	lib/storage: do not prefetch metric names for small number of metricIDs This eliminates prefetchedMetricIDsLock lock contention for queries, which return less than 500 time series. This is a follow-up for `9d886a2eb0`	2024-01-17 13:50:01 +02:00
Aliaksandr Valialkin	ecb0a3d27d	docs/keyConcepts.md: typo fixes after `b7ffee2644` Updates https://github.com/VictoriaMetrics/VictoriaMetrics/issues/5555	2024-01-17 13:27:37 +02:00
Aliaksandr Valialkin	b7ffee2644	docs/keyConcepts.md: document /internal/force_flush handler Updates https://github.com/VictoriaMetrics/VictoriaMetrics/issues/5555	2024-01-17 13:23:46 +02:00
Aliaksandr Valialkin	014bba6d8e	docs/CHANGELOG*: move changes for 2023 year to docs/CHANGELOG_2023.md	2024-01-17 13:12:58 +02:00
hagen1778	d4ae2cb5d0	docs/troubleshooting: mention query latency in unexpected query results See https://github.com/VictoriaMetrics/VictoriaMetrics/issues/5555 Signed-off-by: hagen1778 <roman@victoriametrics.com>	2024-01-17 13:12:35 +02:00
Aliaksandr Valialkin	36e7689946	LICENSE: update the current year from 2023 to 2024	2024-01-17 01:48:12 +02:00
Aliaksandr Valialkin	a130b86533	docs/CHANGELOG.md: document v1.93.10 LTS release See https://github.com/VictoriaMetrics/VictoriaMetrics/releases/tag/v1.93.10	2024-01-17 01:45:22 +02:00
Aliaksandr Valialkin	09f23b0296	lib/promscrape: cosmetic changes after `3ac44baebe` - Rename mustLoadScrapeConfigFiles() to loadScrapeConfigFiles(), since now it may return error. - Split too long line with the error message into two lines in order to improve readability a bit. Updates https://github.com/VictoriaMetrics/VictoriaMetrics/issues/5508 Updates https://github.com/VictoriaMetrics/VictoriaMetrics/pull/5560	2024-01-17 01:07:16 +02:00

... 34 35 36 37 38 ...

9283 commits