github-mirrors/VictoriaMetrics

mirror of https://github.com/VictoriaMetrics/VictoriaMetrics.git synced 2024-12-01 14:47:38 +00:00

Author	SHA1	Message	Date
Aliaksandr Valialkin	950edad2e3	deployment/docker: update Go builder from Go1.21.1 to Go1.21.3 See https://github.com/golang/go/issues?q=milestone%3AGo1.21.2+label%3ACherryPickApproved and https://github.com/golang/go/issues?q=milestone%3AGo1.21.3+label%3ACherryPickApproved	2023-10-15 18:58:02 +02:00
Dmytro Kozlov	d8748a3398	app/vmagent: fix check of the DataDog agent path requests when requests have trailing slashes (#5106 ) * app/vmagent: fix check of the DataDog agent path requests when requests have trailing slashes * app/vmagent: fix CHANGELOG.md description * wip * wip --------- Co-authored-by: Aliaksandr Valialkin <valyala@victoriametrics.com>	2023-10-02 21:22:39 +02:00
Aliaksandr Valialkin	d55ca23837	app/vmagent: follow-up for `cfef814750` - Properly handle /insert/multitenant/api/put url for opentsdb handler at vmagent - Document that the bug has been introduced in v1.93.2 at docs/CHANGELOG.md - Add a link to multitenant url docs in bugfix description Updates https://github.com/VictoriaMetrics/VictoriaMetrics/issues/5061 Updates https://github.com/VictoriaMetrics/VictoriaMetrics/issues/4910	2023-10-01 21:06:47 +02:00
Alexander Marshalov	d8cd6edd55	fixed ingestion via multitenant url for opentsdbhttp (#5061 ) (#5063 )	2023-10-01 21:05:47 +02:00
Aliaksandr Valialkin	7bbaff4d8a	app/vmselect/netstorage: run `make fmt` after `58326dbf25`	2023-09-10 15:25:49 +02:00
Roman Khavronenko	ada9afc74c	vmalert: correctly add duplicated params to the query (#4955 ) Fix the bug when Group's `params` fields with multiple values were overriding each other instead of adding up. The bug was introduced in this commit `eccecdf177` starting from v1.91.1 https://github.com/VictoriaMetrics/VictoriaMetrics/releases/tag/v1.91.1 https://github.com/VictoriaMetrics/VictoriaMetrics/issues/4908 Signed-off-by: hagen1778 <roman@victoriametrics.com> (cherry picked from commit `6351d07da8`)	2023-09-08 09:37:54 +02:00
Aliaksandr Valialkin	8ff9235717	app/vmselect: return 503 status code when partial responses are denied and some of vmstorage nodes are temporarily unavailable This should help detecting this case and automatic retrying the query at healthy cluster replica in another availability zone. This commit is needed as a preparation for automatic query retry at another backend at vmauth on 5xx errors as described at https://github.com/VictoriaMetrics/VictoriaMetrics/issues/4792#issuecomment-1674338561	2023-09-07 16:17:14 +02:00
Aliaksandr Valialkin	841465d98a	all: update Go builder from Go1.21.0 to Go1.21.1 See https://github.com/golang/go/issues?q=milestone%3AGo1.21.1+label%3ACherryPickApproved	2023-09-07 11:42:57 +02:00
Aliaksandr Valialkin	9603d06057	lib/auth: add NewTokenPossibleMultitenant() for parsing auth token, which can be multitenant Disallow parsing multitenant token at auth.NewToken(). Use auth.NewTokenPossibleMultitenant() at vminsert only. All the other callers should call auth.NewToken(), since they do not support multitenant token. This is a follow-up for `f0c06b428e` Updates https://github.com/VictoriaMetrics/VictoriaMetrics/issues/4910	2023-09-06 12:12:32 +02:00
Nikolay	561dd2900a	app/vminsert: properly close vmstorage connection (#4935 ) * app/vminsert: properly close vmstorage connection previously vmstorage may stuck in broken state until vminsert restarts since vmstorage was marked as read-only and connection was broken to it. checkReadonly function never marked connection as broken https://github.com/VictoriaMetrics/VictoriaMetrics/issues/4870 * wip --------- Co-authored-by: Aliaksandr Valialkin <valyala@victoriametrics.com>	2023-09-01 18:03:53 +02:00
Nikolay	ab4c3817ed	app/vminsert: fixes readonly check (#4892 ) * app/vminsert: fixes readonly check previously vminsert doesn't check readOnly state for vmstorage, since check was never performed for nil buffer In this case every 30 second storage node loss readonly state and received some data. It caused re-routing and possible slow down for ingestion https://github.com/VictoriaMetrics/VictoriaMetrics/issues/4870 * wip --------- Co-authored-by: Aliaksandr Valialkin <valyala@victoriametrics.com>	2023-08-30 16:27:23 +02:00
Zakhar Bessarab	30c869dfc4	app/vmselect: fix panic when using `/select/multitenant` endpoint (#4912 ) app/vmselect: fix panic when using `/select/multitenant` endpoint Such requests must be rejected as not found since vmselect does not support multitenant endpoint. See: https://github.com/VictoriaMetrics/VictoriaMetrics/issues/4910 Signed-off-by: Zakhar Bessarab <z.bessarab@victoriametrics.com>	2023-08-30 15:24:29 +02:00
Aliaksandr Valialkin	27f790458b	lib/promrelabel: properly replace `:` char with `_` in metric names when -usePromCompatibleNaming command-line flag is set This addresses https://github.com/VictoriaMetrics/VictoriaMetrics/issues/3113#issuecomment-1275077071 comment from @johnseekins	2023-08-17 13:54:59 +02:00
Roman Khavronenko	0783be0d15	vmbackup: correctly check if specified `-dst` belongs to specified `-storageDataPath` (#4841 ) See this issue https://github.com/VictoriaMetrics/VictoriaMetrics/issues/4837 Signed-off-by: hagen1778 <roman@victoriametrics.com>	2023-08-17 13:54:17 +02:00
Roman Khavronenko	655d64b27f	vmctl: interrupt explore procedure in influx mode if no numeric fields were found (#4576 ) Signed-off-by: hagen1778 <roman@victoriametrics.com>	2023-08-12 13:41:50 -07:00
Dmytro Kozlov	ea38e35205	app/vmctl: fix panic `--remote-read-filter-time-start` flag not defined (#4605 ) * app/vmctl: fix panic `--remote-read-filter-time-start` flag not defined * app/vmctl: update CHANGELOG.md --------- Co-authored-by: Nikolay <nik@victoriametrics.com>	2023-08-12 13:39:43 -07:00
Roman Khavronenko	9b83737a75	vmalert: check for negative offset for missed rounds (#4628 ) It could happen for low evaluation intervals and irregular delays during execution that evaluation time would get a negative offset. This could result into cumulative discrepancy between the actual time and evaluation time for rules. Signed-off-by: hagen1778 <roman@victoriametrics.com>	2023-08-12 13:36:13 -07:00
Haleygo	22954607ba	vmalert: fix evalTS after modify group interval (#4629 )	2023-08-12 13:34:33 -07:00
Haleygo	f5a25ba980	vmselect: fix result in Prometheus query when time is small (#4578 ) vmselect: fix result in Prometheus query when time is small Co-authored-by: Roman Khavronenko <roman@victoriametrics.com>	2023-08-12 13:32:46 -07:00
Aliaksandr Valialkin	973bbd16b0	lib/promscrape/discovery: close unused HTTP connections to service discovery servers This should prevent from connection leaks See https://github.com/VictoriaMetrics/VictoriaMetrics/issues/4724	2023-08-12 13:30:35 -07:00
Zakhar Bessarab	1a75119a92	app/vmagent/remotewrite: fix vmagent panic on shutdown (#4407 ) app/vmagent/remotewrite: fix vmagent panic on shutdown Currently, when vmagent is stopping it first flushes pending series in remote write context and proceeds to stop streaming aggregation. This leads to streaming aggregation being unable to write results into pending timeseries (since it is already nil) and panic. This can lead to losing some aggregation results being lost almost silently. The fix is reordering flow to first stop streaming aggregation and flush all pending time series after that. Signed-off-by: Zakhar Bessarab <z.bessarab@victoriametrics.com>	2023-08-12 13:20:15 -07:00
Aliaksandr Valialkin	d3e5c2acf2	app/vmselect/netstorage: follow-up after `11ac551d52` - Clarify the scope of the fix at docs/CHANGELOG.md - Handle the case when -search.maxSamplesPerSeries limit is exceeded in the same way as the -search.maxSamplesPerQuery limit. Updates https://github.com/VictoriaMetrics/VictoriaMetrics/pull/4472	2023-08-12 13:11:01 -07:00
Roman Khavronenko	324f5eca63	app/vmselect/netstorage: properly process `-search.maxSamplesPerQuery` limit (#4472 ) Properly return the error to user when `-search.maxSamplesPerQuery` limit is exceeded. Before, user could have received a partial response instead. Signed-off-by: hagen1778 <roman@victoriametrics.com>	2023-08-12 13:06:56 -07:00
Aliaksandr Valialkin	0506bead84	Upgrade Go builder from 1.20.4 to 1.21.0	2023-08-12 12:35:43 -07:00
Roman Khavronenko	3d820c0da8	vmalert: do not return nil `rules` for /api/v1/rules (#4344 ) The fix addresses a case when vmalert is configured with a group which has `name`, but doesn't have `rules` configured. In this case it still returns a `nil` instead of `[]` slice. Fixing this via current commit. See https://github.com/VictoriaMetrics/VictoriaMetrics/issues/4221 Signed-off-by: hagen1778 <roman@victoriametrics.com> (cherry picked from commit `66ed6fe62f`)	2023-06-05 11:45:52 +02:00
Roman Khavronenko	ea920edd32	vmalert: properly form assets address if httpPrefix set (#4351 ) Properly form path to static assets in WEB UI if `http.pathPrefix` set. https://github.com/VictoriaMetrics/VictoriaMetrics/issues/4349 Signed-off-by: hagen1778 <roman@victoriametrics.com> (cherry picked from commit `51cea6cad4`) Signed-off-by: hagen1778 <roman@victoriametrics.com>	2023-06-05 11:45:51 +02:00
Roman Khavronenko	aeb386c98a	vmalert: fix nil map assignment (#4392 ) * vmalert: fix nil map assignment The storage instance with nil map params was created for remote-read purposes. And before change `7a9ae9de0d` this map was ignored in ApplyParams. Now, it started to be used and vmalert panics in runtime. The fix properly inits map for at `NewVMStorage` and verifies it is not nil on assignment in `ApplyParams`. Signed-off-by: hagen1778 <roman@victoriametrics.com> * vmalert: add to changelog Signed-off-by: hagen1778 <roman@victoriametrics.com> * vmalert: properly clone Storage params Signed-off-by: hagen1778 <roman@victoriametrics.com> * vmalert: properly clone Storage params Signed-off-by: hagen1778 <roman@victoriametrics.com> * vmalert: properly clone Storage params Signed-off-by: hagen1778 <roman@victoriametrics.com> --------- Signed-off-by: hagen1778 <roman@victoriametrics.com> (cherry picked from commit `de94812088`) Signed-off-by: hagen1778 <roman@victoriametrics.com>	2023-06-05 11:45:50 +02:00
Roman Khavronenko	1e8562fbd2	app/vmalert: follow-up after `7a9ae9de0d` (#4381 ) `7a9ae9de0d` Signed-off-by: hagen1778 <roman@victoriametrics.com> (cherry picked from commit `eccecdf177`)	2023-06-05 11:45:50 +02:00
gsakun	283e2873ed	app/vmalert: fix datasource.roundDigits Parameter (#4341 ) app/vmalert: fix querybuild clone and extraParams merge logic See https://github.com/VictoriaMetrics/VictoriaMetrics/issues/4340 (cherry picked from commit `20dc3db71e`)	2023-06-05 11:45:50 +02:00
Nikolay	dc98abf28b	app/vmauth: do not return invalid credentials (#4288 ) at http response by default https://github.com/VictoriaMetrics/VictoriaMetrics/issues/4188 based on https://github.com/VictoriaMetrics/VictoriaMetrics/pull/4190 Thanks @raj-kumar-j for init implementation	2023-05-17 00:11:50 -07:00
Aliaksandr Valialkin	3741a8d532	deployment/docker: update base docker image from 3.17.3 to 3.18.0 See https://www.alpinelinux.org/posts/Alpine-3.18.0-released.html	2023-05-12 17:33:48 -07:00
Aliaksandr Valialkin	ffddf0b15b	all: update Go builder from Go1.20.3 to Go1.20.4 See https://github.com/golang/go/issues?q=milestone%3AGo1.20.4+label%3ACherryPickApproved	2023-05-09 22:34:55 -07:00
Roman Khavronenko	977c43b88f	vmselect: exit early from queue on context cancel (#4223 ) * vmselect: exit early from queue on context cancel When `-search.maxConcurrentRequests` is reached, vmselect puts request in the queue. It is expected, that requests in the queue will be processed as soon as it would be enough capacity to do so. However, it could happen that while request was waiting its turn, the client could have already cancel it (close the connection, or just close the tab with UI). In this case, we should de-queue such requests to avoid spending extra resources on them. Signed-off-by: hagen1778 <roman@victoriametrics.com> * app/vmselect: address review comments Signed-off-by: hagen1778 <roman@victoriametrics.com> --------- Signed-off-by: hagen1778 <roman@victoriametrics.com>	2023-05-08 22:58:55 -07:00
Roman Khavronenko	07e6b74dfe	vmalert: fix API to return non-nil values (#4222 ) Properly return empty slices instead of nil for `/api/v1/rules` and `/api/v1/alerts` API handlers. This improves compatibility with Grafana. https://github.com/VictoriaMetrics/VictoriaMetrics/issues/4221 Signed-off-by: hagen1778 <roman@victoriametrics.com>	2023-05-08 21:49:25 -07:00
Nikolay	ff8eceb9f2	app/vminsert: correctly allocate buffer for storagenodes (#554 ) in case of dynamic discovery number of nodes may change and we have to allocate new buffer for this case otherwise vminsert may panic	2023-05-08 08:52:59 -07:00
Aliaksandr Valialkin	d780490618	app/vmstorage: deprecate -bigMergeConcurrency command-line flag Improperly configured -bigMergeConcurrency command-line flag usually leads to uncontrolled growth of unmerged parts, which, in turn, increases CPU usage and query durations. So it is better deprecating this flag. In rare cases -smallMergeConcurrency command-line flag can be used instead for controlling the concurrency of background merges.	2023-04-13 23:42:12 -07:00
Aliaksandr Valialkin	b6c475e48c	all: update Go builder from Go1.20.2 to Go1.20.3 See https://github.com/golang/go/issues?q=milestone%3AGo1.20.3+label%3ACherryPickApproved	2023-04-05 13:41:57 -07:00
Max Golionko	4922ba85e9	fix: app/vmui/Dockerfile-web to reduce vulnerabilities (#4044 ) The following vulnerabilities are fixed with an upgrade: - https://snyk.io/vuln/SNYK-ALPINE317-OPENSSL-3368755 - https://snyk.io/vuln/SNYK-ALPINE317-OPENSSL-3368755 - https://snyk.io/vuln/SNYK-ALPINE317-OPENSSL-5291795 - https://snyk.io/vuln/SNYK-ALPINE317-OPENSSL-5291795 Co-authored-by: snyk-bot <snyk-bot@snyk.io>	2023-03-31 22:49:40 -07:00
Aliaksandr Valialkin	1c18b37604	app/vmselect/promql: follow-up for `79e1c6a6fc` - Document the fix at docs/CHANGELOG.md - Add tests with multiple adjancent zero buckets - Simplify the fix a bit Updates https://github.com/VictoriaMetrics/VictoriaMetrics/issues/296 Updates https://github.com/VictoriaMetrics/VictoriaMetrics/pull/4021	2023-03-27 18:06:10 -07:00
Ze'ev Klapow	8f33561797	fix le buckets when adjacent vmrange is empty (#4021 ) There is a bug here where if you have a single bucket like: foo{vmrange="4.084e+02...4.642e+02"} 2 123 The expected output is three le encoded buckets like: foo{le="4.084e+02"} 0 123 foo{le="4.642e+02"} 2 123 foo{le="+Inf"} 2 123 This correctly encodes the start and end of the vmrange. If however, the input contains the previous bucket, and that bucket is empty then you only get the end le and +Inf out currently, i.e: foo{vmrange="7.743e+05...8.799e+05"} 5 123 foo{vmrange="6.813e+05...7.743e+05"} 0 123 results in: foo{le="8.799e+05"} 5 123 foo{le="+Inf"} 5 123 This causes issues when you go to compute a quantile because this means that the assumed lower bound of the buckets is 0 and this we interpolate between 0->end rather than the vmrange start->end as expected.	2023-03-27 18:06:08 -07:00
Aliaksandr Valialkin	e6727fa51c	app/vmselect/netstorage: reduce the contention at fs.ReaderAt stats collection on systems with big number of CPU cores This optimization is based on the profile provided at https://github.com/VictoriaMetrics/VictoriaMetrics/issues/3966#issuecomment-1483208419	2023-03-25 16:45:31 -07:00
Aliaksandr Valialkin	f47e26025c	app/vmselect/netstorage: document why runtime.Gosched() is removed at `28f054bb00` Updates https://github.com/VictoriaMetrics/VictoriaMetrics/issues/3966	2023-03-25 16:45:31 -07:00
Zakhar Bessarab	1c34cc33e5	vmselect/netstorage: remove direct calls to `Gosched` to reduce amount of locks for global scope using `runtime.Gosched` requires acquiring global lock to check if there are any other goroutines to perform tasks. with the latest versions of runtime it can pause running goroutines automatically without requiring to call `Gosched` directly. Updates #3966 Signed-off-by: Zakhar Bessarab <z.bessarab@victoriametrics.com>	2023-03-25 16:45:31 -07:00
Aliaksandr Valialkin	b0e64f95ad	app/vmselect/netstorage: reduce the number of calls to runtime.Gosched() at timeseriesWorker() and unpackWorker() Call runtime.Gosched() only when there is a work to steal from other workers. Simplify the timeseriesWorker() and unpackWroker() code a bit by inlining stealTimeseriesWork() and stealUnpackWork(). This should reduce CPU usage when processing queries on systems with big number of CPU cores. Updates https://github.com/VictoriaMetrics/VictoriaMetrics/issues/3966	2023-03-25 16:43:47 -07:00
Aliaksandr Valialkin	511db8c453	app/vmselect/promql: typo fix after `e7f46a0aab`	2023-03-25 01:25:49 -07:00
Aliaksandr Valialkin	cee0fb8071	app/vmselect/promql: follow-up for `7205c79c5a` - Allocate and initialize seriesByWorkerID slice in a single go instead of initializing every item in the list separately. This should reduce CPU usage a bit. - Properly set anti-false sharing padding at timeseriesWithPadding structure - Document the change at docs/CHANGELOG.md Updates https://github.com/VictoriaMetrics/VictoriaMetrics/issues/3966	2023-03-25 01:25:08 -07:00
Zakhar Bessarab	71044221f6	app/vmselect/promql: use lock-less approach to gather results of parallel processing for `evalRollup` funcs (#4004 ) vmselect/promql: refactor `evalRollupNoIncrementalAggregate` to use lock-less approach for parallel workers computation Locking there is causing issues when running on highly multi-core system as it introduces lock contention during results merge. New implementation uses lock less approach to store results per workerID and merges final result in the end, this is expected to significantly reduce lock contention and CPU usage for systems with high number of cores. Related: #3966 Signed-off-by: Zakhar Bessarab <z.bessarab@victoriametrics.com> * vmselect/promql: add pooling for `timeseriesWithPadding` to reduce allocations Related: #3966 Signed-off-by: Zakhar Bessarab <z.bessarab@victoriametrics.com> * vmselect/promql: refactor `evalRollupFuncWithSubquery` to avoid using locks Uses same approach as `evalRollupNoIncrementalAggregate` to remove locking between workers and reduce lock contention. Related: #3966 Signed-off-by: Zakhar Bessarab <z.bessarab@victoriametrics.com> --------- Signed-off-by: Zakhar Bessarab <z.bessarab@victoriametrics.com>	2023-03-25 01:24:20 -07:00
Aliaksandr Valialkin	d8e218d99e	app/vmselect/promql: pass workerID to the callback inside doParallel() This opens the possibility to remove tssLock from evalRollupFuncWithSubquery() in the follow-up commit from @zekker6 in order to speed up the code for systems with many CPU cores. Updates https://github.com/VictoriaMetrics/VictoriaMetrics/issues/3966	2023-03-25 01:23:06 -07:00
Aliaksandr Valialkin	9b5245a836	app/vmselect/promql: fix TestIncrementalAggr test on systems less than 3 CPU cores This is a follow-up for `4856a4cf5a`	2023-03-25 01:21:43 -07:00
Aliaksandr Valialkin	c780c6a280	app/vmselect: optimize incremental aggregates a bit Substitute sync.Map with an ordinary slice indexed by workerID. This should reduce the overhead when updating the incremental aggregate state	2023-03-24 23:50:31 -07:00

1 2 3 4 5 ...

2383 commits