github-mirrors/VictoriaMetrics

mirror of https://github.com/VictoriaMetrics/VictoriaMetrics.git synced 2024-12-11 14:53:49 +00:00

Author	SHA1	Message	Date
Zakhar Bessarab	30c869dfc4	app/vmselect: fix panic when using `/select/multitenant` endpoint (#4912 ) app/vmselect: fix panic when using `/select/multitenant` endpoint Such requests must be rejected as not found since vmselect does not support multitenant endpoint. See: https://github.com/VictoriaMetrics/VictoriaMetrics/issues/4910 Signed-off-by: Zakhar Bessarab <z.bessarab@victoriametrics.com>	2023-08-30 15:24:29 +02:00
Aliaksandr Valialkin	27f790458b	lib/promrelabel: properly replace `:` char with `_` in metric names when -usePromCompatibleNaming command-line flag is set This addresses https://github.com/VictoriaMetrics/VictoriaMetrics/issues/3113#issuecomment-1275077071 comment from @johnseekins	2023-08-17 13:54:59 +02:00
Roman Khavronenko	0783be0d15	vmbackup: correctly check if specified `-dst` belongs to specified `-storageDataPath` (#4841 ) See this issue https://github.com/VictoriaMetrics/VictoriaMetrics/issues/4837 Signed-off-by: hagen1778 <roman@victoriametrics.com>	2023-08-17 13:54:17 +02:00
Roman Khavronenko	655d64b27f	vmctl: interrupt explore procedure in influx mode if no numeric fields were found (#4576 ) Signed-off-by: hagen1778 <roman@victoriametrics.com>	2023-08-12 13:41:50 -07:00
Dmytro Kozlov	ea38e35205	app/vmctl: fix panic `--remote-read-filter-time-start` flag not defined (#4605 ) * app/vmctl: fix panic `--remote-read-filter-time-start` flag not defined * app/vmctl: update CHANGELOG.md --------- Co-authored-by: Nikolay <nik@victoriametrics.com>	2023-08-12 13:39:43 -07:00
Roman Khavronenko	9b83737a75	vmalert: check for negative offset for missed rounds (#4628 ) It could happen for low evaluation intervals and irregular delays during execution that evaluation time would get a negative offset. This could result into cumulative discrepancy between the actual time and evaluation time for rules. Signed-off-by: hagen1778 <roman@victoriametrics.com>	2023-08-12 13:36:13 -07:00
Haleygo	22954607ba	vmalert: fix evalTS after modify group interval (#4629 )	2023-08-12 13:34:33 -07:00
Haleygo	f5a25ba980	vmselect: fix result in Prometheus query when time is small (#4578 ) vmselect: fix result in Prometheus query when time is small Co-authored-by: Roman Khavronenko <roman@victoriametrics.com>	2023-08-12 13:32:46 -07:00
Aliaksandr Valialkin	973bbd16b0	lib/promscrape/discovery: close unused HTTP connections to service discovery servers This should prevent from connection leaks See https://github.com/VictoriaMetrics/VictoriaMetrics/issues/4724	2023-08-12 13:30:35 -07:00
Zakhar Bessarab	1a75119a92	app/vmagent/remotewrite: fix vmagent panic on shutdown (#4407 ) app/vmagent/remotewrite: fix vmagent panic on shutdown Currently, when vmagent is stopping it first flushes pending series in remote write context and proceeds to stop streaming aggregation. This leads to streaming aggregation being unable to write results into pending timeseries (since it is already nil) and panic. This can lead to losing some aggregation results being lost almost silently. The fix is reordering flow to first stop streaming aggregation and flush all pending time series after that. Signed-off-by: Zakhar Bessarab <z.bessarab@victoriametrics.com>	2023-08-12 13:20:15 -07:00
Aliaksandr Valialkin	d3e5c2acf2	app/vmselect/netstorage: follow-up after `11ac551d52` - Clarify the scope of the fix at docs/CHANGELOG.md - Handle the case when -search.maxSamplesPerSeries limit is exceeded in the same way as the -search.maxSamplesPerQuery limit. Updates https://github.com/VictoriaMetrics/VictoriaMetrics/pull/4472	2023-08-12 13:11:01 -07:00
Roman Khavronenko	324f5eca63	app/vmselect/netstorage: properly process `-search.maxSamplesPerQuery` limit (#4472 ) Properly return the error to user when `-search.maxSamplesPerQuery` limit is exceeded. Before, user could have received a partial response instead. Signed-off-by: hagen1778 <roman@victoriametrics.com>	2023-08-12 13:06:56 -07:00
Aliaksandr Valialkin	0506bead84	Upgrade Go builder from 1.20.4 to 1.21.0	2023-08-12 12:35:43 -07:00
Roman Khavronenko	3d820c0da8	vmalert: do not return nil `rules` for /api/v1/rules (#4344 ) The fix addresses a case when vmalert is configured with a group which has `name`, but doesn't have `rules` configured. In this case it still returns a `nil` instead of `[]` slice. Fixing this via current commit. See https://github.com/VictoriaMetrics/VictoriaMetrics/issues/4221 Signed-off-by: hagen1778 <roman@victoriametrics.com> (cherry picked from commit `66ed6fe62f`)	2023-06-05 11:45:52 +02:00
Roman Khavronenko	ea920edd32	vmalert: properly form assets address if httpPrefix set (#4351 ) Properly form path to static assets in WEB UI if `http.pathPrefix` set. https://github.com/VictoriaMetrics/VictoriaMetrics/issues/4349 Signed-off-by: hagen1778 <roman@victoriametrics.com> (cherry picked from commit `51cea6cad4`) Signed-off-by: hagen1778 <roman@victoriametrics.com>	2023-06-05 11:45:51 +02:00
Roman Khavronenko	aeb386c98a	vmalert: fix nil map assignment (#4392 ) * vmalert: fix nil map assignment The storage instance with nil map params was created for remote-read purposes. And before change `7a9ae9de0d` this map was ignored in ApplyParams. Now, it started to be used and vmalert panics in runtime. The fix properly inits map for at `NewVMStorage` and verifies it is not nil on assignment in `ApplyParams`. Signed-off-by: hagen1778 <roman@victoriametrics.com> * vmalert: add to changelog Signed-off-by: hagen1778 <roman@victoriametrics.com> * vmalert: properly clone Storage params Signed-off-by: hagen1778 <roman@victoriametrics.com> * vmalert: properly clone Storage params Signed-off-by: hagen1778 <roman@victoriametrics.com> * vmalert: properly clone Storage params Signed-off-by: hagen1778 <roman@victoriametrics.com> --------- Signed-off-by: hagen1778 <roman@victoriametrics.com> (cherry picked from commit `de94812088`) Signed-off-by: hagen1778 <roman@victoriametrics.com>	2023-06-05 11:45:50 +02:00
Roman Khavronenko	1e8562fbd2	app/vmalert: follow-up after `7a9ae9de0d` (#4381 ) `7a9ae9de0d` Signed-off-by: hagen1778 <roman@victoriametrics.com> (cherry picked from commit `eccecdf177`)	2023-06-05 11:45:50 +02:00
gsakun	283e2873ed	app/vmalert: fix datasource.roundDigits Parameter (#4341 ) app/vmalert: fix querybuild clone and extraParams merge logic See https://github.com/VictoriaMetrics/VictoriaMetrics/issues/4340 (cherry picked from commit `20dc3db71e`)	2023-06-05 11:45:50 +02:00
Nikolay	dc98abf28b	app/vmauth: do not return invalid credentials (#4288 ) at http response by default https://github.com/VictoriaMetrics/VictoriaMetrics/issues/4188 based on https://github.com/VictoriaMetrics/VictoriaMetrics/pull/4190 Thanks @raj-kumar-j for init implementation	2023-05-17 00:11:50 -07:00
Aliaksandr Valialkin	3741a8d532	deployment/docker: update base docker image from 3.17.3 to 3.18.0 See https://www.alpinelinux.org/posts/Alpine-3.18.0-released.html	2023-05-12 17:33:48 -07:00
Aliaksandr Valialkin	ffddf0b15b	all: update Go builder from Go1.20.3 to Go1.20.4 See https://github.com/golang/go/issues?q=milestone%3AGo1.20.4+label%3ACherryPickApproved	2023-05-09 22:34:55 -07:00
Roman Khavronenko	977c43b88f	vmselect: exit early from queue on context cancel (#4223 ) * vmselect: exit early from queue on context cancel When `-search.maxConcurrentRequests` is reached, vmselect puts request in the queue. It is expected, that requests in the queue will be processed as soon as it would be enough capacity to do so. However, it could happen that while request was waiting its turn, the client could have already cancel it (close the connection, or just close the tab with UI). In this case, we should de-queue such requests to avoid spending extra resources on them. Signed-off-by: hagen1778 <roman@victoriametrics.com> * app/vmselect: address review comments Signed-off-by: hagen1778 <roman@victoriametrics.com> --------- Signed-off-by: hagen1778 <roman@victoriametrics.com>	2023-05-08 22:58:55 -07:00
Roman Khavronenko	07e6b74dfe	vmalert: fix API to return non-nil values (#4222 ) Properly return empty slices instead of nil for `/api/v1/rules` and `/api/v1/alerts` API handlers. This improves compatibility with Grafana. https://github.com/VictoriaMetrics/VictoriaMetrics/issues/4221 Signed-off-by: hagen1778 <roman@victoriametrics.com>	2023-05-08 21:49:25 -07:00
Nikolay	ff8eceb9f2	app/vminsert: correctly allocate buffer for storagenodes (#554 ) in case of dynamic discovery number of nodes may change and we have to allocate new buffer for this case otherwise vminsert may panic	2023-05-08 08:52:59 -07:00
Aliaksandr Valialkin	d780490618	app/vmstorage: deprecate -bigMergeConcurrency command-line flag Improperly configured -bigMergeConcurrency command-line flag usually leads to uncontrolled growth of unmerged parts, which, in turn, increases CPU usage and query durations. So it is better deprecating this flag. In rare cases -smallMergeConcurrency command-line flag can be used instead for controlling the concurrency of background merges.	2023-04-13 23:42:12 -07:00
Aliaksandr Valialkin	b6c475e48c	all: update Go builder from Go1.20.2 to Go1.20.3 See https://github.com/golang/go/issues?q=milestone%3AGo1.20.3+label%3ACherryPickApproved	2023-04-05 13:41:57 -07:00
Max Golionko	4922ba85e9	fix: app/vmui/Dockerfile-web to reduce vulnerabilities (#4044 ) The following vulnerabilities are fixed with an upgrade: - https://snyk.io/vuln/SNYK-ALPINE317-OPENSSL-3368755 - https://snyk.io/vuln/SNYK-ALPINE317-OPENSSL-3368755 - https://snyk.io/vuln/SNYK-ALPINE317-OPENSSL-5291795 - https://snyk.io/vuln/SNYK-ALPINE317-OPENSSL-5291795 Co-authored-by: snyk-bot <snyk-bot@snyk.io>	2023-03-31 22:49:40 -07:00
Aliaksandr Valialkin	1c18b37604	app/vmselect/promql: follow-up for `79e1c6a6fc` - Document the fix at docs/CHANGELOG.md - Add tests with multiple adjancent zero buckets - Simplify the fix a bit Updates https://github.com/VictoriaMetrics/VictoriaMetrics/issues/296 Updates https://github.com/VictoriaMetrics/VictoriaMetrics/pull/4021	2023-03-27 18:06:10 -07:00
Ze'ev Klapow	8f33561797	fix le buckets when adjacent vmrange is empty (#4021 ) There is a bug here where if you have a single bucket like: foo{vmrange="4.084e+02...4.642e+02"} 2 123 The expected output is three le encoded buckets like: foo{le="4.084e+02"} 0 123 foo{le="4.642e+02"} 2 123 foo{le="+Inf"} 2 123 This correctly encodes the start and end of the vmrange. If however, the input contains the previous bucket, and that bucket is empty then you only get the end le and +Inf out currently, i.e: foo{vmrange="7.743e+05...8.799e+05"} 5 123 foo{vmrange="6.813e+05...7.743e+05"} 0 123 results in: foo{le="8.799e+05"} 5 123 foo{le="+Inf"} 5 123 This causes issues when you go to compute a quantile because this means that the assumed lower bound of the buckets is 0 and this we interpolate between 0->end rather than the vmrange start->end as expected.	2023-03-27 18:06:08 -07:00
Aliaksandr Valialkin	e6727fa51c	app/vmselect/netstorage: reduce the contention at fs.ReaderAt stats collection on systems with big number of CPU cores This optimization is based on the profile provided at https://github.com/VictoriaMetrics/VictoriaMetrics/issues/3966#issuecomment-1483208419	2023-03-25 16:45:31 -07:00
Aliaksandr Valialkin	f47e26025c	app/vmselect/netstorage: document why runtime.Gosched() is removed at `28f054bb00` Updates https://github.com/VictoriaMetrics/VictoriaMetrics/issues/3966	2023-03-25 16:45:31 -07:00
Zakhar Bessarab	1c34cc33e5	vmselect/netstorage: remove direct calls to `Gosched` to reduce amount of locks for global scope using `runtime.Gosched` requires acquiring global lock to check if there are any other goroutines to perform tasks. with the latest versions of runtime it can pause running goroutines automatically without requiring to call `Gosched` directly. Updates #3966 Signed-off-by: Zakhar Bessarab <z.bessarab@victoriametrics.com>	2023-03-25 16:45:31 -07:00
Aliaksandr Valialkin	b0e64f95ad	app/vmselect/netstorage: reduce the number of calls to runtime.Gosched() at timeseriesWorker() and unpackWorker() Call runtime.Gosched() only when there is a work to steal from other workers. Simplify the timeseriesWorker() and unpackWroker() code a bit by inlining stealTimeseriesWork() and stealUnpackWork(). This should reduce CPU usage when processing queries on systems with big number of CPU cores. Updates https://github.com/VictoriaMetrics/VictoriaMetrics/issues/3966	2023-03-25 16:43:47 -07:00
Aliaksandr Valialkin	511db8c453	app/vmselect/promql: typo fix after `e7f46a0aab`	2023-03-25 01:25:49 -07:00
Aliaksandr Valialkin	cee0fb8071	app/vmselect/promql: follow-up for `7205c79c5a` - Allocate and initialize seriesByWorkerID slice in a single go instead of initializing every item in the list separately. This should reduce CPU usage a bit. - Properly set anti-false sharing padding at timeseriesWithPadding structure - Document the change at docs/CHANGELOG.md Updates https://github.com/VictoriaMetrics/VictoriaMetrics/issues/3966	2023-03-25 01:25:08 -07:00
Zakhar Bessarab	71044221f6	app/vmselect/promql: use lock-less approach to gather results of parallel processing for `evalRollup` funcs (#4004 ) vmselect/promql: refactor `evalRollupNoIncrementalAggregate` to use lock-less approach for parallel workers computation Locking there is causing issues when running on highly multi-core system as it introduces lock contention during results merge. New implementation uses lock less approach to store results per workerID and merges final result in the end, this is expected to significantly reduce lock contention and CPU usage for systems with high number of cores. Related: #3966 Signed-off-by: Zakhar Bessarab <z.bessarab@victoriametrics.com> * vmselect/promql: add pooling for `timeseriesWithPadding` to reduce allocations Related: #3966 Signed-off-by: Zakhar Bessarab <z.bessarab@victoriametrics.com> * vmselect/promql: refactor `evalRollupFuncWithSubquery` to avoid using locks Uses same approach as `evalRollupNoIncrementalAggregate` to remove locking between workers and reduce lock contention. Related: #3966 Signed-off-by: Zakhar Bessarab <z.bessarab@victoriametrics.com> --------- Signed-off-by: Zakhar Bessarab <z.bessarab@victoriametrics.com>	2023-03-25 01:24:20 -07:00
Aliaksandr Valialkin	d8e218d99e	app/vmselect/promql: pass workerID to the callback inside doParallel() This opens the possibility to remove tssLock from evalRollupFuncWithSubquery() in the follow-up commit from @zekker6 in order to speed up the code for systems with many CPU cores. Updates https://github.com/VictoriaMetrics/VictoriaMetrics/issues/3966	2023-03-25 01:23:06 -07:00
Aliaksandr Valialkin	9b5245a836	app/vmselect/promql: fix TestIncrementalAggr test on systems less than 3 CPU cores This is a follow-up for `4856a4cf5a`	2023-03-25 01:21:43 -07:00
Aliaksandr Valialkin	c780c6a280	app/vmselect: optimize incremental aggregates a bit Substitute sync.Map with an ordinary slice indexed by workerID. This should reduce the overhead when updating the incremental aggregate state	2023-03-24 23:50:31 -07:00
oliverpool	a3bc64e7ed	app/vmselect/promql: add test to ensure 8-byte alignment (#3948 ) See `0af9e2b693`	2023-03-24 23:49:04 -07:00
Aliaksandr Valialkin	8a878fa169	app/vmbackup: simplify code a bit after `5ba347bd2c` Unconditionally call deleteSnapshot() func just after making the snapshot, either successful or unsuccessful Related issue: https://github.com/VictoriaMetrics/VictoriaMetrics/issues/2055	2023-03-24 22:20:20 -07:00
Zakhar Bessarab	91a5661b86	app/vmbackup: delete created snapshot in case of error during backup (#4008 ) Related issue: #2055 Signed-off-by: Zakhar Bessarab <z.bessarab@victoriametrics.com> Co-authored-by: Aliaksandr Valialkin <valyala@victoriametrics.com>	2023-03-24 22:20:20 -07:00
Aliaksandr Valialkin	1050d6bc0f	app/vmselect/promql: prevent from `cannot unmarshal timeseries from rollupResultCache` panic after the upgrade to v1.89.0 The issue has been introduced in `0af9e2b693`	2023-03-12 19:10:48 -07:00
Aliaksandr Valialkin	5a9300be13	app/vmselect: remove data race on updating EvalConfig.IsPartialResponse from concurrently running goroutines This properly returns `is_partial: true` for partial responses.	2023-03-12 16:57:28 -07:00
Aliaksandr Valialkin	c8f8aa85a2	app/vmselect/promql: prevent from SIGBUS crash on architecures, which deny unaligned access to 8-byte words (e.g. ARM) Thanks to @oliverpool for nailing down the root cause of the issue and for the initial attempt to fix it at https://github.com/VictoriaMetrics/VictoriaMetrics/pull/3927	2023-03-12 16:56:37 -07:00
Roman Khavronenko	3d2cfc4f33	security: bump go version to 1.20.2 (#3935 ) upgrade Go builder from Go1.20.1 to Go1.20.2 See the list of issues addressed in Go1.20.2 here (https://github.com/golang/go/issues?q=milestone%3AGo1.20.2+label%3ACherryPickApproved). Signed-off-by: hagen1778 <roman@victoriametrics.com>	2023-03-12 01:38:51 -08:00
Aliaksandr Valialkin	7f34158db5	app/vmselect/netstorage: do not intern string representation of MetricName for time series received from vmstorage It has been appeared that this interning may lead to increased memory usage and increased CPU usage when vmselect performs queries, which select big number of time series. Updates https://github.com/VictoriaMetrics/VictoriaMetrics/issues/3692 Updates https://github.com/VictoriaMetrics/VictoriaMetrics/issues/3863	2023-03-12 01:36:07 -08:00
Aliaksandr Valialkin	bc144e2b05	all: follow-up for `7a3e16e774` - Sync the description for -httpListenAddr.useProxyProtocol command-line flag at vmagent and vmauth, so it is consistent with the description at vmauth and victoria-metrics - Add a sample of panic text to docs/CHANGELOG.md, so it could be googled - Mention the -httpListenAddr.useProxyProtocol command-line flag in the description for the bugfix Updates https://github.com/VictoriaMetrics/VictoriaMetrics/issues/3335	2023-03-12 01:19:55 -08:00
Nikolay	c80d0aaaf0	lib/netutil: fixes panic at proxy protocol (#3905 ) it may occur if non proxy protocol message received by tcp server. Listener Accept method must return only non-recoverable errors. https://github.com/VictoriaMetrics/VictoriaMetrics/issues/3335	2023-03-12 01:12:53 -08:00
Dmytro Kozlov	d161873857	app/vmctl: skip series if measurement not found (#3869 ) app/vmctl: skip measurements with no fields for influxdb mode	2023-02-27 12:04:48 -08:00

1 2 3 4 5 ...

2372 commits