github-mirrors/VictoriaMetrics

mirror of https://github.com/VictoriaMetrics/VictoriaMetrics.git synced 2024-11-21 14:44:00 +00:00

Author	SHA1	Message	Date
Aliaksandr Valialkin	d577657fb7	lib/streamaggr: follow-up for `ff72ca14b9` - Make sure that the last successfully loaded config is used on hot-reload failure - Properly cleanup resources occupied by already initialized aggregators when the current aggregator fails to be initialized - Expose distinct vmagent_streamaggr_config_reload* metrics per each -remoteWrite.streamAggr.config This should simplify monitoring and debugging failed reloads - Remove race condition at app/vminsert/common.MustStopStreamAggr when calling sa.MustStop() while sa could be in use at realoadSaConfig() - Remove lib/streamaggr.aggregator.hasState global variable, since it may negatively impact scalability on system with big number of CPU cores at hasState.Store(true) call inside aggregator.Push(). - Remove fine-grained aggregator reload - reload all the aggregators on config change instead. This simplifies the code a bit. The fine-grained aggregator reload may be returned back if there will be demand from real users for it. - Check -relabelConfig and -streamAggr.config files when single-node VictoriaMetrics runs with -dryRun flag - Return back accidentally removed changelog for v1.87.4 at docs/CHANGELOG.md Updates https://github.com/VictoriaMetrics/VictoriaMetrics/issues/3639	2023-03-31 22:30:38 -07:00
Roman Khavronenko	4a49577028	vmalert: use `missingkey=zero` for templating (#4040 ) Replace empty labels with "" instead of "<no value>" during templating, as Prometheus does. See https://github.com/VictoriaMetrics/VictoriaMetrics/issues/4012 Signed-off-by: hagen1778 <roman@victoriametrics.com>	2023-03-30 16:57:00 +04:00
Zakhar Bessarab	ec45f1bc5f	lib/fs: verify response code when reading configuration over HTTP (#4036 ) Verifying status code helps to avoid misleading errors caused by attempt to parse unsuccessful response. Related issue: #4034 Signed-off-by: Zakhar Bessarab <z.bessarab@victoriametrics.com>	2023-03-30 13:18:00 +02:00
Alexander Marshalov	ff72ca14b9	added hot reload support for stream aggregation configs (#3969 ) (#3970 ) added hot reload support for stream aggregation configs (#3969) Signed-off-by: Alexander Marshalov <_@marshalov.org>	2023-03-29 18:05:58 +02:00
Aliaksandr Valialkin	94cabf29b0	lib/flagutil: ArrayString: support commas inside quoted strings and inside `[]`, `{}` and `()` braces Updates https://github.com/VictoriaMetrics/VictoriaMetrics/issues/3915	2023-03-28 21:22:55 -07:00
Aliaksandr Valialkin	aea6df8197	app/vmagent/remotewrite: cosmetic updates after `f3a51e8b1d` - Compare directory names instead of paths to directory when determining which persistent queues must be deleted This is less error-prone solution, since paths to the same directory can differ, which could lead to accidental directory removal for the existing -remoteWrite.url - Log the `removed %d dangling queues` message when at least a single queue has been removed - Consistently use filepath.Join() for creating paths to persistent queues. This is needed for Windows support (see https://github.com/VictoriaMetrics/VictoriaMetrics/issues/70 ) - Clarify the description of the change at docs/CHANGELOG.md Updates https://github.com/VictoriaMetrics/VictoriaMetrics/issues/4014	2023-03-27 18:33:07 -07:00
Zakhar Bessarab	f3a51e8b1d	app/vmagent: add `-remoteWrite.removeDanglingQueues` flag (#4017 ) * app/vmagent: add `-remoteWrite.removeDanglingQueues` flag which allows to automatically remove dangling persistent queue contents Related issue: #4014 Signed-off-by: Zakhar Bessarab <z.bessarab@victoriametrics.com> * app/vmagent: address review feedback - remove persistent queues files by default - rename `remoteWrite.removeDanglingQueues` to `remoteWrite.keepDanglingQueues` - update docs to reflect changed behaviour Related issue: #4014 * Apply suggestions from code review --------- Signed-off-by: Zakhar Bessarab <z.bessarab@victoriametrics.com> Co-authored-by: Aliaksandr Valialkin <valyala@victoriametrics.com>	2023-03-27 18:15:28 -07:00
Aliaksandr Valialkin	02ee4ffd4d	app/vmselect/promql: follow-up for `79e1c6a6fc` - Document the fix at docs/CHANGELOG.md - Add tests with multiple adjancent zero buckets - Simplify the fix a bit Updates https://github.com/VictoriaMetrics/VictoriaMetrics/issues/296 Updates https://github.com/VictoriaMetrics/VictoriaMetrics/pull/4021	2023-03-27 18:03:36 -07:00
Yury Molodov	3214b1c315	vmui: heatmap (#3780 ) * fix: add stroke and font for all axes * feat: add util for generate gradient * feat: add heatmap plugin * feat: add heatmap legend * feat: add heatmap graph (#3384) * vmui: add heatmap graph (#3384) * feat: add convert Prometheus to VictoriaMetrics histogram * fix: prevent re-render graph * feat: reset step for heatmap * feat: normalize heatmap data * fix: format heatmap legend * wip * app/vmselect/vmui: run `make vmui-update` --------- Co-authored-by: Aliaksandr Valialkin <valyala@victoriametrics.com>	2023-03-26 00:30:02 -07:00
Aliaksandr Valialkin	72a0b49330	docs/CHANGELOG.md: document v1.87.4 LTS release	2023-03-25 22:43:59 -07:00
Aliaksandr Valialkin	811f4a9380	app/{vmbackup,vmrestore}: publish vmbackup and vmrestore binaries for Windows Updates https://github.com/VictoriaMetrics/VictoriaMetrics/issues/70	2023-03-25 15:08:21 -07:00
Aliaksandr Valialkin	e7f46a0aab	app/vmselect/promql: follow-up for `7205c79c5a` - Allocate and initialize seriesByWorkerID slice in a single go instead of initializing every item in the list separately. This should reduce CPU usage a bit. - Properly set anti-false sharing padding at timeseriesWithPadding structure - Document the change at docs/CHANGELOG.md Updates https://github.com/VictoriaMetrics/VictoriaMetrics/issues/3966	2023-03-24 23:34:37 -07:00
Zakhar Bessarab	5ba347bd2c	app/vmbackup: delete created snapshot in case of error during backup (#4008 ) Related issue: #2055 Signed-off-by: Zakhar Bessarab <z.bessarab@victoriametrics.com> Co-authored-by: Aliaksandr Valialkin <valyala@victoriametrics.com>	2023-03-24 21:49:58 -07:00
Aliaksandr Valialkin	27f9a1eda2	docs/CHANGELOG.md: cosmetic fixes: remove trailing whitespace and consistently use `-flag` instead of `--flag`	2023-03-24 15:44:33 -07:00
Alexander Marshalov	7c86dcc4fa	allowed using dashes and dots in environment variables names (#4009 ) * allowed using dashes and dots in environment variables names for templating config files with envtemplate (#3999) Signed-off-by: Alexander Marshalov <_@marshalov.org> * Apply suggestions from code review --------- Signed-off-by: Alexander Marshalov <_@marshalov.org> Co-authored-by: Aliaksandr Valialkin <valyala@victoriametrics.com>	2023-03-24 15:43:05 -07:00
Aliaksandr Valialkin	c1d871a45a	docs/vmauth.md: follow-up for `36edba9bfb` - Document `-configCheckInterval` command-line flag in `quick start` section - Clarify the addition of `-configCheckInterval` at docs/CHANGELOG.md Updates https://github.com/VictoriaMetrics/VictoriaMetrics/issues/3990	2023-03-24 13:22:37 -07:00
Dmytro Kozlov	ba505dd357	docs: follow up after `dc2c712a29` (#4001 )	2023-03-23 18:27:55 +01:00
Yury Molodov	023c65968f	vmui: display errors for each query individually (#3987 ) (#3994 )	2023-03-23 13:10:59 +01:00
Alexander Marshalov	36edba9bfb	added configCheckInterval flag for vmauth (#3990 ) (#3991 ) * added configCheckInterval flag for vmauth (#3990) Signed-off-by: Alexander Marshalov <_@marshalov.org>	2023-03-23 09:34:12 +01:00
Nikolay	a2f716b6cc	lib/netutil: log only parsing errors for proxy-protocol (#3985 ) * lib/netutil: log only parsing errors for proxy-protocol Previosly every error was logged. With configured TCP health checks at load-balancer or kubernetes, vmauth spams a lot of false positive error message into logs * Update docs/CHANGELOG.md Co-authored-by: Roman Khavronenko <roman@victoriametrics.com> * Update lib/netutil/tcplistener.go Co-authored-by: Roman Khavronenko <roman@victoriametrics.com> --------- Co-authored-by: Aliaksandr Valialkin <valyala@victoriametrics.com> Co-authored-by: Roman Khavronenko <roman@victoriametrics.com>	2023-03-21 10:22:39 -07:00
Dmytro Kozlov	e79cd24807	lib/promrelabel: make target url from labels on target relabel page (#3882 ) * lib/promrelabel: make target url from labels on target relabel page * wip --------- Co-authored-by: Aliaksandr Valialkin <valyala@victoriametrics.com>	2023-03-20 22:07:52 -07:00
Aliaksandr Valialkin	8d709f3483	docs/CHANGELOG.md: cosmetic fixes	2023-03-20 14:14:20 -07:00
Dmytro Kozlov	8da9502df6	app/vmctl: automatically check tty (#3938 ) app/vmctl: automatically detect if TTY is available	2023-03-20 11:16:08 +01:00
Yury Molodov	d4525bd2d0	vmui: support for drag'n'drop in the "Trace analyzer" page (#3971 ) vmui: add drag-and-drop support for the trace analyzer page	2023-03-20 11:07:18 +01:00
Yury Molodov	a2af2e5a1b	vmui: improve usability of date/time picker (#3968 ) * vmui: allow manually set input date and time * vmui/docs: improve usability of date/time picker	2023-03-20 09:22:49 +01:00
Aliaksandr Valialkin	43b24164ef	all: add Windows build for VictoriaMetrics This commit changes background merge algorithm, so it becomes compatible with Windows file semantics. The previous algorithm for background merge: 1. Merge source parts into a destination part inside tmp directory. 2. Create a file in txn directory with instructions on how to atomically swap source parts with the destination part. 3. Perform instructions from the file. 4. Delete the file with instructions. This algorithm guarantees that either source parts or destination part is visible in the partition after unclean shutdown at any step above, since the remaining files with instructions is replayed on the next restart, after that the remaining contents of the tmp directory is deleted. Unfortunately this algorithm doesn't work under Windows because it disallows removing and moving files, which are in use. So the new algorithm for background merge has been implemented: 1. Merge source parts into a destination part inside the partition directory itself. E.g. now the partition directory may contain both complete and incomplete parts. 2. Atomically update the parts.json file with the new list of parts after the merge, e.g. remove the source parts from the list and add the destination part to the list before storing it to parts.json file. 3. Remove the source parts from disk when they are no longer used. This algorithm guarantees that either source parts or destination part is visible in the partition after unclean shutdown at any step above, since incomplete partitions from step 1 or old source parts from step 3 are removed on the next startup by inspecting parts.json file. This algorithm should work under Windows, since it doesn't remove or move files in use. This algorithm has also the following benefits: - It should work better for NFS. - It fits object storage semantics. The new algorithm changes data storage format, so it is impossible to downgrade to the previous versions of VictoriaMetrics after upgrading to this algorithm. Updates https://github.com/VictoriaMetrics/VictoriaMetrics/issues/3236 Updates https://github.com/VictoriaMetrics/VictoriaMetrics/issues/3821 Updates https://github.com/VictoriaMetrics/VictoriaMetrics/issues/70	2023-03-19 01:36:51 -07:00
Aliaksandr Valialkin	6460475e3b	lib/{mergeset,storage}: prevent from long wait time when creating a snapshot under high data ingestion rate Updates https://github.com/VictoriaMetrics/VictoriaMetrics/issues/3551 Updates https://github.com/VictoriaMetrics/VictoriaMetrics/pull/3873	2023-03-19 00:15:30 -07:00
Nikolay	91cbb9063d	Vmagent kafka updates (#535 ) * app/vmagent: allow vm proto for kafka consumer and producer it should reduce network usage up to 50%. According to benchmarks without any encoding at kafka topic, it reduces traffic up to 50%. With enabled zstd at kafka topic, it shows no diffence in traffic. So it doesn't make much sense to use it. https://github.com/VictoriaMetrics/VictoriaMetrics/issues/1225 * mention eb61a7dd68b834b08d01727a918f207700348ada at changelog * app/vmagent: bumps kafka lib version it allows compiling vmagent for arm64 machines fixes https://github.com/VictoriaMetrics/VictoriaMetrics/issues/2271 * mention d19b1a888248c96cfd7ccee00ba6f596d89be1d7 at change log * app/vmagent: adds natural concurrency for kafka consumer it should improve performance for data consumption https://github.com/VictoriaMetrics/VictoriaMetrics/issues/1957 * mention change 0c143bb22ca2e7e0b7eec9bc84a94ee2b41626ca * Update app/vmagent/kafka/consumer.go Co-authored-by: Roman Khavronenko <roman@victoriametrics.com> * Update app/vmagent/kafka/consumer_cgo.go Co-authored-by: Roman Khavronenko <roman@victoriametrics.com> --------- Co-authored-by: Aliaksandr Valialkin <valyala@victoriametrics.com> Co-authored-by: Roman Khavronenko <roman@victoriametrics.com>	2023-03-15 13:03:44 -07:00
Zakhar Bessarab	6a5d236245	lib/storage: log original labels set when label value is truncated (#3952 ) lib/storage: log original labels set when label value is truncated	2023-03-14 10:59:40 +01:00
Aliaksandr Valialkin	3e7bfe1200	docs/CHANGELOG.md: document v1.87.3	2023-03-13 00:20:51 -07:00
Aliaksandr Valialkin	02ffe05750	docs/CHANGELOG.md: document v1.79.11 LTS release	2023-03-12 23:22:53 -07:00
Aliaksandr Valialkin	388d6ee16e	docs/CHANGELOG.md: cut v1.89.1	2023-03-12 19:14:19 -07:00
Aliaksandr Valialkin	e8225d7d6b	app/vmselect/promql: prevent from `cannot unmarshal timeseries from rollupResultCache` panic after the upgrade to v1.89.0 The issue has been introduced in `0af9e2b693`	2023-03-12 19:09:39 -07:00
Aliaksandr Valialkin	911bab4f6a	docs/CHANGELOG.md: cut v1.89.0	2023-03-12 17:29:44 -07:00
Aliaksandr Valialkin	468de76e9a	app/vmselect: remove data race on updating EvalConfig.IsPartialResponse from concurrently running goroutines This properly returns `is_partial: true` for partial responses.	2023-03-12 16:54:08 -07:00
Aliaksandr Valialkin	0af9e2b693	app/vmselect/promql: prevent from SIGBUS crash on architecures, which deny unaligned access to 8-byte words (e.g. ARM) Thanks to @oliverpool for nailing down the root cause of the issue and for the initial attempt to fix it at https://github.com/VictoriaMetrics/VictoriaMetrics/pull/3927	2023-03-12 16:32:08 -07:00
Yury Molodov	01367faa39	vmui: remove send step param for instant queries (#3931 ) * fix: remove step param for instant queries (#3896) * vmui: remove send step param for instant queries * Update docs/CHANGELOG.md --------- Co-authored-by: Aliaksandr Valialkin <valyala@victoriametrics.com>	2023-03-12 03:09:56 -07:00
Aliaksandr Valialkin	a52413ce0a	docs/CHANGELOG.md: document `113a89904d`	2023-03-12 01:58:18 -08:00
Aliaksandr Valialkin	b19de3fa12	docs/CHANGELOG.md: yet another typo fix	2023-03-12 01:06:40 -08:00
Aliaksandr Valialkin	2f1d24fccf	docs/CHANGELOG.md: typo fix	2023-03-12 01:04:14 -08:00
Aliaksandr Valialkin	b5db69fe05	app/vmselect/netstorage: do not intern string representation of MetricName for time series received from vmstorage It has been appeared that this interning may lead to increased memory usage and increased CPU usage when vmselect performs queries, which select big number of time series. Updates https://github.com/VictoriaMetrics/VictoriaMetrics/issues/3692 Updates https://github.com/VictoriaMetrics/VictoriaMetrics/issues/3863	2023-03-12 00:52:35 -08:00
Aliaksandr Valialkin	babc9e9815	docs/CHANGELOG.md: document `927d9da270`	2023-03-12 00:25:00 -08:00
Aliaksandr Valialkin	e3488c6cbc	docs/CHANGELOG.md: typo fixes	2023-03-12 00:09:26 -08:00
Aliaksandr Valialkin	48e32b325e	docs/CHANGELOG.md: document c9f44daaee8f4282d9ed41e3ba799c7a33841313	2023-03-11 23:55:13 -08:00
Roman Khavronenko	856c2db144	vmalert: support concurrent reading from object storage (#532 ) * vmalert: support concurrent reading from object storage Config reading from GCS or S3 can be slow if object storage contains a big number of files. Object storages are usually fast for downloading and are slow for individual operations. If there would be thousands of files to read, vmalert could spend significant time for retrieving those because it is done sequentially. The change introduces ability to read configs from object storage concurrently. By default, both GCS and S3 are now read with 50 concurrent readers. This significantly reduces the load time: * loading 500 files with concurrency=1 takes 27s * loading 500 files with concurrency=50 takes <1s * vmalert: add note to Changelog * vmalert: cleanup * vmalert: use ticker properly * app/vmalert: improve status reporting during config loading * vmalert: support concurrent reading from object storage Config reading from GCS or S3 can be slow if object storage contains a big number of files. Object storages are usually fast for downloading and are slow for individual operations. If there would be thousands of files to read, vmalert could spend significant time for retrieving those because it is done sequentially. The change introduces ability to read configs from object storage concurrently. By default, both GCS and S3 are now read with 50 concurrent readers. This significantly reduces the load time: * loading 500 files with concurrency=1 takes 27s * loading 500 files with concurrency=50 takes <1s * app/vmalert: make linter happy	2023-03-11 23:51:23 -08:00
Dmytro Kozlov	3c9058c168	app/vmctl: add support of basic auth and barer token (#3921 ) app/vmctl: add support of basic auth and bearer token	2023-03-09 14:53:29 +01:00
Roman Khavronenko	d66bae212b	app/vmalert: log number of configration files found for each specified `-rule` (#3936 ) The change also introduces `List` method to `FS` interface. The `List` method can be used for wildcard support in object storage FS. Signed-off-by: hagen1778 <roman@victoriametrics.com> Co-authored-by: Nikolay <nik@victoriametrics.com>	2023-03-09 14:46:19 +01:00
Dmytro Kozlov	7f54c181bb	app/vmctl: follow up after `09e3742a82` (#3937 ) app/vmctl: follow up after `09e3742a82`	2023-03-09 13:28:55 +01:00
Roman Khavronenko	3de7fc5c71	security: bump go version to 1.20.2 (#3935 ) upgrade Go builder from Go1.20.1 to Go1.20.2 See the list of issues addressed in Go1.20.2 here (https://github.com/golang/go/issues?q=milestone%3AGo1.20.2+label%3ACherryPickApproved). Signed-off-by: hagen1778 <roman@victoriametrics.com>	2023-03-09 13:20:54 +01:00
Aliaksandr Valialkin	1b5dc9f91d	all: follow-up for `7a3e16e774` - Sync the description for -httpListenAddr.useProxyProtocol command-line flag at vmagent and vmauth, so it is consistent with the description at vmauth and victoria-metrics - Add a sample of panic text to docs/CHANGELOG.md, so it could be googled - Mention the -httpListenAddr.useProxyProtocol command-line flag in the description for the bugfix Updates https://github.com/VictoriaMetrics/VictoriaMetrics/issues/3335	2023-03-08 01:26:55 -08:00

1 2 3 4 5 ...

1291 commits