github-mirrors/VictoriaMetrics

mirror of https://github.com/VictoriaMetrics/VictoriaMetrics.git synced 2024-11-21 14:44:00 +00:00

Author	SHA1	Message	Date
Aliaksandr Valialkin	5582a24ecf	lib/streamaggr: add tests for keep_metric_names and drop_input_labels options	2024-03-06 18:34:04 +02:00
Aliaksandr Valialkin	b4b38f782c	app/vmagent/remotewrite: clarify the reason behind the default value for -remoteWrite.queues in the same way as the reason for -maxConcurrentInserts is defined at `73f5fb0f0c`	2024-03-06 13:43:08 +02:00
hagen1778	73f5fb0f0c	lib/writeconcurrencylimiter: mention dependency on CPU cores for `-maxConcurrentInserts` flag The change also removes misleading `default` value from README for `maxConcurrentInserts` cmd-line flag. Signed-off-by: hagen1778 <roman@victoriametrics.com>	2024-03-05 18:55:38 +01:00
Aliaksandr Valialkin	da611ad628	app/{vmagent,vminsert}: add `-streamAggr.dropInputSamples` command-line flag for dropping the specified labels from input samples before deduplication and streaming aggregation	2024-03-05 02:15:01 +02:00
Aliaksandr Valialkin	ed523b5bbc	app/{vminsert,vmagent}: allow using -streamAggr.dedupInterval without -streamAggr.config This allows performing online de-duplication of incoming samples	2024-03-05 00:45:30 +02:00
Aliaksandr Valialkin	22d63ac7cd	lib/streamaggr: do not reset aggregation state after the aggregation took longer than the configured interval It is better from user PoV preserving this state until the next flush	2024-03-04 20:03:06 +02:00
Aliaksandr Valialkin	32653db7d5	lib/streamaggr: add missing "s" suffix in the warning message when the de-duplication or aggregation couldnt be finished in a timely manner	2024-03-04 19:37:58 +02:00
Aliaksandr Valialkin	6319d029a8	lib/streamaggr: benchmark only flush routines in BenchmarkDedupAggrFlushSerial and BenchmarkAggregatorsFlushSerial	2024-03-04 19:12:28 +02:00
Aliaksandr Valialkin	074abd5bee	Revert "lib/streamaggr: do not flush dedup shards in parallel" This reverts commit `eb40395a1c`. Reason for revert: it has been appeared that the performance gain on multiple CPU cores wasn't visible because the benchmark was generating incorrect pushSample.key. See a207e0bf687d65f5198207477248d70c69284296	2024-03-04 19:12:28 +02:00
Aliaksandr Valialkin	e70177c5fb	lib/streamaggr: properly generate pushSample.key in benchmarks	2024-03-04 19:12:27 +02:00
Aliaksandr Valialkin	b232968bb4	lib/streamaggr: reduce the number of pointers at "total" aggregation state This should reduce load on GC when scanning heap objects.	2024-03-04 19:12:27 +02:00
Aliaksandr Valialkin	d42667fc41	lib/streamaggr: use multiple job label values in BenchmarkAggregatorsPush instead of single value This should make the benchmark closer to production cases	2024-03-04 19:12:26 +02:00
Aliaksandr Valialkin	f5bbffd45f	lib/streamaggr: use multiple job labels in BenchmarkAggregatorsPush	2024-03-04 19:12:26 +02:00
Aliaksandr Valialkin	eb40395a1c	lib/streamaggr: do not flush dedup shards in parallel This significantly increases CPU usage on systems with many CPU cores, while doesn't reduce flush latency too much	2024-03-04 17:00:20 +02:00
Aliaksandr Valialkin	946814afee	lib/streamaggr: reduce memory allocations when registering new series in deduplication and aggregation structs	2024-03-04 17:00:19 +02:00
Aliaksandr Valialkin	925f60841f	lib/streamaggr: make aggregate.runFlusher() more roubst and clear	2024-03-04 17:00:19 +02:00
Aliaksandr Valialkin	aa5e7e268c	lib/streamaggr: properly drop samples on the first incomplete interval Previously samples were dropped on the first incomplete interval and the next complete interval. Also make sure that the de-duplication is performed just before flushing the aggregate state. This should help the case then dedup_interval = interval.	2024-03-04 17:00:18 +02:00
Aliaksandr Valialkin	86494518da	lib/streamaggr: explicitly call resetSeries after flushSeries This makes the code less fragile	2024-03-04 06:01:18 +02:00
Aliaksandr Valialkin	ac3cf3f357	lib/streamaggr: enable time alignment for aggregate flushed to multiples of interval For example, if `interval: 1m`, then data flush occurs at the end of every minute, while `interval: 1h` leads to data flush at the end of every hour. Add `no_align_flush_to_interval` option, which can be used for disabling the alignment.	2024-03-04 05:42:58 +02:00
Aliaksandr Valialkin	138a4d1c2b	lib/streamaggr: ignore the first sample in new time series during staleness_interval seconds after the stream aggregation start for total and increase outputs	2024-03-04 01:49:26 +02:00
Aliaksandr Valialkin	0422ae01ba	lib/streamaggr: flush dedup state and aggregation state in parallel on all the available CPU cores This should reduce the time needed for aggregation state flush on systems with many CPU cores	2024-03-04 01:21:50 +02:00
Aliaksandr Valialkin	3c06b3af92	lib/streamaggr: add a benchmark for flushing dedup state	2024-03-04 01:16:30 +02:00
Aliaksandr Valialkin	9648c88b71	lib/streamaggr: add a benchmark for measuring the performance of aggregator.flush	2024-03-04 00:45:48 +02:00
Aliaksandr Valialkin	54a1c506e3	lib/streamaggr: add a benchmark for de-duplicating of 1M samples	2024-03-04 00:26:59 +02:00
Aliaksandr Valialkin	614d34e539	lib/prompbmarshal: use clear() instead of a loop for clearing tss inside ResetTimeSeries()	2024-03-03 23:40:34 +02:00
Aliaksandr Valialkin	4e65636b44	lib/promutils: optimize LabelsCompressor.Decompress by using a specialized labelsMap struct instead of sync.Map The labelsMap struct employs the fact that label indexes are condensed around 0, so it stores the referred labels in a slice instead of map and uses slice index as label key. This allows increasing the LabelsCompressor.Decompress performance by up to 3x. This also reduces the latency of data flush in stream aggregation.	2024-03-03 23:21:25 +02:00
Aliaksandr Valialkin	28a9e92b5e	lib/streamaggr: huge pile of changes - Reduce memory usage by up to 5x when de-duplicating samples across big number of time series. - Reduce memory usage by up to 5x when aggregating across big number of output time series. - Add lib/promutils.LabelsCompressor, which is going to be used by other VictoriaMetrics components for reducing memory usage for marshaled []prompbmarshal.Label. - Add `dedup_interval` option at aggregation config, which allows setting individual deduplication intervals per each aggregation. - Add `keep_metric_names` option at aggregation config, which allows keeping the original metric names in the output samples. - Add `unique_samples` output, which counts the number of unique sample values. - Add `increase_prometheus` and `total_prometheus` outputs, which ignore the first sample per each newly encountered time series. - Use 64-bit hashes instead of marshaled labels as map keys when calculating `count_series` output. This makes obsolete https://github.com/VictoriaMetrics/VictoriaMetrics/pull/5579 - Expose various metrics, which may help debugging stream aggregation: - vm_streamaggr_dedup_state_size_bytes - the size of data structures responsible for deduplication - vm_streamaggr_dedup_state_items_count - the number of items in the deduplication data structures - vm_streamaggr_labels_compressor_size_bytes - the size of labels compressor data structures - vm_streamaggr_labels_compressor_items_count - the number of entries in the labels compressor - vm_streamaggr_flush_duration_seconds - a histogram, which shows the duration of stream aggregation flushes - vm_streamaggr_dedup_flush_duration_seconds - a histogram, which shows the duration of deduplication flushes - vm_streamaggr_flush_timeouts_total - counter for timed out stream aggregation flushes, which took longer than the configured interval - vm_streamaggr_dedup_flush_timeouts_total - counter for timed out deduplication flushes, which took longer than the configured dedup_interval - Actualize docs/stream-aggregation.md The memory usage reduction increases CPU usage during stream aggregation by up to 30%. This commit is based on https://github.com/VictoriaMetrics/VictoriaMetrics/pull/5850 Updates https://github.com/VictoriaMetrics/VictoriaMetrics/issues/5898	2024-03-02 02:42:50 +02:00
Aliaksandr Valialkin	eb8e95516f	lib/streamaggr: allow one second aggregation interval	2024-03-01 21:33:16 +02:00
Aliaksandr Valialkin	cf2e80a869	lib/promrelabel: use clear() function inside CleanLabels()	2024-03-01 21:33:15 +02:00
Aliaksandr Valialkin	c8c2c5f8e5	lib/fs: fix GOOS=windows build after `f8baf29b6e`	2024-03-01 01:46:29 +02:00
Aliaksandr Valialkin	5aa3dfbd20	lib/protoparser/opentelemetry/firehose: verify that the full response is parsed properly in ProcessRequestBody This is a follow-up for `bf9cb84575` Updates https://github.com/VictoriaMetrics/VictoriaMetrics/pull/5899	2024-03-01 00:39:10 +02:00
Andrii Chubatiuk	bf9cb84575	opentelemetry: fix firehose message parsing (#5899 ) Co-authored-by: Andrii Chubatiuk <wachy@Andriis-MBP-2.lan>	2024-03-01 00:23:54 +02:00
Aliaksandr Valialkin	6a8dc74ee7	lib/mergeset: use unsafe.Slice and unsafe.String instead of deprecated reflect.SliceHeader with unsafe conversion from slice header to string header	2024-02-29 17:29:33 +02:00
Aliaksandr Valialkin	38e0397ebd	lib/bytesutil: use unsafe.String instead of unsafe conversion of slice header to string header	2024-02-29 17:27:51 +02:00
Aliaksandr Valialkin	e959f54351	lib/fs: properly handle the case when data=nil is passed to mUnmap	2024-02-29 17:26:07 +02:00
Aliaksandr Valialkin	c75bfd5b07	lib/storage: use unsafe.Slice instead of deprecated reflect.SliceHeader	2024-02-29 17:24:34 +02:00
Aliaksandr Valialkin	bb48d416fc	lib/protoparser/csvimport: unse unsafe.Slice instead of deprecated reflect.SliceHeader	2024-02-29 17:19:57 +02:00
Aliaksandr Valialkin	f8baf29b6e	lib/fs: use unsafe.Slice instead of deprecated reflect.SliceHeader	2024-02-29 17:18:33 +02:00
Aliaksandr Valialkin	7a04f99c72	lib/fastnum: use unsafe.Slice() instead of deprecated reflect.SliceHeader	2024-02-29 17:17:13 +02:00
Aliaksandr Valialkin	a3cf3d7de1	lib/bytesutil: make BenchmarkToUnsafeString and BenchmarkToUnsafeBytes more reliable This is needed for https://github.com/VictoriaMetrics/VictoriaMetrics/pull/5880	2024-02-29 17:11:03 +02:00
helen	8266b77d0e	Optimize TouUnsafeBytes to make it leaner, more standards-compliant and (#5880 ) slightly faster.	2024-02-29 17:10:10 +02:00
XLONG96	a5795f533d	lib/logstorage: avoid panic when parsing regex with stream filter (#5897 )	2024-02-29 15:31:54 +02:00
Aliaksandr Valialkin	04d13f6149	app/{vminsert,vmagent}: follow-up after `67a55b89a4` - Document the ability to read OpenTelemetry data from Amazon Firehose at docs/CHANGELOG.md - Simplify parsing Firehose data. There is no need in trying to optimize the parsing with fastjson and byte slice tricks, since OpenTelemetry protocol is really slooow because of over-engineering. It is better to write clear code for better maintanability in the future. - Move Firehose parser from /lib/protoparser/firehose to lib/protoparser/opentelemetry/firehose, since it is used only by opentelemetry parser. Updates https://github.com/VictoriaMetrics/VictoriaMetrics/pull/5893	2024-02-29 14:38:23 +02:00
Andrii Chubatiuk	67a55b89a4	{vmagent,vminsert}: added firehose http destination opentelemetry data ingestion support (#5893 ) Co-authored-by: Andrii Chubatiuk <wachy@Andriis-MBP-2.lan> Co-authored-by: Aliaksandr Valialkin <valyala@victoriametrics.com>	2024-02-29 14:03:24 +02:00
Aliaksandr Valialkin	6f203ebc9f	lib/streamaggr: make the BenchmarkAggregatorsPushByJobAvg closer to production case with long list of labels per sample	2024-02-29 02:39:16 +02:00
Hui Wang	8c33ba537a	chore: add actual request size in error message (#5889 )	2024-02-28 22:33:08 +08:00
Aliaksandr Valialkin	7e1dd8ab9d	lib: consistently use atomic.* types instead of atomic.* functions See `ea9e2b19a5`	2024-02-24 02:07:53 +02:00
Aliaksandr Valialkin	d5ca67e667	lib/backup/actions: expose vm_backups_downloaded_bytes_total metric in order to be consistent with vm_backups_uploaded_bytes_total metric	2024-02-24 01:14:50 +02:00
Aliaksandr Valialkin	906a35bdbb	lib/backup/actions: update vm_backups_uploaded_bytes_total metric along the file upload instead of after the file upload This solves two issues: 1. The vm_backups_uploaded_bytes_total metric will grow more smoothly 2. This prevents from int overflow at metrics.Counter.Add() when uploading files bigger than 2GiB	2024-02-24 01:07:20 +02:00
Aliaksandr Valialkin	ece86cd314	lib/backup/actions: consistently use atomic.* types instead of atomic.* functions See `ea9e2b19a5`	2024-02-24 01:02:21 +02:00

1 2 3 4 5 ...

2374 commits