github-mirrors/VictoriaMetrics

mirror of https://github.com/VictoriaMetrics/VictoriaMetrics.git synced 2024-11-21 14:44:00 +00:00

Author	SHA1	Message	Date
Aliaksandr Valialkin	f5bbffd45f	lib/streamaggr: use multiple job labels in BenchmarkAggregatorsPush	2024-03-04 19:12:26 +02:00
Aliaksandr Valialkin	eb40395a1c	lib/streamaggr: do not flush dedup shards in parallel This significantly increases CPU usage on systems with many CPU cores, while doesn't reduce flush latency too much	2024-03-04 17:00:20 +02:00
Aliaksandr Valialkin	946814afee	lib/streamaggr: reduce memory allocations when registering new series in deduplication and aggregation structs	2024-03-04 17:00:19 +02:00
Aliaksandr Valialkin	925f60841f	lib/streamaggr: make aggregate.runFlusher() more roubst and clear	2024-03-04 17:00:19 +02:00
Aliaksandr Valialkin	aa5e7e268c	lib/streamaggr: properly drop samples on the first incomplete interval Previously samples were dropped on the first incomplete interval and the next complete interval. Also make sure that the de-duplication is performed just before flushing the aggregate state. This should help the case then dedup_interval = interval.	2024-03-04 17:00:18 +02:00
Aliaksandr Valialkin	86494518da	lib/streamaggr: explicitly call resetSeries after flushSeries This makes the code less fragile	2024-03-04 06:01:18 +02:00
Aliaksandr Valialkin	ac3cf3f357	lib/streamaggr: enable time alignment for aggregate flushed to multiples of interval For example, if `interval: 1m`, then data flush occurs at the end of every minute, while `interval: 1h` leads to data flush at the end of every hour. Add `no_align_flush_to_interval` option, which can be used for disabling the alignment.	2024-03-04 05:42:58 +02:00
Aliaksandr Valialkin	138a4d1c2b	lib/streamaggr: ignore the first sample in new time series during staleness_interval seconds after the stream aggregation start for total and increase outputs	2024-03-04 01:49:26 +02:00
Aliaksandr Valialkin	0422ae01ba	lib/streamaggr: flush dedup state and aggregation state in parallel on all the available CPU cores This should reduce the time needed for aggregation state flush on systems with many CPU cores	2024-03-04 01:21:50 +02:00
Aliaksandr Valialkin	3c06b3af92	lib/streamaggr: add a benchmark for flushing dedup state	2024-03-04 01:16:30 +02:00
Aliaksandr Valialkin	9648c88b71	lib/streamaggr: add a benchmark for measuring the performance of aggregator.flush	2024-03-04 00:45:48 +02:00
Aliaksandr Valialkin	54a1c506e3	lib/streamaggr: add a benchmark for de-duplicating of 1M samples	2024-03-04 00:26:59 +02:00
Aliaksandr Valialkin	614d34e539	lib/prompbmarshal: use clear() instead of a loop for clearing tss inside ResetTimeSeries()	2024-03-03 23:40:34 +02:00
Aliaksandr Valialkin	4e65636b44	lib/promutils: optimize LabelsCompressor.Decompress by using a specialized labelsMap struct instead of sync.Map The labelsMap struct employs the fact that label indexes are condensed around 0, so it stores the referred labels in a slice instead of map and uses slice index as label key. This allows increasing the LabelsCompressor.Decompress performance by up to 3x. This also reduces the latency of data flush in stream aggregation.	2024-03-03 23:21:25 +02:00
Aliaksandr Valialkin	28a9e92b5e	lib/streamaggr: huge pile of changes - Reduce memory usage by up to 5x when de-duplicating samples across big number of time series. - Reduce memory usage by up to 5x when aggregating across big number of output time series. - Add lib/promutils.LabelsCompressor, which is going to be used by other VictoriaMetrics components for reducing memory usage for marshaled []prompbmarshal.Label. - Add `dedup_interval` option at aggregation config, which allows setting individual deduplication intervals per each aggregation. - Add `keep_metric_names` option at aggregation config, which allows keeping the original metric names in the output samples. - Add `unique_samples` output, which counts the number of unique sample values. - Add `increase_prometheus` and `total_prometheus` outputs, which ignore the first sample per each newly encountered time series. - Use 64-bit hashes instead of marshaled labels as map keys when calculating `count_series` output. This makes obsolete https://github.com/VictoriaMetrics/VictoriaMetrics/pull/5579 - Expose various metrics, which may help debugging stream aggregation: - vm_streamaggr_dedup_state_size_bytes - the size of data structures responsible for deduplication - vm_streamaggr_dedup_state_items_count - the number of items in the deduplication data structures - vm_streamaggr_labels_compressor_size_bytes - the size of labels compressor data structures - vm_streamaggr_labels_compressor_items_count - the number of entries in the labels compressor - vm_streamaggr_flush_duration_seconds - a histogram, which shows the duration of stream aggregation flushes - vm_streamaggr_dedup_flush_duration_seconds - a histogram, which shows the duration of deduplication flushes - vm_streamaggr_flush_timeouts_total - counter for timed out stream aggregation flushes, which took longer than the configured interval - vm_streamaggr_dedup_flush_timeouts_total - counter for timed out deduplication flushes, which took longer than the configured dedup_interval - Actualize docs/stream-aggregation.md The memory usage reduction increases CPU usage during stream aggregation by up to 30%. This commit is based on https://github.com/VictoriaMetrics/VictoriaMetrics/pull/5850 Updates https://github.com/VictoriaMetrics/VictoriaMetrics/issues/5898	2024-03-02 02:42:50 +02:00
Aliaksandr Valialkin	eb8e95516f	lib/streamaggr: allow one second aggregation interval	2024-03-01 21:33:16 +02:00
Aliaksandr Valialkin	cf2e80a869	lib/promrelabel: use clear() function inside CleanLabels()	2024-03-01 21:33:15 +02:00
Aliaksandr Valialkin	c8c2c5f8e5	lib/fs: fix GOOS=windows build after `f8baf29b6e`	2024-03-01 01:46:29 +02:00
Aliaksandr Valialkin	5aa3dfbd20	lib/protoparser/opentelemetry/firehose: verify that the full response is parsed properly in ProcessRequestBody This is a follow-up for `bf9cb84575` Updates https://github.com/VictoriaMetrics/VictoriaMetrics/pull/5899	2024-03-01 00:39:10 +02:00
Andrii Chubatiuk	bf9cb84575	opentelemetry: fix firehose message parsing (#5899 ) Co-authored-by: Andrii Chubatiuk <wachy@Andriis-MBP-2.lan>	2024-03-01 00:23:54 +02:00
Aliaksandr Valialkin	6a8dc74ee7	lib/mergeset: use unsafe.Slice and unsafe.String instead of deprecated reflect.SliceHeader with unsafe conversion from slice header to string header	2024-02-29 17:29:33 +02:00
Aliaksandr Valialkin	38e0397ebd	lib/bytesutil: use unsafe.String instead of unsafe conversion of slice header to string header	2024-02-29 17:27:51 +02:00
Aliaksandr Valialkin	e959f54351	lib/fs: properly handle the case when data=nil is passed to mUnmap	2024-02-29 17:26:07 +02:00
Aliaksandr Valialkin	c75bfd5b07	lib/storage: use unsafe.Slice instead of deprecated reflect.SliceHeader	2024-02-29 17:24:34 +02:00
Aliaksandr Valialkin	bb48d416fc	lib/protoparser/csvimport: unse unsafe.Slice instead of deprecated reflect.SliceHeader	2024-02-29 17:19:57 +02:00
Aliaksandr Valialkin	f8baf29b6e	lib/fs: use unsafe.Slice instead of deprecated reflect.SliceHeader	2024-02-29 17:18:33 +02:00
Aliaksandr Valialkin	7a04f99c72	lib/fastnum: use unsafe.Slice() instead of deprecated reflect.SliceHeader	2024-02-29 17:17:13 +02:00
Aliaksandr Valialkin	a3cf3d7de1	lib/bytesutil: make BenchmarkToUnsafeString and BenchmarkToUnsafeBytes more reliable This is needed for https://github.com/VictoriaMetrics/VictoriaMetrics/pull/5880	2024-02-29 17:11:03 +02:00
helen	8266b77d0e	Optimize TouUnsafeBytes to make it leaner, more standards-compliant and (#5880 ) slightly faster.	2024-02-29 17:10:10 +02:00
XLONG96	a5795f533d	lib/logstorage: avoid panic when parsing regex with stream filter (#5897 )	2024-02-29 15:31:54 +02:00
Aliaksandr Valialkin	04d13f6149	app/{vminsert,vmagent}: follow-up after `67a55b89a4` - Document the ability to read OpenTelemetry data from Amazon Firehose at docs/CHANGELOG.md - Simplify parsing Firehose data. There is no need in trying to optimize the parsing with fastjson and byte slice tricks, since OpenTelemetry protocol is really slooow because of over-engineering. It is better to write clear code for better maintanability in the future. - Move Firehose parser from /lib/protoparser/firehose to lib/protoparser/opentelemetry/firehose, since it is used only by opentelemetry parser. Updates https://github.com/VictoriaMetrics/VictoriaMetrics/pull/5893	2024-02-29 14:38:23 +02:00
Andrii Chubatiuk	67a55b89a4	{vmagent,vminsert}: added firehose http destination opentelemetry data ingestion support (#5893 ) Co-authored-by: Andrii Chubatiuk <wachy@Andriis-MBP-2.lan> Co-authored-by: Aliaksandr Valialkin <valyala@victoriametrics.com>	2024-02-29 14:03:24 +02:00
Aliaksandr Valialkin	6f203ebc9f	lib/streamaggr: make the BenchmarkAggregatorsPushByJobAvg closer to production case with long list of labels per sample	2024-02-29 02:39:16 +02:00
Hui Wang	8c33ba537a	chore: add actual request size in error message (#5889 )	2024-02-28 22:33:08 +08:00
Aliaksandr Valialkin	7e1dd8ab9d	lib: consistently use atomic.* types instead of atomic.* functions See `ea9e2b19a5`	2024-02-24 02:07:53 +02:00
Aliaksandr Valialkin	d5ca67e667	lib/backup/actions: expose vm_backups_downloaded_bytes_total metric in order to be consistent with vm_backups_uploaded_bytes_total metric	2024-02-24 01:14:50 +02:00
Aliaksandr Valialkin	906a35bdbb	lib/backup/actions: update vm_backups_uploaded_bytes_total metric along the file upload instead of after the file upload This solves two issues: 1. The vm_backups_uploaded_bytes_total metric will grow more smoothly 2. This prevents from int overflow at metrics.Counter.Add() when uploading files bigger than 2GiB	2024-02-24 01:07:20 +02:00
Aliaksandr Valialkin	ece86cd314	lib/backup/actions: consistently use atomic.* types instead of atomic.* functions See `ea9e2b19a5`	2024-02-24 01:02:21 +02:00
Aliaksandr Valialkin	55f1f24e62	lib/storage: replace the remaining atomic.* functions with atomic.* types for the sake of consistency See `ea9e2b19a5`	2024-02-24 00:53:30 +02:00
Aliaksandr Valialkin	b3d9d36fb3	lib/storage: consistently use atomic.* types instead of atomic.* function calls on ordinary types See `ea9e2b19a5`	2024-02-24 00:15:26 +02:00
Aliaksandr Valialkin	4617dc8bbe	lib/logstorage: consistently use atomic.* types instead of atomic.* functions on regular types See `ea9e2b19a5`	2024-02-23 23:46:13 +02:00
Aliaksandr Valialkin	f81b480905	lib/mergeset: consistently use atomic.* types instead of atomic.* function calls on ordinary types See `ea9e2b19a5`	2024-02-23 23:29:35 +02:00
Aliaksandr Valialkin	275335c181	lib/logstorage: consistently use atomic.* type for refCount and mustDrop fields in datadb and storage structs in the same way as it is used in lib/storage See `ea9e2b19a5` and `a204fd69f1`	2024-02-23 23:04:42 +02:00
Aliaksandr Valialkin	5c89150fc9	lib/mergeset: consistently use atomic.* type for refCount and mustDrop fields in table struct in the same way as it is used in lib/storage See `ea9e2b19a5` and `a204fd69f1`	2024-02-23 22:59:23 +02:00
Aliaksandr Valialkin	a204fd69f1	lib/storage: consistently use atomic.* type for refCount and mustDrop fields in indexDB, table and partition structs See `ea9e2b19a5`	2024-02-23 22:54:59 +02:00
Aliaksandr Valialkin	0f1ea36dc8	lib/storage: convert dedupsDuringMerge from uint64 to atomic.Uint64 This should simplify code maintenance by gradually converting to atomic.* types instead of calling atomic.* functions on int and bool types. See `ea9e2b19a5`	2024-02-23 22:52:00 +02:00
Aliaksandr Valialkin	ea9e2b19a5	lib/{storage,mergeset}: properly fix 'unaligned 64-bit atomic operation' panic on 32-bit architectures The issue has been introduced in `bace9a2501` The improper fix was in the `d4c0615dcd` , since it fixed the issue just by an accident, because Go comiler aligned the rawRowsShards field by 4-byte boundary inside partition struct. The proper fix is to use atomic.Int64 field - this guarantees that the access to this field won't result in unaligned 64-bit atomic operation. See https://github.com/golang/go/issues/50860 and https://github.com/golang/go/issues/19057	2024-02-23 22:27:06 +02:00
Aliaksandr Valialkin	cf94522389	lib/httpserver: return back the default value for -http.connTimeout to 2 minutes It has been appeared that there are VictoriaMetrics users, who rely on the fact that VictoriaMetrics components were closing incoming connections to -httpListenAddr every 2 minutes by default. So let's return back this value by default in order to fix the breaking change made at `d8c1db7953` . See https://github.com/VictoriaMetrics/VictoriaMetrics/issues/1304#issuecomment-1961891450 .	2024-02-23 22:03:37 +02:00
hagen1778	c8d1d2ab72	lib/storage: cleanup after `d4c0615dcd` Signed-off-by: hagen1778 <roman@victoriametrics.com>	2024-02-23 18:53:55 +01:00
Dmytro Kozlov	d4c0615dcd	lib/storage: fix aligning (#5860 )	2024-02-23 16:37:21 +01:00

1 2 3 4 5 ...

2362 commits