github-mirrors/VictoriaMetrics

mirror of https://github.com/VictoriaMetrics/VictoriaMetrics.git synced 2024-11-21 14:44:00 +00:00

Author	SHA1	Message	Date
Aliaksandr Valialkin	ac3cf3f357	lib/streamaggr: enable time alignment for aggregate flushed to multiples of interval For example, if `interval: 1m`, then data flush occurs at the end of every minute, while `interval: 1h` leads to data flush at the end of every hour. Add `no_align_flush_to_interval` option, which can be used for disabling the alignment.	2024-03-04 05:42:58 +02:00
Aliaksandr Valialkin	138a4d1c2b	lib/streamaggr: ignore the first sample in new time series during staleness_interval seconds after the stream aggregation start for total and increase outputs	2024-03-04 01:49:26 +02:00
Aliaksandr Valialkin	0422ae01ba	lib/streamaggr: flush dedup state and aggregation state in parallel on all the available CPU cores This should reduce the time needed for aggregation state flush on systems with many CPU cores	2024-03-04 01:21:50 +02:00
Aliaksandr Valialkin	3c06b3af92	lib/streamaggr: add a benchmark for flushing dedup state	2024-03-04 01:16:30 +02:00
Aliaksandr Valialkin	9648c88b71	lib/streamaggr: add a benchmark for measuring the performance of aggregator.flush	2024-03-04 00:45:48 +02:00
Aliaksandr Valialkin	54a1c506e3	lib/streamaggr: add a benchmark for de-duplicating of 1M samples	2024-03-04 00:26:59 +02:00
Aliaksandr Valialkin	614d34e539	lib/prompbmarshal: use clear() instead of a loop for clearing tss inside ResetTimeSeries()	2024-03-03 23:40:34 +02:00
Aliaksandr Valialkin	4e65636b44	lib/promutils: optimize LabelsCompressor.Decompress by using a specialized labelsMap struct instead of sync.Map The labelsMap struct employs the fact that label indexes are condensed around 0, so it stores the referred labels in a slice instead of map and uses slice index as label key. This allows increasing the LabelsCompressor.Decompress performance by up to 3x. This also reduces the latency of data flush in stream aggregation.	2024-03-03 23:21:25 +02:00
Aliaksandr Valialkin	28a9e92b5e	lib/streamaggr: huge pile of changes - Reduce memory usage by up to 5x when de-duplicating samples across big number of time series. - Reduce memory usage by up to 5x when aggregating across big number of output time series. - Add lib/promutils.LabelsCompressor, which is going to be used by other VictoriaMetrics components for reducing memory usage for marshaled []prompbmarshal.Label. - Add `dedup_interval` option at aggregation config, which allows setting individual deduplication intervals per each aggregation. - Add `keep_metric_names` option at aggregation config, which allows keeping the original metric names in the output samples. - Add `unique_samples` output, which counts the number of unique sample values. - Add `increase_prometheus` and `total_prometheus` outputs, which ignore the first sample per each newly encountered time series. - Use 64-bit hashes instead of marshaled labels as map keys when calculating `count_series` output. This makes obsolete https://github.com/VictoriaMetrics/VictoriaMetrics/pull/5579 - Expose various metrics, which may help debugging stream aggregation: - vm_streamaggr_dedup_state_size_bytes - the size of data structures responsible for deduplication - vm_streamaggr_dedup_state_items_count - the number of items in the deduplication data structures - vm_streamaggr_labels_compressor_size_bytes - the size of labels compressor data structures - vm_streamaggr_labels_compressor_items_count - the number of entries in the labels compressor - vm_streamaggr_flush_duration_seconds - a histogram, which shows the duration of stream aggregation flushes - vm_streamaggr_dedup_flush_duration_seconds - a histogram, which shows the duration of deduplication flushes - vm_streamaggr_flush_timeouts_total - counter for timed out stream aggregation flushes, which took longer than the configured interval - vm_streamaggr_dedup_flush_timeouts_total - counter for timed out deduplication flushes, which took longer than the configured dedup_interval - Actualize docs/stream-aggregation.md The memory usage reduction increases CPU usage during stream aggregation by up to 30%. This commit is based on https://github.com/VictoriaMetrics/VictoriaMetrics/pull/5850 Updates https://github.com/VictoriaMetrics/VictoriaMetrics/issues/5898	2024-03-02 02:42:50 +02:00
Aliaksandr Valialkin	eb8e95516f	lib/streamaggr: allow one second aggregation interval	2024-03-01 21:33:16 +02:00
Aliaksandr Valialkin	cf2e80a869	lib/promrelabel: use clear() function inside CleanLabels()	2024-03-01 21:33:15 +02:00
Aliaksandr Valialkin	c8c2c5f8e5	lib/fs: fix GOOS=windows build after `f8baf29b6e`	2024-03-01 01:46:29 +02:00
Aliaksandr Valialkin	5aa3dfbd20	lib/protoparser/opentelemetry/firehose: verify that the full response is parsed properly in ProcessRequestBody This is a follow-up for `bf9cb84575` Updates https://github.com/VictoriaMetrics/VictoriaMetrics/pull/5899	2024-03-01 00:39:10 +02:00
Andrii Chubatiuk	bf9cb84575	opentelemetry: fix firehose message parsing (#5899 ) Co-authored-by: Andrii Chubatiuk <wachy@Andriis-MBP-2.lan>	2024-03-01 00:23:54 +02:00
Aliaksandr Valialkin	6a8dc74ee7	lib/mergeset: use unsafe.Slice and unsafe.String instead of deprecated reflect.SliceHeader with unsafe conversion from slice header to string header	2024-02-29 17:29:33 +02:00
Aliaksandr Valialkin	38e0397ebd	lib/bytesutil: use unsafe.String instead of unsafe conversion of slice header to string header	2024-02-29 17:27:51 +02:00
Aliaksandr Valialkin	e959f54351	lib/fs: properly handle the case when data=nil is passed to mUnmap	2024-02-29 17:26:07 +02:00
Aliaksandr Valialkin	c75bfd5b07	lib/storage: use unsafe.Slice instead of deprecated reflect.SliceHeader	2024-02-29 17:24:34 +02:00
Aliaksandr Valialkin	bb48d416fc	lib/protoparser/csvimport: unse unsafe.Slice instead of deprecated reflect.SliceHeader	2024-02-29 17:19:57 +02:00
Aliaksandr Valialkin	f8baf29b6e	lib/fs: use unsafe.Slice instead of deprecated reflect.SliceHeader	2024-02-29 17:18:33 +02:00
Aliaksandr Valialkin	7a04f99c72	lib/fastnum: use unsafe.Slice() instead of deprecated reflect.SliceHeader	2024-02-29 17:17:13 +02:00
Aliaksandr Valialkin	a3cf3d7de1	lib/bytesutil: make BenchmarkToUnsafeString and BenchmarkToUnsafeBytes more reliable This is needed for https://github.com/VictoriaMetrics/VictoriaMetrics/pull/5880	2024-02-29 17:11:03 +02:00
helen	8266b77d0e	Optimize TouUnsafeBytes to make it leaner, more standards-compliant and (#5880 ) slightly faster.	2024-02-29 17:10:10 +02:00
XLONG96	a5795f533d	lib/logstorage: avoid panic when parsing regex with stream filter (#5897 )	2024-02-29 15:31:54 +02:00
Aliaksandr Valialkin	04d13f6149	app/{vminsert,vmagent}: follow-up after `67a55b89a4` - Document the ability to read OpenTelemetry data from Amazon Firehose at docs/CHANGELOG.md - Simplify parsing Firehose data. There is no need in trying to optimize the parsing with fastjson and byte slice tricks, since OpenTelemetry protocol is really slooow because of over-engineering. It is better to write clear code for better maintanability in the future. - Move Firehose parser from /lib/protoparser/firehose to lib/protoparser/opentelemetry/firehose, since it is used only by opentelemetry parser. Updates https://github.com/VictoriaMetrics/VictoriaMetrics/pull/5893	2024-02-29 14:38:23 +02:00
Andrii Chubatiuk	67a55b89a4	{vmagent,vminsert}: added firehose http destination opentelemetry data ingestion support (#5893 ) Co-authored-by: Andrii Chubatiuk <wachy@Andriis-MBP-2.lan> Co-authored-by: Aliaksandr Valialkin <valyala@victoriametrics.com>	2024-02-29 14:03:24 +02:00
Aliaksandr Valialkin	6f203ebc9f	lib/streamaggr: make the BenchmarkAggregatorsPushByJobAvg closer to production case with long list of labels per sample	2024-02-29 02:39:16 +02:00
Hui Wang	8c33ba537a	chore: add actual request size in error message (#5889 )	2024-02-28 22:33:08 +08:00
Aliaksandr Valialkin	7e1dd8ab9d	lib: consistently use atomic.* types instead of atomic.* functions See `ea9e2b19a5`	2024-02-24 02:07:53 +02:00
Aliaksandr Valialkin	d5ca67e667	lib/backup/actions: expose vm_backups_downloaded_bytes_total metric in order to be consistent with vm_backups_uploaded_bytes_total metric	2024-02-24 01:14:50 +02:00
Aliaksandr Valialkin	906a35bdbb	lib/backup/actions: update vm_backups_uploaded_bytes_total metric along the file upload instead of after the file upload This solves two issues: 1. The vm_backups_uploaded_bytes_total metric will grow more smoothly 2. This prevents from int overflow at metrics.Counter.Add() when uploading files bigger than 2GiB	2024-02-24 01:07:20 +02:00
Aliaksandr Valialkin	ece86cd314	lib/backup/actions: consistently use atomic.* types instead of atomic.* functions See `ea9e2b19a5`	2024-02-24 01:02:21 +02:00
Aliaksandr Valialkin	55f1f24e62	lib/storage: replace the remaining atomic.* functions with atomic.* types for the sake of consistency See `ea9e2b19a5`	2024-02-24 00:53:30 +02:00
Aliaksandr Valialkin	b3d9d36fb3	lib/storage: consistently use atomic.* types instead of atomic.* function calls on ordinary types See `ea9e2b19a5`	2024-02-24 00:15:26 +02:00
Aliaksandr Valialkin	4617dc8bbe	lib/logstorage: consistently use atomic.* types instead of atomic.* functions on regular types See `ea9e2b19a5`	2024-02-23 23:46:13 +02:00
Aliaksandr Valialkin	f81b480905	lib/mergeset: consistently use atomic.* types instead of atomic.* function calls on ordinary types See `ea9e2b19a5`	2024-02-23 23:29:35 +02:00
Aliaksandr Valialkin	275335c181	lib/logstorage: consistently use atomic.* type for refCount and mustDrop fields in datadb and storage structs in the same way as it is used in lib/storage See `ea9e2b19a5` and `a204fd69f1`	2024-02-23 23:04:42 +02:00
Aliaksandr Valialkin	5c89150fc9	lib/mergeset: consistently use atomic.* type for refCount and mustDrop fields in table struct in the same way as it is used in lib/storage See `ea9e2b19a5` and `a204fd69f1`	2024-02-23 22:59:23 +02:00
Aliaksandr Valialkin	a204fd69f1	lib/storage: consistently use atomic.* type for refCount and mustDrop fields in indexDB, table and partition structs See `ea9e2b19a5`	2024-02-23 22:54:59 +02:00
Aliaksandr Valialkin	0f1ea36dc8	lib/storage: convert dedupsDuringMerge from uint64 to atomic.Uint64 This should simplify code maintenance by gradually converting to atomic.* types instead of calling atomic.* functions on int and bool types. See `ea9e2b19a5`	2024-02-23 22:52:00 +02:00
Aliaksandr Valialkin	ea9e2b19a5	lib/{storage,mergeset}: properly fix 'unaligned 64-bit atomic operation' panic on 32-bit architectures The issue has been introduced in `bace9a2501` The improper fix was in the `d4c0615dcd` , since it fixed the issue just by an accident, because Go comiler aligned the rawRowsShards field by 4-byte boundary inside partition struct. The proper fix is to use atomic.Int64 field - this guarantees that the access to this field won't result in unaligned 64-bit atomic operation. See https://github.com/golang/go/issues/50860 and https://github.com/golang/go/issues/19057	2024-02-23 22:27:06 +02:00
Aliaksandr Valialkin	cf94522389	lib/httpserver: return back the default value for -http.connTimeout to 2 minutes It has been appeared that there are VictoriaMetrics users, who rely on the fact that VictoriaMetrics components were closing incoming connections to -httpListenAddr every 2 minutes by default. So let's return back this value by default in order to fix the breaking change made at `d8c1db7953` . See https://github.com/VictoriaMetrics/VictoriaMetrics/issues/1304#issuecomment-1961891450 .	2024-02-23 22:03:37 +02:00
hagen1778	c8d1d2ab72	lib/storage: cleanup after `d4c0615dcd` Signed-off-by: hagen1778 <roman@victoriametrics.com>	2024-02-23 18:53:55 +01:00
Dmytro Kozlov	d4c0615dcd	lib/storage: fix aligning (#5860 )	2024-02-23 16:37:21 +01:00
Aliaksandr Valialkin	9bad52b687	app/vmstorage: deprecate -snapshotCreateTimeout command-line flag Creating snapshot shouldn't time out under normal conditions. The timeout was related to the bug, which has been fixed in `6460475e3b` . Updates https://github.com/VictoriaMetrics/VictoriaMetrics/issues/3551	2024-02-23 04:49:23 +02:00
Aliaksandr Valialkin	f79944532b	lib/storage: do not drop (date, metricID) entries for the date older than 2 days if samples are ingested at this date Previously the (date, metricID) entries for dates older than the last 2 days were removed. This could lead to slow check for the (date, metricID) entry in the indexdb during ingesting historical data (aka backfilling). The issue has been introduced in `431aa16c8d`	2024-02-23 04:06:19 +02:00
Aliaksandr Valialkin	f46eaf92eb	app/vmselect: add -search.maxLabelsAPIDuration and -search.maxLabelsAPISeries options for fine-tuning CPU and RAM usage for /api/v1/series , /api/v1/labels and /api/v1/label/.../values This commit returns back limits for these endpoints, which have been removed at `5d66ee88bd` , since it has been appeared that missing limits result in high CPU usage, while the introduced concurrency limiter results in failed lightweight requests to these endpoints because of timeout when heavyweight requests are executed. Updates https://github.com/VictoriaMetrics/VictoriaMetrics/issues/5055	2024-02-23 02:57:16 +02:00
Aliaksandr Valialkin	df7d3c55ed	lib/promutils: hide the math.Round() logic inside ParseTimeMsec() function This should prevent from bugs similar to https://github.com/VictoriaMetrics/VictoriaMetrics/issues/5801 in the future This is a follow-up for `ce3ec3ff2e`	2024-02-23 00:55:32 +02:00
Aliaksandr Valialkin	5934002b57	lib/mergeset: run `go fmt` after `bace9a2501`	2024-02-23 00:53:28 +02:00
Aliaksandr Valialkin	bace9a2501	lib/{mergeset,storage}: convert bufferred items to searchable parts more optimally Do not convert shard items to part when a shard becomes full. Instead, collect multiple full shards and then convert them to a searchable part at once. This reduces the number of searchable parts, which, in turn, should increase query performance, since queries need to scan smaller number of parts.	2024-02-23 00:16:34 +02:00

1 2 3 4 5 ...

2356 commits