VictoriaMetrics

mirror of https://github.com/VictoriaMetrics/VictoriaMetrics.git synced 2024-11-21 14:44:00 +00:00

History

Aliaksandr Valialkin 7a8b92b590 lib/{mergeset,storage}: make background merge more responsive and scalable - Maintain a separate worker pool per each part type (in-memory, file, big and small). Previously a shared pool was used for merging all the part types. A single merge worker could merge parts with mixed types at once. For example, it could merge simultaneously an in-memory part plus a big file part. Such a merge could take hours for big file part. During the duration of this merge the in-memory part was pinned in memory and couldn't be persisted to disk under the configured -inmemoryDataFlushInterval . Another common issue, which could happen when parts with mixed types are merged, is uncontrolled growth of in-memory parts or small parts when all the merge workers were busy with merging big files. Such growth could lead to significant performance degradataion for queries, since every query needs to check ever growing list of parts. This could also slow down the registration of new time series, since VictoriaMetrics searches for the internal series_id in the indexdb for every new time series. The third issue is graceful shutdown duration, which could be very long when a background merge is running on in-memory parts plus big file parts. This merge couldn't be interrupted, since it merges in-memory parts. A separate pool of merge workers per every part type elegantly resolves both issues: - In-memory parts are merged to file-based parts in a timely manner, since the maximum size of in-memory parts is limited. - Long-running merges for big parts do not block merges for in-memory parts and small parts. - Graceful shutdown duration is now limited by the time needed for flushing in-memory parts to files. Merging for file parts is instantly canceled on graceful shutdown now. - Deprecate -smallMergeConcurrency command-line flag, since the new background merge algorithm should automatically self-tune according to the number of available CPU cores. - Deprecate -finalMergeDelay command-line flag, since it wasn't working correctly. It is better to run forced merge when needed - https://docs.victoriametrics.com/#forced-merge - Tune the number of shards for pending rows and items before the data goes to in-memory parts and becomes visible for search. This improves the maximum data ingestion rate and the maximum rate for registration of new time series. This should reduce the duration of data ingestion slowdown in VictoriaMetrics cluster on e.g. re-routing events, when some of vmstorage nodes become temporarily unavailable. - Prevent from possible "sync: WaitGroup misuse" panic on graceful shutdown. This is a follow-up for `fa566c68a6` . Thanks @misutoth to for the inspiration at https://github.com/VictoriaMetrics/VictoriaMetrics/pull/5212 Updates https://github.com/VictoriaMetrics/VictoriaMetrics/issues/5190 Updates https://github.com/VictoriaMetrics/VictoriaMetrics/issues/3790 Updates https://github.com/VictoriaMetrics/VictoriaMetrics/issues/3551 Updates https://github.com/VictoriaMetrics/VictoriaMetrics/issues/3337 Updates https://github.com/VictoriaMetrics/VictoriaMetrics/issues/3425 Updates https://github.com/VictoriaMetrics/VictoriaMetrics/issues/3647 Updates https://github.com/VictoriaMetrics/VictoriaMetrics/issues/3641 Updates https://github.com/VictoriaMetrics/VictoriaMetrics/issues/648 Updates https://github.com/VictoriaMetrics/VictoriaMetrics/issues/291		2024-01-26 22:19:52 +01:00
..
block_header.go	lib/mergeset: properly reset bsr.bhIdx after the call to blockStreamReader.readNextBHS()	2022-11-16 21:22:51 +02:00
block_stream_reader.go	lib/{storage,mergeset}: convert InitFromFilePart to MustInitFromFilePart	2023-04-14 15:47:20 -07:00
block_stream_reader_test.go	lib/mergeset: use deterministic random generator in tests	2023-01-23 19:44:10 -08:00
block_stream_writer.go	lib/{storage,mergeset}: convert InitFromFilePart to MustInitFromFilePart	2023-04-14 15:47:20 -07:00
encoding.go	lib/mergeset: remove inmemoryBlock pooling, since it wasn't effecitve	2024-01-26 21:34:22 +01:00
encoding_test.go	lib/mergeset: use deterministic random generator in tests	2023-01-23 19:44:10 -08:00
encoding_timing_test.go	lib/mergeset: fix data race in BenchmarkInmemoryBlockMarshal	2023-01-23 19:44:07 -08:00
filenames.go	lib/mergeset: consistently use OS-independent separator in file paths	2023-03-25 14:34:33 -07:00
inmemory_part.go	lib/{storage,mergeset}: convert InitFromFilePart to MustInitFromFilePart	2023-04-14 15:47:20 -07:00
merge.go	lib/mergeset: make sure that the first and the last items are in the original range after prepareBlock()	2024-01-23 12:59:04 +02:00
merge_test.go	lib/{storage,mergeset}: convert InitFromFilePart to MustInitFromFilePart	2023-04-14 15:47:20 -07:00
metaindex_row.go	all: subsitute ioutil.ReadAll with io.ReadAll	2022-08-22 00:16:04 +03:00
part.go	lib/{storage,mergeset}: convert InitFromFilePart to MustInitFromFilePart	2023-04-14 15:47:20 -07:00
part_header.go	lib/{storage,mergeset}: convert InitFromFilePart to MustInitFromFilePart	2023-04-14 15:47:20 -07:00
part_search.go	lib/mergeset: remove inmemoryBlock pooling, since it wasn't effecitve	2024-01-26 21:34:22 +01:00
part_search_test.go	lib/{storage,mergeset}: convert InitFromFilePart to MustInitFromFilePart	2023-04-14 15:47:20 -07:00
table.go	lib/{mergeset,storage}: make background merge more responsive and scalable	2024-01-26 22:19:52 +01:00
table_search.go	optimized code (#2103 )	2022-01-28 12:10:47 +02:00
table_search_test.go	lib/fs: add MustReadDir() function	2023-04-14 22:11:40 -07:00
table_search_timing_test.go	lib/fs: add MustReadDir() function	2023-04-14 22:11:40 -07:00
table_test.go	lib/mergeset: close and open the table before making snapshots at TestTableCreateSnapshotAt()	2023-05-16 15:32:34 -07:00