github-mirrors/VictoriaMetrics

mirror of https://github.com/VictoriaMetrics/VictoriaMetrics.git synced 2024-11-21 14:44:00 +00:00

Author	SHA1	Message	Date
Roman Khavronenko	909cd04c55	lib/storage: keep sample with the biggest value on timestamp conflict (#3421 ) The change leaves raw sample with the biggest value for identical timestamps per each `-dedup.minScrapeInterval` discrete interval when the deduplication is enabled. ``` benchstat old.txt new.txt name old time/op new time/op delta DeduplicateSamples/minScrapeInterval=1s-10 817ns ± 2% 832ns ± 3% ~ (p=0.052 n=10+10) DeduplicateSamples/minScrapeInterval=2s-10 1.56µs ± 1% 2.12µs ± 0% +35.19% (p=0.000 n=9+7) DeduplicateSamples/minScrapeInterval=5s-10 1.32µs ± 3% 1.65µs ± 2% +25.57% (p=0.000 n=10+10) DeduplicateSamples/minScrapeInterval=10s-10 1.13µs ± 2% 1.50µs ± 1% +32.85% (p=0.000 n=10+10) name old speed new speed delta DeduplicateSamples/minScrapeInterval=1s-10 10.0GB/s ± 2% 9.9GB/s ± 3% ~ (p=0.052 n=10+10) DeduplicateSamples/minScrapeInterval=2s-10 5.24GB/s ± 1% 3.87GB/s ± 0% -26.03% (p=0.000 n=9+7) DeduplicateSamples/minScrapeInterval=5s-10 6.22GB/s ± 3% 4.96GB/s ± 2% -20.37% (p=0.000 n=10+10) DeduplicateSamples/minScrapeInterval=10s-10 7.28GB/s ± 2% 5.48GB/s ± 1% -24.74% (p=0.000 n=10+10) ``` https://github.com/VictoriaMetrics/VictoriaMetrics/issues/3333 Signed-off-by: hagen1778 <roman@victoriametrics.com> Signed-off-by: hagen1778 <roman@victoriametrics.com>	2022-12-08 18:18:36 -08:00
Aliaksandr Valialkin	3a25a4b1de	app/{vminsert,vmselect}: speed up TestInitStopNodes()	2022-12-03 23:53:14 -08:00
Zakhar Bessarab	e407e7243a	{app/vmstorage,app/vmselect}: add API to get list of existing tenants (#3348 ) * {app/vmstorage,app/vmselect}: add API to get list of existing tenants * {app/vmstorage,app/vmselect}: add API to get list of existing tenants * app/vmselect: fix error message * {app/vmstorage,app/vmselect}: fix error messages * app/vmselect: change log level for error handling * wip Co-authored-by: Aliaksandr Valialkin <valyala@victoriametrics.com>	2022-11-25 10:32:45 -08:00
Aliaksandr Valialkin	353b137ff0	app/vmselect/netstorage: typo fix after `61736e4a1d`	2022-11-19 00:53:54 +02:00
Aliaksandr Valialkin	97eafbe4a7	app/vmselect: clarify that it isnt recommended setting -replicationFactor at vmselect nodes even if the replication is enabled at vminsert nodes	2022-11-18 14:04:12 +02:00
Aliaksandr Valialkin	61736e4a1d	app/vmselect/netstorage: remove superflouos map lookup at ProcessSearchQuery This should reduce CPU usage a bit during querying	2022-11-18 13:49:59 +02:00
Aliaksandr Valialkin	eb784ff399	app/vmselect/netstorage: emit more useful information in query traces when some of vmstorage nodes return errors or if there is no need to wait for their responses	2022-11-18 13:01:42 +02:00
Aliaksandr Valialkin	fe8d40f12c	app/{vminsert,vmselect}: test initialization with different number of storage nodes Updates https://github.com/VictoriaMetrics/VictoriaMetrics/pull/3329	2022-11-09 11:48:39 +02:00
Aliaksandr Valialkin	976bbe3677	app/{vminsert,vmselect}: limit the access to storageNodes to getStorageNodesBucket and setStorageNodesBucket functions This makes the code more maintainable and earier to test.	2022-10-28 11:41:55 +03:00
Aliaksandr Valialkin	4f53147ed4	app/{vminsert,vmselect}/netstorage: allow calling Init()+MustStop() in a loop Previously netstorage.MustStop() call didn't free up all the resources, so the subsequent call to nestorage.Init() would panic. This allows writing tests, which call nestorage.Init() + nestorage.MustStop() in a loop.	2022-10-25 14:43:05 +03:00
Aliaksandr Valialkin	43bdd96a6e	app/vmselect: improve performance scalability on multi-CPU systems for `/api/v1/export/...` endpoints	2022-10-01 22:16:07 +03:00
Aliaksandr Valialkin	f0eea5b02d	app/vmselect/netstorage: fix a typo, which leads to incorrect query results in VictoriaMetrics cluster The typo has been introduced in the commit `1a254ea20c` Updates https://github.com/VictoriaMetrics/VictoriaMetrics/issues/3067	2022-09-08 13:46:40 +03:00
Aliaksandr Valialkin	9cca3a0a1b	app/vmselect/netstorage: fix potential panic under high load The panic may trigger during data blocks' processing received from vmstorage nodes when some of vmstorage nodes return an error or when `-replicationFactor` is set to values higher than 2 at `vmselect`. Updates https://github.com/VictoriaMetrics/VictoriaMetrics/issues/3058	2022-09-02 21:36:15 +03:00
Aliaksandr Valialkin	08b8467e97	app/vmselect/netstorage: make golangci-lint happy by naming the unused padding field as _	2022-08-22 00:32:37 +03:00
Aliaksandr Valialkin	9ddd2699fd	all: remove the remaining bits of io/ioutil The io/ioutil package is deprecated since Go1.16 - see https://tip.golang.org/doc/go1.16#ioutil VictoriaMetrics requires at least Go1.18, so it is time to remove the io/ioutil from source code This is a follow-up for `02ca2342ab`	2022-08-22 00:22:41 +03:00
Aliaksandr Valialkin	87e0d69bf4	app/vmselect/netstorage: fix a bug introduced in `1a254ea20c` The bug results in `duplicate output time series` error because the same time series is added two times into the orderedMetricNames list inside the tmpBlocksFileWrapper.Finalize(). While at it, properly release all the tmpBlocksFile structs on tbf.Finalize() error. Previously only the remaining tbf entries were released. This could result in resource leak.	2022-08-17 14:07:51 +03:00
Aliaksandr Valialkin	1a254ea20c	app/vmselect/netstorage: remove common contention points related to inter-CPU communcations This should improve vmselect performance scalability on systems with many CPU cores. The following tasks were done: - Use separate temporary files for storing the data read from each vmstorage node. This may result in the following potential issues: - Up to N times higher memory usage for performing each query where N is the number of vmstorage nodes known to vmselect. This issue shouldn't increase chances of out of memory errors in most cases, since per-query memory overhead is quite low comparing to the overall vmselect memory usage. - Up to N times higher number of open temporary files where N is the number of vmstorage nodes known to vmselect. This issue should be fixed by increasing the limit on the number of open files. - Use separate counters per each vmstorage node for various stats calculation when reading the data from vmstorage nodes.	2022-08-11 23:22:56 +03:00
Aliaksandr Valialkin	ec3df0b913	app/vmselect/netstorage: improve scalability of blocks processing on systems with multiple CPU cores Previously a single syncwg.WaitGroup was used for tracking the lifetime of processBlock callbacks across all the per-vmstorage goroutines. This could be slow on systems with many CPU cores because of inter-CPU synchronization overhead. Use a separate per-vmstorage sync.WaitGroup instead in order to reduce inter-CPU synchronization overhead. This should imrpove performance for heavy queries over big number of blocks on multi-CPU systems.	2022-08-11 21:37:24 +03:00
Aliaksandr Valialkin	1996e36cf0	app/vmselect/netstorage: prevent from calling processBlocks callback after the exit from ProcessBlocks function This should prevent from panic at multi-level vmselect when the top-level vmselect is configured with -replicationFactor > 1	2022-08-08 13:32:44 +03:00
Aliaksandr Valialkin	2635211bf4	app/vmselect/netstorage: properly detect and log timeout errors when querying vmstorage from vmselect This change is based on https://github.com/VictoriaMetrics/VictoriaMetrics/pull/2937 Thanks to @isodude for the initial pull request.	2022-08-08 00:21:05 +03:00
Aliaksandr Valialkin	43185353bc	app/vmselect/netstorage: cleanup after `92630c1ab4` Updates https://github.com/VictoriaMetrics/VictoriaMetrics/issues/2896	2022-08-04 18:34:38 +03:00
Aliaksandr Valialkin	c81d2b4c18	app/vmselect/netstorage: initializes tsw.rowsProcessed before calling tsw.f, since tsw.f can modify r.Timestamps and r.Values lengths	2022-07-30 00:39:14 +03:00
Aliaksandr Valialkin	5ddae2e293	app/vmselect/netstorage: re-use random generator used for series shuffle in Result.RunParallel This should reduce CPU usage needed for rand.Rand initialization	2022-07-30 00:31:00 +03:00
Aliaksandr Valialkin	3d4c312ba2	app/vmselect/netstorage: improve the speed of queries over big number of time series on multi-CPU system Reduce inter-CPU communications when processing the query over big number of time series. This should improve performance for queries over big number of time series on systems with many CPU cores. Updates https://github.com/VictoriaMetrics/VictoriaMetrics/issues/2896 Based on `b596ac3745` Thanks to @zqyzyq for the idea.	2022-07-25 09:22:28 +03:00
Aliaksandr Valialkin	fbb403b5c0	app/vmselect/netstorage: optimize mergeSortBlocks() for the worst case when blocks contain interleaved samples	2022-07-12 12:30:24 +03:00
Aliaksandr Valialkin	aee08117e9	app/vmselect/netstorage: add mergeSortBlocks benchmark for the worstcase	2022-07-12 12:26:27 +03:00
Aliaksandr Valialkin	c0af52228a	app/vmselect/netstorage: add benchmarks for mergeSortBlocks This is a follow-up for `743ff84863`	2022-07-11 12:53:46 +03:00
Aliaksandr Valialkin	d442ee4610	app/vmselect/netstorage: optimize mergeSortBlocks function - Use binary search instead of linear scan when locating the run of smallest timestamps in blocks with intersected time ranges. This should improve performance when merging blocks with big number of samples - Skip samples with duplicate timestamps. This should increase query performance in cluster version of VictoriaMetrics with the enabled replication.	2022-07-09 00:35:38 +03:00
Aliaksandr Valialkin	195dccf678	app/vmselect: add ability to query `vmselect` from another `vmselect`	2022-07-06 13:19:45 +03:00
Aliaksandr Valialkin	cdd89d9cc2	app/vmselect: properly generate response for /api/v1/series The response has been broken in `7d5d33fd71`	2022-07-06 12:46:23 +03:00
Aliaksandr Valialkin	270e555f47	lib/vmselectapi: pass maxSuffixes arg to tagValueSuffixes RPC call	2022-07-06 12:46:22 +03:00
Aliaksandr Valialkin	f4df43f7cc	app/vmselect/netstorage: remove unused auth.Token arg	2022-07-06 12:46:21 +03:00
Aliaksandr Valialkin	78eeca6f0d	lib/vmselectapi: rename deleteMetrics to more correct deleteSeries	2022-07-06 12:46:21 +03:00
Aliaksandr Valialkin	daefb64f38	app/vmselect: expose additional histograms at `/metrics` page, which may help get more insights for the query workload This commit is based on https://github.com/VictoriaMetrics/VictoriaMetrics/pull/2792	2022-06-28 20:18:31 +03:00
Aliaksandr Valialkin	7d5d33fd71	lib/storage: return marshaled metric names from SearchMetricNames Previously SearchMetricNames was returning unmarshaled metric names. This wasn't great for vmstorage, which should spend additional CPU time for marshaling the metric names before sending them to vmselect. While at it, remove possible duplicate metric names, which could occur when multiple samples for new time series are ingested via concurrent requests. Also sort the metric names before returning them to the client. This simplifies debugging of the returned metric names across repeated requests to /api/v1/series	2022-06-28 18:16:32 +03:00
Aliaksandr Valialkin	399d4c36ae	app/vmselect: optimize /api/v1/series a bit for time ranges smaller than one day	2022-06-28 12:55:20 +03:00
Aliaksandr Valialkin	a667d339be	app/vmselect/netstorage/netstorage.go: group metrics in order to improve readability a bit	2022-06-27 14:00:24 +03:00
Aliaksandr Valialkin	08de733924	app/vmselect/netstorage: assume the response is full if up to -replicationFactor-1 vmstorage nodes are unavailable This is a follow-up for `ee5c502446` Updates https://github.com/VictoriaMetrics/VictoriaMetrics/issues/1767	2022-06-27 12:21:26 +03:00
Aliaksandr Valialkin	bc9d704ef4	app/vmselect/netstorage: remove Get prefix from netstorage functions This makes these function names more consistent with the server side	2022-06-27 00:37:49 +03:00
hagen1778	e40d015e9a	vmselect: make `vm_partial_results_total` consistent Metrics `vm_partial_results_total` and `vm_requests_total` serving the similar purpose, but contain inconsistent set of labels. This change updates `vm_partial_results_total` labels to be consistent with `vm_requests_total`. The change breaks backward compatibility with assumption that `vm_partial_results_total` wasn't widely used, since it is not documented and absent in the alerts and dashboards. Signed-off-by: hagen1778 <roman@victoriametrics.com>	2022-06-24 13:50:26 +02:00
Nikolay	ee5c502446	app/vmselect: fixes partial response with replicationFactor (#2777 ) * app/vmselect: fixes partial response with replicationFactor Allow partial response if it meets replicationFactor configured at vmselect https://t.me/VictoriaMetrics_ru1/38490 * docs/CHANGELOG.md: document this change Co-authored-by: Aliaksandr Valialkin <valyala@victoriametrics.com>	2022-06-23 20:17:24 +03:00
Aliaksandr Valialkin	dceca7e864	all: remove explicit "xxhash" name when importing github.com/cespare/xxhash/v2 package This is a follow-up for `fe2269b999`	2022-06-21 20:27:30 +03:00
Aliaksandr Valialkin	b28c6febf9	app/{vminsert,vmselect}: add `-vmstorageDialTimeout` command-line flag for tuning the maximum time needed for establishing connections to vmstorage Updates https://github.com/VictoriaMetrics/VictoriaMetrics/issues/711	2022-06-20 15:17:34 +03:00
Aliaksandr Valialkin	da1d1e83df	app/{vmselect,vmstorage}: properly pass seriesCountByLabelName and seriesCountByFocusLabelValue entries from vmstorage to vmselect	2022-06-16 10:44:29 +03:00
Aliaksandr Valialkin	ee9954082f	app/vmselect/netstorage: properly aggregate seriesCountByLabelName and seriesCountByFocusLabelValue obtained from multiple vmselect nodes at /api/v1/status/tsdb	2022-06-15 16:48:40 +03:00
Aliaksandr Valialkin	45fa9d798d	app/vmselect: accept `focusLabel` query arg at /api/v1/status/tsdb	2022-06-14 18:39:00 +03:00
Aliaksandr Valialkin	61e03f172b	app/vmselect: optimize `/api/v1/labels` and `/api/v1/label/.../values` handlers when `match[]` query arg is passed to them	2022-06-12 14:06:24 +03:00
Aliaksandr Valialkin	4a94cd81ce	app/vmselect: add optional `limit` query arg to `/api/v1/labels` and `/api/v1/label_values` endpoints This arg allows limiting the number of sample values returned from these APIs	2022-06-10 10:24:07 +03:00
Aliaksandr Valialkin	a9ea3fee38	lib/querytracer: make it easier to use by passing trace context message to New and NewChild The context message can be extended by calling Donef. If there is no need to extend the message, then just call Done.	2022-06-08 21:16:12 +03:00
Aliaksandr Valialkin	2b343d8bd0	app: properly collect and merge /api/v1/status/tsdb info from vmstorage nodes The collection has been broken in `f2754c3e90` Updates https://github.com/VictoriaMetrics/VictoriaMetrics/issues/2233	2022-06-08 19:26:09 +03:00

1 2 3 4

159 commits