github-mirrors/VictoriaMetrics

mirror of https://github.com/VictoriaMetrics/VictoriaMetrics.git synced 2024-12-11 14:53:49 +00:00

Author	SHA1	Message	Date
Aliaksandr Valialkin	df32d2836c	lib/storage: properly handle big time ranges passed to `/api/v1/labels` and `/api/v1/label/<labelName>/values` It should be faster querying all the labels and/or all the values instead of querying per-day labels/values on time ranges exceeding maxDaysForPerDaySearch	2021-04-07 13:33:46 +03:00
Aliaksandr Valialkin	6e855d4b82	lib/storage: tune loopsCountPerMetricNameMatch according to production workload	2021-03-25 13:27:47 +02:00
Aliaksandr Valialkin	8e2afdf568	lib/storage: improve Search.NextMetricBlock performance by using MetricID->MetricName cache	2021-03-22 22:49:18 +02:00
Aliaksandr Valialkin	910092ca4d	lib/storage: tune loopsCountPerMetricNameMatch	2021-03-22 12:53:17 +02:00
Aliaksandr Valialkin	45dabfac1b	lib/storage: faster move heavy filters to the end of list	2021-03-17 15:12:13 +02:00
Aliaksandr Valialkin	ccfb0ae2d3	lib/storage: limit loops count in order to reduce max CPU usage during filter search	2021-03-17 00:49:26 +02:00
Aliaksandr Valialkin	576a80b3d9	lib/storage: do not modify filterLoopsCount stats with loopsCount stats Such a modification can result in incorrect filter sorting later	2021-03-17 00:49:26 +02:00
Aliaksandr Valialkin	fd86a7dc1d	lib/storage: time series search optimization according to production workload profiling Do not pass filter metric ids to getMetricIDsForTagFilter, since it has been appeared that this slows down the function by multiple times when it finds big number of metricIDs (tens of millions).	2021-03-16 20:01:43 +02:00
Aliaksandr Valialkin	e36fbfae5b	lib/storage: further tuning for time series search	2021-03-16 18:46:22 +02:00
Aliaksandr Valialkin	dd7e82c34f	app/vmstorage: add `-logNewSeries` command-line flag for determining the source of series churn rate	2021-03-15 22:38:50 +02:00
Aliaksandr Valialkin	9c77e34ef9	lib/storage: further tuning for time series selector code	2021-03-15 20:31:34 +02:00
Aliaksandr Valialkin	3ccf7ea20c	lib/storage: tune per-day index search	2021-03-15 13:31:55 +02:00
Aliaksandr Valialkin	f669531506	lib/storage: further tune filters sorting logic	2021-03-12 00:53:04 +02:00
John Belmonte	364fdf4a56	spelling fix: adjacent (#1115 )	2021-03-09 09:18:19 +02:00
Aliaksandr Valialkin	345980f78f	lib/storage: go fmt	2021-03-08 12:03:31 +02:00
Aliaksandr Valialkin	18fe0ff14b	lib/storage: tune loopsCount estimations in getMetricIDsForTagFilterSlow The adjusted estmations give up to 2x lower median response times on 200qps /api/v1/query_range workload	2021-03-07 21:12:35 +02:00
Aliaksandr Valialkin	2c44178645	lib/storage: consistency renaming: durationsPerDateTagFilterCache -> loopsPerDateTagFilterCache	2021-02-23 15:47:19 +02:00
faceair	15d61c4879	lib/storage: correct tagfilter match cost (#1079 )	2021-02-22 21:46:56 +02:00
Aliaksandr Valialkin	636c55b526	lib/mergeset: reduce memory usage for inmemoryBlock by using more compact items representation This also should reduce CPU time spent by GC, since inmemoryBlock.items don't have pointers now, so GC doesn't need visiting them.	2021-02-21 22:06:47 +02:00
Aliaksandr Valialkin	388cdb1980	lib/storage: do not re-calculate stats for heavy tag filters This should reduce the number of slow queries when stats for heavy tag filters was recalculated.	2021-02-21 21:39:01 +02:00
Aliaksandr Valialkin	e540c02014	lib/storage: prevent from running identical heavy tag filters in concurrent queries when measuring the number of loops for such tag filter. This should reduce CPU usage spikes when measuring the number of loops needed for heavy tag filters	2021-02-18 13:58:18 +02:00
Aliaksandr Valialkin	711f8a5b8d	lib/storage: sort tag filters by the number of loops they need for the execution This metric should work better than the filter execution duration, since it cannot be distorted by concurrently running queries.	2021-02-18 12:47:38 +02:00
Aliaksandr Valialkin	faad6f84a4	lib/storage: return back filter arg to getMetricIDsForTagFilter function The filter arg has been removed in the commit `c7ee2fabb8` because it was preventing from caching the number of matching time series per each tf. Now the cache contains duration for tf execution, so the filter shouldn't break such caching.	2021-02-17 19:33:22 +02:00
Aliaksandr Valialkin	d4849561ef	app/vmstorage: export vm_composite_filter_success_conversions_total and vm_composite_filter_missing_conversions_total metrics	2021-02-17 19:13:38 +02:00
Aliaksandr Valialkin	63fc140624	lib/storage: tag filters sorting...	2021-02-17 17:55:29 +02:00
Aliaksandr Valialkin	74424b55ee	lib/storage: further tune tag filters sorting	2021-02-17 17:28:15 +02:00
Aliaksandr Valialkin	442fcfec5a	lib/storage: tune the logic for sorting tag filters according the their exeuction times	2021-02-17 15:00:08 +02:00
Aliaksandr Valialkin	d61f7b7279	lib/storage: more tuning for tag filters sorting according the time they take	2021-02-16 21:22:23 +02:00
Aliaksandr Valialkin	ca191696fe	lib/storage: tune sorting for tag filters	2021-02-16 13:04:49 +02:00
Aliaksandr Valialkin	ecf132933e	lib/storage: increase match cost for negative tag filters, since they need to scan all the label pairs	2021-02-15 16:34:23 +02:00
Aliaksandr Valialkin	71c417427c	lib/storage: sort tag filters by actual execution time instead of by the number of matching time series This should improve query speed for queries with regexp filters matching small number of time series on a label with big number of unique values.	2021-02-15 00:18:13 +02:00
Aliaksandr Valialkin	0e26b7168a	lib/storage: return back in-order applying of tag filters, since concurrently executing tag filters can result in CPU and RAM waste in common case	2021-02-10 22:41:04 +02:00
Aliaksandr Valialkin	b51c23dc5b	lib/storage: parallelize tag filters execution a bit This should reduce execution time when a query contains multiple tag filters and each such filter matches big number of time series.	2021-02-10 18:12:25 +02:00
Aliaksandr Valialkin	c7ee2fabb8	lib/storage: remove filter arg from getMetricIDsForDateTagFilter function The `filter` arg breaks the logic for sorting tag filters by the matching metrics, which may result in non-optimal performance during time series search.	2021-02-10 18:12:20 +02:00
Aliaksandr Valialkin	cdecf83ce5	app/vmstorage: export vm_composite_index_min_timestamp metric	2021-02-10 17:14:08 +02:00
Aliaksandr Valialkin	553016ea99	lib/storage: disable composite index usage when querying old data	2021-02-10 14:57:50 +02:00
Aliaksandr Valialkin	fcb7655d1e	lib/storage: fix metric name match for composite filter	2021-02-10 01:27:45 +02:00
Aliaksandr Valialkin	c7dccebaef	lib/storage: optimize search by label filters matching big number of time series	2021-02-10 00:44:54 +02:00
Aliaksandr Valialkin	a4140de9e6	lib/mergeset: unconditionally cache indexdb blocks Production workloads show that indexdb blocks must be cached unconditionally for reducing CPU usage. This shouldn't increase memory usage too much, since unused blocks are removed from the cache every two minutes.	2021-02-09 00:47:50 +02:00
Aliaksandr Valialkin	157c02622b	app/vmselect: add ability to set Graphite-compatible filter via `{__graphite__="foo.*.bar"}` syntax	2021-02-03 01:21:54 +02:00
Aliaksandr Valialkin	4aaee33860	lib/storage: do not show artifically created label for reverse Graphite labels at /api/v1/labels page	2020-11-16 00:44:35 +02:00
Aliaksandr Valialkin	b378cd6ed8	app/vmselect: optimize querying for `/api/v1/labels` and `/api/v1/label/<name>/values` when `start` and `end` args are set	2020-11-05 01:01:33 +02:00
Aliaksandr Valialkin	fe289331dd	lib/storage: remove obsolete code	2020-11-02 19:11:59 +02:00
faceair	d2960a20e0	evaluate the execution cost of all tag filters (#824 ) * evaluate the execution cost of all tag filters * fix suffixes typo	2020-10-17 00:46:55 +03:00
Aliaksandr Valialkin	764dc2499f	lib/storage: code cleanup after `10f2eedee0` Remove the code that uses metricIDs caches for the current and the previous hour during metricIDs search, since this code became unused after implementing per-day inverted index almost a year ago. While at it, fix a bug, which could prevent from finding time series with names containing dots (aka Graphite-like names such as `foo.bar.baz`).	2020-10-01 19:06:23 +03:00
Aliaksandr Valialkin	10f2eedee0	lib/storage: imrpove cache effectiveness for time series ids matching the given filters Previously the maximum cache lifetime has been limited by 10 seconds. Now it is extended up to a day. This should reduce CPU usage in the following cases: * when querying recently added data with small churn rate for time series * when querying historical data	2020-10-01 14:38:25 +03:00
Aliaksandr Valialkin	a69234ed18	lib/storage: code prettifying after `be5e1222f3` Updates https://github.com/VictoriaMetrics/VictoriaMetrics/issues/781	2020-09-22 00:36:45 +03:00
faceair	be5e1222f3	add filter to getMetricIDs (#783 ) * add getMetricIDs filter * check nil filter before use	2020-09-22 00:33:43 +03:00
Aliaksandr Valialkin	f6bc608e86	app/vmselect: initial implementation of Graphite Metrics API See https://graphite-api.readthedocs.io/en/latest/api.html#the-metrics-api	2020-09-11 00:30:01 +03:00
Aliaksandr Valialkin	9043a509a3	lib/storage: properly check timeouts and pace limits Previously they were checked on every iteration for small number of iterations	2020-08-07 08:40:37 +03:00
Aliaksandr Valialkin	8e44fba76d	lib/storage: reduce the frequency (and overhead) for timeout and pace limiter checks by 4x	2020-08-06 18:45:55 +03:00
Aliaksandr Valialkin	54ef2d8112	lib/storage: slightly reduce code difference between single-node and cluster versions	2020-07-24 00:31:16 +03:00
Aliaksandr Valialkin	039c9d2441	lib/storage: respect `-search.maxQueryDuration` when searching for time series in inverted index Previously the time spent on inverted index search could exceed the configured `-search.maxQueryDuration`. This commit stops searching in inverted index on query timeout.	2020-07-23 21:21:42 +03:00
Aliaksandr Valialkin	2a45871823	lib/storage: add more fine-grained pace limiting for search	2020-07-23 19:26:08 +03:00
Aliaksandr Valialkin	d3442b40b2	lib/uint64set: optimize adding items to the set via Set.AddMulti	2020-07-21 20:56:59 +03:00
Aliaksandr Valialkin	e1107fec10	lib/storage: reset `MetricName->TSID` cache after marking metricIDs as deleted This is a follow-up commit after `12b16077c4` , which didn't reset the `tsidCache` in all the required places. This could result in indefinite errors like: missing metricName by metricID ...; this could be the case after unclean shutdown; deleting the metricID, so it could be re-created next time Fix this by resetting the cache inside deleteMetricIDs function.	2020-07-14 14:06:32 +03:00
Aliaksandr Valialkin	0b2086b7a5	app/vminsert: prevent from adding and/or selecting labels with empty values Updates https://github.com/VictoriaMetrics/VictoriaMetrics/issues/600	2020-07-02 23:14:11 +03:00
Aliaksandr Valialkin	d5dddb0953	all: use %w instead of %s for wrapping errors in `fmt.Errorf` This will simplify examining the returned errors such as httpserver.ErrorWithStatusCode . See https://blog.golang.org/go1.13-errors for details.	2020-06-30 23:05:11 +03:00
Aliaksandr Valialkin	ae1cc0fc4b	lib/storage: properly match `{tag!="\|foo"}` filters Updates https://github.com/VictoriaMetrics/VictoriaMetrics/issues/546	2020-06-10 19:35:56 +03:00
Aliaksandr Valialkin	d186472081	lib/storage: improve search speed for time series matching Graphite whildcards such as `foo..bar.baz` Add index for reverse Graphite-like metric names with dots. Use this index during search for filters like `__name__=~"foo\\.[^.]\\.bar\\.baz"` which end with non-empty suffix with dots, i.e. `.bar.baz` in this case. This change may "hide" historical time series during queries. The workaround is to add `[.]` to the end of regexp label filter, i.e. "foo\\.[^.]\\.bar\\.baz" should be substituted with "foo\\.[^.]\\.bar\\.baz[.]".	2020-05-27 21:45:52 +03:00
Aliaksandr Valialkin	4fc33163c4	lib/storage: optimize ingestion pefrormance for new time series	2020-05-15 13:24:37 +03:00
Aliaksandr Valialkin	0afd48d2ee	lib: extract common code for returning fast unix timestamp into lib/fasttime	2020-05-14 23:02:07 +03:00
Aliaksandr Valialkin	827a3a7866	lib/storage: document that getnerateUniqueMetricID should return dense ids	2020-05-14 14:08:45 +03:00
Aliaksandr Valialkin	cc00a2c453	lib/storage: typo fixes in error messages: `or -> of`	2020-05-12 12:12:42 +03:00
Aliaksandr Valialkin	83aca79137	lib/storage: recover when metricID->metricName entry is missing in the inverted index after unclean shutdown Newly added index entries can be missing after unclean shutdown, since they didn't flush to persistent storage yet. Log about this and delete the corresponding metricID, so it could be re-created next time.	2020-04-28 12:00:33 +03:00
Aliaksandr Valialkin	491b31b369	lib/storage: postpone label filters matching too many time series instead of giving up with error This should reduce the frequency of the following errors: cannot find tag filter matching less than N time series; either increase -search.maxUniqueTimeseries or use more specific tag filters more than N time series found on the time range [...]; either increase -search.maxUniqueTimeseries or shrink the time range	2020-04-24 21:13:50 +03:00
Aliaksandr Valialkin	364db13c9c	app/vmselect: add `/api/v1/status/tsdb` page with useful stats for locating root cause for high cardinality issues See https://prometheus.io/docs/prometheus/latest/querying/api/#tsdb-stats Updates https://github.com/VictoriaMetrics/VictoriaMetrics/issues/425 Updates https://github.com/VictoriaMetrics/VictoriaMetrics/issues/268	2020-04-22 22:03:43 +03:00
Aliaksandr Valialkin	c1de3f67b4	lib/storage: skip metricID if the corresponding metricID->metricName is missing in inverted index during search This case is possible when the corresponding metricID->metricName entry didn't propagate to inverted index yet. This should fix the following error: error when searching tsids for tfss [...]: cannot find metricName by metricID 1582417212213420669: EOF	2020-04-15 00:06:43 +03:00
Aliaksandr Valialkin	e0d0348f36	lib/storage: add missing reset for tagFilter.matchesEmptyValue on tagFilter.Init	2020-04-01 17:42:44 +03:00
Aliaksandr Valialkin	972713bd79	lib/storage: add fast path for the previous indexdb search if it doesn't contain per-day inverted index yet	2020-03-31 12:51:21 +03:00
Aliaksandr Valialkin	5d99ca6cfc	lib/storage: optimize per-day inverted index search for tag filters matching big number of time series - Sort tag filters in the ascending number of matching time series in order to apply the most specific filters first. - Fall back to metricName search for filters matching big number of time series (usually this are negative filters or regexp filters).	2020-03-31 00:48:35 +03:00
Aliaksandr Valialkin	df91d2d91f	lib/storage: remove obsolete code	2020-03-13 22:48:17 +02:00
Aliaksandr Valialkin	f9289b804a	lib/storage: reduce memory allocations when merging metricID sets	2020-01-17 22:10:44 +02:00
Aliaksandr Valialkin	a247236f61	lib/storage: fall back to global inverted index if a filter match too many time series in per-day index Previously this resulted to error message. The query may succeed via search in global index.	2019-12-03 14:48:31 +02:00
Aliaksandr Valialkin	f52874dab4	lib/storage: optimize regexp filter search	2019-12-03 00:43:12 +02:00
Aliaksandr Valialkin	20812008a7	lib/storage: remove metricID with missing metricID->metricName entry The metricID->metricName entry can be missing in the indexdb after unclean shutdown when only a part of entries for new time series is written into indexdb. Recover from such a situation by removing the broken metricID. New metricID will be automatically created for time series with the given metricName when new data point will arive to it.	2019-12-02 20:46:44 +02:00
Aliaksandr Valialkin	da98703748	app/vmselect/promql: optimize binary search over big number of samples during rollup calculations	2019-11-25 14:01:46 +02:00
Aliaksandr Valialkin	f652c0f40f	lib/storage: move non-matching tag filters to the top at matchTagFilters This should reduce the amount of useless work needed for matching the next metricNames.	2019-11-21 21:35:13 +02:00
Aliaksandr Valialkin	b8cde6cce1	lib/storage: speed up time series search for queries with multiple filters Use optimized specialized binary search for uint64 metricIDs instead of generic sort.Search.	2019-11-21 18:43:17 +02:00
Aliaksandr Valialkin	2ab4cea5e5	lib/storage: always start using per-day inverted index on the next day after its creation The current day could miss entries for already stopped time series before enabling per-day index. This fixes the issue when queries return empty results during the first hour after upgrading to v1.29.*	2019-11-16 12:11:25 +02:00
Aliaksandr Valialkin	86a1cd700b	lib/storage: remove inmemory index for recent hour, since it uses too much memory Production workload shows that the index requires ~4Kb of RAM per active time series. This is too much for high number of active time series, so let's delete this index. Now the queries should fall back to the index for the current day instead of the index for the recent hour. The query performance for the current day index should be good enough given the 100M rows/sec scan speed per CPU core.	2019-11-13 17:58:07 +02:00
Aliaksandr Valialkin	33895d4a0f	lib/storage: add missing increment for recentHourInvertedIndexSearchCalls	2019-11-13 15:13:51 +02:00
Aliaksandr Valialkin	c57eb0ff83	lib/storage: add `-disableRecentHourIndex` flag for disabling inmemory index for recent hour This may be useful for saving RAM on high number of time series aka high cardinality	2019-11-13 15:02:51 +02:00
Aliaksandr Valialkin	ca259864e2	lib/storage: return back inmemory inverted index for recent hour Issues fixed: - Slow startup times. Now the index is loaded from cache during start. - High memory usage related to superflouos index copies every 10 seconds.	2019-11-13 13:11:04 +02:00
Aliaksandr Valialkin	01bb3c06c7	lib/storage: remove inmemory inverted index for recent hours Production load with >10M active time series showed it could slow down VictoriaMetrics startup times and could eat all the memory leading to OOM. Remove inmemory inverted index for recent hours until thorough testing on production data shows it works OK.	2019-11-13 10:45:53 +02:00
Aliaksandr Valialkin	8e8f98f712	lib/storage: add tests for dateMetricIDCache	2019-11-11 13:21:57 +02:00
Aliaksandr Valialkin	3956003dd0	lib/storage: reorganize the code in getStartDateForPerDayInvertedIndex according to golangci-lint	2019-11-10 00:38:59 +02:00
Aliaksandr Valialkin	ee7765b10d	lib/storage: implement per-day inverted index	2019-11-10 00:02:46 +02:00
Aliaksandr Valialkin	5810ba57c2	lib/storage: use specialized cache for (date, metricID) entries This improves ingestion performance.	2019-11-09 23:06:11 +02:00
Aliaksandr Valialkin	e573ef2126	lib/storage: remove unused code from getMetricIDsForTimeRange: it is expected that time range is always non-zero	2019-11-09 19:03:34 +02:00
Aliaksandr Valialkin	823fa085ef	lib/storage: properly set time range when deleting time series	2019-11-09 18:49:49 +02:00
Aliaksandr Valialkin	695c1dc5eb	lib/storage: obtain all the time series ids from (tag->metricIDs) rows instead of (metricID->TSID) rows, since this much faster	2019-11-09 18:04:33 +02:00
Aliaksandr Valialkin	cdbe848102	lib/storage: small code prettifying	2019-11-09 14:19:52 +02:00
Aliaksandr Valialkin	6ad7fe8eeb	lib/storage: export `vm_new_timeseries_created_total` metric for determining time series churn rate	2019-11-08 21:21:07 +02:00
Aliaksandr Valialkin	d888b21657	lib/storage: add inmemory inverted index for the last hour It should improve performance for `last N hours` dashboards with update intervals smaller than 1 hour.	2019-11-08 21:21:07 +02:00
Aliaksandr Valialkin	e472f0b23b	lib/storage: substitute error message about unsorted items in the index block after metricIDs merge with counter The origin of the error has been detected and documented in the code, so it is enough to export a counter for such errors at `vm_index_blocks_with_metric_ids_incorrect_order_total`, so it could be monitored and alerted on high error rates. Export also the counter for processed index blocks with metricIDs - `vm_index_blocks_with_metric_ids_processed_total`, so its' rate could be compared to `rate(vm_index_blocks_with_metric_ids_incorrect_order_total)`.	2019-11-06 14:28:11 +02:00
Aliaksandr Valialkin	c51ca04a43	lib/storage: take into account the requested time range when caching TSIDs for the given tag filters	2019-11-06 14:28:11 +02:00
Aliaksandr Valialkin	e37f06dc52	lib/storage: dump incorrectly sorted items on a single line; this should simplify error reporting	2019-11-05 18:44:22 +02:00
Aliaksandr Valialkin	885ba17905	lib/storage: separate the max inverted index scan loops per metric into fast and slow loops Slow loops could require seeks and expensive regexp matching, while fast loops just scans all the metricIDs for the given `tag=value` prefix. So these operations must have separate max loops multiplier.	2019-11-05 17:27:48 +02:00
Aliaksandr Valialkin	b9a06e8e74	lib/storage: skip repeated useless work when intersection of metricIDs with the given filter is too expensive This should improve performance for query filters over big number of time series.	2019-11-05 14:19:13 +02:00

1 2 3 4 5

202 commits