github-mirrors/VictoriaMetrics

mirror of https://github.com/VictoriaMetrics/VictoriaMetrics.git synced 2025-03-21 15:45:01 +00:00

Author	SHA1	Message	Date
kiriklo	82badc3dd5	app/vmselect/promql: improve performance of parseCache on systems with many CPU cores ### Describe Your Changes Parse cache is a pretty simple implementation of cache. It's just a standard map with mutex. Map with mutex overall has poor performance, plus when the cache overflow occurs, the whole cache locks until 1k elements have been deleted (now it's 10% of 10000 max elements in the cache). To avoid this bottleneck and improve performance of cache on systems with many CPU cores but keep it rather simple, we can implement cache with per bucket locks like it's done in fastcache. The logic and API remain the same. So now each bucket will have a map with approximately 78 elements (with 128 buckets), and overflow will occur now for each bucket, and only 7 elements need to be deleted. Because exec_test.go has about 10k lines of code, it's better to move the cache into a separate file to add tests and benchmarks for it, because now it does not have them. ``` goos: windows goarch: amd64 pkg: github.com/VictoriaMetrics/VictoriaMetrics/app/vmselect/promql cpu: 11th Gen Intel(R) Core(TM) i9-11900K @ 3.50GHz Current cache implementation performance on 8 cores: BenchmarkCachePutNoOverFlow-8 1932 618372 ns/op 253 B/op 0 allocs/op BenchmarkCacheGetNoOverflow-8 6547 211527 ns/op 0 B/op 0 allocs/op BenchmarkCachePutGetNoOverflow-8 1873 621718 ns/op 261 B/op 0 allocs/op BenchmarkCachePutOverflow-8 2262 464328 ns/op 32 B/op 0 allocs/op BenchmarkCachePutGetOverflow-8 1764 655866 ns/op 38 B/op 0 allocs/op New cache implementation performance on 8 cores: BenchmarkCachePutNoOverFlow-8 10408 111412 ns/op 0 B/op 0 allocs/op BenchmarkCacheGetNoOverflow-8 22407 52809 ns/op 0 B/op 0 allocs/op BenchmarkCachePutGetNoOverflow-8 6583 168088 ns/op 0 B/op 0 allocs/op BenchmarkCachePutOverflow-8 9822 117212 ns/op 2 B/op 0 allocs/op BenchmarkCachePutGetOverflow-8 6481 175952 ns/op 3 B/op 0 allocs/op Current cache implementation performance on 16 cores: BenchmarkCachePutNoOverFlow-16 2331 475307 ns/op 218 B/op 0 allocs/op BenchmarkCacheGetNoOverflow-16 6069 196905 ns/op 0 B/op 0 allocs/op BenchmarkCachePutGetNoOverflow-16 1870 644236 ns/op 262 B/op 0 allocs/op BenchmarkCachePutOverflow-16 2296 509279 ns/op 34 B/op 0 allocs/op BenchmarkCachePutGetOverflow-16 1726 671510 ns/op 45 B/op 0 allocs/op New cache implementation performance on 16 cores: BenchmarkCachePutNoOverFlow-16 13549 82413 ns/op 0 B/op 0 allocs/op BenchmarkCacheGetNoOverflow-16 30274 38997 ns/op 0 B/op 0 allocs/op BenchmarkCachePutGetNoOverflow-16 8512 126239 ns/op 0 B/op 0 allocs/op BenchmarkCachePutOverflow-16 13884 88124 ns/op 1 B/op 0 allocs/op BenchmarkCachePutGetOverflow-16 7903 131299 ns/op 3 B/op 0 allocs/op ``` From the benchmarks above, we can see that the new implementation is ~5 times faster than the old one. --------- Co-authored-by: f41gh7 <nik@victoriametrics.com>	2025-01-02 17:43:23 +01:00
Andrii Chubatiuk	a88f896b43	promql: exclude limit_offset from default by metric name sorting (#7402 ) ### Describe Your Changes I don't like this solution, but it works. Other possible solutions described in an issue fixes https://github.com/VictoriaMetrics/VictoriaMetrics/issues/7068 ### Checklist The following checks are mandatory: - [ ] My change adheres [VictoriaMetrics contributing guidelines](https://docs.victoriametrics.com/contributing/). --------- Signed-off-by: hagen1778 <roman@victoriametrics.com> Co-authored-by: hagen1778 <roman@victoriametrics.com>	2024-11-06 15:10:23 +01:00
Aliaksandr Valialkin	c9bb4ddeed	app/vlselect: add /select/logsql/stats_query endpoint, which is going to be used by vmalert Updates https://github.com/VictoriaMetrics/VictoriaMetrics/issues/6942 Updates https://github.com/VictoriaMetrics/VictoriaMetrics/issues/6706	2024-09-06 23:06:43 +02:00
Aliaksandr Valialkin	61d794c5e7	app/vmselect/promql: follow-up for `dd0d2c77c8` and `6149adbe10` Use metricsql.IsLikelyInvalid() function for determining whether the given query is likely invalid, e.g. there is high change the query is incorrectly written, so it will return unexpected results. The query is invalid most of the time if it passes something other than series selector into rollup function. For example: - rate(sum(foo)) - rate(foo + bar) - rate(foo > bar) Improtant note: the query is considered valid if it misses the lookbehind window in square brackes inside rollup function, e.g. rate(foo), since this is very convenient MetricsQL extention to PromQL, and this query returns the expected results most of the time. Other unsafe query types can be added in the future into metricsql.IsLikelyInvalid(). TODO: probably, the -search.disableImplicitConversion command-line flag must be set by default in the future releases of VictoriaMetrics. Updates https://github.com/VictoriaMetrics/VictoriaMetrics/issues/4338 Updates https://github.com/VictoriaMetrics/VictoriaMetrics/pull/6180 Updates https://github.com/VictoriaMetrics/VictoriaMetrics/pull/6450	2024-07-03 00:47:10 +02:00
Roman Khavronenko	6149adbe10	app/vmselect/promql: check for ranged vectors in aggr funcs if implicit conversions are disabled (#6450 ) Check for ranged vector arguments in aggregate expressions when `-search.disableImplicitConversion` or `-search.logImplicitConversion` are enabled. For example, `sum(up[5m])` will fail to execute if these flags are set. ### Describe Your Changes Please provide a brief description of the changes you made. Be as specific as possible to help others understand the purpose and impact of your modifications. ### Checklist The following checks are mandatory: - [*] My change adheres [VictoriaMetrics contributing guidelines](https://docs.victoriametrics.com/contributing/). --------- Signed-off-by: hagen1778 <roman@victoriametrics.com>	2024-06-17 14:21:16 +02:00
Aliaksandr Valialkin	92de6ea340	app/vmselect: use strings.EqualFold instead of strings.ToLower where appropriate Strings.EqualFold doesn't allocate memory contrary to strings.ToLower if the input string contains uppercase chars	2024-05-12 10:20:41 +02:00
Hui Wang	dd0d2c77c8	app/vmselect: implement cmd-line flags `-search.disableImplicitConversions` and `-search.logImplicitConversions` (#6180 ) address https://github.com/VictoriaMetrics/VictoriaMetrics/issues/4338 support disable or log [implicit conversions](https://docs.victoriametrics.com/metricsql/#implicit-query-conversions) for subquery with cmd-line flags `-search.disableImplicitConversion` and `-search.logImplicitConversion` Signed-off-by: hagen1778 <roman@victoriametrics.com> Co-authored-by: hagen1778 <roman@victoriametrics.com>	2024-04-25 12:54:42 +02:00
Aliaksandr Valialkin	6697da73e5	app: consistently use atomic.* types instead of atomic.* functions See `ea9e2b19a5`	2024-02-24 02:44:24 +02:00
Aliaksandr Valialkin	190a6565ae	app/vmselect/promql: consistently sort results of `a or b` query Previously the order of results returned from `a or b` query could change with each request because the sorting for such query has been disabled in order to satisfy https://github.com/VictoriaMetrics/VictoriaMetrics/issues/4763 . This commit executes `a or b` query as `sortByMetricName(a) or sortByMetricName(b)`. This makes the order of returned time series consistent across requests, while maintaining the requirement from https://github.com/VictoriaMetrics/VictoriaMetrics/issues/4763 , e.g. `b` results are consistently put after `a` results. Updates https://github.com/VictoriaMetrics/VictoriaMetrics/issues/5393	2024-01-16 01:30:10 +02:00
Aliaksandr Valialkin	3d3b0e31e0	app/vmselect: add -search.maxResponseSeries command-line flag for limiting the number of time series a single response can return This limit can be used for preventing from high memory usage at Grafana when the response returns too many series. Updates https://github.com/VictoriaMetrics/VictoriaMetrics/issues/5372	2023-12-10 00:54:42 +02:00
Aliaksandr Valialkin	da887b49e7	app/vmui: show query execution duration in the header of query input field This should simplify the process of query optimization	2023-11-01 16:43:51 +01:00
Nikolay	1f91f22b5f	app/vmselect: reduce lock contention for heavy aggregation requests (#5119 ) reduce lock contention for heavy aggregation requests previously lock contetion may happen on machine with big number of CPU due to enabled string interning. sync.Map was a choke point for all aggregation requests. Now instead of interning, new string is created. It may increase CPU and memory usage for some cases. https://github.com/VictoriaMetrics/VictoriaMetrics/issues/5087	2023-10-10 13:45:20 +02:00
Aliaksandr Valialkin	3b9605dba5	app/vmselect/promql: do not sort `q1 or q2` results This makes sure that `q2` series are returned after `q1` series in the same way as Prometheus does See https://github.com/VictoriaMetrics/VictoriaMetrics/issues/4763	2023-09-25 16:14:16 +02:00
Aliaksandr Valialkin	edee262ecc	Makefile: update golangci-lint from v1.51.2 to v1.54.2 See https://github.com/golangci/golangci-lint/releases/tag/v1.54.2	2023-09-01 10:16:42 +02:00
Aliaksandr Valialkin	4cb024d8a3	all: add support for `or` filters in series selectors This commit adds ability to select series matching distinct filters via a single series selector. For example, the following selector selects series with either {env="prod",job="a"} or {env="dev",job="b"} labels: {env="prod",job="a" or env="dev",job="b"} The `or` filter is supported in all the VictoriaMetrics tools now. Updates https://github.com/VictoriaMetrics/VictoriaMetrics/issues/3997 Uses https://github.com/VictoriaMetrics/metricsql/pull/14	2023-07-16 00:06:33 -07:00
Aliaksandr Valialkin	2f3ddd4884	app/vmselect/promql: avoid memory allocations and copying from source timeseries to the returned result at timeseriesToResult()	2023-01-09 22:38:59 -08:00
Aliaksandr Valialkin	4f0c11ee93	app/vmselect/promql: intern output series names inside timeseriesToResult() This reduces the number of memory allocations for repeated queries, which return (almost) the same set of time series.	2023-01-09 22:19:56 -08:00
Dmytro Kozlov	b75f1854c5	vmselect/promql: add alphanumeric sort by label (sort_by_label_numeric) (#2982 ) * vmselect/promql: add alphanumeric sort by label (sort_by_label_numeric) * vmselect/promql: fix tests, add documentation * vmselect/promql: update test * vmselect/promql: update for alphanumeric sorting, fix tests * vmselect/promql: remove comments * vmselect/promql: cleanup * vmselect/promql: avoid memory allocations, update functions descriptions * vmselect/promql: make linter happy (remove ineffectual assigment) * vmselect/promql: add test case, fix behavior when strings are equal * vendor: update github.com/VictoriaMetrics/metricsql from v0.44.1 to v0.45.0 this adds support for sort_by_label_numeric and sort_by_label_numeric_desc functions * wip * lib/promscrape: read response body into memory in stream parsing mode before parsing it This reduces scrape duration for targets returning big responses. The response body was already read into memory in stream parsing mode before this change, so this commit shouldn't increase memory usage. * wip Co-authored-by: Aliaksandr Valialkin <valyala@victoriametrics.com>	2022-09-14 17:41:09 +03:00
Aliaksandr Valialkin	4ac79d29ad	app/vmselect: follow-up after `63e0f16062` * Explicitly store a pointer to UserReadableError in the error interface. Previously Go automatically converted the value to a pointer before storing in the error interface. * Add Unwrap() method to UserReadableError, so it can be used transparently with the other code, which calls errors.Is() and errors.As(). * Document the change in docs/CHANGELOG.md	2022-08-15 13:50:16 +03:00
Roman Khavronenko	63e0f16062	vmselect: introduce UserReadableError type of error (#2894 ) When read query fails, VM returns rich error message with all the details. While these details might be useful for debugging specific cases, they're usually too verbose for users. Introducing a new error type `UserReadableError` is supposed to allow to return to user only the most important parts of the error trace. This supposed to improve error readability in web interfaces such as VMUI or Grafana. The full error trace is still logged with the full context and can be found in vmselect logs. Signed-off-by: hagen1778 <roman@victoriametrics.com> Signed-off-by: hagen1778 <roman@victoriametrics.com>	2022-08-15 13:38:47 +03:00
Aliaksandr Valialkin	41958ed5dd	all: add initial support for query tracing See https://docs.victoriametrics.com/Single-server-VictoriaMetrics.html#query-tracing Updates https://github.com/VictoriaMetrics/VictoriaMetrics/issues/1403	2022-06-01 02:29:23 +03:00
Aliaksandr Valialkin	ed97908ca9	app/vmselect/promql: rename removeNaNs() to more clear removeEmptySeries()	2022-04-20 19:53:46 +03:00
Aliaksandr Valialkin	0e3de5a0cc	app/vmselect/promql: add `topk_last` and `bottomk_last` functions	2021-09-30 13:22:52 +03:00
Aliaksandr Valialkin	8ed95e82c6	app/vmselect/promql: follow-up after `57b3320478`	2021-09-24 01:24:18 +03:00
Roman Khavronenko	57b3320478	app/vmselect: make sorting for query result similar to Prometheus (#1647 ) * app/vmselect: make sorting for query result similar to Prometheus Updated sorting allows to get the order of series in result similar or equal to what Prometheus returns. The change is needed for compatibility reasons. * Update app/vmselect/promql/exec_test.go Co-authored-by: Aliaksandr Valialkin <valyala@victoriametrics.com>	2021-09-24 01:03:12 +03:00
Aliaksandr Valialkin	83a4db813e	app/vmselect: log slow requests to all the `/api/v1/*` handlers if their execution time exceeds `-search.logSlowQueryDuration`	2021-06-18 19:04:42 +03:00
Aliaksandr Valialkin	d3fa0ccabd	app/vmselect/promql: properly detect aggregate `topk` and `bottomk` aggregate functions in order to disable duplicate sorting Updates https://github.com/VictoriaMetrics/VictoriaMetrics/issues/1189	2021-04-08 00:09:40 +03:00
Aliaksandr Valialkin	1177dca3da	app/vmselect: do not sort series returned from `topk` and `bottomk` functions, since these series are already sorted in user-expected order Updates https://github.com/VictoriaMetrics/VictoriaMetrics/issues/1189	2021-04-07 14:16:08 +03:00
Aliaksandr Valialkin	c79e4a2f90	app/vmselect/promql: remove the limit on the number of time series that can be sorted, since it may confuse users Always sort time series returned from `/api/v1/query` and `/api/v1/query_range` unless `sort_*` function is used at top level of the query.	2021-04-02 15:02:08 +03:00
Aliaksandr Valialkin	2dae0a2c47	app/vmselect: add `round_digits` query arg to `/api/v1/query` and `/api/v1/query_range` handlers for limiting the number of decimal digits after the point	2021-03-15 12:36:33 +02:00
Aliaksandr Valialkin	8629fd8a72	app/vmselect: deprecate `-search.treatDotsAsIsInRegexps` in favor to `{__graphite__="foo.*.bar"}` syntax	2021-02-03 20:36:01 +02:00
Aliaksandr Valialkin	df0309eae0	app/vmselect/promql: simplify defer call for querystats.RegisterQuery	2020-12-27 12:06:04 +02:00
Aliaksandr Valialkin	59183f66d0	app/vmselect: refactor `/api/v1/stats/top_queries` Updates https://github.com/VictoriaMetrics/VictoriaMetrics/issues/907	2020-12-25 16:44:29 +02:00
Nikolay	86630350bf	Adds query stats handler (#945 ) * Adds query stat handler, for query and query_range api, victoriametrics tracks query execution time, stats are expored at /api/v1/status/queries endpoint with topN param https://github.com/VictoriaMetrics/VictoriaMetrics/issues/907 * fixed query stats bugs * improves queryStats tracker * improves query stat * small fix * fix tests * added more tests * fixes 386 tests * naming fixes * adds drop for outdated records	2020-12-25 16:42:05 +02:00
Aliaksandr Valialkin	91a4c279cc	app/vmselect: return `metric` values from `time() cmp_op metric` query when `cmp_op` comparison is true This aligns MetricsQL behavior to Prometheus' one. The issue has been identified at https://promlabs.com/promql-compliance-test-results/2020-12-01/victoriametrics/	2020-12-02 12:09:34 +02:00
Aliaksandr Valialkin	2859a452d4	app/vmselect: add remoteAddr to slow query log in order to improve debuggability This will simplify identifying the client that sends slow queries to VictoriaMetrics.	2020-11-18 20:38:32 +02:00
Aliaksandr Valialkin	348edd92fe	app/vmselect: add `-search.treatDotsAsIsInRegexps` command-line flag for automatic escaping of dots in regexp label filters	2020-11-11 12:39:07 +02:00
Aliaksandr Valialkin	e9f2e2cbc9	app/vmselect/promql: add missing label filters to binary operands before query execution This implements the optimization described at https://utcc.utoronto.ca/~cks/space/blog/sysadmin/PrometheusLabelNonOptimization See also https://github.com/cortexproject/cortex/issues/3253	2020-10-07 21:15:09 +03:00
Aliaksandr Valialkin	68e4f40a72	app/vmselect: properly handle PromQL queries like `scalar1 < metric < scalar2` like Prometheus does This fixes some cases from https://promlabs.com/promql-compliance-test-results-victoriametrics/	2020-08-06 23:21:03 +03:00
Aliaksandr Valialkin	5d0c37bec0	app/vmselect: use warning level instead of info level for logging slow queries that take longer than `-search.logSlowQueryDuration`	2020-08-04 20:25:35 +03:00
Aliaksandr Valialkin	742da690f4	app/vmselect: add `/api/v1/status/active_queries` page with the list of currently running queries This is a follow-up for https://github.com/VictoriaMetrics/VictoriaMetrics/pull/598 Updates https://github.com/VictoriaMetrics/VictoriaMetrics/pull/575	2020-07-08 18:55:38 +03:00
DexterZhang	99f54e44ff	feat(vmselect): add current running query list, add ability for getting the running query info and killing running query for master branch (#598 )	2020-07-08 18:52:55 +03:00
Aliaksandr Valialkin	4e4f57b121	lib/metricsql: move it to a separate repository - github.com/VictoriaMetrics/metrics	2020-04-28 15:28:22 +03:00
Aliaksandr Valialkin	7b1c7051a3	app/vmselect: add `sort_by_label(q, label)` and `sort_by_label_desc(q, label)` functions This is implementation of https://github.com/prometheus/prometheus/pull/1533 for VictoriaMetrics.	2020-02-13 17:01:37 +02:00
Aliaksandr Valialkin	a6c6a2debc	app/vmselect/promql: do not add step to range end, since this hack became obsolete since commit `9e1119dab8`	2020-02-05 21:22:19 +02:00
Aliaksandr Valialkin	680080887d	all: consistently log durations in seconds with millisecond precision This should improve logs readability	2020-01-22 18:28:27 +02:00
Aliaksandr Valialkin	6f67e0b56b	lib/metricsq: add ExpandWithExprs	2019-12-25 22:20:30 +02:00
Aliaksandr Valialkin	1925ee038d	Rename lib/promql to lib/metricsql and apply small fixes	2019-12-25 22:03:59 +02:00
Mike Poindexter	bec62e4e43	Split Extended PromQL parsing to a separate library	2019-12-25 22:03:51 +02:00
Aliaksandr Valialkin	50ae1879c6	app/vmselect/promql: add `histogram` aggregate function, which is useful for building heatmaps from multiple time series	2019-11-24 00:04:25 +02:00

1 2

58 commits