github-mirrors/VictoriaMetrics

mirror of https://github.com/VictoriaMetrics/VictoriaMetrics.git synced 2025-03-21 15:45:01 +00:00

Author	SHA1	Message	Date
Aliaksandr Valialkin	f1eac36a80	lib/logstorage: add `contains_any` and `contains_all` filters - `contains_any` selects logs with fields containing at least one word/phrase from the provided list. The provided list can be generated by a subquery. - `contains_all` selects logs with fields containing all the words and phrases from the provided list. The provided list can be generated by a subquery.	2025-02-22 21:55:58 +01:00
Aliaksandr Valialkin	6afd66dcc8	lib/logstorage: remove needExecuteQuery from filterIn and filterStreamID, since it isn't needed	2025-02-19 01:45:06 +01:00
Aliaksandr Valialkin	5cd7e1cc2f	lib/logstorage: consistently use Query.cloneShallow() for shallow cloning of the original query	2025-02-14 18:55:08 +01:00
Aliaksandr Valialkin	ad6c587494	lib/logstorage: properly propagate extra filters to all the subqueries The purpose of extra filters ( https://docs.victoriametrics.com/victorialogs/querying/#extra-filters ) is to limit the subset of logs, which can be queried. For example, it is expected that all the queries with `extra_filters={tenant=123}` can access only logs, which contain `123` value for the `tenant` field. Previously this wasn't the case, since the provided extra filters weren't applied to subqueries. For example, the following query could be used to select all the logs outside `tenant=123`, for any `extra_filters` arg: * \| union({tenant!=123}) This commit fixes this by propagating extra filters to all the subqueries. While at it, this commit also properly propagates [start, end] time range filter from HTTP querying APIs into all the subqueries, since this is what most users expect. This behaviour can be overriden on per-subquery basis with the `options(ignore_global_time_filter=true)` option - see https://docs.victoriametrics.com/victorialogs/logsql/#query-options Also properly apply apply optimizations across all the subqueries. Previously the optimizations at Query.optimize() function were applied only to the top-level query.	2025-01-24 18:49:25 +01:00
Aliaksandr Valialkin	b620b5cff5	lib/logstorage: add an ability to set query concurrency on a per-query basis This is done via 'options(concurrency=N)' prefix for the query. For example, the following query is executed on at most 4 CPU cores: options(concurrency=4) _time:1d \| count_uniq(user_id) This allows reducing RAM and CPU usage at the cost of longer query execution times, since by default every query is executed in parallel on all the available CPU cores. See https://docs.victoriametrics.com/victorialogs/logsql/#query-options	2025-01-23 02:42:16 +01:00
Aliaksandr Valialkin	42c21ff671	lib/logstorage: always pass the current timestamp to newLexer() Also always initialize Query.timestamp with the timestamp from the lexer. This should avoid potential problems with relative timestamps inside inner queries. For example, the `_time:1h` filter in the following query is correctly executed relative to the current timestamp: foo:in(_time:1h \| keep foo)	2025-01-23 02:42:16 +01:00
Aliaksandr Valialkin	43d615ae87	lib/logstorage: properly pass tenantIDs list to initStreamFilters Previously an empty tenantIDs list was mistakenly passed to initStreamFilters when the query already contained top-level stream filter. See https://github.com/VictoriaMetrics/VictoriaMetrics/issues/8037	2025-01-16 17:45:49 +01:00
Aliaksandr Valialkin	f27e120aeb	lib/logstorage: add `union` pipe, which allows uniting results from multiple queries	2025-01-15 22:22:07 +01:00
Aliaksandr Valialkin	c5949af9e8	lib/logstorage: reduce memory allocations when splitting in(...) values into tokens and calculating hashes for these tokens While at it, reduce memory allocations at Storage.getFieldValuesNoHits and make it more scalable on multi-CPU systems. This improves performance of in(<query>) filter when the <query> returns big number of values.	2024-12-22 13:13:44 +01:00
Aliaksandr Valialkin	dbec34bafc	lib/logstorage: add `facets` pipe for returning the most frequent values across all the log fields seen in the selected logs	2024-12-06 01:24:15 +01:00
Aliaksandr Valialkin	66b2987f49	lib/logstorage: optimize query imeediately after its parsing This eliminates possible bugs related to forgotten Query.Optimize() calls. This also allows removing optimize() function from pipe interface. While at it, drop filterNoop inside filterAnd.	2024-11-08 16:43:54 +01:00
Aliaksandr Valialkin	5a6531b329	lib/logstorage: add an ability to add prefix to resulting query field names in `join` pipe See https://docs.victoriametrics.com/victorialogs/logsql/#join-pipe	2024-11-08 16:43:53 +01:00
Aliaksandr Valialkin	f9e23bf8e3	lib/logstorage: add `join` pipe for joining multiple query results	2024-11-06 18:53:29 +01:00
Aliaksandr Valialkin	1892e357c3	lib/logstorage: consistently use "pHits := m[..]" pattern Consistency improves maintainability of the code a bit.	2024-10-18 02:22:43 +02:00
Aliaksandr Valialkin	508e498ae3	lib/logstorage: follow-up for `72941eac36` - Allow dropping metrics if the query result contains at least a single metric. - Allow copying by(...) fields. - Disallow overriding by(...) fields via `math` pipe. - Allow using `format` pipe in stats query. This is useful for constructing some labels from the existing by(...) fields. - Add more tests. - Remove the check for time range in the query filter according to https://github.com/VictoriaMetrics/VictoriaMetrics/pull/7254/files#r1803405826 Updates https://github.com/VictoriaMetrics/VictoriaMetrics/pull/7254	2024-10-16 19:43:52 +02:00
Hui Wang	72941eac36	victorialogs: add more checks for stats query APIs (#7254 ) 1. Verify if field in [fields pipe](https://docs.victoriametrics.com/victorialogs/logsql/#fields-pipe) exists. If not, it generates a metric with illegal float value "" for prometheus metrics protocol. 2. check if multiple time range filters produce conflicted query time range, for instance: ``` query: _time: 5m \| stats count(), start:2024-10-08T10:00:00.806Z, end: 2024-10-08T12:00:00.806Z, time: 2024-10-10T10:02:59.806Z ``` must give no result due to invalid final time range. --------- Co-authored-by: Aliaksandr Valialkin <valyala@victoriametrics.com>	2024-10-16 19:25:43 +02:00
Aliaksandr Valialkin	b4b79a4961	lib/logstorage: make a copy of s.partitions slice when performing queries over the selected partitions s.partitions can be changed when new partition is registered or when old partition is dropped. This could lead to data races and panics when s.partitions slice is accessed by concurrently executed queries. The fix is to make a copy of the selected partitions under s.partitionsLock before performing the query.	2024-10-13 22:14:34 +02:00
Aliaksandr Valialkin	4b1611267f	lib/logstorage: properly return surrounding logs outside the selected time range by stream_context pipe Previously only logs inside the selected time range could be returned by stream_context pipe. For example, the following query could return up to 10 surrounding logs only for the last 5 minutes, while most users expect this query should return up to 10 surrounding logs without restrictions on the time range. _time:5m panic \| stream_context before 10 This enables the ability to implement stream context feature at VictoriaLogs web UI: https://github.com/VictoriaMetrics/VictoriaMetrics/issues/7063 . Reduce memory usage when returning stream context over big log streams with millions of entries. The new logic scans over all the log messages for the selected log stream, while keeping in memory only the given number of surrounding logs. Previously all the logs for the given log stream on the selected time range were loaded in memory before selecting the needed surrounding logs. This should help https://github.com/VictoriaMetrics/VictoriaMetrics/issues/6730 . Reduce the scan performance for big log streams by fetching only the requested fields. For example, the following query should be executed much faster than before if logs contain many fields other than _stream, _msg and _time: panic \| stream_context after 30 \| fields _stream, _msg, _time	2024-09-26 17:03:45 +02:00
Aliaksandr Valialkin	4599429f51	lib/logstorage: read timestamps column when it is really needed during query execution Previously timestamps column was read unconditionally on every query. This could significantly slow down queries, which do not need reading this column like in https://github.com/VictoriaMetrics/VictoriaMetrics/issues/7070 .	2024-09-25 19:17:47 +02:00
Aliaksandr Valialkin	0205170409	lib/logstorage: consistently use nsecsPerDay constant and remove nsecPerDay constant	2024-09-06 16:17:04 +02:00
Aliaksandr Valialkin	d4ca651547	lib/logstorage: add `stream_context` pipe, which allows selecting surrounding logs for the matching logs	2024-06-28 19:14:29 +02:00
Aliaksandr Valialkin	87f1c8bd6c	lib/logstorage: work-in-progress	2024-06-27 14:20:43 +02:00
Aliaksandr Valialkin	de7450b7e0	lib/logstorage: work-in-progress	2024-06-24 23:27:12 +02:00
Aliaksandr Valialkin	7229dd8c33	lib/logstorage: work-in-progress	2024-06-20 03:10:08 +02:00
Aliaksandr Valialkin	0aafca29be	lib/logstorage: work-in-progress	2024-05-28 19:29:41 +02:00
Aliaksandr Valialkin	dc55146752	lib/logstorage: work-in-progress	2024-05-25 21:36:16 +02:00
Aliaksandr Valialkin	e2590f0485	lib/logstorage: work-in-progress	2024-05-25 00:30:58 +02:00
Aliaksandr Valialkin	4b458370c1	lib/logstorage: work-in-progress	2024-05-24 03:06:55 +02:00
Aliaksandr Valialkin	22107421eb	lib/logstorage: work-in-progress	2024-05-22 21:01:20 +02:00
Aliaksandr Valialkin	bc4a0b8f37	lib/logstorage: fix golangci-lint warnings	2024-05-20 11:04:12 +02:00
Aliaksandr Valialkin	ad505a7a9a	lib/logstorage: work-in-progress	2024-05-20 04:08:30 +02:00
Aliaksandr Valialkin	0aa19a2837	lib/logstorage: work-in-progress	2024-05-15 04:55:44 +02:00
Aliaksandr Valialkin	cb35e62e04	lib/logstorage: work-in-progress Updates https://github.com/VictoriaMetrics/VictoriaMetrics/issues/6258	2024-05-14 01:49:23 +02:00
hagen1778	17283fab6c	lib/logstorage: make linter happy Signed-off-by: hagen1778 <roman@victoriametrics.com>	2024-05-13 15:35:11 +02:00
Aliaksandr Valialkin	9dbd0f9085	lib/logstorage: initial implementation of pipes in LogsQL See https://docs.victoriametrics.com/victorialogs/logsql/#pipes	2024-05-12 16:33:31 +02:00
Aliaksandr Valialkin	918cccaddf	all: fix golangci-lint(revive) warnings after `0c0ed61ce7` Updates https://github.com/VictoriaMetrics/VictoriaMetrics/pull/6001	2024-04-02 23:16:29 +03:00
Aliaksandr Valialkin	0514091948	app/vlselect: follow-up for `451d2abf50` - Consistently return the first `limit` log entries if the total size of found log entries doesn't exceed 1Mb. See app/vlselect/logsql/sort_writer.go . Previously random log entries could be returned with each request. - Document the change at docs/VictoriaLogs/CHANGELOG.md - Document the `limit` query arg at docs/VictoriaLogs/querying/README.md - Make the change less intrusive. Updates https://github.com/VictoriaMetrics/VictoriaMetrics/issues/5674 Updates https://github.com/VictoriaMetrics/VictoriaMetrics/pull/5778	2024-02-18 23:05:51 +02:00
Dmytro Kozlov	451d2abf50	Enable the `limit` query param for the `/select/logsql/query` (#5778 ) * app/vlselect: add limit for logs query * app/vlselect: CHANGELOG.md * app/vlselect: stop search process if limit is reached, update logic, remove default limit * app/vlselect: fix tests * app/vlselect: fix filter tests * app/vlselect: fix tests	2024-02-18 22:58:47 +02:00
noodles2hg	cafd6f08b3	lib/logstorage: proper exit during block search (#5400 )	2024-02-01 12:11:05 +00:00
Aliaksandr Valialkin	cef7a39ba3	lib/logstorage: always check the previous indexBlockHeader for blocks with matching tenantID and/or streamID The previous indexBlockHeader may contain blocks for the matching tenantID and/or streamID, so it must be scanned unconditionally during the search. Updates https://github.com/VictoriaMetrics/VictoriaMetrics/pull/5295 Updates https://github.com/VictoriaMetrics/VictoriaMetrics/issues/4856 This is a follow-up for `89dcbc2fe7`	2023-11-13 23:13:53 +01:00
XLONG96	89dcbc2fe7	lib/logstorage: fix streamID and tenantID search (#4856 ) (#5295 )	2023-11-13 23:09:39 +01:00
Aliaksandr Valialkin	87b66db47d	app/victoria-logs: initial code release	2023-06-19 22:55:12 -07:00

42 commits