github-mirrors/VictoriaMetrics

mirror of https://github.com/VictoriaMetrics/VictoriaMetrics.git synced 2024-11-21 14:44:00 +00:00

Author	SHA1	Message	Date
Yury Molodov	36a86c3aaf	vmui/logs: fix display of hits chart (#7167 ) ### Describe Your Changes Fixed the display of hits chart in VictoriaLogs. See #7133 ### Checklist The following checks are mandatory: - [ ] My change adheres [VictoriaMetrics contributing guidelines](https://docs.victoriametrics.com/contributing/).	2024-10-18 02:28:23 +02:00
Aliaksandr Valialkin	064b9a6314	docs/VictoriaLogs/CHANGELOG.md: remove "index.html" trailer from the link to docs for the sake of consistency with other links to docs This is a follow-up for `3538869942` Updates https://github.com/VictoriaMetrics/VictoriaMetrics/pull/7252	2024-10-18 02:26:01 +02:00
Aliaksandr Valialkin	2023f017b1	lib/logstorage: optimize performance for queries, which select all the log fields for logs containing hundreds of log fields (aka "wide events") Unpack the full columnsHeader block instead of unpacking meta-information per each individual column when the query, which selects all the columns, is executed. This improves performance when scanning logs with big number of fields.	2024-10-18 02:22:42 +02:00
Aliaksandr Valialkin	78c6fb0883	lib/logstorage: improve performance of `top` and `field_values` pipes on systems with many CPU cores - Parallelize mering of per-CPU results. - Parallelize writing the results to the next pipe.	2024-10-18 02:22:42 +02:00
Aliaksandr Valialkin	c4b2fdff70	lib/logstorage: optimize 'stats by(...)' calculations for by(...) fields with millions of unique values on multi-CPU systems - Parallelize merging of per-CPU `stats by(...)` result shards. - Parallelize writing `stats by(...)` results to the next pipe.	2024-10-18 02:22:41 +02:00
Aliaksandr Valialkin	192c07f76a	lib/logstorage: optimize performance for `top` pipe when it is applied to a field with millions of unique values - Use parallel merge of per-CPU shard results. This improves merge performance on multi-CPU systems. - Use topN heap sort of per-shard results. This improves performance when results contain millions of entries.	2024-10-18 02:21:56 +02:00
Andrii Chubatiuk	3538869942	vlogs: added basic alerts (#7252 ) ### Describe Your Changes Added basic VLogs alerts Signed-off-by: hagen1778 <roman@victoriametrics.com> Co-authored-by: hagen1778 <roman@victoriametrics.com>	2024-10-17 11:33:06 +02:00
Aliaksandr Valialkin	a72e1155b9	docs/VictoriaLogs/CHANGELOG.md: add missing part of the sentence	2024-10-16 20:22:19 +02:00
Aliaksandr Valialkin	677f1cd1be	docs/VictoriaLogs/CHANGELOG.md: typo fix: refer the correct endpoints for stats results	2024-10-16 20:19:22 +02:00
Aliaksandr Valialkin	91987763d4	docs/VictoriaLogs/CHANGELOG.md: cut v0.36.0-victorialogs release	2024-10-16 20:00:35 +02:00
Aliaksandr Valialkin	508e498ae3	lib/logstorage: follow-up for `72941eac36` - Allow dropping metrics if the query result contains at least a single metric. - Allow copying by(...) fields. - Disallow overriding by(...) fields via `math` pipe. - Allow using `format` pipe in stats query. This is useful for constructing some labels from the existing by(...) fields. - Add more tests. - Remove the check for time range in the query filter according to https://github.com/VictoriaMetrics/VictoriaMetrics/pull/7254/files#r1803405826 Updates https://github.com/VictoriaMetrics/VictoriaMetrics/pull/7254	2024-10-16 19:43:52 +02:00
Hui Wang	72941eac36	victorialogs: add more checks for stats query APIs (#7254 ) 1. Verify if field in [fields pipe](https://docs.victoriametrics.com/victorialogs/logsql/#fields-pipe) exists. If not, it generates a metric with illegal float value "" for prometheus metrics protocol. 2. check if multiple time range filters produce conflicted query time range, for instance: ``` query: _time: 5m \| stats count(), start:2024-10-08T10:00:00.806Z, end: 2024-10-08T12:00:00.806Z, time: 2024-10-10T10:02:59.806Z ``` must give no result due to invalid final time range. --------- Co-authored-by: Aliaksandr Valialkin <valyala@victoriametrics.com>	2024-10-16 19:25:43 +02:00
Aliaksandr Valialkin	202eb429a7	lib/logstorage: refactor storage format to be more efficient for querying wide events It has been appeared that VictoriaLogs is frequently used for collecting logs with tens of fields. For example, standard Kuberntes setup on top of Filebeat generates more than 20 fields per each log. Such logs are also known as "wide events". The previous storage format was optimized for logs with a few fields. When at least a single field was referenced in the query, then the all the meta-information about all the log fields was unpacked and parsed per each scanned block during the query. This could require a lot of additional disk IO and CPU time when logs contain many fields. Resolve this issue by providing an (field -> metainfo_offset) index per each field in every data block. This index allows reading and extracting only the needed metainfo for fields used in the query. This index is stored in columnsHeaderIndexFilename ( columns_header_index.bin ). This allows increasing performance for queries over wide events by 10x and more. Another issue was that the data for bloom filters and field values across all the log fields except of _msg was intermixed in two files - fieldBloomFilename ( field_bloom.bin ) and fieldValuesFilename ( field_values.bin ). This could result in huge disk read IO overhead when some small field was referred in the query, since the Operating System usually reads more data than requested. It reads the data from disk in at least 4KiB blocks (usually the block size is much bigger in the range 64KiB - 512KiB). So, if 512-byte bloom filter or values' block is read from the file, then the Operating System reads up to 512KiB of data from disk, which results in 1000x disk read IO overhead. This overhead isn't visible for recently accessed data, since this data is usually stored in RAM (aka Operating System page cache), but this overhead may become very annoying when performing the query over large volumes of data which isn't present in OS page cache. The solution for this issue is to split bloom filters and field values across multiple shards. This reduces the worst-case disk read IO overhead by at least Nx where N is the number of shards, while the disk read IO overhead is completely removed in best case when the number of columns doesn't exceed N. Currently the number of shards is 8 - see bloomValuesShardsCount . This solution increases performance for queries over large volumes of newly ingested data by up to 1000x. The new storage format is versioned as v1, while the old storage format is version as v0. It is stored in the partHeader.FormatVersion. Parts with the old storage format are converted into parts with the new storage format during background merge. It is possible to force merge by querying /internal/force_merge HTTP endpoint - see https://docs.victoriametrics.com/victorialogs/#forced-merge .	2024-10-16 17:35:07 +02:00
Yury Molodov	86029de0d4	vmui: fix alert display with long messages (#7228 ) ### Describe Your Changes Fix `Alert` component to prevent it from overflowing the screen when displaying long messages. Related issue: #7207 ### Checklist The following checks are mandatory: - [x] My change adheres [VictoriaMetrics contributing guidelines](https://docs.victoriametrics.com/contributing/). --------- Signed-off-by: Zakhar Bessarab <z.bessarab@victoriametrics.com> Signed-off-by: hagen1778 <roman@victoriametrics.com> Co-authored-by: hagen1778 <roman@victoriametrics.com>	2024-10-15 16:35:57 +02:00
Yury Molodov	6c9772b101	vmui: add the ability to cancel running queries (#7204 ) ### Describe Your Changes - Added functionality to cancel running queries on the Explore Logs and Query pages. - The loader was changed from a spinner to a top bar within the block. This still indicates loading, but solves the issue of the spinner "flickering," especially during graph dragging. Related issue: #7097 https://github.com/user-attachments/assets/98e59aeb-905b-4b9d-bbb2-688223b22a82 ### Checklist The following checks are mandatory: - [ ] My change adheres [VictoriaMetrics contributing guidelines](https://docs.victoriametrics.com/contributing/).	2024-10-15 14:48:40 +02:00
Aliaksandr Valialkin	bac193e50b	app/vlselect: do not show empty fields in query results Empty fields are treated as non-existing fields by VictoriaLogs data model. So there is no sense in returning empty fields in query results, since they may mislead and confuse users.	2024-10-14 23:43:58 +02:00
Aliaksandr Valialkin	3c73dbbacc	app/vlstorage: add support for forced merge via /internal/force_merge HTTP endpoint	2024-10-13 22:20:31 +02:00
Aliaksandr Valialkin	b4b79a4961	lib/logstorage: make a copy of s.partitions slice when performing queries over the selected partitions s.partitions can be changed when new partition is registered or when old partition is dropped. This could lead to data races and panics when s.partitions slice is accessed by concurrently executed queries. The fix is to make a copy of the selected partitions under s.partitionsLock before performing the query.	2024-10-13 22:14:34 +02:00
Aliaksandr Valialkin	867f671cc4	lib/logstorage: make sure that bs.br is non-nil before checking br.bs.bsw.bh.rowsCount there br.bs may be nil when br contains the block with additional filters applied during pipe calculations. For example, `* \| count() if (error) errors`.	2024-10-12 20:51:29 +02:00
Aliaksandr Valialkin	252aa792f7	docs/VictoriaLogs: cut v0.35.0 release	2024-10-09 15:55:20 +02:00
Aliaksandr Valialkin	ad5d8097da	app/vlogscli: add -accountID and -projectID command-line flags for querying the given tenants	2024-10-09 12:56:49 +02:00
Aliaksandr Valialkin	e31625e0b2	app/vlogscli: add support for live tailing	2024-10-09 12:30:17 +02:00
Aliaksandr Valialkin	6878982c93	docs/VictoriaLogs/CHANGELOG.md: cut v0.34.0 release	2024-10-08 12:21:19 +02:00
Aliaksandr Valialkin	492190885d	app/vlogscli: add ability to display query results in logfmt, single-line and multi-line json modes	2024-10-07 12:20:06 +02:00
Aliaksandr Valialkin	daad96b3a5	app/vlogscli: return back sorting result fields by name This simplifies locating the needed field when the number of fields per each returned result is big	2024-10-07 10:41:48 +02:00
Aliaksandr Valialkin	596e4de248	app/vlogscli: preserve the original order of fields in the displayed responses	2024-10-05 21:27:32 +02:00
Aliaksandr Valialkin	364f084b43	lib/logstorage: add `len` pipe for calculating byte length of log field values	2024-10-03 18:21:10 +02:00
Aliaksandr Valialkin	234c81754e	docs/VictoriaLogs/CHANGELOG.md: cut v0.33.0-victorialogs release	2024-10-01 13:42:18 +02:00
Aliaksandr Valialkin	a350be48b6	lib/logstorage: do not count dictionary values which have no matching logs in `count_uniq` stats function Create blockResultColumn.forEachDictValue* helper functions for visiting matching dictionary values. These helper functions should prevent from counting dictionary values without matching logs in the future. This is a follow-up for `0c0f013a60` Updates https://github.com/VictoriaMetrics/VictoriaMetrics/issues/7152	2024-10-01 13:34:45 +02:00
Aliaksandr Valialkin	630211cfed	app/vlogscli: add interactive command-line tool for querying VictoriaLogs	2024-10-01 12:23:07 +02:00
Aliaksandr Valialkin	82482fca4b	docs/VictoriaLogs/CHANGELOG.md: cut v0.32.1-victorialogs release	2024-09-30 14:31:17 +02:00
Aliaksandr Valialkin	0c0f013a60	lib/logstorage: skip values with zero hits for 'uniq', 'top' and 'field_values' pipes See https://github.com/VictoriaMetrics/victorialogs-datasource/issues/72#issuecomment-2352078483	2024-09-30 14:15:07 +02:00
Aliaksandr Valialkin	45cfb6b526	docs/VictoriaLogs/CHANGELOG.md: cut v0.32.0-victorialogs	2024-09-29 14:47:31 +02:00
Aliaksandr Valialkin	55eb321f77	lib/logstorage: clear hits slice obtained from encoding.GetUint64s() before updating it with hits for valueTypeDict column encoding.GetUint64s() returns uninitialized slice, which may contain arbitrary values. So values in this slice must be reset to zero before using it for counting hits in `uniq` and `top` pipes.	2024-09-29 10:29:13 +02:00
Aliaksandr Valialkin	0b91452ca4	lib/logstorage: add non-empty `if (...)` condition to automatically generated result names in `stats` pipe This allows executing queries with `stats` pipe, which calculate multiple results with the same functions, but with different `if (...)` conditions. For example: _time:5m \| count(), count() if (error) Previously such queries couldn't be executed becasue automatically generated name for the second result didn't include `if (error)`, so names for both results were identical - `count(*)`.	2024-09-29 09:51:28 +02:00
Aliaksandr Valialkin	8772aea24b	lib/logstorage: support `order` alias for `sort` pipe Now the following queries are equivalents: _time:5s \| sort by (_time) _time:5s \| order by (_time) This is needed for convenience, since `order by` is commonly used in other query languages such as SQL.	2024-09-29 09:51:27 +02:00
Aliaksandr Valialkin	806bc2ac58	app/vlinsert: support unix timestamps in seconds and milliseconds in JSON stream data ingestion API	2024-09-28 21:56:50 +02:00
Aliaksandr Valialkin	7d7d7c03bc	app/vlinsert: accept unix timestamp in seconds additionally to milliseconds at ElasticSearch bulk API Timestamps in seconds are sometimes used for data ingestion via ElasticSearch bulk API	2024-09-28 21:19:54 +02:00
Aliaksandr Valialkin	58c69386c7	docs/VictoriaLogs/CHANGELOG.md: cut v0.31.0-victorialogs release	2024-09-27 13:54:17 +02:00
Yury Molodov	8657d03433	vmui/logs: improve graph usability (#7025 ) ### Describe Your Changes - Show the time range in the tooltip when hovering over staircase graphs. - Use bolder lines for staircase graphs. - Increase the number of steps on the staircase graph to 100. - Reduce the maximum width of the tooltip to 1/3 of the screen. - Insert only the label name under the cursor into the query input field when `Ctrl`-clicking the line legend. See [this comment](https://github.com/VictoriaMetrics/VictoriaMetrics/issues/6545#issuecomment-2336805237). ### Checklist The following checks are mandatory: - [ ] My change adheres [VictoriaMetrics contributing guidelines](https://docs.victoriametrics.com/contributing/). --------- Co-authored-by: Aliaksandr Valialkin <valyala@victoriametrics.com>	2024-09-27 13:19:46 +02:00
Aliaksandr Valialkin	09b309a82e	lib/logstorage: allow using `-` instead of `!` as a shorthand for `NOT` operator in LogsQL	2024-09-27 13:14:47 +02:00
Aliaksandr Valialkin	76c1b0b8ea	lib/logstorage: support skipping _stream: prefix for stream filters '_stream:{...}' can be written as '{...}' This simplifies writing queries with stream filters, and makes them more familier to Loki users.	2024-09-27 13:14:46 +02:00
Yury Molodov	c896bf340d	vmui: add functionality to preserve selected columns (#7037 ) ### Describe Your Changes 1) Changed table settings from a popup to a modal window to simplify future functionality additions. 2) Added functionality to save selected columns when data is modified or the page is reloaded. See #7016. <details> <summary>Example screenshots</summary> <img alt="demo-1" width="600" src="https://github.com/user-attachments/assets/a5d9a910-363c-4931-8b12-18ea8b3d97d8"/> </details> ### Checklist The following checks are mandatory: - [x] My change adheres [VictoriaMetrics contributing guidelines](https://docs.victoriametrics.com/contributing/). --------- Co-authored-by: Roman Khavronenko <roman@victoriametrics.com>	2024-09-27 11:52:01 +02:00
Aliaksandr Valialkin	8077585303	docs/VictoriaLogs/CHANGELOG.md: cut v0.30.1-victorialogs release	2024-09-27 11:20:58 +02:00
Aliaksandr Valialkin	9367a9a6a2	lib/logstorage: consistently sort stream contexts belonging to different streams by the minimum time seen in the matching logs This should simplify debugging of stream_context output, since it remains stable over repeated requests.	2024-09-27 11:19:26 +02:00
Aliaksandr Valialkin	31117c66d5	docs/VictoriaLogs/CHANGELOG.md: cut v0.30.0-victorialogs release	2024-09-27 09:18:31 +02:00
Aliaksandr Valialkin	4b1611267f	lib/logstorage: properly return surrounding logs outside the selected time range by stream_context pipe Previously only logs inside the selected time range could be returned by stream_context pipe. For example, the following query could return up to 10 surrounding logs only for the last 5 minutes, while most users expect this query should return up to 10 surrounding logs without restrictions on the time range. _time:5m panic \| stream_context before 10 This enables the ability to implement stream context feature at VictoriaLogs web UI: https://github.com/VictoriaMetrics/VictoriaMetrics/issues/7063 . Reduce memory usage when returning stream context over big log streams with millions of entries. The new logic scans over all the log messages for the selected log stream, while keeping in memory only the given number of surrounding logs. Previously all the logs for the given log stream on the selected time range were loaded in memory before selecting the needed surrounding logs. This should help https://github.com/VictoriaMetrics/VictoriaMetrics/issues/6730 . Reduce the scan performance for big log streams by fetching only the requested fields. For example, the following query should be executed much faster than before if logs contain many fields other than _stream, _msg and _time: panic \| stream_context after 30 \| fields _stream, _msg, _time	2024-09-26 17:03:45 +02:00
Aliaksandr Valialkin	037652d5ae	app/vlinsert: support `_time` field without timezone information during data ingestion Use local timezone of the host server in this case. The timezone can be overridden with TZ environment variable if needed. While at it, allow using whitespace instead of T as a delimiter between data and time in the ingested _time field. For example, '2024-09-20 10:20:30' is now accepted during data ingestion. This is valid ISO8601 format, which is used by some log shippers, so it should be supported. This format is also known as SQL datetime format. Also assume local time zone when time without timezone information is passed to querying APIs. Previously such a time was parsed in UTC timezone. Add `Z` to the end of the time string if the old behaviour is preferred. Updates https://github.com/VictoriaMetrics/VictoriaMetrics/issues/6721	2024-09-26 12:49:35 +02:00
Aliaksandr Valialkin	7c86835f3c	docs/VictoriaLogs/CHANGELOG.md: typo fix: itentifying -> identifying	2024-09-26 09:41:30 +02:00
Zhu Jiekun	7185fe012b	feature: [victorialogs] drop logs without non-empty _msg field (#7056 ) ### Describe Your Changes VictoriaLogs allows logs without `_msg` field or `_msg` field is empty. This lead to incorrect search result. See: https://github.com/VictoriaMetrics/VictoriaMetrics/issues/6785 This pull request search for non-empty `_msg` field before log entry is added to `LogRows`. New counter `vl_rows_dropped_total{reason="msg_not_exist"}` is introduced. Example log output: ``` 2024-09-23T02:33:19.719Z warn app/vlinsert/insertutils/common_params.go:189 dropping log line without _msg field; [{@timestamp 2024-09-18T13:42:16.600000000Z} {Attributes.array.attribute ["many","values"]} {Attributes.boolean.attribute true} {Attributes.double.attribute 637.704} {Attributes.int.attribute 10} {Attributes.map.attribute.some.map.key some value} {Attributes.string.attribute some string} {Body Example ddddddddddlog record} {Resource.service.name my.service} {Scope.my.scope.attribute some scope attribute} {Scope.name my.library} {Scope.version 1.0.0} {SeverityNumber 10} {SeverityText Information} {SpanId eee19b7ec3c1b174} {TraceFlags 0} {TraceId 5b8efff798038103d269b633813fc60c}] ``` ### Checklist The following checks are mandatory: - [x] My change adheres [VictoriaMetrics contributing guidelines](https://docs.victoriametrics.com/contributing/). - [ ] Benchmark for potential performance loss. --------- Co-authored-by: Aliaksandr Valialkin <valyala@victoriametrics.com>	2024-09-26 09:35:28 +02:00

1 2 3 4

190 commits