github-mirrors/VictoriaMetrics

mirror of https://github.com/VictoriaMetrics/VictoriaMetrics.git synced 2024-11-21 14:44:00 +00:00

Author	SHA1	Message	Date
Aliaksandr Valialkin	102e9d4f4e	lib/logstorage: make sure that the number of output (bloom, values) shards is bigger than zero. If the number of output (bloom, values) shards is zero, then this may lead to panic as shown at https://github.com/VictoriaMetrics/VictoriaMetrics/issues/7391 . This panic may happen when parts with only constant fields with distinct values are merged into output part with non-constant fields, which should be written to (bloom, values) shards.	2024-10-30 13:39:28 +01:00
Aliaksandr Valialkin	c963d7d10d	docs/VictoriaLogs/CHANGELOG.md: remove unnededed `with` prefix in front of `rank` at `top` pipe example	2024-10-29 18:27:40 +01:00
Aliaksandr Valialkin	12223cf5d0	docs/VictoriaLogs/CHANGELOG.md: cut v0.38.0 release	2024-10-29 18:08:36 +01:00
Aliaksandr Valialkin	3c06d083ea	lib/logstorage: add an ability to return rank from `top` pipe results	2024-10-29 16:44:45 +01:00
Aliaksandr Valialkin	7a62eefa34	lib/logstorage: dynamically adjust the number of (bloom, values) shards in a part depending on the number of non-const columns This allows reducing the amounts of data, which must be read during queries over logs with big number of fields (aka "wide events"). This, in turn, improves query performance when the data, which needs to be scanned during the query, doesn't fit OS page cache.	2024-10-29 16:44:45 +01:00
Aliaksandr Valialkin	8d968acd0a	lib/logstorage: avoid reading columnsHeader data when `field_values` pipe is applied directly to log filters This improves performance of `field_values` pipe when it is applied to large number of data blocks. This also improves performance of /select/logsql/field_values HTTP API.	2024-10-29 16:44:44 +01:00
Andrii Chubatiuk	7e60afb6fc	app/vlinsert: adds journald ingestion support This commit allows to ingest logs with journald format. https://www.freedesktop.org/software/systemd/man/latest/systemd-journal-remote.service.html related issue: https://github.com/VictoriaMetrics/VictoriaMetrics/issues/4618	2024-10-27 20:36:33 +01:00
Yury Molodov	dd89745a34	vmui/logs: fix query and limit update issue (#7294 ) ### Describe Your Changes Fixes issues with incorrect updating of query and limit fields, and resolves the problem where the display tab resets. Related issue: #7279 and #7290 ### Checklist The following checks are mandatory: - [ ] My change adheres [VictoriaMetrics contributing guidelines](https://docs.victoriametrics.com/contributing/). --------- Signed-off-by: hagen1778 <roman@victoriametrics.com> Co-authored-by: Zakhar Bessarab <z.bessarab@victoriametrics.com>	2024-10-25 09:32:20 +02:00
Aliaksandr Valialkin	51cd3ba02b	docs/VictoriaLogs/CHANGELOG.md: cut v0.37.0-victorialogs release	2024-10-18 02:32:37 +02:00
Yury Molodov	423df09d7d	vmui/logs: add ability to hide hits chart (#7206 ) ### Describe Your Changes Added ability to hide the hits chart - Users can now hide or show the hits chart by clicking the "eye" icon located in the upper-right corner of the chart. - When the chart is hidden, it will stop sending requests to `/select/logsql/hits`. - Upon displaying the chart again, it will automatically refresh. If a relative time range is set, the chart will update according to the time period of the logs currently being displayed. Hits chart visible: ![image](https://github.com/user-attachments/assets/577e877b-6417-4b83-8d84-c55e3d39864a) Hits chart hidden: ![image](https://github.com/user-attachments/assets/068b1143-d140-4d72-8d65-663900124f32) Related issue: #7117 ### Checklist The following checks are mandatory: - [ ] My change adheres [VictoriaMetrics contributing guidelines](https://docs.victoriametrics.com/contributing/). Co-authored-by: Aliaksandr Valialkin <valyala@victoriametrics.com>	2024-10-18 02:30:56 +02:00
Yury Molodov	36a86c3aaf	vmui/logs: fix display of hits chart (#7167 ) ### Describe Your Changes Fixed the display of hits chart in VictoriaLogs. See #7133 ### Checklist The following checks are mandatory: - [ ] My change adheres [VictoriaMetrics contributing guidelines](https://docs.victoriametrics.com/contributing/).	2024-10-18 02:28:23 +02:00
Aliaksandr Valialkin	064b9a6314	docs/VictoriaLogs/CHANGELOG.md: remove "index.html" trailer from the link to docs for the sake of consistency with other links to docs This is a follow-up for `3538869942` Updates https://github.com/VictoriaMetrics/VictoriaMetrics/pull/7252	2024-10-18 02:26:01 +02:00
Aliaksandr Valialkin	2023f017b1	lib/logstorage: optimize performance for queries, which select all the log fields for logs containing hundreds of log fields (aka "wide events") Unpack the full columnsHeader block instead of unpacking meta-information per each individual column when the query, which selects all the columns, is executed. This improves performance when scanning logs with big number of fields.	2024-10-18 02:22:42 +02:00
Aliaksandr Valialkin	78c6fb0883	lib/logstorage: improve performance of `top` and `field_values` pipes on systems with many CPU cores - Parallelize mering of per-CPU results. - Parallelize writing the results to the next pipe.	2024-10-18 02:22:42 +02:00
Aliaksandr Valialkin	c4b2fdff70	lib/logstorage: optimize 'stats by(...)' calculations for by(...) fields with millions of unique values on multi-CPU systems - Parallelize merging of per-CPU `stats by(...)` result shards. - Parallelize writing `stats by(...)` results to the next pipe.	2024-10-18 02:22:41 +02:00
Aliaksandr Valialkin	192c07f76a	lib/logstorage: optimize performance for `top` pipe when it is applied to a field with millions of unique values - Use parallel merge of per-CPU shard results. This improves merge performance on multi-CPU systems. - Use topN heap sort of per-shard results. This improves performance when results contain millions of entries.	2024-10-18 02:21:56 +02:00
Andrii Chubatiuk	3538869942	vlogs: added basic alerts (#7252 ) ### Describe Your Changes Added basic VLogs alerts Signed-off-by: hagen1778 <roman@victoriametrics.com> Co-authored-by: hagen1778 <roman@victoriametrics.com>	2024-10-17 11:33:06 +02:00
Aliaksandr Valialkin	a72e1155b9	docs/VictoriaLogs/CHANGELOG.md: add missing part of the sentence	2024-10-16 20:22:19 +02:00
Aliaksandr Valialkin	677f1cd1be	docs/VictoriaLogs/CHANGELOG.md: typo fix: refer the correct endpoints for stats results	2024-10-16 20:19:22 +02:00
Aliaksandr Valialkin	91987763d4	docs/VictoriaLogs/CHANGELOG.md: cut v0.36.0-victorialogs release	2024-10-16 20:00:35 +02:00
Aliaksandr Valialkin	508e498ae3	lib/logstorage: follow-up for `72941eac36` - Allow dropping metrics if the query result contains at least a single metric. - Allow copying by(...) fields. - Disallow overriding by(...) fields via `math` pipe. - Allow using `format` pipe in stats query. This is useful for constructing some labels from the existing by(...) fields. - Add more tests. - Remove the check for time range in the query filter according to https://github.com/VictoriaMetrics/VictoriaMetrics/pull/7254/files#r1803405826 Updates https://github.com/VictoriaMetrics/VictoriaMetrics/pull/7254	2024-10-16 19:43:52 +02:00
Hui Wang	72941eac36	victorialogs: add more checks for stats query APIs (#7254 ) 1. Verify if field in [fields pipe](https://docs.victoriametrics.com/victorialogs/logsql/#fields-pipe) exists. If not, it generates a metric with illegal float value "" for prometheus metrics protocol. 2. check if multiple time range filters produce conflicted query time range, for instance: ``` query: _time: 5m \| stats count(), start:2024-10-08T10:00:00.806Z, end: 2024-10-08T12:00:00.806Z, time: 2024-10-10T10:02:59.806Z ``` must give no result due to invalid final time range. --------- Co-authored-by: Aliaksandr Valialkin <valyala@victoriametrics.com>	2024-10-16 19:25:43 +02:00
Aliaksandr Valialkin	202eb429a7	lib/logstorage: refactor storage format to be more efficient for querying wide events It has been appeared that VictoriaLogs is frequently used for collecting logs with tens of fields. For example, standard Kuberntes setup on top of Filebeat generates more than 20 fields per each log. Such logs are also known as "wide events". The previous storage format was optimized for logs with a few fields. When at least a single field was referenced in the query, then the all the meta-information about all the log fields was unpacked and parsed per each scanned block during the query. This could require a lot of additional disk IO and CPU time when logs contain many fields. Resolve this issue by providing an (field -> metainfo_offset) index per each field in every data block. This index allows reading and extracting only the needed metainfo for fields used in the query. This index is stored in columnsHeaderIndexFilename ( columns_header_index.bin ). This allows increasing performance for queries over wide events by 10x and more. Another issue was that the data for bloom filters and field values across all the log fields except of _msg was intermixed in two files - fieldBloomFilename ( field_bloom.bin ) and fieldValuesFilename ( field_values.bin ). This could result in huge disk read IO overhead when some small field was referred in the query, since the Operating System usually reads more data than requested. It reads the data from disk in at least 4KiB blocks (usually the block size is much bigger in the range 64KiB - 512KiB). So, if 512-byte bloom filter or values' block is read from the file, then the Operating System reads up to 512KiB of data from disk, which results in 1000x disk read IO overhead. This overhead isn't visible for recently accessed data, since this data is usually stored in RAM (aka Operating System page cache), but this overhead may become very annoying when performing the query over large volumes of data which isn't present in OS page cache. The solution for this issue is to split bloom filters and field values across multiple shards. This reduces the worst-case disk read IO overhead by at least Nx where N is the number of shards, while the disk read IO overhead is completely removed in best case when the number of columns doesn't exceed N. Currently the number of shards is 8 - see bloomValuesShardsCount . This solution increases performance for queries over large volumes of newly ingested data by up to 1000x. The new storage format is versioned as v1, while the old storage format is version as v0. It is stored in the partHeader.FormatVersion. Parts with the old storage format are converted into parts with the new storage format during background merge. It is possible to force merge by querying /internal/force_merge HTTP endpoint - see https://docs.victoriametrics.com/victorialogs/#forced-merge .	2024-10-16 17:35:07 +02:00
Yury Molodov	86029de0d4	vmui: fix alert display with long messages (#7228 ) ### Describe Your Changes Fix `Alert` component to prevent it from overflowing the screen when displaying long messages. Related issue: #7207 ### Checklist The following checks are mandatory: - [x] My change adheres [VictoriaMetrics contributing guidelines](https://docs.victoriametrics.com/contributing/). --------- Signed-off-by: Zakhar Bessarab <z.bessarab@victoriametrics.com> Signed-off-by: hagen1778 <roman@victoriametrics.com> Co-authored-by: hagen1778 <roman@victoriametrics.com>	2024-10-15 16:35:57 +02:00
Yury Molodov	6c9772b101	vmui: add the ability to cancel running queries (#7204 ) ### Describe Your Changes - Added functionality to cancel running queries on the Explore Logs and Query pages. - The loader was changed from a spinner to a top bar within the block. This still indicates loading, but solves the issue of the spinner "flickering," especially during graph dragging. Related issue: #7097 https://github.com/user-attachments/assets/98e59aeb-905b-4b9d-bbb2-688223b22a82 ### Checklist The following checks are mandatory: - [ ] My change adheres [VictoriaMetrics contributing guidelines](https://docs.victoriametrics.com/contributing/).	2024-10-15 14:48:40 +02:00
Aliaksandr Valialkin	bac193e50b	app/vlselect: do not show empty fields in query results Empty fields are treated as non-existing fields by VictoriaLogs data model. So there is no sense in returning empty fields in query results, since they may mislead and confuse users.	2024-10-14 23:43:58 +02:00
Aliaksandr Valialkin	3c73dbbacc	app/vlstorage: add support for forced merge via /internal/force_merge HTTP endpoint	2024-10-13 22:20:31 +02:00
Aliaksandr Valialkin	b4b79a4961	lib/logstorage: make a copy of s.partitions slice when performing queries over the selected partitions s.partitions can be changed when new partition is registered or when old partition is dropped. This could lead to data races and panics when s.partitions slice is accessed by concurrently executed queries. The fix is to make a copy of the selected partitions under s.partitionsLock before performing the query.	2024-10-13 22:14:34 +02:00
Aliaksandr Valialkin	867f671cc4	lib/logstorage: make sure that bs.br is non-nil before checking br.bs.bsw.bh.rowsCount there br.bs may be nil when br contains the block with additional filters applied during pipe calculations. For example, `* \| count() if (error) errors`.	2024-10-12 20:51:29 +02:00
Aliaksandr Valialkin	252aa792f7	docs/VictoriaLogs: cut v0.35.0 release	2024-10-09 15:55:20 +02:00
Aliaksandr Valialkin	ad5d8097da	app/vlogscli: add -accountID and -projectID command-line flags for querying the given tenants	2024-10-09 12:56:49 +02:00
Aliaksandr Valialkin	e31625e0b2	app/vlogscli: add support for live tailing	2024-10-09 12:30:17 +02:00
Aliaksandr Valialkin	6878982c93	docs/VictoriaLogs/CHANGELOG.md: cut v0.34.0 release	2024-10-08 12:21:19 +02:00
Aliaksandr Valialkin	492190885d	app/vlogscli: add ability to display query results in logfmt, single-line and multi-line json modes	2024-10-07 12:20:06 +02:00
Aliaksandr Valialkin	daad96b3a5	app/vlogscli: return back sorting result fields by name This simplifies locating the needed field when the number of fields per each returned result is big	2024-10-07 10:41:48 +02:00
Aliaksandr Valialkin	596e4de248	app/vlogscli: preserve the original order of fields in the displayed responses	2024-10-05 21:27:32 +02:00
Aliaksandr Valialkin	364f084b43	lib/logstorage: add `len` pipe for calculating byte length of log field values	2024-10-03 18:21:10 +02:00
Aliaksandr Valialkin	234c81754e	docs/VictoriaLogs/CHANGELOG.md: cut v0.33.0-victorialogs release	2024-10-01 13:42:18 +02:00
Aliaksandr Valialkin	a350be48b6	lib/logstorage: do not count dictionary values which have no matching logs in `count_uniq` stats function Create blockResultColumn.forEachDictValue* helper functions for visiting matching dictionary values. These helper functions should prevent from counting dictionary values without matching logs in the future. This is a follow-up for `0c0f013a60` Updates https://github.com/VictoriaMetrics/VictoriaMetrics/issues/7152	2024-10-01 13:34:45 +02:00
Aliaksandr Valialkin	630211cfed	app/vlogscli: add interactive command-line tool for querying VictoriaLogs	2024-10-01 12:23:07 +02:00
Aliaksandr Valialkin	82482fca4b	docs/VictoriaLogs/CHANGELOG.md: cut v0.32.1-victorialogs release	2024-09-30 14:31:17 +02:00
Aliaksandr Valialkin	0c0f013a60	lib/logstorage: skip values with zero hits for 'uniq', 'top' and 'field_values' pipes See https://github.com/VictoriaMetrics/victorialogs-datasource/issues/72#issuecomment-2352078483	2024-09-30 14:15:07 +02:00
Aliaksandr Valialkin	45cfb6b526	docs/VictoriaLogs/CHANGELOG.md: cut v0.32.0-victorialogs	2024-09-29 14:47:31 +02:00
Aliaksandr Valialkin	55eb321f77	lib/logstorage: clear hits slice obtained from encoding.GetUint64s() before updating it with hits for valueTypeDict column encoding.GetUint64s() returns uninitialized slice, which may contain arbitrary values. So values in this slice must be reset to zero before using it for counting hits in `uniq` and `top` pipes.	2024-09-29 10:29:13 +02:00
Aliaksandr Valialkin	0b91452ca4	lib/logstorage: add non-empty `if (...)` condition to automatically generated result names in `stats` pipe This allows executing queries with `stats` pipe, which calculate multiple results with the same functions, but with different `if (...)` conditions. For example: _time:5m \| count(), count() if (error) Previously such queries couldn't be executed becasue automatically generated name for the second result didn't include `if (error)`, so names for both results were identical - `count(*)`.	2024-09-29 09:51:28 +02:00
Aliaksandr Valialkin	8772aea24b	lib/logstorage: support `order` alias for `sort` pipe Now the following queries are equivalents: _time:5s \| sort by (_time) _time:5s \| order by (_time) This is needed for convenience, since `order by` is commonly used in other query languages such as SQL.	2024-09-29 09:51:27 +02:00
Aliaksandr Valialkin	806bc2ac58	app/vlinsert: support unix timestamps in seconds and milliseconds in JSON stream data ingestion API	2024-09-28 21:56:50 +02:00
Aliaksandr Valialkin	7d7d7c03bc	app/vlinsert: accept unix timestamp in seconds additionally to milliseconds at ElasticSearch bulk API Timestamps in seconds are sometimes used for data ingestion via ElasticSearch bulk API	2024-09-28 21:19:54 +02:00
Aliaksandr Valialkin	58c69386c7	docs/VictoriaLogs/CHANGELOG.md: cut v0.31.0-victorialogs release	2024-09-27 13:54:17 +02:00
Yury Molodov	8657d03433	vmui/logs: improve graph usability (#7025 ) ### Describe Your Changes - Show the time range in the tooltip when hovering over staircase graphs. - Use bolder lines for staircase graphs. - Increase the number of steps on the staircase graph to 100. - Reduce the maximum width of the tooltip to 1/3 of the screen. - Insert only the label name under the cursor into the query input field when `Ctrl`-clicking the line legend. See [this comment](https://github.com/VictoriaMetrics/VictoriaMetrics/issues/6545#issuecomment-2336805237). ### Checklist The following checks are mandatory: - [ ] My change adheres [VictoriaMetrics contributing guidelines](https://docs.victoriametrics.com/contributing/). --------- Co-authored-by: Aliaksandr Valialkin <valyala@victoriametrics.com>	2024-09-27 13:19:46 +02:00

1 2 3 4

200 commits