Commit graph

15 commits

Author SHA1 Message Date
Aliaksandr Valialkin
001f8969f8
wip 2024-06-03 14:01:05 +02:00
Aliaksandr Valialkin
639b3091b5
wip 2024-05-15 15:46:42 +02:00
Aliaksandr Valialkin
435506b223
wip 2024-05-13 23:44:44 +02:00
Aliaksandr Valialkin
900e558678
wip 2024-05-13 16:45:34 +02:00
Aliaksandr Valialkin
ecd51e48ec
wip 2024-05-13 14:00:33 +02:00
Aliaksandr Valialkin
1e7090cc8e
wip 2024-05-11 01:34:41 +02:00
Aliaksandr Valialkin
3bc01a1ad6
wip 2024-05-11 01:10:07 +02:00
Aliaksandr Valialkin
4c457cf20f
wip 2024-05-11 00:39:12 +02:00
Aliaksandr Valialkin
68dfaa1449
wip 2024-05-10 04:57:40 +02:00
Aliaksandr Valialkin
57afedbfe8
wip 2024-05-10 04:52:38 +02:00
Aliaksandr Valialkin
bc7dfd5ba4
wip 2024-05-05 00:28:01 +02:00
Aliaksandr Valialkin
7fd9d31e90
wip 2024-05-03 12:10:45 +02:00
Aliaksandr Valialkin
77e2d0be60
wip 2024-05-03 11:15:09 +02:00
Aliaksandr Valialkin
8dce4eb189
lib/logstorage: follow-up for 94627113db
- Move uniqueFields from rows to blockStreamMerger struct.
  This allows localizing all the references to uniqueFields inside blockStreamMerger.mustWriteBlock(),
  which should improve readability and maintainability of the code.

- Remove logging of the event when blocks cannot be merged because they contain more than maxColumnsPerBlock,
  since the provided logging didn't provide the solution for the issue with too many columns.
  I couldn't figure out the proper solution, which could be helpful for end user,
  so decided to remove the logging until we find the solution.

This commit also contains the following additional changes:

- It truncates field names longer than 128 chars during logs ingestion.
  This should prevent from ingesting bogus field names.
  This also should prevent from too big columnsHeader blocks,
  which could negatively affect search query performance,
  since columnsHeader is read on every scan of the corresponding data block.

- It limits the maximum length of const column value to 256.
  Longer values are stored in an ordinary columns.
  This helps limiting the size of columnsHeader blocks
  and improving search query performance by avoiding
  reading too long const columns on every scan of the corresponding data block.

- It deduplicates columns with identical names during data ingestion
  and background merging. Previously it was possible to pass columns with duplicate names
  to block.mustInitFromRows(), and they were stored as is in the block.

Updates https://github.com/VictoriaMetrics/VictoriaMetrics/issues/4762
Updates https://github.com/VictoriaMetrics/VictoriaMetrics/pull/4969
2023-10-02 19:19:08 +02:00
Aliaksandr Valialkin
87b66db47d
app/victoria-logs: initial code release 2023-06-19 22:55:12 -07:00