github-mirrors/VictoriaMetrics

mirror of https://github.com/VictoriaMetrics/VictoriaMetrics.git synced 2024-11-21 14:44:00 +00:00

Author	SHA1	Message	Date
Roman Khavronenko	89b5a6a4d5	vmctl: mention replicationFactor during migration (#4633 ) Addresses https://github.com/VictoriaMetrics/VictoriaMetrics/issues/4624 Signed-off-by: hagen1778 <roman@victoriametrics.com>	2023-07-14 10:36:46 -07:00
dependabot[bot]	a15a66ee89	build(deps): bump tough-cookie in /app/vmui/packages/vmui (#4603 ) Bumps [tough-cookie](https://github.com/salesforce/tough-cookie) from 4.1.2 to 4.1.3. - [Release notes](https://github.com/salesforce/tough-cookie/releases) - [Changelog](https://github.com/salesforce/tough-cookie/blob/master/CHANGELOG.md) - [Commits](https://github.com/salesforce/tough-cookie/compare/v4.1.2...v4.1.3) --- updated-dependencies: - dependency-name: tough-cookie dependency-type: indirect ... Signed-off-by: dependabot[bot] <support@github.com> Co-authored-by: dependabot[bot] <49699333+dependabot[bot]@users.noreply.github.com>	2023-07-13 22:22:53 -07:00
Aliaksandr Valialkin	e1cf962bad	lib/storage: switch from global to per-day index for `MetricName -> TSID` mapping Previously all the newly ingested time series were registered in global `MetricName -> TSID` index. This index was used during data ingestion for locating the TSID (internal series id) for the given canonical metric name (the canonical metric name consists of metric name plus all its labels sorted by label names). The `MetricName -> TSID` index is stored on disk in order to make sure that the data isn't lost on VictoriaMetrics restart or unclean shutdown. The lookup in this index is relatively slow, since VictoriaMetrics needs to read the corresponding data block from disk, unpack it, put the unpacked block into `indexdb/dataBlocks` cache, and then search for the given `MetricName -> TSID` entry there. So VictoriaMetrics uses in-memory cache for speeding up the lookup for active time series. This cache is named `storage/tsid`. If this cache capacity is enough for all the currently ingested active time series, then VictoriaMetrics works fast, since it doesn't need to read the data from disk. VictoriaMetrics starts reading data from `MetricName -> TSID` on-disk index in the following cases: - If `storage/tsid` cache capacity isn't enough for active time series. Then just increase available memory for VictoriaMetrics or reduce the number of active time series ingested into VictoriaMetrics. - If new time series is ingested into VictoriaMetrics. In this case it cannot find the needed entry in the `storage/tsid` cache, so it needs to consult on-disk `MetricName -> TSID` index, since it doesn't know that the index has no the corresponding entry too. This is a typical event under high churn rate, when old time series are constantly substituted with new time series. Reading the data from `MetricName -> TSID` index is slow, so inserts, which lead to reading this index, are counted as slow inserts, and they can be monitored via `vm_slow_row_inserts_total` metric exposed by VictoriaMetrics. Prior to this commit the `MetricName -> TSID` index was global, e.g. it contained entries sorted by `MetricName` for all the time series ever ingested into VictoriaMetrics during the configured -retentionPeriod. This index can become very large under high churn rate and long retention. VictoriaMetrics caches data from this index in `indexdb/dataBlocks` in-memory cache for speeding up index lookups. The `indexdb/dataBlocks` cache may occupy significant share of available memory for storing recently accessed blocks at `MetricName -> TSID` index when searching for newly ingested time series. This commit switches from global `MetricName -> TSID` index to per-day index. This allows significantly reducing the amounts of data, which needs to be cached in `indexdb/dataBlocks`, since now VictoriaMetrics consults only the index for the current day when new time series is ingested into it. The downside of this change is increased indexdb size on disk for workloads without high churn rate, e.g. with static time series, which do no change over time, since now VictoriaMetrics needs to store identical `MetricName -> TSID` entries for static time series for every day. This change removes an optimization for reducing CPU and disk IO spikes at indexdb rotation, since it didn't work correctly - see https://github.com/VictoriaMetrics/VictoriaMetrics/issues/1401 . At the same time the change fixes the issue, which could result in lost access to time series, which stop receving new samples during the first hour after indexdb rotation - see https://github.com/VictoriaMetrics/VictoriaMetrics/issues/2698 The issue with the increased CPU and disk IO usage during indexdb rotation will be addressed in a separate commit according to https://github.com/VictoriaMetrics/VictoriaMetrics/issues/1401#issuecomment-1553488685 This is a follow-up for `1f28b46ae9`	2023-07-13 17:03:50 -07:00
Aliaksandr Valialkin	650af7c5ca	app/vmalert: silence golagci-lint at TestAlertingRule_Template Add a break if gotAlert is nil This removes the following golangci-lint warning: app/vmalert/alerting_test.go:868:8: SA5011(related information): this check suggests that the pointer can be nil (staticcheck) if gotAlert == nil { ^	2023-07-13 12:16:00 -07:00
Dmytro Kozlov	f31ac064f9	app/vmctl: fix panic `--remote-read-filter-time-start` flag not defined (#4605 ) * app/vmctl: fix panic `--remote-read-filter-time-start` flag not defined * app/vmctl: update CHANGELOG.md --------- Co-authored-by: Nikolay <nik@victoriametrics.com>	2023-07-13 12:13:21 -07:00
Roman Khavronenko	fdccb56620	vmalert: check for negative offset for missed rounds (#4628 ) It could happen for low evaluation intervals and irregular delays during execution that evaluation time would get a negative offset. This could result into cumulative discrepancy between the actual time and evaluation time for rules. Signed-off-by: hagen1778 <roman@victoriametrics.com>	2023-07-13 12:05:52 -07:00
Aliaksandr Valialkin	b07a1c85b9	all: update Go builder from 1.20.5 to 1.20.6 See https://github.com/golang/go/issues?q=milestone%3AGo1.20.6+label%3ACherryPickApproved	2023-07-12 01:00:24 -07:00
Dmytro Kozlov	5c4ca4aea8	app/vmctl: remove undefined flag from the documentation. See: https://github.com/VictoriaMetrics/VictoriaMetrics/issues/4552 . (#4606 )	2023-07-10 15:01:54 -07:00
Aliaksandr Valialkin	f65153018b	app/{vmselect,vlselect}: run `make vmui-update vmui-logs-update`	2023-07-09 12:44:04 -07:00
Zakhar Bessarab	ddd918b93c	docs: make `httpAuth.` flags description less ambiguous (#4588 ) docs: make `httpAuth.` flags description less ambiguous Currently, it may confuse users whether `httpAuth.` flags are used by HTTP client or server configuration(see https://github.com/VictoriaMetrics/VictoriaMetrics/issues/4586 for example). Signed-off-by: Zakhar Bessarab <z.bessarab@victoriametrics.com> * docs: fix a typo Signed-off-by: Zakhar Bessarab <z.bessarab@victoriametrics.com> --------- Signed-off-by: Zakhar Bessarab <z.bessarab@victoriametrics.com>	2023-07-09 12:36:14 -07:00
Haleygo	ef8e3eb9b3	vmselect: fix result in Prometheus query when time is small (#4578 ) vmselect: fix result in Prometheus query when time is small Co-authored-by: Roman Khavronenko <roman@victoriametrics.com>	2023-07-09 12:33:29 -07:00
Aliaksandr Valialkin	e1a2404db5	app/vmselect/netstorage: follow-up after `173ccf4333` - Clarify docs about -replicationFactor command-line flag at vmselect - Clarify description for -replicationFactor and -search.skipSlowReplicas command-line flags - Fix the logic for returning responses if -search.skipSlowReplicas command-line flag is enabled. The logic was broken in the `173ccf4333`, so it could return responses only if some of vmstorage nodes return error, while it should return when query results are successfully collected from more than (len(storageNodes) - replicationFactor) vmstorage nodes. Updates https://github.com/VictoriaMetrics/VictoriaMetrics/issues/1207 Updates https://github.com/VictoriaMetrics/VictoriaMetrics/issues/711	2023-07-09 11:58:22 -07:00
Haleygo	3c2308fd52	vmalert:fix query request using rfc3339 format (#4577 ) vmalert: consistently use time.RFC3339 format for time in queries Co-authored-by: hagen1778 <roman@victoriametrics.com>	2023-07-09 11:03:10 -07:00
Haleygo	14e242d0b9	vmselect: fix result collect count (#4599 )	2023-07-08 08:21:27 +02:00
Roman Khavronenko	173ccf4333	vmselect: introduce `search.skipSlowReplicas` cmd-line flag (#4538 ) * vmselect: introduce `search.skipSlowReplicas` cmd-line flag vmselect has two logical conditions during request processing when `-replicationFactor` cmd-line flag is set: 1. If at least `len(storageNodes) - replicationFactor` responded, it could skip waiting for the rest of nodes to respond. This could lead to problems described here https://github.com/VictoriaMetrics/VictoriaMetrics/issues/1207. 2. Mark response as partial if less than `len(storageNodes) - replicationFactor` responded without an error. The P1 showed itself error-prone and became the main reason why `-replicationFactor` wasn't recommended to use at vmselect level. However, this optimization could be still very useful in situations when there are slow and fast replicas in cluster. But P2 remains viable and important conditionless. Hiding P1 behind the feature-flag `search.skipSlowReplicas` should make `-replicationFactor` flag usable again. And let users choose whether they want P1 to be respected. Related issues https://github.com/VictoriaMetrics/VictoriaMetrics/issues/1207 https://github.com/VictoriaMetrics/VictoriaMetrics/issues/711 Signed-off-by: hagen1778 <roman@victoriametrics.com> * docs: update changelog Signed-off-by: hagen1778 <roman@victoriametrics.com> --------- Signed-off-by: hagen1778 <roman@victoriametrics.com>	2023-07-07 11:50:26 +02:00
Roman Khavronenko	109e55f865	vmalert: allow disabling of `step` param attached to instant queries (#4574 ) vmalert: allow disabling of `step` param attached to instant queries This might be useful for using vmalert with datasources that to not support this param, unlike VictoriaMetrics. See https://github.com/VictoriaMetrics/VictoriaMetrics/issues/4573 Signed-off-by: hagen1778 <roman@victoriametrics.com>	2023-07-06 23:13:56 -07:00
Aliaksandr Valialkin	0107d78639	docs/vmgateway.md: update `-help` output	2023-07-06 23:07:47 -07:00
Aliaksandr Valialkin	ee4280d132	docs/vmbackupmanager.md: update `-help` output	2023-07-06 22:57:31 -07:00
Aliaksandr Valialkin	921d8b36b5	docs/vmrestore.md: update `-help` output	2023-07-06 22:55:26 -07:00
Aliaksandr Valialkin	e1993dadc2	docs/vmbackup.md: update `-help` output	2023-07-06 22:54:15 -07:00
Aliaksandr Valialkin	e35abdd2e4	docs/vmauth.md: update `-help` output	2023-07-06 22:52:48 -07:00
Aliaksandr Valialkin	316abe550d	docs/vmalert.md: update `-help` output	2023-07-06 22:50:47 -07:00
Aliaksandr Valialkin	b9790515e4	docs/vmagent.md: update `-help` output	2023-07-06 22:48:23 -07:00
Roman Khavronenko	bd5abb74fd	vmctl: interrupt explore procedure in influx mode if no numeric fields were found (#4576 ) Signed-off-by: hagen1778 <roman@victoriametrics.com>	2023-07-06 22:21:18 -07:00
Sergey	747c39d714	security: update base Alpine image to 3.18.2 to avoid security risks (#4571 ) libcrypto3 and libssl3 in Alpine 3.18.0 have versions `3.1.0-r4` which contains CVE-2023-2650: https://cve.mitre.org/cgi-bin/cvename.cgi?name=CVE-2023-2650 Use ALpine image 3.18.2 which contains fixed versions of libssl3 and libcrypto3: 3.1.1-r0 NB: In Openshift these containers are marked as vulnerabilities because of these CVEs.	2023-07-06 22:12:20 -07:00
Dmytro Kozlov	dd412a3757	app/vmalert: show on UI groups error after reload config (#4543 ) show on UI groups error after reload config https://github.com/VictoriaMetrics/VictoriaMetrics/issues/4076 Co-authored-by: hagen1778 <roman@victoriametrics.com>	2023-07-06 22:11:36 -07:00
Zakhar Bessarab	e42f856b56	app/vmagent/remotewrite: fix error message for auth config (#4545 ) Error message will be present for any auth error, but message claims an error is about OAuth2 configuration which is confusing. Signed-off-by: Zakhar Bessarab <z.bessarab@victoriametrics.com>	2023-07-06 22:10:13 -07:00
Yury Molodov	8c190ec8fb	vmui: fix app routing issues (#4408 ) The change focuses on rectifying inconsistencies in the navigation behavior of the application and eliminating issues encountered when manually altering the URL. The key updates include: - Refactoring of the routing mechanism to handle all possible routes and their states. - Enhancement of the React Router usage to ensure a smoother navigation experience. - Handling application state when the URL is manually changed.	2023-07-06 21:58:09 -07:00
Roman Khavronenko	cf433c066a	vmauth: expose latency metrics per user (#4525 ) expose `vmauth_user_request_duration_seconds` and `vmauth_unauthorized_user_request_duration_seconds` summary metrics for measuring requests latency per user. Signed-off-by: hagen1778 <roman@victoriametrics.com>	2023-07-06 21:55:37 -07:00
Roman Khavronenko	8a15397b5c	vmauth: rm ip filters from non-ent config example (#4526 ) It is impossible to run OS vmauth with the provided config. The example of using ip filters should be only a part of docs. All other examples should work seamlessly with OS version. Signed-off-by: hagen1778 <roman@victoriametrics.com>	2023-07-06 21:52:51 -07:00
Haleygo	9e49a9e924	vmalert: add `vmalert_remotewrite_sent_duration_seconds_total` metric (#4517 ) add `vmalert_remotewrite_sent_duration_seconds_total` metric	2023-07-06 21:51:31 -07:00
Roman Khavronenko	a677509b38	vmalert: make linter happy (#4509 ) Signed-off-by: hagen1778 <roman@victoriametrics.com>	2023-07-06 21:46:22 -07:00
Roman Khavronenko	d5e7ea5ef3	vmalert: update retry policy for pushing data to `-remoteWrite.url` (#4504 ) By default, vmalert will make multiple retry attempts with exponential delay. The total time spent during retry attempts shouldn't exceed `-remoteWrite.retryMaxTime` (default is 30s). When retry time is exceeded vmalert drops the data dedicated for `-remoteWrite.url`. Before, vmalert dropped data after 5 retry attempts with 1s delay between attempts (not configurable). See `-remoteWrite.retryMinInterval` and `-remoteWrite.retryMaxTime` cmd-line flags. Signed-off-by: hagen1778 <roman@victoriametrics.com> Co-authored-by: Nikolay <nik@victoriametrics.com>	2023-07-06 21:44:18 -07:00
Roman Khavronenko	311a81c7b0	vmalert: properly interrupt remotewrite retries on shutdown (#4505 ) Signed-off-by: hagen1778 <roman@victoriametrics.com>	2023-07-06 21:43:04 -07:00
Aliaksandr Valialkin	a9eb2409ea	app/vlstorage: export vl_active_merges and vl_merges_total metrics	2023-07-06 21:38:09 -07:00
Aliaksandr Valialkin	352429486a	Revert "app/vlselect/logsql: use buffered writer in order to save syscalls when sending big amounts of data to clients" This reverts commit `c19048dc13`. Reason for revert: it has been appeared that the net/http.ResponseWriter is already buffered, so there in no need in double bufferring	2023-07-06 21:37:38 -07:00
Aliaksandr Valialkin	19870d42c5	app/vlselect/logsql: use buffered writer in order to save syscalls when sending big amounts of data to clients	2023-07-06 21:37:38 -07:00
Aliaksandr Valialkin	33625610c6	app/vmui/Makefile: consistently use tabs instead of spaces in multi-line Makefile rules	2023-07-06 21:37:38 -07:00
Aliaksandr Valialkin	4b10432435	app/vlselect: handle vmui at /select/vmui path instead of /vmui This simplifies routing at auth proxies such as vmauth to vlselect component, which serves VMUI - just route all the requests, which start with /select/, to vlselect.	2023-07-06 21:36:28 -07:00
Aliaksandr Valialkin	08634ae612	app/vlinsert/jsonline: code prettifying	2023-07-06 21:35:55 -07:00
Aliaksandr Valialkin	772852ff4f	app/vlselect/logsql: properly handle the error from ParseLogMessage	2023-07-06 21:33:22 -07:00
Dmytro Kozlov	caf4743e45	app/victoria-logs: remove header control (#4493 )	2023-07-06 21:33:00 -07:00
Alexander Marshalov	db910dd336	removed debug message from jsonlines handler of victorialogs (#4492 ) Signed-off-by: Alexander Marshalov <_@marshalov.org>	2023-07-06 21:32:38 -07:00
dmitryk-dk	058fbbdb16	app/victoria-logs: add vmui dependecies	2023-07-06 21:32:19 -07:00
Yury Molodov	a04a206cd2	vmui: logs explorer (#4484 ) * feat: add a logs page * app/vixtoria-logs: add handlers for vmui * feat: add group logs * feat: add logs build * app/vixtoria-logs: update make file * app/vixtoria-logs: cleanup make * app/vixtoria-logs: fix description * fix: correct url for logs * fix: save display view in query params * fix: change logo for logs build * app/vixtoria-logs: remove dashboards from vlselect * app/vixtoria-logs: enable user --------- Co-authored-by: dmitryk-dk <kozlovdmitriyy@gmail.com>	2023-07-06 21:31:33 -07:00
Alexander Marshalov	d9d759bc90	jsonline support for data ingestion in vlinsert (#4487 ) added json lines / json stream format for ingestion to vlinsert	2023-07-06 21:30:35 -07:00
Aliaksandr Valialkin	efee71986f	app/vlselect/logsql: sort query results by _time if their summary size doesnt exceed -select.maxSortBufferSize	2023-07-06 21:25:00 -07:00
Roman Khavronenko	4e99bf8c9e	docs/vmalert: specify version requirements for new features (#4480 ) Signed-off-by: hagen1778 <roman@victoriametrics.com>	2023-07-06 21:24:38 -07:00
Aliaksandr Valialkin	fd6c2dd02e	docs/VictoriaLogs: change the structure of the docs in order to be more maintainable The change is based on https://github.com/VictoriaMetrics/VictoriaMetrics/pull/4477	2023-07-06 21:22:59 -07:00
Aliaksandr Valialkin	e21b3bceab	app/vlinsert/elasticsearch: allow empty lines in Elasticsearch bulk protocol Empty lines may appear there during debugging and custom client implementation	2023-07-06 21:22:22 -07:00

1 2 3 4 5 ...

2636 commits