github-mirrors/VictoriaMetrics

mirror of https://github.com/VictoriaMetrics/VictoriaMetrics.git synced 2024-11-21 14:44:00 +00:00

Author	SHA1	Message	Date
Aliaksandr Valialkin	b212c9d6f5	vendor: run `make vendor-update`	2024-04-04 01:34:44 +03:00
Aliaksandr Valialkin	967d5496cf	app/vmagent: follow-up for `b3b29ba6ac` - Automatically reload changed TLS root CA pointed by -remoteWrite.tlsCAFile command-line flag - Automatically reload changed TLS root CA configured via oauth2.tsl_config.ca_file option at -promscrape.config - Document the change as a feature instead of a bug at docs/CHANGELOG.md - Simplify the code at lib/promauth, which is responsible for reloading changed TLS root CA files. - Simplify the usage of lib/promauth.Config.NewRoundTripper() - now it accepts the base http.Transport instead of a callback, which can change the internal http.Transport. - Reuse the default tls config if lib/promauth.Config doesn't contain tls-specific configs. This should reduce memory usage a bit when tls isn't used for scraping big number of targets. - Do not re-read TLS root CA files on every processed request. Re-read them once per second. This should reduce CPU usage when scraping big number of targets over https. - Do not store cert.pem and key.pem files in TestTLSConfigWithCertificatesFilesUpdate, since they can be loaded from byte slices via crypto/tls.X509KeyPair(). - Remove obsolete comparisons of string representations for authConfig and proxyAuthConfig at areEqualScrapeConfigs(). Updates https://github.com/VictoriaMetrics/VictoriaMetrics/pull/5725 Updates https://github.com/VictoriaMetrics/VictoriaMetrics/issues/5526 Updates https://github.com/VictoriaMetrics/VictoriaMetrics/issues/2171	2024-04-04 01:27:35 +03:00
Aliaksandr Valialkin	b958fb1e76	docs/CHANGELOG.md: add - in front of -logInvalidAuthTokens command-line flag in order to be consistent with command-line flag naming Updates https://github.com/VictoriaMetrics/VictoriaMetrics/issues/6029	2024-04-03 20:04:09 +03:00
Zakhar Bessarab	a8acf3767a	vmgateway: add an ability to log invalid auth tokens (#743 ) * app/vmgateway: add an ability to log invalid auth tokens This is useful for debugging to make it easier for user to find issues in token contents. See: https://github.com/VictoriaMetrics/VictoriaMetrics/issues/6029 Signed-off-by: Zakhar Bessarab <z.bessarab@victoriametrics.com> * docs: add info about new vmgateway flag - add changelog entry - add info about logInvalidAuthTokens flag Signed-off-by: Zakhar Bessarab <z.bessarab@victoriametrics.com> * app/vmgateway/filters/auth: improve reject reason visibility Explicitly return a rejection reason for request when "logInvalidAuthTokens" is enabled. Signed-off-by: Zakhar Bessarab <z.bessarab@victoriametrics.com> --------- Signed-off-by: Zakhar Bessarab <z.bessarab@victoriametrics.com> Co-authored-by: Nikolay <nik@victoriametrics.com>	2024-04-03 20:02:50 +03:00
Aliaksandr Valialkin	1de6cd4442	app/vmalert: document that -rule.stripFilePath command-line flag is available only in enterprise version of vmalert	2024-04-03 19:56:57 +03:00
Zakhar Bessarab	f80ac120f3	lib/promscrape/config: fix missing timeout for http client (#6063 ) Follow-up for `b3b29ba6` Signed-off-by: Zakhar Bessarab <z.bessarab@victoriametrics.com>	2024-04-03 18:18:48 +02:00
Thomas	93c3be2530	chore(docs): fix vmalertmanager typo (#6056 ) Fixes: #6055 Signed-off-by: Thomas Way <thomas@6f.io> Co-authored-by: Alexander Marshalov <_@marshalov.org>	2024-04-03 11:02:30 +02:00
Github Actions	a51a2bc692	Automatic update operator docs from VictoriaMetrics/operator@92cdca3 (#6052 )	2024-04-03 12:02:41 +04:00
Zakhar Bessarab	b3b29ba6ac	lib/{promauth,promscrape}: automatically refresh root CA certificates after changes on disk (#5725 ) * lib/{promauth,promscrape}: automatically refresh root CA certificates after changes on disk Added a custom `http.RoundTripper` implementation which checks for root CA content changes and updates `tls.Config` used by `http.RoundTripper` after detecting CA change. Client certificate changes are not tracked by this implementation since `tls.Config` already supports passing certificate dynamically by overriding `tls.Config.GetClientCertificate`. This change implements dynamic reload of root CA only for streaming client used for scraping. Blocking client (`fasthttp.HostClient`) does not support using custom transport so can't use this implementation. See: https://github.com/VictoriaMetrics/VictoriaMetrics/issues/5526 Signed-off-by: Zakhar Bessarab <z.bessarab@victoriametrics.com> * lib/promauth/config: update NewRoundTripper API Update API to allow user to update only parameters required for transport. Add warning log when reloading Root CA failed. Signed-off-by: Zakhar Bessarab <z.bessarab@victoriametrics.com> * lib/promauth/config: fix mutex acquire logic Signed-off-by: Zakhar Bessarab <z.bessarab@victoriametrics.com> * lib/promauth/config: replace RWMutex with regular mutex to simplify the code - remove additional mutex used for getRootCABytes - require callee to use mutex - replace RWMutex with regular mutex Signed-off-by: Zakhar Bessarab <z.bessarab@victoriametrics.com> * lib/promauth/config: refactor - hold the mutex lock to avoid round tripper being re-created twice - move recreation logic into separate func to simplify the code Signed-off-by: Zakhar Bessarab <z.bessarab@victoriametrics.com> --------- Signed-off-by: Zakhar Bessarab <z.bessarab@victoriametrics.com> Co-authored-by: Nikolay <nik@victoriametrics.com>	2024-04-03 10:01:43 +02:00
Aliaksandr Valialkin	6910e72c99	docs/CHANGELOG.md: typo fix: resonses -> responses	2024-04-03 03:20:18 +03:00
Aliaksandr Valialkin	fb42380ef3	lib/protoparser/opentelemetry: follow-up after `47892b4a4c` - Rename -opentelemetry.sanitizeMetrics command-line flag to more clear -opentelemetry.usePrometheusNaming - Clarify the description of the change at docs/CHANGELOG.md - Rename promrelabel.SanitizeLabelNameParts to more clear promrelabel.SplitMetricNameToTokens - Properly split metric names at '_' char in promerlabel.SplitMetricNameToTokens. - Add tests for various edge cases for Prometheus metric names' normalization according to the code at `b865505850/pkg/translator/prometheus/normalize_name.go` - Extract the code responsible for Prometheus metric names' normalization into a separate file (santize.go) Updates https://github.com/VictoriaMetrics/VictoriaMetrics/issues/6037 Updates https://github.com/VictoriaMetrics/VictoriaMetrics/pull/6035	2024-04-03 02:25:29 +03:00
Aliaksandr Valialkin	3de8656551	app/vmagent/remotewrite: follow-up for `166b97b8d0` and `b6bd9a97a3` - Make the configuration more clear by accepting the list of ignored labels during sharding via a dedicated command-line flag - -remoteWrite.shardByURL.ignoreLabels. This prevents from overloading the meaning of -remoteWrite.shardByURL.labels command-line flag. - Removed superfluous memory allocation per each processed sample if sharding by remote storage is enabled. Updates https://github.com/VictoriaMetrics/VictoriaMetrics/pull/5938	2024-04-03 00:54:01 +03:00
Aliaksandr Valialkin	55bd43f28e	docs: follow-up for `ac9c2a796f` Remove description for -search.maxExportDuration and -search.maxStatusRequestDuration command-line flags from the 'Resource usage limits' chapter, since these flags are rarely used for limiting resource usage and they are already documented in the 'List of command-line flags' chapter.	2024-04-02 23:57:37 +03:00
Aliaksandr Valialkin	e4eccd7074	app/vmselect/graphite: follow-up for `23ab865035` - Fix docs for new functions at app/vmselect/graphite/functions.json - Properly drain series lists on errors in aggregateSeriesListsGeneric() and aggregateSeriesList() - Add links to docs for the added functions at docs/CHANGELOG.md Updates https://github.com/VictoriaMetrics/VictoriaMetrics/pull/5809	2024-04-02 23:39:00 +03:00
Aliaksandr Valialkin	918cccaddf	all: fix golangci-lint(revive) warnings after `0c0ed61ce7` Updates https://github.com/VictoriaMetrics/VictoriaMetrics/pull/6001	2024-04-02 23:16:29 +03:00
Aliaksandr Valialkin	c3a72b6cdb	lib/storage: consistently use stopCh instead of stop	2024-04-02 21:24:57 +03:00
Aliaksandr Valialkin	be36ceb1cf	app/vmauth: add ability to authorize via any opaque HTTP request header value This can be done via `auth_token` option at -auth.config - see https://docs.victoriametrics.com/vmauth/#auth-config	2024-04-02 21:16:11 +03:00
Aliaksandr Valialkin	21bfb66650	app/vmauth: add ability to read auth tokens from multiple http request headers This is needed for VictoriaMetrics Cloud, where the same token could be passed either via Authorization or via X-Amz-Firehose-Access-Key header - see `4487dac30b (r140500722)` This is a follow-up for `4487dac30b` Updates https://github.com/VictoriaMetrics/VictoriaMetrics/pull/6009	2024-04-02 19:29:00 +03:00
Artem Navoiev	9bd3cadce6	app/{vmagent/insert} fix typo in Firehose Signed-off-by: Artem Navoiev <tenmozes@gmail.com>	2024-04-02 17:41:21 +02:00
Aliaksandr Valialkin	4487dac30b	app/vmauth: follow-up for `bc90f4aae6` - Allow specifying only a single HTTP header for reading auth tokens via -httpAuthHeader command-line flag. This is better from security PoV, since this prevents from accidental reading of auth token from undesired HTTP header. By default the -httpAuthHeader equals to Authorization. When it is overridden, then auth token isn't read from Authorization header - it is read only from the specified header. - Document the -httpAuthHeader command-line flag at https://docs.victoriametrics.com/vmauth/#reading-auth-tokens-from-other-http-headers Updates https://github.com/VictoriaMetrics/VictoriaMetrics/pull/6009	2024-04-02 18:35:21 +03:00
Aliaksandr Valialkin	904e95fc69	app/vmagent: simplify code after `509df44d03` - Simplify the code in order to improve its maintenance - Properly pass tenant ID when processing multi-tenant opentelemetry request at vmagent Updates https://github.com/VictoriaMetrics/VictoriaMetrics/pull/6016	2024-04-02 17:58:13 +03:00
Artem Navoiev	76b1fc6ac1	add more delays to verify that this is not a reason for flaky tests Signed-off-by: Artem Navoiev <tenmozes@gmail.com>	2024-04-02 16:22:36 +02:00
Fred Navruzov	daa1326b98	docs/vmanomaly: typos fix (#6047 )	2024-04-01 13:23:44 -07:00
Fred Navruzov	c300ce659f	docs/vmanomaly: v1.12 updates & fixes (#6046 ) * docs/vmanomaly: v1.12.0 & link updates * add autotuned description to model section * - update refs of vmanomaly on enterprise and vmalert pages - add diagrams for model types - update self-monitoring section * - fix typos - remove .index.html from links	2024-04-01 16:41:55 +03:00
Aliaksandr Valialkin	c79bf3925c	Revert "app/vmselect: make vmselect resilient to absence of cache folder (#5987 )" This reverts commit `cb23685681`. Reason for revert: the "fix" may hide programming bugs related to incorrect creation of folders before their use. This may complicate detecting and fixing such bugs in the future. There are the following fixes for the issue https://github.com/VictoriaMetrics/VictoriaMetrics/issues/5985 : - To configure the OS to do not drop data from the system-wide temporary directory (aka /tmp). - To run VictoriaMetrics with -cacheDataPath command-line flag, which points to the directory, which cannot be removed automatically by the OS. The case when the user accidentally deletes the directory with some files created by VictoriaMetrics shouldn't be considered as expected, so VictoriaMetrics shouldn't try resolving this case automatically. It is much better from operation and debuggability PoV is to crash with the clear `directory doesn't exist` error in this case.	2024-03-30 07:29:24 +02:00
Aliaksandr Valialkin	49a6dca2d5	docs/CHANGELOG.md: mention that the bug with improper use of -search.maxExportDuration instead of -search.maxLabelsAPIDuration has been introduced in v1.99.0 Updates https://github.com/VictoriaMetrics/VictoriaMetrics/pull/5992 This is a follow-up for `bc79f7196d`	2024-03-30 07:05:09 +02:00
Aliaksandr Valialkin	8f59ca423b	docs/VictoriaLogs/CHANGELOG.md: improve the description of the bugfix from `43b5d8bc7a`, so it can be googled by users	2024-03-30 06:54:48 +02:00
Aliaksandr Valialkin	830b871baf	app/vmagent: properly shutdown when -maxIngestionRate limit is reached The remotewrite.Stop() expects that there are no pending calls to TryPush(). This means that the ingestionRateLimiter.Register() must be unblocked inside TryPush() when calling remotewrite.Stop(). Provide remotewrite.StopIngestionRateLimiter() function for unblocking the rate limiter before calling the remotewrite.Stop(). While at it, move the rate limiter into lib/ratelimiter package, since it has two users. Also move the description of the feature to the correct place at docs/CHANGELOG.md. Also cross-reference -remoteWrite.rateLimit and -maxIngestionRate command-line flags. This is a follow-up for `02bccd1eb9` Updates https://github.com/VictoriaMetrics/VictoriaMetrics/pull/5900	2024-03-30 06:43:48 +02:00
Aliaksandr Valialkin	f5848a5c8b	docs/managed-victoriametrics/alerting-vmalert-managed-victoria-metrics.md: user proper image paths according to docs/assets/README.md	2024-03-30 05:09:41 +02:00
Aliaksandr Valialkin	f17248eb3f	docs/managed-victoriametrics: use proper names for the linked images according to docs/assets/README.md This is a follow-up for `db3709c87d` Updates https://github.com/VictoriaMetrics/VictoriaMetrics/pull/5989	2024-03-30 05:00:19 +02:00
Aliaksandr Valialkin	4cb70ee9a3	vendor: update github.com/VictoriaMetrics/metrics and github.com/VictoriaMetrics/metricsql to newer versions This is needed for updating broken links to MetricsQL docs: https://github.com/VictoriaMetrics/VictoriaMetrics/wiki/MetricsQL -> https://docs.victoriametrics.com/metricsql/ This is a follow-up for `7e3511ffbd`	2024-03-30 04:44:19 +02:00
Aliaksandr Valialkin	4d71a33cb5	docs/CHANGELOG.md: remove the update notes regarding converting custom HTTP header keys to canonical form Custom HTTP headers are set via net/http.Header.Set or net/http.Header.Add functions. These functions always convert header keys to canonical form. So the change at `b577413d3b` isn't visible to users of VictoriaMetrics components. There is no need in documenting this change at docs/CHANGELOG.md, since it doesn't give any useful information to users. This is a follow-up for `e6dd52b04c`	2024-03-30 04:26:37 +02:00
Zakhar Bessarab	af3922b1df	lib/storage: add ability to use downsampling for the given series filter (#733 ) * lib/storage: add ability to use downsampling for the given series filter Signed-off-by: Zakhar Bessarab <z.bessarab@victoriametrics.com> * docs: add information about downsampling filters Signed-off-by: Zakhar Bessarab <z.bessarab@victoriametrics.com> * docs: fix MetricsQL filter Signed-off-by: Zakhar Bessarab <z.bessarab@victoriametrics.com> * lib/storage/downsampling: treat missing downsampling filter as a bug Signed-off-by: Zakhar Bessarab <z.bessarab@victoriametrics.com> * lib/storage/part_header: verify correctness of downsampling filters when opening partition Signed-off-by: Zakhar Bessarab <z.bessarab@victoriametrics.com> * lib/storage/downsampling: save only appliable rules in part metadata Filter and save only rules which are appliable to partition based on MinTimestamp of stored data. Signed-off-by: Zakhar Bessarab <z.bessarab@victoriametrics.com> * lib/storage/downsampling: update log messages for final dedup Properly specify a reason of re-running deduplication for partition. Signed-off-by: Zakhar Bessarab <z.bessarab@victoriametrics.com> * lib/storage: consistently use MaxTimestamp to determine deduplication/downsampling rules Using MinTimestamp leads to applying downsampling to parts which are only partially covered by downsampling rule. For example, partition covers range [1000-2000]. At t=2100 and rule offset 500 data with t=2100-500 => 1600 must be downsampled. The range check against MinTimestamp evaluates to true even though partition contains range which must not be downsampled - [1600:2000]. Signed-off-by: Zakhar Bessarab <z.bessarab@victoriametrics.com> * Follow-up - Apply the first matching downsampling period if multiple filters match the given time series. This allows fine-tuning the downsampling config for the specific needs. - Take into account downsampling filters during search queries. - Reduce the difference between community and enterprise branches. This should simplify further maintenance of these branches. - Properly parse series filters with colons inside them. - Document the feature at docs/CHANGELOG.md. Updates https://github.com/VictoriaMetrics/VictoriaMetrics/issues/4960 --------- Signed-off-by: Zakhar Bessarab <z.bessarab@victoriametrics.com> Co-authored-by: Aliaksandr Valialkin <valyala@victoriametrics.com>	2024-03-30 04:12:23 +02:00
Aliaksandr Valialkin	131f357098	lib/storage/table.go: reduce the difference with enterprise branch	2024-03-30 03:22:51 +02:00
Aliaksandr Valialkin	4001ca36b8	lib/storage/partition.go: reduce code difference a bit with enterprise branch	2024-03-30 01:39:27 +02:00
Nikolay	a05303eaa0	lib/storage: adds metrics for downsampling (#382 ) * lib/storage: adds metrics for downsampling vm_downsampling_partitions_scheduled - shows the number of parts, that must be downsampled vm_downsampling_partitions_scheduled_size_bytes - shows total size in bytes for parts, the must be donwsampled These two metrics answer the questions - is downsampling running? how many parts scheduled for downsampling and how many of them currently downsampled? Storage space that it occupies. https://github.com/VictoriaMetrics/VictoriaMetrics/issues/2612 * wip Co-authored-by: Aliaksandr Valialkin <valyala@victoriametrics.com>	2024-03-30 01:11:49 +02:00
hagen1778	e79b05b4ab	docs: update vmalert troubleshooting docs * rm recommendation to keep look-behind window empty, as it is not correct * mention the change of default value for `-search.latencyOffset` Signed-off-by: hagen1778 <roman@victoriametrics.com>	2024-03-29 18:00:37 +01:00
hagen1778	2e843a8ed9	docs: follow-up after `623d257faf` `623d257faf` Signed-off-by: hagen1778 <roman@victoriametrics.com>	2024-03-29 14:29:02 +01:00
Jiekun	623d257faf	app/vmalert: respect batch size limit for remote write on shutdown (#6039 ) During shutdown period of vmalert, remotewrite client retrieve all pending time series from buffer queue, compose them into 1 batch and execute remote write. This final batch may exceed the limit of -remoteWrite.maxBatchSize, and be rejected by the receiver (gateway, vmcluster or others). This changes ensures that even during shutdown vmalert won't exceed the max batch size limit for remote write destination. https://github.com/VictoriaMetrics/VictoriaMetrics/issues/6025	2024-03-29 14:27:50 +01:00
hagen1778	b6bd9a97a3	app/vmagent: follow-up `166b97b8d0` * add tests for sharding function * update flags description * add changelog note `166b97b8d0` Signed-off-by: hagen1778 <roman@victoriametrics.com>	2024-03-29 14:08:08 +01:00
Andrii Chubatiuk	47892b4a4c	opentelemetry: added cmd flag to sanitize metric names (#6035 )	2024-03-29 13:51:24 +01:00
Eugene Ma	166b97b8d0	vmagent: support sharding by excluded labels (#5938 ) To horizontally scale streaming aggregation, you might want to deploy a separate hashing tier of vmagents that route to a separate aggregation tier. The hashing tier should shard by all labels except the instance-level labels, to ensure the input metrics are routed correctly to the aggregator instance responsible for those labels. For this to achieve we introduce `remoteWrite.shardByURL.inverseLabels` flag to inverse logic of `remoteWrite.shardByURL.labels` --------- Co-authored-by: Eugene Ma <eugene.ma@airbnb.com> Co-authored-by: Roman Khavronenko <roman@victoriametrics.com>	2024-03-29 13:26:02 +01:00
Dmytro Kozlov	ac9c2a796f	docs: describe timeout query argument (#6020 ) Signed-off-by: hagen1778 <roman@victoriametrics.com> Co-authored-by: hagen1778 <roman@victoriametrics.com>	2024-03-28 16:17:33 +01:00
Hui Wang	47e7ad2e01	docs: fix golangci-lint check (#6036 ) Signed-off-by: hagen1778 <roman@victoriametrics.com> Co-authored-by: hagen1778 <roman@victoriametrics.com>	2024-03-28 08:58:27 +01:00
Hui Wang	d7224b2d1c	vmalert: fix sending alert messages (#6028 ) * vmalert: fix sending alert messages 1. fix `endsAt` field in messages that send to alertmanager, previously rule with small interval could never be triggered; 2. fix behavior of `-rule.resendDelay`, before it could prevent sending firing message when rule state is volatile. * docs: update changelog notes Signed-off-by: hagen1778 <roman@victoriametrics.com> --------- Signed-off-by: hagen1778 <roman@victoriametrics.com> Co-authored-by: hagen1778 <roman@victoriametrics.com>	2024-03-28 08:55:10 +01:00
Aliaksandr Valialkin	77eca6bb37	docs/MetricsQL.md: typo fix: outlier_iqr_over_time(memory_usage_bytes[1h]) triggers when memory_usage_bytes goes outside the usual value range for the last hour, not the last 24 hours This is a follow-up for `ea81f6fc36`	2024-03-27 20:59:43 +02:00
hagen1778	f937439657	docs: fix new line for update notes in CHANGELOG Signed-off-by: hagen1778 <roman@victoriametrics.com>	2024-03-27 16:24:16 +01:00
hagen1778	d72b565c03	docs: mention new guide `How to use OpenTelemetry metrics with VictoriaMetrics` in docs Signed-off-by: hagen1778 <roman@victoriametrics.com>	2024-03-27 16:23:28 +01:00
Nikolay	f8f4025dca	docs/opentelemetry: adds opentemetry get started guide (#5861 ) Signed-off-by: hagen1778 <roman@victoriametrics.com> Co-authored-by: Andrii Chubatiuk <andrew.chubatiuk@gmail.com>	2024-03-27 16:04:43 +01:00
Aliaksandr Valialkin	4a359d5f67	lib/storage: follow-up for `76f00cea6b` Store the deadline when the metricID entries must be deleted from indexdb if metricID->metricName entry isn't found after the deadline. This should make the code more clear comparing the the previous version, where the timestamp of the first metricID->metricName lookup miss was stored in missingMetricIDs. Remove the misleading comment about the importance of the order for creating entries in the inverted index when registering new time series. The order doesn't matter, since any subset of the created entries can become visible for search before any other subset after registering in indexdb. Updates https://github.com/VictoriaMetrics/VictoriaMetrics/issues/5948 Updates https://github.com/VictoriaMetrics/VictoriaMetrics/issues/5959	2024-03-27 11:41:28 +02:00

1 2 3 4 5 ...

8122 commits