VictoriaMetrics/app/vmalert
Nikolay 908a50f79d
app/vmalert: adds idleConnTimeout flags and retry trivial network errors (#6382)
* "*.idleConnTimeout" flags must reduce probability of `write: broken
pipe` and `read: connection reset by peer` errors Those errors may occur
if remote server closes TCP socket for connection, while it's still
exist at client.
* single time retries for `write: broken pipe` and `read: connection
reset by peer` must handle a case for incorrectly configured timeouts at
middleware proxies, mitigate minor network issues.

https://github.com/VictoriaMetrics/VictoriaMetrics/issues/5661

### Describe Your Changes

Please provide a brief description of the changes you made. Be as
specific as possible to help others understand the purpose and impact of
your modifications.

---------

Co-authored-by: Roman Khavronenko <roman@victoriametrics.com>
(cherry picked from commit b97916276f)
2024-06-03 11:52:58 +02:00
..
config feat: include cluster info in alert CPUThrottlingHigh (#5956) 2024-03-17 20:46:15 +02:00
datasource app/vmalert: adds idleConnTimeout flags and retry trivial network errors (#6382) 2024-06-03 11:52:58 +02:00
deployment app/vmalert: include it into the next release 2020-04-28 00:11:41 +03:00
multiarch all: run apk update && apk upgrade in base Alpine Docker image in order to get all the recent security fixes 2023-02-09 14:03:02 -08:00
notifier vmalert: support any status code from the range 200-299 from alertmanager as successful (#6111) 2024-04-16 09:58:09 +02:00
remoteread app/vmalert: adds idleConnTimeout flags and retry trivial network errors (#6382) 2024-06-03 11:52:58 +02:00
remotewrite app/vmalert: adds idleConnTimeout flags and retry trivial network errors (#6382) 2024-06-03 11:52:58 +02:00
rule app/vmalert/rule: reduce number of allocations for getStaleSeries fn (#6269) 2024-05-16 09:35:51 +02:00
static app/vmalert: fix links with anchors in vmalert's UI (#6146) 2024-04-22 15:05:23 +02:00
templates all: fix golangci-lint(revive) warnings after 0c0ed61ce7 2024-04-03 03:00:45 +03:00
tpl app/vmalert: show on UI groups error after reload config (#4543) 2023-07-06 22:11:36 -07:00
utils app/vmbackup: support client-side TLS configuration for create/delete snapshot API (#5738) 2024-02-08 15:58:34 +01:00
main.go all: replace old https://docs.victoriametrics.com/vmalert.html url with the new one - https://docs.victoriametrics.com/vmalert/ 2024-04-18 01:44:54 +02:00
main_test.go all: use Gauge instead of Counter for *_config_last_reload_successful metrics 2023-12-20 14:25:44 +02:00
Makefile Add build support for loong64 (#6222) 2024-05-10 14:32:05 +02:00
manager.go app/vmalert: fix data race during hot-config reload (#5698) 2024-01-26 22:43:02 +01:00
manager_test.go vmalert-tool: implement unittest (#4789) 2023-10-16 14:12:06 +02:00
README.md all: replace old https://docs.victoriametrics.com/vmalert.html url with the new one - https://docs.victoriametrics.com/vmalert/ 2024-04-18 01:44:54 +02:00
replay.go all: consistently use %w instead of %s in when error is passed to fmt.Errorf() 2023-10-26 09:44:40 +02:00
replay_test.go vmalert-tool: implement unittest (#4789) 2023-10-16 14:12:06 +02:00
web.go all: replace old https://docs.victoriametrics.com/vmalert.html url with the new one - https://docs.victoriametrics.com/vmalert/ 2024-04-18 01:44:54 +02:00
web.qtpl app/vmalert: follow-up after b60dcbe11f 2024-02-20 13:35:02 +01:00
web.qtpl.go app/vmalert: follow-up after b60dcbe11f 2024-02-20 13:35:02 +01:00
web_test.go all: fix golangci-lint(revive) warnings after 0c0ed61ce7 2024-04-03 03:00:45 +03:00
web_types.go app/vmalert: support filtering for /api/v1/rule like Prometheus does (#5787) 2024-02-09 14:36:15 +01:00
web_types_test.go vmalert-tool: implement unittest (#4789) 2023-10-16 14:12:06 +02:00

See vmalert docs here.

vmalert docs can be edited at docs/vmalert.md.