Starting last Wednesday, 2020-06-10, the number of connection failures with the Termbox SSR services seems to sharply rise to about 300 - 450 per hour.
https://logstash.wikimedia.org/goto/808228af4133ed6e857a168de0c6cd4c
The error message is given as:
Wikibase\View\Termbox\Renderer\TermboxRemoteRenderer: encountered a bad response from the remote renderer
with the content being:
upstream connect error or disconnect/reset before headers. reset reason: connection termination
It is currently unclear what is going on or what the effects are. At the very least there is lots of logspam that started recently.
IRC log of discussion: https://wm-bot.wmflabs.org/browser/index.php?start=06%2F15%2F2020&end=06%2F15%2F2020&display=%23wikimedia-serviceops
Investigate:
- what is going on
- what are the effects
Likely it is the "top-level" mediawiki to termbox requests only that timeout.
Acceptance criteria:
- Investigation results are provided in this task