You signed in with another tab or window. Reload to refresh your session.You signed out in another tab or window. Reload to refresh your session.You switched accounts on another tab or window. Reload to refresh your session.Dismiss alert
With 2000 workspaces and sending web terminal traffic to 30% of them (600), we see that coder memory usage shoots up.
The OOM kill did not happen, however, before we also started a concurrent workspace app proxying test. But I suppose web term and workspace apps function on much the same principle.
From the heap, we can observe that a lot of this memory is allocated by WireGuard:
IIRC, single tailnet was supposed to help with this. So I'm wondering if this is a regression?
I don't see how #11789 would affect Wireguard, since Wireguard is just pushing IP (layer 3) packets and HTTP connection caching is TCP (layer 4). Also, the web terminal uses a single websocket, so caching the connection won't help.
I gave https://ghcr.io/coder/coder-preview:main-2.7.1-devel-3d76e1b55 a try, and with 100 web terminal connections it seemed that memory usage had been improved, but after another try with 500 web terminal connections, we ran out of memory again (16 GB limit) with two coder instances. In other words, we went from 2GB used to >16 GB + OOM.
I still believe there's been a regression since we initially tried out the single_tailnet experiment. There we could handle 500 web terminals with ease and only 4 GB of RAM.
With 2000 workspaces and sending web terminal traffic to 30% of them (600), we see that coder memory usage shoots up.
The OOM kill did not happen, however, before we also started a concurrent workspace app proxying test. But I suppose web term and workspace apps function on much the same principle.
From the heap, we can observe that a lot of this memory is allocated by WireGuard:
IIRC, single tailnet was supposed to help with this. So I'm wondering if this is a regression?
// ping @coadler @spikecurtis
The text was updated successfully, but these errors were encountered: