libnet/d/bridge: drop connections to lo mappings, and direct remote connections #49325

akerouanton · 2025-01-22T10:59:38Z

Related to libnet/d/bridge: port mappings: filter by input iface #48721
Related to Revert "libnet/d/bridge: port mappings: filter by input iface" #49310
Close Publishing ports explicitly to private networks should not be accessible from LAN hosts #45610

- What I did

This PR fixes two separate security issues that have a common cause -- lack of proper packet filtering before NATing.

- How I did it

1st and 2nd commit are just a small refactoring, and a new integration for the use-case broken by:

libnet/d/bridge: port mappings: filter by input iface #48721

3rd commit: libnet/d/bridge: port mappings: drop direct-access when gw_mode=nat

When a NAT-based port mapping is created, the daemon adds a DNAT rule in nat-DOCKER to replace the dest addr with the container IP. However, the daemon never sets up rules to filter packets destined directly to the container port. This allows a rogue neighbor (ie. a host that shares a L2 segment with the host) to send packets directly to the container on its container-side exposed port.

For instance, if container port 5000 is mapped to host port 6000, a neighbor could send packets directly to the container on its port 5000.

Since nat-DOCKER mangles the dest addr, and the nat table forbids DROP rules, this change adds a new rule in the raw-PREROUTING chain to filter ingress connections targeting the container's IP address.

This filtering is only done when gw_mode=nat. For the unprotected variant, no filtering is done.

4th commit: libnet/d/bridge: drop remote connections to port mapped on lo

Traditionnally when Linux receives remote packets with daddr set to a loopback address, it reject them as 'martians'. However, when a NAT rule is applied through iptables this doesn't happen. Our current DNAT rule used to map host ports to containers is applied unconditionnally, even for such 'martian' packets.

This means a neighbor host (ie. a host connected to the same L2 segment) can send packets to a port mapped on a loopback address. The purpose of publishing on a loopback address is to make ports inaccessible to remote hosts -- lack of proper filtering defeats that.

This commit adds an iptables rule to the raw-PREROUTING chain to drop packets with a loopback dest address and coming from any interface other than lo.

To accomodate WSL2 mirrored mode, another rule is inserted beforehand to specifically accept packets coming from the loopback0 interface.

- How to verify it

New integration tests.

- Description for the changelog

- Fix a security issue that was allowing remote hosts to connect directly to a container, on one of its published port.
- Fix a security issue that was allowing neighbor hosts to connect to ports mapped on a loopback address.

robmry

Looks good ...

Disallowing direct routing to published ports is quite a bit more drastic than the original change - we'll need to make it clear in release notes. (In this release we were already blocking direct routing to unpublished ports, which were previously open if the filter-FORWARD policy was ACCEPT. So it's an update for that description.)

We'll also need a docs update - this note can go, but we should probably replace it with a description of these new rules. I've not checked for other mentions / places we should mention this.

Nit - typo releasePortBindigs in the first commit comment, might make it less searchable-for. (Also, in the last commit comment, Traditionnally and unconditionnally only have one n.)

integration/networking/port_mapping_linux_test.go

libnetwork/drivers/bridge/setup_ip_tables_linux_test.go

thaJeztah

code looks mostly good to me; left some small suggestions.

not good enough on the networking part of things to say if it's all good though 🙈

integration/networking/port_mapping_linux_test.go

libnetwork/drivers/bridge/setup_ip_tables_linux_test.go

thaJeztah · 2025-01-27T12:48:11Z

libnetwork/drivers/bridge/setup_ip_tables_linux_test.go

+		err := netlink.LinkAdd(iface)
+		assert.NilError(t, err)


Is this something that needs cleaning up afterwards, or not a problem if we don't?

Both times this function is called a disposable network namespace is created first. I'll add a comment stating that it should be used that way.

libnetwork/drivers/bridge/setup_ip_tables_linux_test.go

Signed-off-by: Albin Kerouanton <albinker@gmail.com>

Commit fc7caf9 reverted 433b1f9 as it was introducing a regression, ie. containers couldn't reach ports published on the host using their gateway's IP address or the host IP address. These scenarios are now tested. Signed-off-by: Albin Kerouanton <albinker@gmail.com>

When a NAT-based port mapping is created, the daemon adds a DNAT rule in nat-DOCKER to replace the dest addr with the container IP. However, the daemon never sets up rules to filter packets destined directly to the container port. This allows a rogue neighbor (ie. a host that shares a L2 segment with the host) to send packets directly to the container on its container-side exposed port. For instance, if container port 5000 is mapped to host port 6000, a neighbor could send packets directly to the container on its port 5000. Since nat-DOCKER mangles the dest addr, and the nat table forbids DROP rules, this change adds a new rule in the raw-PREROUTING chain to filter ingress connections targeting the container's IP address. This filtering is only done when gw_mode=nat. For the unprotected variant, no filtering is done. Signed-off-by: Albin Kerouanton <albinker@gmail.com>

Traditionally when Linux receives remote packets with daddr set to a loopback address, it reject them as 'martians'. However, when a NAT rule is applied through iptables this doesn't happen. Our current DNAT rule used to map host ports to containers is applied unconditionally, even for such 'martian' packets. This means a neighbor host (ie. a host connected to the same L2 segment) can send packets to a port mapped on a loopback address. The purpose of publishing on a loopback address is to make ports inaccessible to remote hosts -- lack of proper filtering defeats that. This commit adds an iptables rule to the raw-PREROUTING chain to drop packets with a loopback dest address and coming from any interface other than lo. To accomodate WSL2 mirrored mode, another rule is inserted beforehand to specifically accept packets coming from the loopback0 interface. Signed-off-by: Albin Kerouanton <albinker@gmail.com>

thaJeztah

code LGTM

thaJeztah · 2025-01-28T15:36:12Z

Discussing with @akerouanton and @robmry - more eyes on this could still be useful, but we can make changes in a follow-up where needed; we can probably bring this one in

raesene · 2025-02-20T07:44:10Z

Hi all, can I check as this was identified as a security issue that's been fixed (and I know from reading some linked issues, it's been around a while), are there any plans to assign a CVE for it?

Just thinking that without that people might not realise that it's important to upgrade to this version for security purposes.

akerouanton · 2025-02-20T16:59:43Z

Thanks for asking @raesene.

We're planning to release a blog post soon that should help clarify what we did, what this issue is about, and why we did consider fixing it in v28. At the moment, we're considering this as an hardening improvement, and we're not planning to get a CVE assigned.

akerouanton added status/2-code-review area/security area/networking impact/changelog impact/documentation area/networking/firewalling area/networking/d/bridge area/networking/portmapping labels Jan 22, 2025

akerouanton added this to the 28.0.0 milestone Jan 22, 2025

akerouanton self-assigned this Jan 22, 2025

akerouanton changed the title ~~Fix 45610 v2~~ libnet/d/bridge: drop connections to lo mappings, and direct remote connections Jan 22, 2025

akerouanton force-pushed the fix-45610-v2 branch 3 times, most recently from 03b8006 to ecde2b3 Compare January 23, 2025 22:51

akerouanton requested review from thaJeztah and robmry January 23, 2025 22:56

akerouanton marked this pull request as ready for review January 23, 2025 22:56

akerouanton force-pushed the fix-45610-v2 branch from ecde2b3 to 1f18df4 Compare January 24, 2025 07:14

robmry approved these changes Jan 24, 2025

View reviewed changes

integration/networking/port_mapping_linux_test.go Outdated Show resolved Hide resolved

libnetwork/drivers/bridge/setup_ip_tables_linux_test.go Outdated Show resolved Hide resolved

thaJeztah reviewed Jan 27, 2025

View reviewed changes

akerouanton force-pushed the fix-45610-v2 branch 2 times, most recently from 7a891ad to ecf3873 Compare January 27, 2025 17:41

akerouanton added 4 commits January 27, 2025 18:41

libnet/d/bridge: releasePortBindings: append directly into 'errs'

a7e6d0a

Signed-off-by: Albin Kerouanton <albinker@gmail.com>

akerouanton force-pushed the fix-45610-v2 branch from ecf3873 to d216084 Compare January 27, 2025 17:41

thaJeztah approved these changes Jan 28, 2025

View reviewed changes

thaJeztah merged commit 47dc8d5 into moby:master Jan 28, 2025
147 checks passed

This was referenced Jan 29, 2025

api/types: remove some redundant imports #49355

Merged

Flaky test: TestAccessPublishedPortFromAnotherNetwork #49358

Open

akerouanton mentioned this pull request Jan 16, 2025

[epic] iptables and port publishing improvements for moby 28.0 #48815

Closed

11 tasks

robmry mentioned this pull request Feb 14, 2025

Don't create iptables rules when iptables is disabled #49467

Merged

akerouanton deleted the fix-45610-v2 branch February 20, 2025 09:01

thaJeztah added this to 🔦 Maintainer spotlight May 8, 2025

github-project-automation bot moved this to Up next in 🔦 Maintainer spotlight May 8, 2025

thompson-shaun moved this from Up next to Complete in 🔦 Maintainer spotlight May 8, 2025

antoninbas mentioned this pull request May 14, 2025

Fix Kind job failures on Github after Docker Engine update antrea-io/antrea#7165

Merged

robmry mentioned this pull request May 15, 2025

Drop DOCKER-ISOLATION rules #49981

Merged

dtantsur mentioned this pull request May 16, 2025

All E2E tests are red metal3-io/baremetal-operator#2456

Closed

robmry mentioned this pull request May 28, 2025

Published port to localhost (127.0.0.1) are reachable on the LAN #41872

Closed

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Uh oh!

libnet/d/bridge: drop connections to lo mappings, and direct remote connections #49325

libnet/d/bridge: drop connections to lo mappings, and direct remote connections #49325

Uh oh!

akerouanton commented Jan 22, 2025 •

edited

Loading

Uh oh!

robmry left a comment

Uh oh!

Uh oh!

Uh oh!

thaJeztah left a comment

Uh oh!

Uh oh!

Uh oh!

Uh oh!

thaJeztah Jan 27, 2025

Uh oh!

akerouanton Jan 27, 2025

Uh oh!

Uh oh!

Uh oh!

thaJeztah left a comment

Uh oh!

thaJeztah commented Jan 28, 2025

Uh oh!

Uh oh!

raesene commented Feb 20, 2025

Uh oh!

akerouanton commented Feb 20, 2025

Uh oh!

Uh oh!

libnet/d/bridge: drop connections to lo mappings, and direct remote connections #49325

libnet/d/bridge: drop connections to lo mappings, and direct remote connections #49325

Uh oh!

Conversation

akerouanton commented Jan 22, 2025 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Uh oh!

robmry left a comment

Choose a reason for hiding this comment

Uh oh!

Uh oh!

Uh oh!

thaJeztah left a comment

Choose a reason for hiding this comment

Uh oh!

Uh oh!

Uh oh!

Uh oh!

thaJeztah Jan 27, 2025

Choose a reason for hiding this comment

Uh oh!

akerouanton Jan 27, 2025

Choose a reason for hiding this comment

Uh oh!

Uh oh!

Uh oh!

thaJeztah left a comment

Choose a reason for hiding this comment

Uh oh!

thaJeztah commented Jan 28, 2025

Uh oh!

Uh oh!

raesene commented Feb 20, 2025

Uh oh!

akerouanton commented Feb 20, 2025

Uh oh!

Uh oh!

akerouanton commented Jan 22, 2025 •

edited

Loading