Skip to content

Commit 128dd17

Browse files
Eric Wongtorvalds
authored andcommitted
epoll: prevent missed events on EPOLL_CTL_MOD
EPOLL_CTL_MOD sets the interest mask before calling f_op->poll() to ensure events are not missed. Since the modifications to the interest mask are not protected by the same lock as ep_poll_callback, we need to ensure the change is visible to other CPUs calling ep_poll_callback. We also need to ensure f_op->poll() has an up-to-date view of past events which occured before we modified the interest mask. So this barrier also pairs with the barrier in wq_has_sleeper(). This should guarantee either ep_poll_callback or f_op->poll() (or both) will notice the readiness of a recently-ready/modified item. This issue was encountered by Andreas Voellmy and Junchang(Jason) Wang in: http://thread.gmane.org/gmane.linux.kernel/1408782/ Signed-off-by: Eric Wong <normalperson@yhbt.net> Cc: Hans Verkuil <hans.verkuil@cisco.com> Cc: Jiri Olsa <jolsa@redhat.com> Cc: Jonathan Corbet <corbet@lwn.net> Cc: Al Viro <viro@zeniv.linux.org.uk> Cc: Davide Libenzi <davidel@xmailserver.org> Cc: Hans de Goede <hdegoede@redhat.com> Cc: Mauro Carvalho Chehab <mchehab@infradead.org> Cc: David Miller <davem@davemloft.net> Cc: Eric Dumazet <eric.dumazet@gmail.com> Cc: Andrew Morton <akpm@linux-foundation.org> Cc: Andreas Voellmy <andreas.voellmy@yale.edu> Tested-by: "Junchang(Jason) Wang" <junchang.wang@yale.edu> Cc: netdev@vger.kernel.org Cc: linux-fsdevel@vger.kernel.org Signed-off-by: Linus Torvalds <torvalds@linux-foundation.org>
1 parent 4a490b7 commit 128dd17

File tree

1 file changed

+21
-1
lines changed

1 file changed

+21
-1
lines changed

fs/eventpoll.c

Lines changed: 21 additions & 1 deletion
Original file line numberDiff line numberDiff line change
@@ -1313,7 +1313,7 @@ static int ep_modify(struct eventpoll *ep, struct epitem *epi, struct epoll_even
13131313
* otherwise we might miss an event that happens between the
13141314
* f_op->poll() call and the new event set registering.
13151315
*/
1316-
epi->event.events = event->events;
1316+
epi->event.events = event->events; /* need barrier below */
13171317
pt._key = event->events;
13181318
epi->event.data = event->data; /* protected by mtx */
13191319
if (epi->event.events & EPOLLWAKEUP) {
@@ -1323,6 +1323,26 @@ static int ep_modify(struct eventpoll *ep, struct epitem *epi, struct epoll_even
13231323
ep_destroy_wakeup_source(epi);
13241324
}
13251325

1326+
/*
1327+
* The following barrier has two effects:
1328+
*
1329+
* 1) Flush epi changes above to other CPUs. This ensures
1330+
* we do not miss events from ep_poll_callback if an
1331+
* event occurs immediately after we call f_op->poll().
1332+
* We need this because we did not take ep->lock while
1333+
* changing epi above (but ep_poll_callback does take
1334+
* ep->lock).
1335+
*
1336+
* 2) We also need to ensure we do not miss _past_ events
1337+
* when calling f_op->poll(). This barrier also
1338+
* pairs with the barrier in wq_has_sleeper (see
1339+
* comments for wq_has_sleeper).
1340+
*
1341+
* This barrier will now guarantee ep_poll_callback or f_op->poll
1342+
* (or both) will notice the readiness of an item.
1343+
*/
1344+
smp_mb();
1345+
13261346
/*
13271347
* Get current event bits. We can safely use the file* here because
13281348
* its usage count has been increased by the caller of this function.

0 commit comments

Comments
 (0)