Skip to content

Commit 8852aac

Browse files
committed
workqueue: mod_delayed_work_on() shouldn't queue timer on 0 delay
8376fe2 ("workqueue: implement mod_delayed_work[_on]()") implemented mod_delayed_work[_on]() using the improved try_to_grab_pending(). The function is later used, among others, to replace [__]candel_delayed_work() + queue_delayed_work() combinations. Unfortunately, a delayed_work item w/ zero @delay is handled slightly differently by mod_delayed_work_on() compared to queue_delayed_work_on(). The latter skips timer altogether and directly queues it using queue_work_on() while the former schedules timer which will expire on the closest tick. This means, when @delay is zero, that [__]cancel_delayed_work() + queue_delayed_work_on() makes the target item immediately executable while mod_delayed_work_on() may induce delay of upto a full tick. This somewhat subtle difference breaks some of the converted users. e.g. block queue plugging uses delayed_work for deferred processing and uses mod_delayed_work_on() when the queue needs to be immediately unplugged. The above problem manifested as noticeably higher number of context switches under certain circumstances. The difference in behavior was caused by missing special case handling for 0 delay in mod_delayed_work_on() compared to queue_delayed_work_on(). Joonsoo Kim posted a patch to add it - ("workqueue: optimize mod_delayed_work_on() when @delay == 0")[1]. The patch was queued for 3.8 but it was described as optimization and I missed that it was a correctness issue. As both queue_delayed_work_on() and mod_delayed_work_on() use __queue_delayed_work() for queueing, it seems that the better approach is to move the 0 delay special handling to the function instead of duplicating it in mod_delayed_work_on(). Fix the problem by moving 0 delay special case handling from queue_delayed_work_on() to __queue_delayed_work(). This replaces Joonsoo's patch. [1] http://thread.gmane.org/gmane.linux.kernel/1379011/focus=1379012 Signed-off-by: Tejun Heo <tj@kernel.org> Reported-and-tested-by: Anders Kaseorg <andersk@MIT.EDU> Reported-and-tested-by: Zlatko Calusic <zlatko.calusic@iskon.hr> LKML-Reference: <alpine.DEB.2.00.1211280953350.26602@dr-wily.mit.edu> LKML-Reference: <50A78AA9.5040904@iskon.hr> Cc: Joonsoo Kim <js1304@gmail.com>
1 parent 412d32e commit 8852aac

File tree

1 file changed

+11
-3
lines changed

1 file changed

+11
-3
lines changed

kernel/workqueue.c

Lines changed: 11 additions & 3 deletions
Original file line numberDiff line numberDiff line change
@@ -1364,6 +1364,17 @@ static void __queue_delayed_work(int cpu, struct workqueue_struct *wq,
13641364
BUG_ON(timer_pending(timer));
13651365
BUG_ON(!list_empty(&work->entry));
13661366

1367+
/*
1368+
* If @delay is 0, queue @dwork->work immediately. This is for
1369+
* both optimization and correctness. The earliest @timer can
1370+
* expire is on the closest next tick and delayed_work users depend
1371+
* on that there's no such delay when @delay is 0.
1372+
*/
1373+
if (!delay) {
1374+
__queue_work(cpu, wq, &dwork->work);
1375+
return;
1376+
}
1377+
13671378
timer_stats_timer_set_start_info(&dwork->timer);
13681379

13691380
/*
@@ -1417,9 +1428,6 @@ bool queue_delayed_work_on(int cpu, struct workqueue_struct *wq,
14171428
bool ret = false;
14181429
unsigned long flags;
14191430

1420-
if (!delay)
1421-
return queue_work_on(cpu, wq, &dwork->work);
1422-
14231431
/* read the comment in __queue_work() */
14241432
local_irq_save(flags);
14251433

0 commit comments

Comments
 (0)