Skip to content

Commit f319da0

Browse files
Peter ZijlstraIngo Molnar
authored andcommitted
sched: Fix load avg vs cpu-hotplug
Rabik and Paul reported two different issues related to the same few lines of code. Rabik's issue is that the nr_uninterruptible migration code is wrong in that he sees artifacts due to this (Rabik please do expand in more detail). Paul's issue is that this code as it stands relies on us using stop_machine() for unplug, we all would like to remove this assumption so that eventually we can remove this stop_machine() usage altogether. The only reason we'd have to migrate nr_uninterruptible is so that we could use for_each_online_cpu() loops in favour of for_each_possible_cpu() loops, however since nr_uninterruptible() is the only such loop and its using possible lets not bother at all. The problem Rabik sees is (probably) caused by the fact that by migrating nr_uninterruptible we screw rq->calc_load_active for both rqs involved. So don't bother with fancy migration schemes (meaning we now have to keep using for_each_possible_cpu()) and instead fold any nr_active delta after we migrate all tasks away to make sure we don't have any skewed nr_active accounting. Reported-by: Rakib Mullick <rakib.mullick@gmail.com> Reported-by: Paul E. McKenney <paulmck@linux.vnet.ibm.com> Signed-off-by: Peter Zijlstra <a.p.zijlstra@chello.nl> Link: http://lkml.kernel.org/r/1345454817.23018.27.camel@twins Signed-off-by: Ingo Molnar <mingo@kernel.org>
1 parent 5b716ac commit f319da0

File tree

1 file changed

+10
-21
lines changed

1 file changed

+10
-21
lines changed

kernel/sched/core.c

Lines changed: 10 additions & 21 deletions
Original file line numberDiff line numberDiff line change
@@ -5304,27 +5304,17 @@ void idle_task_exit(void)
53045304
}
53055305

53065306
/*
5307-
* While a dead CPU has no uninterruptible tasks queued at this point,
5308-
* it might still have a nonzero ->nr_uninterruptible counter, because
5309-
* for performance reasons the counter is not stricly tracking tasks to
5310-
* their home CPUs. So we just add the counter to another CPU's counter,
5311-
* to keep the global sum constant after CPU-down:
5312-
*/
5313-
static void migrate_nr_uninterruptible(struct rq *rq_src)
5314-
{
5315-
struct rq *rq_dest = cpu_rq(cpumask_any(cpu_active_mask));
5316-
5317-
rq_dest->nr_uninterruptible += rq_src->nr_uninterruptible;
5318-
rq_src->nr_uninterruptible = 0;
5319-
}
5320-
5321-
/*
5322-
* remove the tasks which were accounted by rq from calc_load_tasks.
5307+
* Since this CPU is going 'away' for a while, fold any nr_active delta
5308+
* we might have. Assumes we're called after migrate_tasks() so that the
5309+
* nr_active count is stable.
5310+
*
5311+
* Also see the comment "Global load-average calculations".
53235312
*/
5324-
static void calc_global_load_remove(struct rq *rq)
5313+
static void calc_load_migrate(struct rq *rq)
53255314
{
5326-
atomic_long_sub(rq->calc_load_active, &calc_load_tasks);
5327-
rq->calc_load_active = 0;
5315+
long delta = calc_load_fold_active(rq);
5316+
if (delta)
5317+
atomic_long_add(delta, &calc_load_tasks);
53285318
}
53295319

53305320
/*
@@ -5618,8 +5608,7 @@ migration_call(struct notifier_block *nfb, unsigned long action, void *hcpu)
56185608
BUG_ON(rq->nr_running != 1); /* the migration thread */
56195609
raw_spin_unlock_irqrestore(&rq->lock, flags);
56205610

5621-
migrate_nr_uninterruptible(rq);
5622-
calc_global_load_remove(rq);
5611+
calc_load_migrate(rq);
56235612
break;
56245613
#endif
56255614
}

0 commit comments

Comments
 (0)