linux

mirror of https://github.com/torvalds/linux.git synced 2024-11-19 10:31:48 +00:00

History

Srikar Dronamraju 05b40e0577 sched/fair: Prefer sibiling only if local group is under-utilized If the child domain prefers tasks to go siblings, the local group could end up pulling tasks to itself even if the local group is almost equally loaded as the source group. Lets assume a 4 core,smt==2 machine running 5 thread ebizzy workload. Everytime, local group has capacity and source group has atleast 2 threads, local group tries to pull the task. This causes the threads to constantly move between different cores. This is even more profound if the cores have more threads, like in Power 8, smt 8 mode. Fix this by only allowing local group to pull a task, if the source group has more number of tasks than the local group. Here are the relevant perf stat numbers of a 22 core,smt 8 Power 8 machine. Without patch: Performance counter stats for 'ebizzy -t 22 -S 100' (5 runs): 1,440 context-switches # 0.001 K/sec ( +- 1.26% ) 366 cpu-migrations # 0.000 K/sec ( +- 5.58% ) 3,933 page-faults # 0.002 K/sec ( +- 11.08% ) Performance counter stats for 'ebizzy -t 48 -S 100' (5 runs): 6,287 context-switches # 0.001 K/sec ( +- 3.65% ) 3,776 cpu-migrations # 0.001 K/sec ( +- 4.84% ) 5,702 page-faults # 0.001 K/sec ( +- 9.36% ) Performance counter stats for 'ebizzy -t 96 -S 100' (5 runs): 8,776 context-switches # 0.001 K/sec ( +- 0.73% ) 2,790 cpu-migrations # 0.000 K/sec ( +- 0.98% ) 10,540 page-faults # 0.001 K/sec ( +- 3.12% ) With patch: Performance counter stats for 'ebizzy -t 22 -S 100' (5 runs): 1,133 context-switches # 0.001 K/sec ( +- 4.72% ) 123 cpu-migrations # 0.000 K/sec ( +- 3.42% ) 3,858 page-faults # 0.002 K/sec ( +- 8.52% ) Performance counter stats for 'ebizzy -t 48 -S 100' (5 runs): 2,169 context-switches # 0.000 K/sec ( +- 6.19% ) 189 cpu-migrations # 0.000 K/sec ( +- 12.75% ) 5,917 page-faults # 0.001 K/sec ( +- 8.09% ) Performance counter stats for 'ebizzy -t 96 -S 100' (5 runs): 5,333 context-switches # 0.001 K/sec ( +- 5.91% ) 506 cpu-migrations # 0.000 K/sec ( +- 3.35% ) 10,792 page-faults # 0.001 K/sec ( +- 7.75% ) Which show that in these workloads CPU migrations get reduced significantly. Signed-off-by: Srikar Dronamraju <srikar@linux.vnet.ibm.com> Signed-off-by: Peter Zijlstra (Intel) <peterz@infradead.org> Cc: Linus Torvalds <torvalds@linux-foundation.org> Cc: Mike Galbraith <efault@gmx.de> Cc: Peter Zijlstra <peterz@infradead.org> Cc: Thomas Gleixner <tglx@linutronix.de> Cc: Vincent Guittot <vincent.guittot@linaro.org> Link: http://lkml.kernel.org/r/1490205470-10249-1-git-send-email-srikar@linux.vnet.ibm.com Signed-off-by: Ingo Molnar <mingo@kernel.org>		2017-03-27 10:22:26 +02:00
..
autogroup.c	sched/autogroup: Rename auto_group.[ch] to autogroup.[ch]	2017-02-08 09:01:11 +01:00
autogroup.h	sched/headers: Prepare for new header dependencies before moving code to <linux/sched/autogroup.h>	2017-03-02 08:42:28 +01:00
clock.c	sched/headers: Prepare for new header dependencies before moving code to <linux/sched/nmi.h>	2017-03-02 08:42:30 +01:00
completion.c	sched/headers: Prepare for new header dependencies before moving code to <linux/sched/debug.h>	2017-03-02 08:42:34 +01:00
core.c	sched/core: Fix rq lock pinning warning after call balance callbacks	2017-03-23 07:44:51 +01:00
cpuacct.c	sched/cputime: Convert kcpustat to nsecs	2017-02-01 09:13:47 +01:00
cpuacct.h	sched/cpuacct: Simplify the cpuacct code	2016-03-21 11:00:28 +01:00
cpudeadline.c	sched/core: Remove the tsk_cpus_allowed() wrapper	2017-03-02 08:42:24 +01:00
cpudeadline.h	sched/deadline: Split cpudl_set() into cpudl_set() and cpudl_clear()	2016-09-05 13:29:43 +02:00
cpufreq_schedutil.c	cpufreq: schedutil: Pass sg_policy to get_next_freq()	2017-03-05 23:58:48 +01:00
cpufreq.c	cpufreq / sched: Pass flags to cpufreq_update_util()	2016-08-16 22:14:55 +02:00
cpupri.c	sched/core: Remove the tsk_cpus_allowed() wrapper	2017-03-02 08:42:24 +01:00
cpupri.h	sched/cpupri: Remove unnecessary definitions in cpupri.h	2014-11-16 10:58:59 +01:00
cputime.c	sched/headers: Prepare to move cputime functionality from <linux/sched.h> into <linux/sched/cputime.h>	2017-03-02 08:42:39 +01:00
deadline.c	sched/deadline: Use deadline instead of period when calculating overflow	2017-03-16 09:37:38 +01:00
debug.c	sched/headers: Prepare to move the task_lock()/unlock() APIs to <linux/sched/task.h>	2017-03-02 08:42:38 +01:00
fair.c	sched/fair: Prefer sibiling only if local group is under-utilized	2017-03-27 10:22:26 +02:00
features.h	sched/core: Add WARNING for multiple update_rq_clock() calls	2017-03-16 09:46:21 +01:00
idle_task.c	sched/core: Add wrappers for lockdep_(un)pin_lock()	2017-01-14 11:29:30 +01:00
idle.c	sched/headers: Prepare for new header dependencies before moving code to <linux/sched/idle.h>	2017-03-02 08:42:26 +01:00
loadavg.c	sched/loadavg: Use {READ,WRITE}_ONCE() for sample window	2017-03-16 09:21:01 +01:00
Makefile	sched/autogroup: Rename auto_group.[ch] to autogroup.[ch]	2017-02-08 09:01:11 +01:00
rt.c	sched/rt: Add comments describing the RT IPI pull method	2017-03-16 09:41:35 +01:00
sched.h	sched/core: Add {EN,DE}QUEUE_NOCLOCK flags	2017-03-16 09:46:23 +01:00
stats.c	sched: use %*pb[l] to print bitmaps including cpumasks and nodemasks	2015-02-13 21:21:37 -08:00
stats.h	sched/headers: Move cputime functionality from <linux/sched.h> and <linux/cputime.h> into <linux/sched/cputime.h>	2017-03-03 01:45:22 +01:00
stop_task.c	sched/core: Add wrappers for lockdep_(un)pin_lock()	2017-01-14 11:29:30 +01:00
swait.c	sched/headers: Prepare to move signal wakeup & sigpending methods from <linux/sched.h> into <linux/sched/signal.h>	2017-03-02 08:42:32 +01:00
topology.c	sched/topology: Split out scheduler topology code from core.c into topology.c	2017-02-07 10:58:12 +01:00
wait.c	sched/headers: fix up header file dependency on <linux/sched/signal.h>	2017-03-08 10:36:03 -08:00