linux

mirror of https://github.com/torvalds/linux.git synced 2024-11-19 10:31:48 +00:00

History

Vincent Guittot 625ed2bf04 sched/cfs: Make util/load_avg more stable In the current implementation of load/util_avg, we assume that the ongoing time segment has fully elapsed, and util/load_sum is divided by LOAD_AVG_MAX, even if part of the time segment still remains to run. As a consequence, this remaining part is considered as idle time and generates unexpected variations of util_avg of a busy CPU in the range [1002..1024[ whereas util_avg should stay at 1023. In order to keep the metric stable, we should not consider the ongoing time segment when computing load/util_avg but only the segments that have already fully elapsed. But to not consider the current time segment adds unwanted latency in the load/util_avg responsivness especially when the time is scaled instead of the contribution. Instead of waiting for the current time segment to have fully elapsed before accounting it in load/util_avg, we can already account the elapsed part but change the range used to compute load/util_avg accordingly. At the very beginning of a new time segment, the past segments have been decayed and the max value is LOAD_AVG_MAXy. At the very end of the current time segment, the max value becomes: LOAD_AVG_MAXy + 1024(us) (== LOAD_AVG_MAX) In fact, the max value is: LOAD_AVG_MAXy + sa->period_contrib at any time in the time segment. Taking advantage of the fact that: LOAD_AVG_MAXy == LOAD_AVG_MAX-1024 the range becomes [0..LOAD_AVG_MAX-1024+sa->period_contrib]. As the elapsed part is already accounted in load/util_sum, we update the max value according to the current position in the time segment instead of removing its contribution. Suggested-by: Peter Zijlstra <peterz@infradead.org> Signed-off-by: Vincent Guittot <vincent.guittot@linaro.org> Signed-off-by: Peter Zijlstra (Intel) <peterz@infradead.org> Cc: Linus Torvalds <torvalds@linux-foundation.org> Cc: Mike Galbraith <efault@gmx.de> Cc: Morten.Rasmussen@arm.com Cc: Thomas Gleixner <tglx@linutronix.de> Cc: bsegall@google.com Cc: dietmar.eggemann@arm.com Cc: pjt@google.com Cc: yuyang.du@intel.com Link: http://lkml.kernel.org/r/1493188076-2767-1-git-send-email-vincent.guittot@linaro.org Signed-off-by: Ingo Molnar <mingo@kernel.org>		2017-05-15 10:15:13 +02:00
..
autogroup.c	sched/autogroup: Rename auto_group.[ch] to autogroup.[ch]	2017-02-08 09:01:11 +01:00
autogroup.h	sched/headers: Prepare for new header dependencies before moving code to <linux/sched/autogroup.h>	2017-03-02 08:42:28 +01:00
clock.c	sched/clock: Fix broken stable to unstable transfer	2017-03-27 10:23:48 +02:00
completion.c	sched/headers: Prepare for new header dependencies before moving code to <linux/sched/debug.h>	2017-03-02 08:42:34 +01:00
core.c	sched/core: Call __schedule() from do_idle() without enabling preemption	2017-05-15 10:09:12 +02:00
cpuacct.c	sched/cputime: Convert kcpustat to nsecs	2017-02-01 09:13:47 +01:00
cpuacct.h	sched/cpuacct: Simplify the cpuacct code	2016-03-21 11:00:28 +01:00
cpudeadline.c	sched/core: Remove the tsk_cpus_allowed() wrapper	2017-03-02 08:42:24 +01:00
cpudeadline.h	sched/deadline: Split cpudl_set() into cpudl_set() and cpudl_clear()	2016-09-05 13:29:43 +02:00
cpufreq_schedutil.c	cpufreq: schedutil: Use policy-dependent transition delays	2017-04-17 18:37:27 +02:00
cpufreq.c	cpufreq / sched: Pass flags to cpufreq_update_util()	2016-08-16 22:14:55 +02:00
cpupri.c	sched/core: Remove the tsk_cpus_allowed() wrapper	2017-03-02 08:42:24 +01:00
cpupri.h	sched/cpupri: Remove unnecessary definitions in cpupri.h	2014-11-16 10:58:59 +01:00
cputime.c	sched/cputime: Fix ksoftirqd cputime accounting regression	2017-04-27 09:08:26 +02:00
deadline.c	sched/deadline: Use deadline instead of period when calculating overflow	2017-03-16 09:37:38 +01:00
debug.c	sched/headers: Prepare to move the task_lock()/unlock() APIs to <linux/sched/task.h>	2017-03-02 08:42:38 +01:00
fair.c	sched/cfs: Make util/load_avg more stable	2017-05-15 10:15:13 +02:00
features.h	sched/core: Add WARNING for multiple update_rq_clock() calls	2017-03-16 09:46:21 +01:00
idle_task.c	sched/core: Add wrappers for lockdep_(un)pin_lock()	2017-01-14 11:29:30 +01:00
idle.c	sched/core: Call __schedule() from do_idle() without enabling preemption	2017-05-15 10:09:12 +02:00
loadavg.c	sched/loadavg: Use {READ,WRITE}_ONCE() for sample window	2017-03-16 09:21:01 +01:00
Makefile	sched/autogroup: Rename auto_group.[ch] to autogroup.[ch]	2017-02-08 09:01:11 +01:00
rt.c	sched/rt: Add comments describing the RT IPI pull method	2017-03-16 09:41:35 +01:00
sched-pelt.h	sched/fair: Move the PELT constants into a generated header	2017-04-14 10:26:37 +02:00
sched.h	sched/core: Call __schedule() from do_idle() without enabling preemption	2017-05-15 10:09:12 +02:00
stats.c	sched: use %*pb[l] to print bitmaps including cpumasks and nodemasks	2015-02-13 21:21:37 -08:00
stats.h	sched/headers: Move cputime functionality from <linux/sched.h> and <linux/cputime.h> into <linux/sched/cputime.h>	2017-03-03 01:45:22 +01:00
stop_task.c	sched/core: Add wrappers for lockdep_(un)pin_lock()	2017-01-14 11:29:30 +01:00
swait.c	sched/headers: Prepare to move signal wakeup & sigpending methods from <linux/sched.h> into <linux/sched/signal.h>	2017-03-02 08:42:32 +01:00
topology.c	sched/topology: Split out scheduler topology code from core.c into topology.c	2017-02-07 10:58:12 +01:00
wait.c	sched/headers: fix up header file dependency on <linux/sched/signal.h>	2017-03-08 10:36:03 -08:00