linux/arch/um/include/sysdep-i386
Jeff Dike 53b173327d [PATCH] uml: fix I/O hang
Fix a UML hang in which everything would just stop until some I/O happened
- a ping, someone whacking the keyboard - at which point everything would
start up again as though nothing had happened.

The cause was gcc reordering some code which absolutely needed to be
executed in the order in the source.  When unblock_signals switches signals
from off to on, it needs to see if any interrupts had happened in the
critical section.  The interrupt handlers check signals_enabled - if it is
zero, then the handler adds a bit to the "pending" bitmask and returns.
unblock_signals checks this mask to see if any signals need to be
delivered.

The crucial part is this:
	signals_enabled = 1;
	save_pending = pending;
	if(save_pending == 0)
		return;
	pending = 0;

In order to avoid an interrupt arriving between reading pending and setting
it to zero, in which case, the record of the interrupt would be erased,
signals are enabled.

What happened was that gcc reordered this so that 'save_pending = pending'
came before 'signals_enabled = 1', creating a one-instruction window within
which an interrupt could arrive, set its bit in pending, and have it be
immediately erased.

When the I/O workload is purely disk-based, the loss of a block device
interrupt stops the entire I/O system because the next block request will
wait for the current one to finish.  Thus the system hangs until something
else causes some I/O to arrive, such as a network packet or console input.

The fix to this particular problem is a memory barrier between enabling
signals and reading the pending signal mask.  An xchg would also probably
work.

Looking over this code for similar problems led me to do a few more
things:

- make signals_enabled and pending volatile so that they don't get cached
  in registers

- add an mb() to the return paths of block_signals and unblock_signals so
  that the modification of signals_enabled doesn't get shuffled into the
  caller in the event that these are inlined in the future.

Signed-off-by: Jeff Dike <jdike@addtoit.com>
Cc: Paolo 'Blaisorblade' Giarrusso <blaisorblade@yahoo.it>
Signed-off-by: Andrew Morton <akpm@osdl.org>
Signed-off-by: Linus Torvalds <torvalds@osdl.org>
2006-11-03 12:27:58 -08:00
..
archsetjmp.h [PATCH] uml: thread creation tidying 2006-09-27 08:26:16 -07:00
barrier.h [PATCH] uml: fix I/O hang 2006-11-03 12:27:58 -08:00
checksum.h [PATCH] uml: sparse cleanups 2006-03-31 12:18:51 -08:00
faultinfo.h [PATCH] uml: S390 preparation, abstract host page fault data 2005-05-05 16:36:36 -07:00
kernel-offsets.h [PATCH] uml: asm offsets duplication removal 2006-10-11 11:14:20 -07:00
ptrace_user.h [PATCH] uml: skas0 - separate kernel address space on stock hosts 2005-07-07 18:23:44 -07:00
ptrace.h [PATCH] uml: clean arch_switch usage 2006-03-31 12:18:52 -08:00
sc.h [PATCH] uml makefiles sanitized 2005-09-29 08:46:26 -07:00
sigcontext.h [PATCH] uml: fix page faults in SKAS3 mode. 2005-09-30 12:41:18 -07:00
skas_ptrace.h [PATCH] uml: S390 preparation, abstract host page fault data 2005-05-05 16:36:36 -07:00
stub.h [PATCH] uml skas0: stop gcc's insanity 2005-12-18 11:19:44 -08:00
syscalls.h [PATCH] unify sys_ptrace prototype 2005-10-30 17:37:20 -08:00
thread.h [PATCH] uml: fix compile failure for TT mode 2005-10-24 08:59:25 -07:00
tls.h [PATCH] uml: check for differences in host support 2006-03-31 12:18:52 -08:00