linux/include
Herbert Xu 4b2f0260c7 [PATCH] nbd: fix TX/RX race condition
Janos Haar of First NetCenter Bt.  reported numerous crashes involving the
NBD driver.  With his help, this was tracked down to bogus bio vectors
which in turn was the result of a race condition between the
receive/transmit routines in the NBD driver.

The bug manifests itself like this:

CPU0				CPU1
do_nbd_request
	add req to queuelist
	nbd_send_request
		send req head
		for each bio
			kmap
			send
				nbd_read_stat
					nbd_find_request
					nbd_end_request
			kunmap

When CPU1 finishes nbd_end_request, the request and all its associated
bio's are freed.  So when CPU0 calls kunmap whose argument is derived from
the last bio, it may crash.

Under normal circumstances, the race occurs only on the last bio.  However,
if an error is encountered on the remote NBD server (such as an incorrect
magic number in the request), or if there were a bug in the server, it is
possible for the nbd_end_request to occur any time after the request's
addition to the queuelist.

The following patch fixes this problem by making sure that requests are not
added to the queuelist until after they have been completed transmission.

In order for the receiving side to be ready for responses involving
requests still being transmitted, the patch introduces the concept of the
active request.

When a response matches the current active request, its processing is
delayed until after the tranmission has come to a stop.

This has been tested by Janos and it has been successful in curing this
race condition.

From: Herbert Xu <herbert@gondor.apana.org.au>

  Here is an updated patch which removes the active_req wait in
  nbd_clear_queue and the associated memory barrier.

  I've also clarified this in the comment.

Signed-off-by: Herbert Xu <herbert@gondor.apana.org.au>
Cc: <djani22@dynamicweb.hu>
Cc: Paul Clements <Paul.Clements@SteelEye.com>
Signed-off-by: Herbert Xu <herbert@gondor.apana.org.au>
Signed-off-by: Andrew Morton <akpm@osdl.org>
Signed-off-by: Linus Torvalds <torvalds@osdl.org>
2006-01-06 08:33:20 -08:00
..
acpi [ACPI] increase owner_id limit to 64 from 32 2005-12-21 04:52:16 -05:00
asm-alpha [FLS64]: generic version 2006-01-03 13:11:06 -08:00
asm-arm Merge git://git.kernel.org/pub/scm/linux/kernel/git/davem/net-2.6 2006-01-04 16:27:41 -08:00
asm-arm26 [FLS64]: generic version 2006-01-03 13:11:06 -08:00
asm-cris [FLS64]: generic version 2006-01-03 13:11:06 -08:00
asm-frv [FLS64]: generic version 2006-01-03 13:11:06 -08:00
asm-generic [FLS64]: generic version 2006-01-03 13:11:06 -08:00
asm-h8300 [FLS64]: generic version 2006-01-03 13:11:06 -08:00
asm-i386 [FLS64]: generic version 2006-01-03 13:11:06 -08:00
asm-ia64 [FLS64]: generic version 2006-01-03 13:11:06 -08:00
asm-m32r [FLS64]: generic version 2006-01-03 13:11:06 -08:00
asm-m68k [FLS64]: generic version 2006-01-03 13:11:06 -08:00
asm-m68knommu [FLS64]: generic version 2006-01-03 13:11:06 -08:00
asm-mips Merge git://git.kernel.org/pub/scm/linux/kernel/git/sam/kbuild 2006-01-04 16:36:52 -08:00
asm-parisc [FLS64]: generic version 2006-01-03 13:11:06 -08:00
asm-powerpc [FLS64]: generic version 2006-01-03 13:11:06 -08:00
asm-ppc [PATCH] ppc: ppc4xx_dma DMA_MODE_{READ,WRITE} fix 2005-12-16 14:43:05 -08:00
asm-s390 [FLS64]: generic version 2006-01-03 13:11:06 -08:00
asm-sh [FLS64]: generic version 2006-01-03 13:11:06 -08:00
asm-sh64 [FLS64]: generic version 2006-01-03 13:11:06 -08:00
asm-sparc [FLS64]: generic version 2006-01-03 13:11:06 -08:00
asm-sparc64 [FLS64]: generic version 2006-01-03 13:11:06 -08:00
asm-um [PATCH] uml: eliminate anonymous union and clean up symlink lossage 2005-11-22 09:13:41 -08:00
asm-v850 [FLS64]: generic version 2006-01-03 13:11:06 -08:00
asm-x86_64 [FLS64]: x86_64 version 2006-01-03 13:11:07 -08:00
asm-xtensa [FLS64]: generic version 2006-01-03 13:11:06 -08:00
keys [PATCH] Keys: Export user-defined keyring operations 2005-10-30 17:37:22 -08:00
linux [PATCH] nbd: fix TX/RX race condition 2006-01-06 08:33:20 -08:00
math-emu
media [PATCH] V4L/DVB: (3086c) Whitespaces cleanups part 4 2005-12-12 08:57:44 -08:00
mtd [MTD] user-abi: Clean up trailing white spaces 2005-11-07 14:43:11 +01:00
net [NETFILTER]: Call POST_ROUTING hook before fragmentation 2006-01-05 12:20:59 -08:00
pcmcia [PATCH] pcmcia: unify attach, EVENT_CARD_INSERTION handlers into one probe callback 2006-01-06 00:03:24 +01:00
rdma [IB] uverbs: have kernel return QP capabilities 2005-11-10 10:22:50 -08:00
rxrpc
scsi Merge git://git.kernel.org/pub/scm/linux/kernel/git/jejb/scsi-misc-2.6 2006-01-04 16:30:12 -08:00
sound [ALSA] Revert the nested-device patch 2006-01-04 10:13:48 +01:00
video