linux

mirror of https://github.com/torvalds/linux.git synced 2024-11-11 06:31:49 +00:00

History

Dave Chinner 6dea405eee xfs: extent size hints can round up extents past MAXEXTLEN This results in BMBT corruption, as seen by this test: # mkfs.xfs -f -d size=40051712b,agcount=4 /dev/vdc .... # mount /dev/vdc /mnt/scratch # xfs_io -ft -c "extsize 16m" -c "falloc 0 30g" -c "bmap -vp" /mnt/scratch/foo which results in this failure on a debug kernel: XFS: Assertion failed: (blockcount & xfs_mask64hi(64-BMBT_BLOCKCOUNT_BITLEN)) == 0, file: fs/xfs/libxfs/xfs_bmap_btree.c, line: 211 .... Call Trace: [<ffffffff814cf0ff>] xfs_bmbt_set_allf+0x8f/0x100 [<ffffffff814cf18d>] xfs_bmbt_set_all+0x1d/0x20 [<ffffffff814f2efe>] xfs_iext_insert+0x9e/0x120 [<ffffffff814c7956>] ? xfs_bmap_add_extent_hole_real+0x1c6/0xc70 [<ffffffff814c7956>] xfs_bmap_add_extent_hole_real+0x1c6/0xc70 [<ffffffff814caaab>] xfs_bmapi_write+0x72b/0xed0 [<ffffffff811c72ac>] ? kmem_cache_alloc+0x15c/0x170 [<ffffffff814fe070>] xfs_alloc_file_space+0x160/0x400 [<ffffffff81ddcc29>] ? down_write+0x29/0x60 [<ffffffff815063eb>] xfs_file_fallocate+0x29b/0x310 [<ffffffff811d2bc8>] ? __sb_start_write+0x58/0x120 [<ffffffff811e3e18>] ? do_vfs_ioctl+0x318/0x570 [<ffffffff811cd680>] vfs_fallocate+0x140/0x260 [<ffffffff811ce6f8>] SyS_fallocate+0x48/0x80 [<ffffffff81ddec09>] system_call_fastpath+0x12/0x17 The tracepoint that indicates the extent that triggered the assert failure is: xfs_iext_insert: idx 0 offset 0 block 16777224 count 2097152 flag 1 Clearly indicating that the extent length is greater than MAXEXTLEN, which is 2097151. A prior trace point shows the allocation was an exact size match and that a length greater than MAXEXTLEN was asked for: xfs_alloc_size_done: agno 1 agbno 8 minlen 2097152 maxlen 2097152 ^^^^^^^ ^^^^^^^ We don't see this problem with extent size hints through the IO path because we can't do single IOs large enough to trigger MAXEXTLEN allocation. fallocate(), OTOH, is not limited in it's allocation sizes and so needs help here. The issue is that the extent size hint alignment is rounding up the extent size past MAXEXTLEN, because xfs_bmapi_write() is not taking into account extent size hints when calculating the maximum extent length to allocate. xfs_bmapi_reserve_delalloc() is already doing this, but direct extent allocation is not. Unfortunately, the calculation in xfs_bmapi_reserve_delalloc() is wrong, and it works only because delayed allocation extents are not limited in size to MAXEXTLEN in the in-core extent tree. hence this calculation does not work for direct allocation, and the delalloc code needs fixing. This may, in fact be the underlying bug that occassionally causes transaction overruns in delayed allocation extent conversion, so now we know it's wrong we should fix it, too. Many thanks to Brian Foster for finding this problem during review of this patch. Hence the fix, after much code reading, is to allow xfs_bmap_extsize_align() to align partial extents when full alignment would extend the alignment past MAXEXTLEN. We can safely do this because all callers have higher layer allocation loops that already handle short allocations, and so will simply run another allocation to cover the remainder of the requested allocation range that we ignored during alignment. The advantage of this approach is that it also removes the need for callers to do anything other than limit their requests to MAXEXTLEN - they don't really need to be aware of extent size hints at all. Signed-off-by: Dave Chinner <dchinner@redhat.com> Reviewed-by: Brian Foster <bfoster@redhat.com> Signed-off-by: Dave Chinner <david@fromorbit.com>		2015-05-29 07:40:06 +10:00
..
9p	Merge branch 'for-linus' of git://git.kernel.org/pub/scm/linux/kernel/git/viro/vfs	2015-04-26 17:22:07 -07:00
adfs	Merge branch 'for-linus' of git://git.kernel.org/pub/scm/linux/kernel/git/viro/vfs	2015-04-26 17:22:07 -07:00
affs	Merge branch 'for-linus' of git://git.kernel.org/pub/scm/linux/kernel/git/viro/vfs	2015-04-26 17:22:07 -07:00
afs	VFS: normal filesystems (and lustre): d_inode() annotations	2015-04-15 15:06:57 -04:00
autofs4	VFS: normal filesystems (and lustre): d_inode() annotations	2015-04-15 15:06:57 -04:00
befs	Merge branch 'for-linus' of git://git.kernel.org/pub/scm/linux/kernel/git/viro/vfs	2015-04-26 17:22:07 -07:00
bfs	Merge branch 'for-linus' of git://git.kernel.org/pub/scm/linux/kernel/git/viro/vfs	2015-04-26 17:22:07 -07:00
btrfs	Merge branch 'for-linus-4.1' of git://git.kernel.org/pub/scm/linux/kernel/git/mason/linux-btrfs	2015-05-01 07:46:21 -07:00
cachefiles	VFS: fs/cachefiles: d_backing_inode() annotations	2015-04-15 15:06:59 -04:00
ceph	Merge branch 'for-linus' of git://git.kernel.org/pub/scm/linux/kernel/git/viro/vfs	2015-04-26 17:22:07 -07:00
cifs	Merge branch 'for-linus' of git://git.kernel.org/pub/scm/linux/kernel/git/viro/vfs	2015-04-26 17:22:07 -07:00
coda	VFS: normal filesystems (and lustre): d_inode() annotations	2015-04-15 15:06:57 -04:00
configfs	VFS: normal filesystems (and lustre): d_inode() annotations	2015-04-15 15:06:57 -04:00
cramfs
debugfs	Merge branch 'for-linus' of git://git.kernel.org/pub/scm/linux/kernel/git/viro/vfs	2015-04-26 17:22:07 -07:00
devpts	VFS: normal filesystems (and lustre): d_inode() annotations	2015-04-15 15:06:57 -04:00
dlm	netlink: make nlmsg_end() and genlmsg_end() void	2015-01-18 01:03:45 -05:00
ecryptfs	VFS: normal filesystems (and lustre): d_inode() annotations	2015-04-15 15:06:57 -04:00
efivarfs	VFS: normal filesystems (and lustre): d_inode() annotations	2015-04-15 15:06:57 -04:00
efs	VFS: normal filesystems (and lustre): d_inode() annotations	2015-04-15 15:06:57 -04:00
exofs	VFS: normal filesystems (and lustre): d_inode() annotations	2015-04-15 15:06:57 -04:00
exportfs	VFS: (Scripted) Convert S_ISLNK/DIR/REG(dentry->d_inode) to d_is_*(dentry)	2015-02-22 11:38:41 -05:00
ext2	Merge branch 'for-linus' of git://git.kernel.org/pub/scm/linux/kernel/git/viro/vfs	2015-04-26 17:22:07 -07:00
ext3	Merge branch 'for-linus' of git://git.kernel.org/pub/scm/linux/kernel/git/viro/vfs	2015-04-26 17:22:07 -07:00
ext4	Some miscellaneous bug fixes and some final on-disk and ABI changes	2015-05-03 18:23:53 -07:00
f2fs	Merge branch 'for-linus' of git://git.kernel.org/pub/scm/linux/kernel/git/viro/vfs	2015-04-26 17:22:07 -07:00
fat	Merge branch 'for-linus' of git://git.kernel.org/pub/scm/linux/kernel/git/viro/vfs	2015-04-26 17:22:07 -07:00
freevxfs	VFS: normal filesystems (and lustre): d_inode() annotations	2015-04-15 15:06:57 -04:00
fscache
fuse	VFS: normal filesystems (and lustre): d_inode() annotations	2015-04-15 15:06:57 -04:00
gfs2	Merge branch 'for-linus' of git://git.kernel.org/pub/scm/linux/kernel/git/viro/vfs	2015-04-26 17:22:07 -07:00
hfs	Merge branch 'for-linus' of git://git.kernel.org/pub/scm/linux/kernel/git/viro/vfs	2015-04-26 17:22:07 -07:00
hfsplus	Merge branch 'for-linus' of git://git.kernel.org/pub/scm/linux/kernel/git/viro/vfs	2015-04-26 17:22:07 -07:00
hostfs	Merge branch 'for-linus' of git://git.kernel.org/pub/scm/linux/kernel/git/viro/vfs	2015-04-26 17:22:07 -07:00
hpfs	VFS: normal filesystems (and lustre): d_inode() annotations	2015-04-15 15:06:57 -04:00
hppfs	VFS: normal filesystems (and lustre): d_inode() annotations	2015-04-15 15:06:57 -04:00
hugetlbfs	Merge branch 'for-linus' of git://git.kernel.org/pub/scm/linux/kernel/git/viro/vfs	2015-04-26 17:22:07 -07:00
isofs	VFS: normal filesystems (and lustre): d_inode() annotations	2015-04-15 15:06:57 -04:00
jbd
jbd2	jbd2: complain about descriptor block checksum errors	2015-01-19 15:59:58 -05:00
jffs2	Merge branch 'for-linus' of git://git.kernel.org/pub/scm/linux/kernel/git/viro/vfs	2015-04-26 17:22:07 -07:00
jfs	Merge branch 'for-linus' of git://git.kernel.org/pub/scm/linux/kernel/git/viro/vfs	2015-04-26 17:22:07 -07:00
kernfs	VFS: normal filesystems (and lustre): d_inode() annotations	2015-04-15 15:06:57 -04:00
lockd	nfsd: eliminate NFSD_DEBUG	2015-04-21 16:16:02 -04:00
logfs	VFS: normal filesystems (and lustre): d_inode() annotations	2015-04-15 15:06:57 -04:00
minix	VFS: normal filesystems (and lustre): d_inode() annotations	2015-04-15 15:06:57 -04:00
ncpfs	VFS: normal filesystems (and lustre): d_inode() annotations	2015-04-15 15:06:57 -04:00
nfs	NFS client updates for Linux 4.1	2015-04-26 17:33:59 -07:00
nfs_common
nfsd	Merge branch 'for-linus' of git://git.kernel.org/pub/scm/linux/kernel/git/viro/vfs	2015-04-26 17:22:07 -07:00
nilfs2	Merge branch 'for-linus' of git://git.kernel.org/pub/scm/linux/kernel/git/viro/vfs	2015-04-26 17:22:07 -07:00
nls
notify	fanotify: fix event filtering with FAN_ONDIR set	2015-03-12 18:46:08 -07:00
ntfs	VFS: normal filesystems (and lustre): d_inode() annotations	2015-04-15 15:06:57 -04:00
ocfs2	Merge branch 'for-linus' of git://git.kernel.org/pub/scm/linux/kernel/git/viro/vfs	2015-04-26 17:22:07 -07:00
omfs	VFS: normal filesystems (and lustre): d_inode() annotations	2015-04-15 15:06:57 -04:00
openpromfs
overlayfs	ovl: upper fs should not be R/O	2015-03-18 10:29:48 +01:00
proc	Merge branch 'for-linus' of git://git.kernel.org/pub/scm/linux/kernel/git/viro/vfs	2015-04-26 17:22:07 -07:00
pstore	Merge branch 'for-linus' of git://git.kernel.org/pub/scm/linux/kernel/git/viro/vfs	2015-04-26 17:22:07 -07:00
qnx4
qnx6	VFS: normal filesystems (and lustre): d_inode() annotations	2015-04-15 15:06:57 -04:00
quota	Merge branch 'for-linus' of git://git.kernel.org/pub/scm/linux/kernel/git/viro/vfs	2015-04-26 17:22:07 -07:00
ramfs	VFS: normal filesystems (and lustre): d_inode() annotations	2015-04-15 15:06:57 -04:00
reiserfs	Merge branch 'for-linus' of git://git.kernel.org/pub/scm/linux/kernel/git/viro/vfs	2015-04-26 17:22:07 -07:00
romfs	make new_sync_{read,write}() static	2015-04-11 22:29:40 -04:00
squashfs	VFS: normal filesystems (and lustre): d_inode() annotations	2015-04-15 15:06:57 -04:00
sysfs	sysfs: Only accept read/write permissions for file attributes	2015-03-25 13:27:57 +01:00
sysv	VFS: normal filesystems (and lustre): d_inode() annotations	2015-04-15 15:06:57 -04:00
tracefs	tracing: Have mkdir and rmdir be part of tracefs	2015-02-03 12:48:43 -05:00
ubifs	Merge branch 'for-linus' of git://git.kernel.org/pub/scm/linux/kernel/git/viro/vfs	2015-04-26 17:22:07 -07:00
udf	Merge branch 'for-linus' of git://git.kernel.org/pub/scm/linux/kernel/git/viro/vfs	2015-04-26 17:22:07 -07:00
ufs	VFS: normal filesystems (and lustre): d_inode() annotations	2015-04-15 15:06:57 -04:00
xfs	xfs: extent size hints can round up extents past MAXEXTLEN	2015-05-29 07:40:06 +10:00
aio.c	Merge branch 'for-linus' of git://git.kernel.org/pub/scm/linux/kernel/git/viro/vfs	2015-04-16 23:27:56 -04:00
anon_inodes.c
attr.c
bad_inode.c	don't bother with most of the bad_file_ops methods	2015-02-20 04:03:58 -05:00
binfmt_aout.c
binfmt_elf_fdpic.c
binfmt_elf.c	mm: fold arch_randomize_brk into ARCH_HAS_ELF_RANDOMIZE	2015-04-14 16:49:05 -07:00
binfmt_em86.c	syscalls: implement execveat() system call	2014-12-13 12:42:51 -08:00
binfmt_flat.c
binfmt_misc.c	Merge branch 'for-linus' of git://git.kernel.org/pub/scm/linux/kernel/git/viro/vfs	2015-04-26 17:22:07 -07:00
binfmt_script.c	syscalls: implement execveat() system call	2014-12-13 12:42:51 -08:00
block_dev.c	direct-io: only inc/dec inode->i_dio_count for file systems	2015-04-24 15:45:28 -04:00
buffer.c	page_writeback: clean up mess around cancel_dirty_page()	2015-04-14 16:49:01 -07:00
char_dev.c	fs: introduce f_op->mmap_capabilities for nommu mmap support	2015-01-20 14:02:58 -07:00
compat_binfmt_elf.c
compat_ioctl.c	Bluetooth: bnep: Add support for get bnep features via ioctl	2015-04-03 23:21:34 +02:00
compat.c
coredump.c	coredump: accept any write method	2015-04-11 22:29:39 -04:00
dax.c	Merge branch 'for-linus' of git://git.kernel.org/pub/scm/linux/kernel/git/viro/vfs	2015-04-26 17:22:07 -07:00
dcache.c	VFS: Impose ordering on accesses of d_inode and d_flags	2015-04-15 15:05:28 -04:00
dcookies.c
direct-io.c	direct-io: only inc/dec inode->i_dio_count for file systems	2015-04-24 15:45:28 -04:00
drop_caches.c	vmscan: per memory cgroup slab shrinkers	2015-02-12 18:54:09 -08:00
eventfd.c	eventfd: don't take the spinlock in eventfd_poll	2015-02-17 14:34:52 -08:00
eventpoll.c	epoll: optimize setting task running after blocking	2015-02-13 21:21:40 -08:00
exec.c	fs: take i_mutex during prepare_binprm for set[ug]id executables	2015-04-19 13:46:21 -07:00
fcntl.c	vfs: renumber FMODE_NONOTIFY and add to uniqueness check	2015-01-08 15:10:52 -08:00
fhandle.c
file_table.c	->aio_read and ->aio_write removed	2015-04-11 22:29:43 -04:00
file.c	mm: rcu-protected get_mm_exe_file()	2015-04-17 09:04:07 -04:00
filesystems.c
fs_pin.c	fs_pin: Allow for the possibility that m_list or s_list go unused.	2015-04-09 11:39:55 -05:00
fs_struct.c
fs-writeback.c	fs: add dirtytime_expire_seconds sysctl	2015-03-17 12:23:32 -04:00
inode.c	direct-io: only inc/dec inode->i_dio_count for file systems	2015-04-24 15:45:28 -04:00
internal.h	trylock_super(): replacement for grab_super_passive()	2015-02-22 11:38:42 -05:00
ioctl.c	fsioctl.c: make generic_block_fiemap() signal-tolerant	2015-02-10 14:30:30 -08:00
Kconfig	f2fs: relocate Kconfig from misc filesystems	2015-04-10 15:08:35 -07:00
Kconfig.binfmt	mm: split ET_DYN ASLR from mmap ASLR	2015-04-14 16:49:05 -07:00
libfs.c	VFS: fs library helpers: d_inode() annotations	2015-04-15 15:06:58 -04:00
locks.c	proc: show locks in /proc/pid/fdinfo/X	2015-04-17 09:04:12 -04:00
Makefile	This adds the new tracefs file system. This has been in linux-next for	2015-04-14 10:22:29 -07:00
mbcache.c
mount.h	switch the IO-triggering parts of umount to fs_pin	2015-01-25 23:17:29 -05:00
mpage.c
namei.c	RCU pathwalk breakage when running into a symlink overmounting something	2015-04-24 15:52:14 -04:00
namespace.c	mnt: Update detach_mounts to leave mounts connected	2015-04-09 11:39:57 -05:00
no-block.c
nsfs.c	VFS: assorted weird filesystems: d_inode() annotations	2015-04-15 15:06:58 -04:00
open.c	xfs: update for 4.1-rc1	2015-04-24 07:08:41 -07:00
pipe.c	VFS: assorted weird filesystems: d_inode() annotations	2015-04-15 15:06:58 -04:00
pnode.c	mnt: Don't propagate unmounts to locked mounts	2015-04-02 20:34:20 -05:00
pnode.h	mnt: Honor MNT_LOCKED when detaching mounts	2015-04-09 11:39:55 -05:00
posix_acl.c	VFS: assorted d_backing_inode() annotations	2015-04-15 15:06:59 -04:00
proc_namespace.c	vfs: add support for a lazytime mount option	2015-02-05 02:45:00 -05:00
read_write.c	new_sync_write(): discard ->ki_pos unless the return value is positive	2015-04-11 22:29:46 -04:00
readdir.c
select.c	all arches, signal: move restart_block to struct task_struct	2015-02-12 18:54:12 -08:00
seq_file.c	bitmap, cpumask, nodemask: remove dedicated formatting functions	2015-02-13 21:21:39 -08:00
signalfd.c
splice.c	dax: unify ext2/4_{dax,}_file_operations	2015-04-15 16:35:20 -07:00
stack.c
stat.c	VFS: assorted d_backing_inode() annotations	2015-04-15 15:06:59 -04:00
statfs.c
super.c	cleancache: remove limit on the number of cleancache enabled filesystems	2015-04-14 16:49:03 -07:00
sync.c	vfs: add support for a lazytime mount option	2015-02-05 02:45:00 -05:00
timerfd.c
utimes.c
xattr.c