linux

History

Zach Brown 24542bf7ea btrfs: limit fallocate extent reservation to 256MB Very large fallocate requests are cpu bound and result in extents with a repeating pattern of ever decreasing size: $ time fallocate -l 1T file real 0m13.039s ( an excerpt of the extents from btrfs-debug-tree: ) prealloc data disk byte 1536292564992 nr 397312 prealloc data disk byte 1536292962304 nr 196608 prealloc data disk byte 1536293158912 nr 98304 prealloc data disk byte 1536293257216 nr 49152 prealloc data disk byte 1536293306368 nr 24576 prealloc data disk byte 1536293330944 nr 12288 prealloc data disk byte 1536293343232 nr 8192 prealloc data disk byte 1536293351424 nr 4096 prealloc data disk byte 1536293355520 nr 4096 prealloc data disk byte 1536293359616 nr 4096 The excessive cpu use comes from __btrfs_prealloc_file_range() trying to allocate the entire remaining size after each extent is allocated. btrfs_reserve_extent() repeatedly cuts this requested size in half until it gets down to the size that the allocators can return. We limit the problem for now by capping each reservation at 256 meg. The small extents come from a masking bug when decreasing the requested reservation size. The high 32bits are cleared and the remaining low bits might happen to reserve a small size. Fix this by using round_down() which properly casts the mask. After these fixes huge fallocate requests are fast and result in nice large extents: $ time fallocate -l 1T file real 0m0.082s prealloc data disk byte 1112425889792 nr 268435456 prealloc data disk byte 1112694325248 nr 268435456 prealloc data disk byte 1112962760704 nr 268435456 Reported-by: Eric Sandeen <sandeen@redhat.com> Signed-off-by: Zach Brown <zab@redhat.com> Signed-off-by: Chris Mason <chris.mason@fusionio.com>		2013-02-20 14:06:25 -05:00
..
acl.c	Btrfs: skip adding an acl attribute if we don't have to	2012-12-16 20:46:15 -05:00
async-thread.c	Btrfs: call the ordered free operation without any locks held	2012-07-25 16:15:07 -04:00
async-thread.h	btrfs: return void in functions without error conditions	2012-03-22 01:45:34 +01:00
backref.c	Btrfs: merge inode_list in __merge_refs	2012-12-12 17:15:27 -05:00
backref.h	Btrfs: move fs/btrfs/ioctl.h to include/uapi/linux/btrfs.h	2013-02-20 09:37:28 -05:00
btrfs_inode.h	Btrfs: serialize unlocked dio reads with truncate	2013-02-20 12:59:47 -05:00
check-integrity.c	btrfs: define BTRFS_MAGIC as a u64 value	2013-02-20 13:00:01 -05:00
check-integrity.h	Btrfs: add optional integrity check code	2011-12-21 19:14:09 +01:00
compat.h
compression.c	Btrfs: add rw argument to merge_bio_hook()	2013-02-01 11:49:47 -05:00
compression.h	btrfs: return void in functions without error conditions	2012-03-22 01:45:34 +01:00
ctree.c	btrfs: remove cache only arguments from defrag path	2013-02-20 12:59:36 -05:00
ctree.h	Merge branch 'raid56-experimental' into for-linus-3.9	2013-02-20 14:06:05 -05:00
delayed-inode.c	btrfs: remove unused "item" in btrfs_insert_delayed_item()	2013-02-20 12:59:23 -05:00
delayed-inode.h	Btrfs: fix lots of orphan inodes when the space is not enough	2013-02-20 09:36:39 -05:00
delayed-ref.c	Btrfs: make delayed ref lock logic more readable	2013-02-20 09:36:41 -05:00
delayed-ref.h	Merge branch 'raid56-experimental' into for-linus-3.9	2013-02-20 14:06:05 -05:00
dev-replace.c	Btrfs: check the return value of btrfs_start_delalloc_inodes()	2013-02-20 09:37:21 -05:00
dev-replace.h	Btrfs: add new sources for device replace code	2012-12-12 17:15:41 -05:00
dir-item.c	Btrfs: fix hash overflow handling	2012-12-17 14:48:21 -05:00
disk-io.c	Merge branch 'raid56-experimental' into for-linus-3.9	2013-02-20 14:06:05 -05:00
disk-io.h	Btrfs: RAID5 and RAID6	2013-02-01 14:24:23 -05:00
export.c	->encode_fh() API change	2012-05-29 23:28:33 -04:00
export.h
extent_io.c	Merge branch 'raid56-experimental' into for-linus-3.9	2013-02-20 14:06:05 -05:00
extent_io.h	Merge branch 'raid56-experimental' into for-linus-3.9	2013-02-20 14:06:05 -05:00
extent_map.c	Merge branch 'for-linus' of git://git.kernel.org/pub/scm/linux/kernel/git/mason/linux-btrfs	2013-02-08 12:06:46 +11:00
extent_map.h	Btrfs: do not allow logged extents to be merged or removed	2013-01-24 12:49:48 -05:00
extent-tree.c	btrfs: limit fallocate extent reservation to 256MB	2013-02-20 14:06:25 -05:00
file-item.c	Btrfs: extend the checksum item as much as possible	2013-02-20 12:59:37 -05:00
file.c	Merge branch 'master' of git://git.kernel.org/pub/scm/linux/kernel/git/josef/btrfs-next into for-linus-3.9	2013-02-20 14:05:45 -05:00
free-space-cache.c	Merge branch 'raid56-experimental' into for-linus-3.9	2013-02-20 14:06:05 -05:00
free-space-cache.h	btrfs: remove all unused functions	2011-05-06 12:34:03 +02:00
hash.h	btrfs: extended inode refs	2012-10-09 09:14:45 -04:00
inode-item.c	btrfs: extended inode refs	2012-10-09 09:14:45 -04:00
inode-map.c	Btrfs: improve the noflush reservation	2012-12-11 13:31:31 -05:00
inode-map.h	Btrfs: Support reading/writing on disk free ino cache	2011-04-25 16:46:11 +08:00
inode.c	btrfs: limit fallocate extent reservation to 256MB	2013-02-20 14:06:25 -05:00
ioctl.c	Merge branch 'master' of git://git.kernel.org/pub/scm/linux/kernel/git/josef/btrfs-next into for-linus-3.9	2013-02-20 14:05:45 -05:00
Kconfig	Btrfs: select XOR_BLOCKS in Kconfig	2013-02-05 09:55:30 -05:00
locking.c	Btrfs: save us a read_lock	2013-02-20 09:37:17 -05:00
locking.h	btrfs: return void in functions without error conditions	2012-03-22 01:45:34 +01:00
lzo.c	btrfs: remove the second argument of k[un]map_atomic()	2012-03-20 21:48:21 +08:00
Makefile	Btrfs: RAID5 and RAID6	2013-02-01 14:24:23 -05:00
math.h	Btrfs: cleanup duplicated division functions	2012-12-11 13:31:30 -05:00
ordered-data.c	Btrfs: place ordered operations on a per transaction list	2013-02-20 12:59:57 -05:00
ordered-data.h	Merge branch 'master' of git://git.kernel.org/pub/scm/linux/kernel/git/josef/btrfs-next into for-linus-3.9	2013-02-20 14:05:45 -05:00
orphan.c	btrfs: replace many BUG_ONs with proper error handling	2012-03-22 11:52:54 +01:00
print-tree.c	btrfs: add missing break in btrfs_print_leaf()	2013-02-20 12:59:20 -05:00
print-tree.h
qgroup.c	Btrfs: fix missing check before disabling quota	2013-02-20 13:00:07 -05:00
raid56.c	Btrfs: add a plugging callback to raid56 writes	2013-02-01 14:24:24 -05:00
raid56.h	Btrfs: RAID5 and RAID6	2013-02-01 14:24:23 -05:00
rcu-string.h	Btrfs: use rcu to protect device->name	2012-06-14 21:29:16 -04:00
reada.c	Btrfs: introduce GET_READ_MIRRORS functionality for btrfs_map_block()	2012-12-12 17:15:43 -05:00
relocation.c	Btrfs: use wrapper page_offset	2013-02-20 09:36:43 -05:00
root-tree.c	Btrfs: rename root_times_lock to root_item_lock	2012-12-16 20:46:21 -05:00
scrub.c	Merge branch 'raid56-experimental' into for-linus-3.9	2013-02-20 14:06:05 -05:00
send.c	btrfs: add "no file data" flag to btrfs send ioctl	2013-02-20 12:59:39 -05:00
send.h	btrfs: add "no file data" flag to btrfs send ioctl	2013-02-20 12:59:39 -05:00
struct-funcs.c	Btrfs: rewrite BTRFS_SETGET_FUNCS	2012-07-23 16:28:06 -04:00
super.c	Btrfs: fix uncompleted transaction	2013-02-20 13:00:05 -05:00
sysfs.c	btrfs: Remove unused sysfs code	2011-06-17 14:54:18 -04:00
transaction.c	Merge branch 'raid56-experimental' into for-linus-3.9	2013-02-20 14:06:05 -05:00
transaction.h	Btrfs: fix uncompleted transaction	2013-02-20 13:00:05 -05:00
tree-defrag.c	btrfs: remove cache only arguments from defrag path	2013-02-20 12:59:36 -05:00
tree-log.c	btrfs: remove cache only arguments from defrag path	2013-02-20 12:59:36 -05:00
tree-log.h	btrfs: return void in functions without error conditions	2012-03-22 01:45:34 +01:00
ulist.c	Btrfs: make aux field of ulist 64 bit	2012-10-01 15:18:53 -04:00
ulist.h	Btrfs: make aux field of ulist 64 bit	2012-10-01 15:18:53 -04:00
version.h
volumes.c	btrfs: Init io_lock after cloning btrfs device struct	2013-02-20 14:06:20 -05:00
volumes.h	Merge branch 'raid56-experimental' into for-linus-3.9	2013-02-20 14:06:05 -05:00
xattr.c	Btrfs: only log the inode item if we can get away with it	2012-12-16 20:46:21 -05:00
xattr.h
zlib.c	btrfs: fix message printing	2012-10-09 09:19:57 -04:00