git.cascardo.eti.br Git - cascardo/linux.git/commit

author	Liu Bo <bo.li.liu@oracle.com>
	Mon, 27 Aug 2012 16:52:20 +0000 (10:52 -0600)
committer	Chris Mason <chris.mason@fusionio.com>
	Mon, 1 Oct 2012 19:19:05 +0000 (15:19 -0400)
commit	4e2f84e63dc138eca91e89ccbc34f37732ce58f7
tree	31691a22773cf249fc289d8414be62b52d071513	tree \| snapshot
parent	ca7e70f59078046db28501519308c2061b0e7a6f	commit \| diff

Btrfs: improve fsync by filtering extents that we want

This is based on Josef's "Btrfs: turbo charge fsync".

The above Josef's patch performs very good in random sync write test,
because we won't have too much extents to merge.

However, it does not performs good on the test:
dd if=/dev/zero of=foobar bs=4k count=12500 oflag=sync

The reason is when we do sequencial sync write, we need to merge the
current extent just with the previous one, so that we can get accumulated
extents to log:

A(4k) --> AA(8k) --> AAA(12k) --> AAAA(16k) ...

So we'll have to flush more and more checksum into log tree, which is the
bottleneck according to my tests.

But we can avoid this by telling fsync the real extents that are needed
to be logged.

With this, I did the above dd sync write test (size=50m),

         w/o (orig)   w/ (josef's)   w/ (this)
SATA      104KB/s       109KB/s       121KB/s
ramdisk   1.5MB/s       1.5MB/s       10.7MB/s (613%)

Signed-off-by: Liu Bo <bo.li.liu@oracle.com>

fs/btrfs/extent_map.c		diff \| blob \| history
fs/btrfs/extent_map.h		diff \| blob \| history
fs/btrfs/inode.c		diff \| blob \| history
fs/btrfs/tree-log.c		diff \| blob \| history