Eric Lee / smarc-fsl-linux-kernel

14 Jan, 2012

1 commit

1a52bb0b6 Merge branch 'for-linus' of git://git.kernel.org/pub/scm/linux/kernel/git/sage/ceph-client ... Browse Code »

* 'for-linus' of git://git.kernel.org/pub/scm/linux/kernel/git/sage/ceph-client:
ceph: ensure prealloc_blob is in place when removing xattr
rbd: initialize snap_rwsem in rbd_add()
ceph: enable/disable dentry complete flags via mount option
vfs: export symbol d_find_any_alias()
ceph: always initialize the dentry in open_root_dentry()
libceph: remove useless return value for osd_client __send_request()
ceph: avoid iput() while holding spinlock in ceph_dir_fsync
ceph: avoid useless dget/dput in encode_fh
ceph: dereference pointer after checking for NULL
crush: fix force for non-root TAKE
ceph: remove unnecessary d_fsdata conditional checks
ceph: Use kmemdup rather than duplicating its implementation

Fix up conflicts in fs/ceph/super.c (d_alloc_root() failure handling vs
always initialize the dentry in open_root_dentry)

Linus Torvalds
2012-01-14 02:29:21 +0800

13 Jan, 2012

2 commits

83eb26af0 ceph: ensure prealloc_blob is in place when removing xattr ... Browse Code »

In __ceph_build_xattrs_blob(), if a ceph inode's extended attributes
are marked dirty, all attributes recorded in its rb_tree index are
formatted into a "blob" buffer. The target buffer is recorded in
ceph_inode->i_xattrs.prealloc_blob, and it is expected to exist and
be of sufficient size to hold the attributes.

The extended attributes are marked dirty in two cases: when a new
attribute is added to the inode; or when one is removed. In the
former case work is done to ensure the prealloc_blob buffer is
properly set up, but in the latter it is not.

Change the logic in ceph_removexattr() so it matches what is
done in ceph_setxattr(). Note that this is done in a way that
keeps the two blocks of code nearly identical, in anticipation
of a subsequent patch that encapsulates some of this logic into
one or more helper routines.

Signed-off-by: Alex Elder
Signed-off-by: Sage Weil

Alex Elder
2012-01-13 03:00:51 +0800
a40dc6cc2 ceph: enable/disable dentry complete flags via mount option ... Browse Code »

Enable/disable use of the dentry dir 'complete' flag via a mount option.
This lets the admin control whether ceph uses the dcache to satisfy
negative lookups or readdir when it has the entire directory contents in
its cache.

This is purely a performance optimization; correctness is guaranteed
whether it is enabled or not.

Reviewed-by: Christoph Hellwig
Signed-off-by: Sage Weil

Sage Weil
2012-01-13 03:00:40 +0800

12 Jan, 2012

1 commit

d46cfba53 ceph: always initialize the dentry in open_root_dentry() ... Browse Code »

When open_root_dentry() gets a dentry via d_obtain_alias() it does
not get initialized. If the dentry obtained came from the cache,
this is OK. But if not, the result is an improperly initialized
dentry.

To fix this, call ceph_init_dentry() regardless of which path
produced the dentry. That function returns immediately for a dentry
that is already initialized, it is safe to use either way.

(Credit to Sage, who suggested this fix.)

Signed-off-by: Alex Elder

Alex Elder
2012-01-12 08:28:25 +0800

11 Jan, 2012

4 commits

2ff179e65 ceph: avoid iput() while holding spinlock in ceph_dir_fsync ... Browse Code »

ceph_mdsc_put_request() can call iput(), which can sleep. Don't do that.

Fixes: #1812
Signed-off-by: Sage Weil

Sage Weil
2012-01-11 00:57:02 +0800
ee6b1baf6 ceph: avoid useless dget/dput in encode_fh ... Browse Code »

Nothing we do here sleeps, so just do it under d_lock and avoid the dget/
dput entirely.

Reported-by: Al Viro
Signed-off-by: Sage Weil

Sage Weil
2012-01-11 00:57:00 +0800
b8cd952b5 ceph: dereference pointer after checking for NULL ... Browse Code »

moved dereference after BUG_ON

Signed-off-by: Yehuda Sadeh

Yehuda Sadeh
2012-01-11 00:56:59 +0800
3d8eb7a94 ceph: remove unnecessary d_fsdata conditional checks ... Browse Code »

We now set d_fsdata unconditionally on all dentries prior to setting up
the d_ops, so all of these checks are unnecessary.

Signed-off-by: Sage Weil

Sage Weil
2012-01-11 00:56:56 +0800

10 Jan, 2012

1 commit

3c5184ef1 ceph: d_alloc_root() may fail ... Browse Code »

... and ceph_init_dentry(NULL) will oops

Signed-off-by: Al Viro

Al Viro
2012-01-10 05:36:12 +0800

07 Jan, 2012

1 commit

34c80b1d9 vfs: switch ->show_options() to struct dentry * ... Browse Code »

Signed-off-by: Al Viro

Al Viro
2012-01-07 12:19:54 +0800

04 Jan, 2012

6 commits

5706b27de ceph: propagate umode_t ... Browse Code »

Signed-off-by: Al Viro

Al Viro
2012-01-04 11:55:16 +0800
dba19c606 get rid of open-coded S_ISREG(), etc. ... Browse Code »

Signed-off-by: Al Viro

Al Viro
2012-01-04 11:55:12 +0800
1a67aafb5 switch ->mknod() to umode_t ... Browse Code »

Signed-off-by: Al Viro

Al Viro
2012-01-04 11:54:54 +0800
4acdaf27e switch ->create() to umode_t ... Browse Code »

vfs_create() ignores everything outside of 16bit subset of its
mode argument; switching it to umode_t is obviously equivalent
and it's the only caller of the method

Signed-off-by: Al Viro

Al Viro
2012-01-04 11:54:53 +0800
18bb1db3e switch vfs_mkdir() and ->mkdir() to umode_t ... Browse Code »

vfs_mkdir() gets int, but immediately drops everything that might not
fit into umode_t and that's the only caller of ->mkdir()...

Signed-off-by: Al Viro

Al Viro
2012-01-04 11:54:53 +0800
6b520e056 vfs: fix the stupidity with i_dentry in inode destructors ... Browse Code »

Seeing that just about every destructor got that INIT_LIST_HEAD() copied into
it, there is no point whatsoever keeping this INIT_LIST_HEAD in inode_init_once();
the cost of taking it into inode_init_always() will be negligible for pipes
and sockets and negative for everything else. Not to mention the removal of
boilerplate code from ->destroy_inode() instances...

Signed-off-by: Al Viro

Al Viro
2012-01-04 11:52:40 +0800

30 Dec, 2011

1 commit

a4d46363c ceph: disable use of dcache for readdir etc. ... Browse Code »

Ceph attempts to use the dcache to satisfy negative lookups and readdir
when the entire directory contents are in cache. Disable this behavior
until lingering bugs in this code are shaken out; we'll re-enable these
hooks once things are fully stable.

Signed-off-by: Sage Weil

Sage Weil
2011-12-30 00:05:14 +0800

14 Dec, 2011

2 commits

9d5a09e65 ceph: add missing spin_unlock at ceph_mdsc_build_path() ... Browse Code »

one of the paths was missing spin_unlock

Signed-off-by: Yehuda Sadeh

Yehuda Sadeh
2011-12-14 03:59:53 +0800
6a82c47aa ceph: fix SEEK_CUR, SEEK_SET regression ... Browse Code »

Commit 06222e491e663dac939f04b125c9dc52126a75c4 got the if wrong so that
it always evaluates as true. This is semantically harmless, but makes
SEEK_CUR and SEEK_SET needlessly query the server.

Rewrite the if to explicitly enumerate the cases we DO need a valid i_size
to make this code less fragile.

Reported-by: Roel Kluin
Signed-off-by: Sage Weil

Sage Weil
2011-12-14 01:19:26 +0800

08 Dec, 2011

1 commit

be655596b ceph: use i_ceph_lock instead of i_lock ... Browse Code »

We have been using i_lock to protect all kinds of data structures in the
ceph_inode_info struct, including lists of inodes that we need to iterate
over while avoiding races with inode destruction. That requires grabbing
a reference to the inode with the list lock protected, but igrab() now
takes i_lock to check the inode flags.

Changing the list lock ordering would be a painful process.

However, using a ceph-specific i_ceph_lock in the ceph inode instead of
i_lock is a simple mechanical change and avoids the ordering constraints
imposed by igrab().

Reported-by: Amon Ott
Signed-off-by: Sage Weil

Sage Weil
2011-12-08 02:46:44 +0800

03 Dec, 2011

1 commit

2151937d7 ceph: fix rasize reporting by ceph_show_options ... Browse Code »

Fix typo.

Reported-by: mowang da
Signed-off-by: Sage Weil

Sage Weil
2011-12-03 01:27:54 +0800

12 Nov, 2011

1 commit

774ac21da ceph: initialize root dentry ... Browse Code »

Set up d_fsdata on the root dentry. This fixes a NULL pointer dereference
in ceph_d_prune on umount. It also means we can eventually strip out all
of the conditional checks on d_fsdata because it is now set unconditionally
(prior to setting up the d_ops).

Fix the ceph_d_prune debug print while we're here.

Signed-off-by: Sage Weil

Sage Weil
2011-11-12 01:50:17 +0800

06 Nov, 2011

4 commits

15a2015fb ceph: fix iput race when queueing inode work ... Browse Code »

If we queue a work item that calls iput(), make sure we ihold() before
attempting to queue work. Otherwise our queued work might miraculously run
before we notice the queue_work() succeeded and call ihold(), allowing the
inode to be destroyed.

That is, instead of

if (queue_work(...))
ihold();

we need to do

ihold();
if (!queue_work(...))
iput();

Reported-by: Amon Ott
Signed-off-by: Sage Weil

Sage Weil
2011-11-06 13:06:31 +0800
0c6d4b4e2 ceph/super.c: quiet sparse noise ... Browse Code »

Quiet the sparse noise:

warning: symbol 'create_fs_client' was not declared. Should it be static?
warning: symbol 'destroy_fs_client' was not declared. Should it be static?

Signed-off-by: H Hartley Sweeten
Cc: Sage Weil
ceph-devel@vger.kernel.org
Signed-off-by: Sage Weil

H Hartley Sweeten
2011-11-06 12:10:12 +0800
7fd7d101f ceph/mds_client.c: quiet sparse noise ... Browse Code »

Quiet the following sparse noise:

warning: symbol 'get_nonsnap_parent' was not declared. Should it be static?
warning: symbol 'done_closing_sessions' was not declared. Should it be static?

Local functions don't need external visability. Make them static.

Signed-off-by: H Hartley Sweeten
Cc: Sage Weil
Signed-off-by: Sage Weil

H Hartley Sweeten
2011-11-06 12:10:11 +0800
c6ffe1001 ceph: use new D_COMPLETE dentry flag ... Browse Code »

We used to use a flag on the directory inode to track whether the dcache
contents for a directory were a complete cached copy. Switch to a dentry
flag CEPH_D_COMPLETE that is safely updated by ->d_prune().

Signed-off-by: Sage Weil

Sage Weil
2011-11-06 12:10:10 +0800

04 Nov, 2011

1 commit

b58dc4100 ceph: clear parent D_COMPLETE flag when on dentry prune ... Browse Code »

When the VFS prunes a dentry from the cache, clear the D_COMPLETE flag
on the parent dentry. Do this for the live and snapshotted namespaces. Do
not bother for the .snap dir contents, since we do not cache that.

Signed-off-by: Sage Weil

Sage Weil
2011-11-04 00:23:49 +0800

02 Nov, 2011

1 commit

bfe868486 filesystems: add set_nlink() ... Browse Code »

Replace remaining direct i_nlink updates with a new set_nlink()
updater function.

Signed-off-by: Miklos Szeredi
Tested-by: Toshiyuki Okajima
Signed-off-by: Christoph Hellwig

Miklos Szeredi
2011-11-02 19:53:43 +0800

26 Oct, 2011

11 commits

339573406 libceph: fix double-free of page vector ... Browse Code »

ceph_release_page_vector() kfrees the vector; we shouldn't do it here too.

Reported-by: Jeff Wu
Signed-off-by: Sage Weil

Sage Weil
2011-10-26 07:10:17 +0800
3310f7541 ceph: fix 32-bit ino numbers ... Browse Code »

Fix 32-bit ino generation to not always be 1.

Signed-off-by: Amon Ott

Amon Ott
2011-10-26 07:10:17 +0800
a35eca958 ceph: let the set_layout ioctl set single traits ... Browse Code »

Previously we were validating the passed-in stripe unit, object size,
and stripe count against each other (and not testing most other stuff).
Instead, make sure that the composed previous layout and new values are valid,
and only send the new values to the MDS. This lets users change the
pool without setting the whole layout, for instance.

Signed-off-by: Greg Farnum

Greg Farnum
2011-10-26 07:10:16 +0800
83eaea22b Revert "ceph: don't truncate dirty pages in invalidate work thread" ... Browse Code »

This reverts commit c9af9fb68e01eb2c2165e1bc45cfeeed510c64e6.

We need to block and truncate all pages in order to reliably invalidate
them. Otherwise, we could:

- have some uptodate pages in the cache
- queue an invalidate
- write(2) locks some pages
- invalidate_work skips them
- write(2) only overwrites part of the page
- page now dirty and uptodate
-> partial leakage of invalidated data

It's not entirely clear why we started skipping locked pages in the first
place. I just ran this through fsx and didn't see any problems.

Signed-off-by: Sage Weil

Sage Weil
2011-10-26 07:10:16 +0800
80db8bea6 ceph: replace leading spaces with tabs ... Browse Code »

Trivial formatting fix.

Signed-off-by: Noah Watkins
Signed-off-by: Sage Weil

Noah Watkins
2011-10-26 07:10:16 +0800
b61c27636 libceph: don't complain on msgpool alloc failures ... Browse Code »

The pool allocation failures are masked by the pool; there is no need to
spam the console about them. (That's the whole point of having the pool
in the first place.)

Mark msg allocations whose failure is safely handled as such.

Signed-off-by: Sage Weil

Sage Weil
2011-10-26 07:10:15 +0800
6ab00d465 libceph: create messenger with client ... Browse Code »

This simplifies the init/shutdown paths, and makes client->msgr available
during the rest of the setup process.

Signed-off-by: Sage Weil

Sage Weil
2011-10-26 07:10:15 +0800
6a8ea4706 ceph: document ioctls ... Browse Code »

...after some prodding by Christoph.

Signed-off-by: Sage Weil

Sage Weil
2011-10-26 07:10:15 +0800
0d66a487c ceph: implement (optional) max read size ... Browse Code »

The 'rsize' mount option limits the maximum size of an individual
read(ahead) operation that is sent off to an OSD. This is distinct from
'rasize', which controls the size of the readahead window.

Signed-off-by: Sage Weil

Sage Weil
2011-10-26 07:10:15 +0800
83817e35c ceph: rename rsize -> rasize ... Browse Code »

It controls readahead.

Signed-off-by: Sage Weil

Sage Weil
2011-10-26 07:10:15 +0800
7c272194e ceph: make readpages fully async ... Browse Code »

When we get a ->readpages() aop, submit async reads for all page ranges
in the provided page list. Lock the pages immediately, so that VFS/MM
will block until the reads complete.

Signed-off-by: Sage Weil

Sage Weil
2011-10-26 07:10:14 +0800

10 Sep, 2011

1 commit

0d20fbbe8 Merge branch 'for-linus' of git://ceph.newdream.net/git/ceph-client ... Browse Code »

* 'for-linus' of git://ceph.newdream.net/git/ceph-client:
libceph: fix leak of osd structs during shutdown
ceph: fix memory leak
ceph: fix encoding of ino only (not relative) paths
libceph: fix msgpool

Linus Torvalds
2011-09-10 06:48:34 +0800