Eric Lee / smarc-fsl-linux-kernel

11 Jan, 2012

1 commit

3d8eb7a94 ceph: remove unnecessary d_fsdata conditional checks ... Browse Code »

We now set d_fsdata unconditionally on all dentries prior to setting up
the d_ops, so all of these checks are unnecessary.

Signed-off-by: Sage Weil

Sage Weil
2012-01-11 00:56:56 +0800

14 Dec, 2011

1 commit

9d5a09e65 ceph: add missing spin_unlock at ceph_mdsc_build_path() ... Browse Code »

one of the paths was missing spin_unlock

Signed-off-by: Yehuda Sadeh

Yehuda Sadeh
2011-12-14 03:59:53 +0800

08 Dec, 2011

1 commit

be655596b ceph: use i_ceph_lock instead of i_lock ... Browse Code »

We have been using i_lock to protect all kinds of data structures in the
ceph_inode_info struct, including lists of inodes that we need to iterate
over while avoiding races with inode destruction. That requires grabbing
a reference to the inode with the list lock protected, but igrab() now
takes i_lock to check the inode flags.

Changing the list lock ordering would be a painful process.

However, using a ceph-specific i_ceph_lock in the ceph inode instead of
i_lock is a simple mechanical change and avoids the ordering constraints
imposed by igrab().

Reported-by: Amon Ott
Signed-off-by: Sage Weil

Sage Weil
2011-12-08 02:46:44 +0800

06 Nov, 2011

2 commits

7fd7d101f ceph/mds_client.c: quiet sparse noise ... Browse Code »

Quiet the following sparse noise:

warning: symbol 'get_nonsnap_parent' was not declared. Should it be static?
warning: symbol 'done_closing_sessions' was not declared. Should it be static?

Local functions don't need external visability. Make them static.

Signed-off-by: H Hartley Sweeten
Cc: Sage Weil
Signed-off-by: Sage Weil

H Hartley Sweeten
2011-11-06 12:10:11 +0800
c6ffe1001 ceph: use new D_COMPLETE dentry flag ... Browse Code »

We used to use a flag on the directory inode to track whether the dcache
contents for a directory were a complete cached copy. Switch to a dentry
flag CEPH_D_COMPLETE that is safely updated by ->d_prune().

Signed-off-by: Sage Weil

Sage Weil
2011-11-06 12:10:10 +0800

26 Oct, 2011

1 commit

b61c27636 libceph: don't complain on msgpool alloc failures ... Browse Code »

The pool allocation failures are masked by the pool; there is no need to
spam the console about them. (That's the whole point of having the pool
in the first place.)

Mark msg allocations whose failure is safely handled as such.

Signed-off-by: Sage Weil

Sage Weil
2011-10-26 07:10:15 +0800

16 Aug, 2011

1 commit

795858dbd ceph: fix encoding of ino only (not relative) paths ... Browse Code »

A 'path' consists of a starting ino and relative component. Encode even
when there is no relative component. This is primarily needed by the
NFS reexport code.

Signed-off-by: Sage Weil

Sage Weil
2011-08-16 04:03:56 +0800

27 Jul, 2011

4 commits

d79698da3 ceph: document unlocked d_parent accesses ... Browse Code »

For the most part we don't care about racing with rename when directing
MDS requests; either the old or new parent is fine. Document that, and
do some minor cleanup.

Reviewed-by: Yehuda Sadeh
Signed-off-by: Sage Weil

Sage Weil
2011-07-27 02:31:26 +0800
41b02e1f9 ceph: explicitly reference rename old_dentry parent dir in request ... Browse Code »

We carry a pin on the parent directory for the rename source and dest
dentries. For the source it's r_locked_dir; we need to explicitly
reference the old_dentry parent as well, since the dentry's d_parent may
change between when the request was created and pinned and when it is
freed.

Reviewed-by: Yehuda Sadeh
Signed-off-by: Sage Weil

Sage Weil
2011-07-27 02:31:14 +0800
e5f86dc37 ceph: avoid d_parent in ceph_dentry_hash; fix ceph_encode_fh() hashing bug ... Browse Code »

Have caller pass in a safely-obtained reference to the parent directory
for calculating a dentry's hash valud.

While we're here, simpify the flow through ceph_encode_fh() so that there
is a single exit point and cleanup.

Also fix a bug with the dentry hash calculation: calculate the hash for the
dentry we were given, not its parent.

Reviewed-by: Yehuda Sadeh
Signed-off-by: Sage Weil

Sage Weil
2011-07-27 02:30:55 +0800
2f90b852e ceph: ignore lease mask ... Browse Code »

The lease mask is no longer used (and it changed a while back). Instead,
use a non-zero duration to indicate that there is a lease being issued.

Reviewed-by: Yehuda Sadeh
Signed-off-by: Sage Weil

Sage Weil
2011-07-27 02:28:25 +0800

17 Jul, 2011

1 commit

1b71fe2ef ceph analog of cifs build_path_from_dentry() race fix ... Browse Code »

... unfortunately, cifs bug got copied. Fix is essentially the same.

Signed-off-by: Al Viro

Al Viro
2011-07-17 11:43:58 +0800

25 May, 2011

1 commit

db3540522 ceph: fix cap flush race reentrancy ... Browse Code »

In e9964c10 we change cap flushing to do a delicate dance because some
inodes on the cap_dirty list could be in a migrating state (got EXPORT but
not IMPORT) in which we couldn't actually flush and move from
dirty->flushing, breaking the while (!empty) { process first } loop
structure. It worked for a single sync thread, but was not reentrant and
triggered infinite loops when multiple syncers came along.

Instead, move inodes with dirty to a separate cap_dirty_migrating list
when in the limbo export-but-no-import state, allowing us to go back to
the simple loop structure (which was reentrant). This is cleaner and more
robust.

Audited the cap_dirty users and this looks fine:
list_empty(&ci->i_dirty_item) is still a reliable indicator of whether we
have dirty caps (which list we're on is irrelevant) and list_del_init()
calls still do the right thing.

Signed-off-by: Sage Weil

Sage Weil
2011-05-25 02:52:12 +0800

20 May, 2011

2 commits

1b3669857 libceph: remove unused variable ... Browse Code »

Signed-off-by: Sage Weil

Sage Weil
2011-05-20 02:24:17 +0800
3b6637803 ceph: take reference on mds request r_unsafe_dir ... Browse Code »

We put ourselves on an inode list for the parent directory of metadata
operations so that an fsync on the directory will wait for metadata updates
to commit to disk. We weren't holding a reference to that directory,
however, and under certain workloads (fsstress in this case) the directory
can go away.

Signed-off-by: Sage Weil

Sage Weil
2011-05-20 02:20:07 +0800

12 May, 2011

1 commit

7d8e18a69 ceph: print debug message before put mds session ... Browse Code »

The mds session, s, could be freed during ceph_put_mds_session.
Move dout before ceph_put_mds_session.

Signed-off-by: Henry C Chang
Signed-off-by: Sage Weil

Henry C Chang
2011-05-12 01:44:34 +0800

26 Mar, 2011

1 commit

ef550f6f4 ceph: flush msgr_wq during mds_client shutdown ... Browse Code »

The release method for mds connections uses a backpointer to the
mds_client, so we need to flush the workqueue of any pending work (and
ceph_connection references) prior to freeing the mds_client. This fixes
an oops easily triggered under UML by

while true ; do mount ... ; umount ... ; done

Also fix an outdated comment: the flush in ceph_destroy_client only flushes
OSD connections out. This bug is basically an artifact of the ceph ->
ceph+libceph conversion.

Signed-off-by: Sage Weil

Sage Weil
2011-03-26 04:27:48 +0800

28 Jan, 2011

1 commit

b12ece7d8 Merge branch 'for-linus' of git://git.kernel.org/pub/scm/linux/kernel/git/sage/ceph-client ... Browse Code »

* 'for-linus' of git://git.kernel.org/pub/scm/linux/kernel/git/sage/ceph-client:
ceph: avoid picking MDS that is not active
ceph: avoid immediate cap check after import
ceph: fix flushing of caps vs cap import
ceph: fix erroneous cap flush to non-auth mds
ceph: fix cap_wanted_delay_{min,max} mount option initialization
ceph: fix xattr rbtree search
ceph: fix getattr on directory when using norbytes

Linus Torvalds
2011-01-28 10:12:58 +0800

26 Jan, 2011

1 commit

d66bbd441 ceph: avoid picking MDS that is not active ... Browse Code »

Ignore replication or auth frag data if it indicates an MDS that is not
active. This can happen if the MDS shuts down and the client has stale
data about the namespace distribution across the MDS cluster. If that's
the case, fall back to directing the request based on the auth cap (which
should always be accurate).

Signed-off-by: Sage Weil

Sage Weil
2011-01-26 00:16:37 +0800

14 Jan, 2011

1 commit

a17031542 Merge branch 'for-linus' of git://git.kernel.org/pub/scm/linux/kernel/git/sage/ceph-client ... Browse Code »

* 'for-linus' of git://git.kernel.org/pub/scm/linux/kernel/git/sage/ceph-client:
rbd: fix cleanup when trying to mount inexistent image
net/ceph: make ceph_msgr_wq non-reentrant
ceph: fsc->*_wq's aren't used in memory reclaim path
ceph: Always free allocated memory in osdmap_decode()
ceph: Makefile: Remove unnessary code
ceph: associate requests with opening sessions
ceph: drop redundant r_mds field
ceph: implement DIRLAYOUTHASH feature to get dir layout from MDS
ceph: add dir_layout to inode

Linus Torvalds
2011-01-14 02:25:24 +0800

13 Jan, 2011

3 commits

dc69e2e9f ceph: associate requests with opening sessions ... Browse Code »

Associate request with sessions that aren't yep open. This makes the
debugfs mdsc request list more informative.

Signed-off-by: Sage Weil

Sage Weil
2011-01-13 07:15:13 +0800
4af25fdda ceph: drop redundant r_mds field ... Browse Code »

The r_mds field is redundant, since we can find the same information at
r_session->s_mds, and when r_session is NULL then r_mds is meaningless.

Signed-off-by: Sage Weil

Sage Weil
2011-01-13 07:15:13 +0800
14303d20f ceph: implement DIRLAYOUTHASH feature to get dir layout from MDS ... Browse Code »

This implements the DIRLAYOUTHASH protocol feature, which passes the dir
layout over the wire from the MDS. This gives the client knowledge
of the correct hash function to use for mapping dentries among dir
fragments.

Note that if this feature is _not_ present on the client but is on the
MDS, the client may misdirect requests. This will result in a forward
and degrade performance. It may also result in inaccurate NFS filehandle
generation, which will prevent fh resolution when the inode is not present
in the client cache and the parent directories have been fragmented.

Signed-off-by: Sage Weil

Sage Weil
2011-01-13 07:15:13 +0800

07 Jan, 2011

1 commit

b7ab39f63 fs: dcache scale dentry refcount ... Browse Code »

Make d_count non-atomic and protect it with d_lock. This allows us to ensure a
0 refcount dentry remains 0 without dcache_lock. It is also fairly natural when
we start protecting many other dentry members with d_lock.

Signed-off-by: Nick Piggin

Nick Piggin
2011-01-07 14:50:21 +0800

02 Dec, 2010

1 commit

25933abdd ceph: Handle file locks in replies from the MDS. ... Browse Code »

Previously the kernel client incorrectly assumed everything was a directory.

Signed-off-by: Herb Shiu
Acked-by: Greg Farnum
Signed-off-by: Sage Weil

Herb Shiu
2010-12-02 06:22:27 +0800

20 Nov, 2010

1 commit

76db8ac45 Merge branch 'for-linus' of git://git.kernel.org/pub/scm/linux/kernel/git/sage/ceph-client ... Browse Code »

* 'for-linus' of git://git.kernel.org/pub/scm/linux/kernel/git/sage/ceph-client:
ceph: fix readdir EOVERFLOW on 32-bit archs
ceph: fix frag offset for non-leftmost frags
ceph: fix dangling pointer
ceph: explicitly specify page alignment in network messages
ceph: make page alignment explicit in osd interface
ceph: fix comment, remove extraneous args
ceph: fix update of ctime from MDS
ceph: fix version check on racing inode updates
ceph: fix uid/gid on resent mds requests
ceph: fix rdcache_gen usage and invalidate
ceph: re-request max_size if cap auth changes
ceph: only let auth caps update max_size
ceph: fix open for write on clustered mds
ceph: fix bad pointer dereference in ceph_fill_trace
ceph: fix small seq message skipping
Revert "ceph: update issue_seq on cap grant"

Linus Torvalds
2010-11-20 07:32:22 +0800

18 Nov, 2010

1 commit

451a3c24b BKL: remove extraneous #include <smp_lock.h> ... Browse Code »

The big kernel lock has been removed from all these files at some point,
leaving only the #include.

Remove this too as a cleanup.

Signed-off-by: Arnd Bergmann
Signed-off-by: Linus Torvalds

Arnd Bergmann
2010-11-18 00:59:32 +0800

08 Nov, 2010

1 commit

cb4276cca ceph: fix uid/gid on resent mds requests ... Browse Code »

MDS requests can be rebuilt and resent in non-process context, but were
filling in uid/gid from current_fsuid/gid. Put that information in the
request struct on request setup.

This fixes incorrect (and root) uid/gid getting set for requests that
are forwarded between MDSs, usually due to metadata migrations.

Signed-off-by: Sage Weil

Sage Weil
2010-11-08 23:29:05 +0800

21 Oct, 2010

3 commits

496e59553 ceph: switch from BKL to lock_flocks() ... Browse Code »

Switch from using the BKL explicitly to the new lock_flocks() interface.
Eventually this will turn into a spinlock.

Signed-off-by: Sage Weil

Sage Weil
2010-10-21 06:38:18 +0800
fca4451ac ceph: preallocate flock state without locks held ... Browse Code »

When the lock_kernel() turns into lock_flocks() and a spinlock, we won't
be able to do allocations with the lock held. Preallocate space without
the lock, and retry if the lock state changes out from underneath us.

Signed-off-by: Greg Farnum
Signed-off-by: Sage Weil

Greg Farnum
2010-10-21 06:38:17 +0800
3d14c5d2b ceph: factor out libceph from Ceph file system ... Browse Code »

This factors out protocol and low-level storage parts of ceph into a
separate libceph module living in net/ceph and include/linux/ceph. This
is mostly a matter of moving files around. However, a few key pieces
of the interface change as well:

- ceph_client becomes ceph_fs_client and ceph_client, where the latter
captures the mon and osd clients, and the fs_client gets the mds client
and file system specific pieces.
- Mount option parsing and debugfs setup is correspondingly broken into
two pieces.
- The mon client gets a generic handler callback for otherwise unknown
messages (mds map, in this case).
- The basic supported/required feature bits can be expanded (and are by
ceph_fs_client).

No functional change, aside from some subtle error handling cases that got
cleaned up in the refactoring process.

Signed-off-by: Sage Weil

Yehuda Sadeh
2010-10-21 06:37:28 +0800

12 Sep, 2010

1 commit

3612abbd5 ceph: fix reconnect encoding for old servers ... Browse Code »

Fix the reconnect encoding to encode the cap record when the MDS does not
have the FLOCK capability (i.e., pre v0.22).

Signed-off-by: Sage Weil

Sage Weil
2010-09-12 01:52:47 +0800

27 Aug, 2010

1 commit

e072f8aa3 ceph: don't BUG on ENOMEM during mds reconnect ... Browse Code »

We are in a position to return an error; do that instead.

Signed-off-by: Sage Weil

Sage Weil
2010-08-27 00:26:37 +0800

23 Aug, 2010

2 commits

eb6bb1c5b ceph: direct requests in snapped namespace based on nonsnap parent ... Browse Code »

When making a request in the virtual snapdir or a snapped portion of the
namespace, we should choose the MDS based on the first nonsnap parent (and
its caps). If that is not the best place, we will get forward hints to
find the right MDS in the cluster. This fixes ESTALE errors when using
the .snap directory and namespace with multiple MDSs.

Signed-off-by: Sage Weil

Sage Weil
2010-08-23 06:16:48 +0800
f3c60c591 ceph: fix multiple mds session shutdown ... Browse Code »

The use of a completion when waiting for session shutdown during umount is
inappropriate, given the complexity of the condition. For multiple MDS's,
this resulted in the umount thread spinning, often preventing the session
close message from being processed in some cases.

Switch to a waitqueue and defined a condition helper. This cleans things
up nicely.

Signed-off-by: Sage Weil

Sage Weil
2010-08-23 06:04:43 +0800

04 Aug, 2010

1 commit

213c99ee0 ceph: whitespace cleanup ... Browse Code »

Signed-off-by: Sage Weil

Sage Weil
2010-08-04 01:25:11 +0800

03 Aug, 2010

2 commits

40819f6fb ceph: add flock/fcntl lock support ... Browse Code »

Implement flock inode operation to support advisory file locking. All
lock/unlock operations are synchronous with the MDS. Lock state is
sent when reconnecting to a recovering MDS to restore the shared lock
state.

Signed-off-by: Greg Farnum
Signed-off-by: Sage Weil

Greg Farnum
2010-08-03 07:10:53 +0800
20cb34ae9 ceph: support v2 reconnect encoding ... Browse Code »

Encode either old or v2 encoding of client_reconnect message, depending on
whether the peer has the FLOCK feature bit.

Signed-off-by: Sage Weil

Sage Weil
2010-08-03 06:48:50 +0800

02 Aug, 2010

2 commits

e55b71f80 ceph: handle ESTALE properly; on receipt send to authority if it wasn't ... Browse Code »

Signed-off-by: Greg Farnum
Signed-off-by: Sage Weil

Greg Farnum
2010-08-02 11:11:41 +0800
154f42c2c ceph: connect to export targets on cap export ... Browse Code »

When we get a cap EXPORT message, make sure we are connected to all export
targets to ensure we can handle the matching IMPORT.

Signed-off-by: Sage Weil

Sage Weil
2010-08-02 11:11:41 +0800