Doug / smarc-fsl-linux-kernel | Embedian Git Server

20 Jul, 2011

2 commits

2830ba7f3 ->permission() sanitizing: don't pass flags to generic_permission() ... Browse Code »

redundant; all callers get it duplicated in mask & MAY_NOT_BLOCK and none of
them removes that bit.

Signed-off-by: Al Viro

Al Viro
2011-07-20 13:43:22 +0800
178ea7352 kill check_acl callback of generic_permission() ... Browse Code »

its value depends only on inode and does not change; we might as
well store it in ->i_op->check_acl and be done with that.

Signed-off-by: Al Viro

Al Viro
2011-07-20 13:43:16 +0800

17 Jul, 2011

1 commit

1b71fe2ef ceph analog of cifs build_path_from_dentry() race fix ... Browse Code »

... unfortunately, cifs bug got copied. Fix is essentially the same.

Signed-off-by: Al Viro

Al Viro
2011-07-17 11:43:58 +0800

14 Jun, 2011

2 commits

d7f124f12 ceph: fix sync and dio writes across stripe boundaries ... Browse Code »

We were iterating across stripe boundaries properly, but not moving the
write buffer pointer forward. This caused us to rewrite the same data
after the break. Fix by adjusting the data pointer forward, and
recalculating the io and buffer alignment after the break.

Signed-off-by: Sage Weil

Sage Weil
2011-06-14 07:26:22 +0800
773e9b442 ceph: fix page alignment corrections ... Browse Code »

dd if=/dev/urandom of=/mnt/fs_depot/dd10 bs=500 seek=8388 count=1
dd if=/mnt/fs_depot/dd10 of=/root/dd10out bs=500 skip=8388 count=1

Reported-by: Henry C Chang
Signed-off-by: Sage Weil

Sage Weil
2011-06-14 07:26:10 +0800

08 Jun, 2011

4 commits

0c1f91f27 ceph: unwind canceled flock state ... Browse Code »

If we request a lock and then abort (e.g., ^C), we need to send a matching
unlock request to the MDS to unwind our lock attempt to avoid indefinitely
blocking other clients.

Reported-by: Brian Chrisman
Signed-off-by: Sage Weil

Sage Weil
2011-06-08 12:36:45 +0800
0e98728fa ceph: fix ENOENT logic in striped_read ... Browse Code »

Getting ENOENT is equivalent to reading 0 bytes. Make that correction
before setting up the hit_stripe and was_short flags.

Fixes the following case:
dd if=/dev/zero of=/mnt/fs_depot/dd3 bs=1 seek=1048576 count=0
dd if=/mnt/fs_depot/dd3 of=/root/ddout1 skip=8 bs=500 count=2 iflag=direct

Reported-by: Henry C Chang
Signed-off-by: Sage Weil

Sage Weil
2011-06-08 12:34:16 +0800
c3cd62839 ceph: fix short sync reads from the OSD ... Browse Code »

If we get a short read from the OSD because the object is small, we need to
zero the remainder of the buffer. For O_DIRECT reads, the attempted range
is not trimmed to i_size by the VFS, so we were actually looping
indefinitely.

Fix by trimming by i_size, and the unconditionally zeroing the trailing
range.

Reported-by: Jeff Wu
Signed-off-by: Sage Weil

Sage Weil
2011-06-08 12:34:14 +0800
70b666c3b ceph: use ihold when we already have an inode ref ... Browse Code »

We should use ihold whenever we already have a stable inode ref, even
when we aren't holding i_lock. This avoids adding new and unnecessary
locking dependencies.

Signed-off-by: Sage Weil

Sage Weil
2011-06-08 12:34:11 +0800

25 May, 2011

3 commits

db3540522 ceph: fix cap flush race reentrancy ... Browse Code »

In e9964c10 we change cap flushing to do a delicate dance because some
inodes on the cap_dirty list could be in a migrating state (got EXPORT but
not IMPORT) in which we couldn't actually flush and move from
dirty->flushing, breaking the while (!empty) { process first } loop
structure. It worked for a single sync thread, but was not reentrant and
triggered infinite loops when multiple syncers came along.

Instead, move inodes with dirty to a separate cap_dirty_migrating list
when in the limbo export-but-no-import state, allowing us to go back to
the simple loop structure (which was reentrant). This is cleaner and more
robust.

Audited the cap_dirty users and this looks fine:
list_empty(&ci->i_dirty_item) is still a reliable indicator of whether we
have dirty caps (which list we're on is irrelevant) and list_del_init()
calls still do the right thing.

Signed-off-by: Sage Weil

Sage Weil
2011-05-25 02:52:12 +0800
45e3d3eeb ceph: avoid inode lookup on nfs fh reconnect ... Browse Code »

If we get the inode from the MDS, we have a reference in req; don't do a
fresh lookup.

Signed-off-by: Sage Weil

Sage Weil
2011-05-25 02:52:06 +0800
3c454cf21 ceph: use LOOKUPINO to make unconnected nfs fh more reliable ... Browse Code »

If we are unable to locate an inode by ino, ask the MDS using the new
LOOKUPINO command.

Signed-off-by: Sage Weil

Sage Weil
2011-05-25 02:52:05 +0800

20 May, 2011

7 commits

9d6fcb081 ceph: check return value for start_request in writepages ... Browse Code »

Since we pass the nofail arg, we should never get an error; BUG if we do.
(And fix the function to not return an error if __map_request fails.)

Signed-off-by: Sage Weil

Sage Weil
2011-05-20 02:25:05 +0800
6b4a3b517 ceph: remove useless check ... Browse Code »

rc is only ever 0 or negative in this method.

Signed-off-by: Sage Weil

Sage Weil
2011-05-20 02:25:05 +0800
da39822c6 ceph: fix broken comparison in readdir loop ... Browse Code »

Both off and fi->offset are unsigned, so the difference is always >= 0.
Compare them directly instead of the sign of the difference.

Signed-off-by: Sage Weil

Sage Weil
2011-05-20 02:25:04 +0800
3540303f8 ceph: fix rare potential cap leak ... Browse Code »

If we grab new_cap, retake the lock, and find we already have a cap now
for the given mds, release new_cap.

Signed-off-by: Sage Weil

Sage Weil
2011-05-20 02:25:03 +0800
ae5980830 ceph: use snprintf for dirstat content ... Browse Code »

We allocate a buffer for rstats if the dirstat option is enabled. Use
snprintf.

Signed-off-by: Sage Weil

Sage Weil
2011-05-20 02:25:02 +0800
1b3669857 libceph: remove unused variable ... Browse Code »

Signed-off-by: Sage Weil

Sage Weil
2011-05-20 02:24:17 +0800
3b6637803 ceph: take reference on mds request r_unsafe_dir ... Browse Code »

We put ourselves on an inode list for the parent directory of metadata
operations so that an fsync on the directory will wait for metadata updates
to commit to disk. We weren't holding a reference to that directory,
however, and under certain workloads (fsstress in this case) the directory
can go away.

Signed-off-by: Sage Weil

Sage Weil
2011-05-20 02:20:07 +0800

12 May, 2011

3 commits

d3d0720d4 ceph: do not use i_wrbuffer_ref as refcount for Fb cap ... Browse Code »

We increments i_wrbuffer_ref when taking the Fb cap. This breaks
the dirty page accounting and causes looping in
__ceph_do_pending_vmtruncate, and ceph client hangs.

This bug can be reproduced occasionally by running blogbench.

Add a new field i_wb_ref to inode and dedicate it to Fb reference
counting.

Signed-off-by: Henry C Chang
Signed-off-by: Sage Weil

Henry C Chang
2011-05-12 01:44:48 +0800
a26a185d2 ceph: fix list_add in ceph_put_snap_realm ... Browse Code »

Signed-off-by: Henry C Chang
Signed-off-by: Sage Weil

Henry C Chang
2011-05-12 01:44:36 +0800
7d8e18a69 ceph: print debug message before put mds session ... Browse Code »

The mds session, s, could be freed during ceph_put_mds_session.
Move dout before ceph_put_mds_session.

Signed-off-by: Henry C Chang
Signed-off-by: Sage Weil

Henry C Chang
2011-05-12 01:44:34 +0800

05 May, 2011

1 commit

fca65b4ad ceph: do not call __mark_dirty_inode under i_lock ... Browse Code »

The __mark_dirty_inode helper now takes i_lock as of 250df6ed. Fix the
one ceph callers that held i_lock (__ceph_mark_dirty_caps) to return the
flags value so that the callers can do it outside of i_lock.

Signed-off-by: Sage Weil

Sage Weil
2011-05-05 03:56:45 +0800

04 May, 2011

2 commits

8c71897be ceph: handle ceph_osdc_new_request failure in ceph_writepages_start ... Browse Code »

We should unlock the page and return -ENOMEM if ceph_osdc_new_request
failed.

Signed-off-by: Henry C Chang
Signed-off-by: Sage Weil

Henry C Chang
2011-05-04 00:28:12 +0800
3772d26d8 ceph: use ihold() when i_lock is held ... Browse Code »

See 0444d76ae64fffc7851797fc1b6ebdbb44ac504a.

Signed-off-by: Sage Weil

Sage Weil
2011-05-04 00:28:08 +0800

08 Apr, 2011

1 commit

42933bac1 Merge branch 'for-linus2' of git://git.profusion.mobi/users/lucas/linux-2.6 ... Browse Code »

* 'for-linus2' of git://git.profusion.mobi/users/lucas/linux-2.6:
Fix common misspellings

Linus Torvalds
2011-04-08 02:14:49 +0800

31 Mar, 2011

2 commits

25985edce Fix common misspellings ... Browse Code »

Fixes generated by 'codespell' and manually reviewed.

Signed-off-by: Lucas De Marchi

Lucas De Marchi
2011-03-31 22:26:23 +0800
50f351582 Merge git://git.kernel.org/pub/scm/linux/kernel/git/sage/ceph-client ... Browse Code »

* git://git.kernel.org/pub/scm/linux/kernel/git/sage/ceph-client:
libceph: Create a new key type "ceph".
libceph: Get secret from the kernel keys api when mounting with key=NAME.
ceph: Move secret key parsing earlier.
libceph: fix null dereference when unregistering linger requests
ceph: unlock on error in ceph_osdc_start_request()
ceph: fix possible NULL pointer dereference
ceph: flush msgr_wq during mds_client shutdown

Linus Torvalds
2011-03-31 00:46:09 +0800

30 Mar, 2011

1 commit

8323c3aa7 ceph: Move secret key parsing earlier. ... Browse Code »

This makes the base64 logic be contained in mount option parsing,
and prepares us for replacing the homebew key management with the
kernel key retention service.

Signed-off-by: Tommi Virtanen
Signed-off-by: Sage Weil

Tommi Virtanen
2011-03-30 03:11:16 +0800

29 Mar, 2011

1 commit

0444d76ae fs: don't use igrab() while holding i_lock ... Browse Code »

Fix the incorrect use of igrab() inside the i_lock in NFS and Ceph‥

If we are already holding the i_lock, we have a reference to the
inode so we can safely use ihold() to gain an extra reference. This
avoids hangs due to lock recursion on the i_lock now that the
inode_lock is gone and igrab() uses the i_lock itself.

Signed-off-by: Dave Chinner
Cc: Al Viro
Cc: linux-fsdevel@vger.kernel.org
Cc: Ryan Mallon
Signed-off-by: Linus Torvalds

Dave Chinner
2011-03-29 22:50:34 +0800

26 Mar, 2011

1 commit

ef550f6f4 ceph: flush msgr_wq during mds_client shutdown ... Browse Code »

The release method for mds connections uses a backpointer to the
mds_client, so we need to flush the workqueue of any pending work (and
ceph_connection references) prior to freeing the mds_client. This fixes
an oops easily triggered under UML by

while true ; do mount ... ; umount ... ; done

Also fix an outdated comment: the flush in ceph_destroy_client only flushes
OSD connections out. This bug is basically an artifact of the ceph ->
ceph+libceph conversion.

Signed-off-by: Sage Weil

Sage Weil
2011-03-26 04:27:48 +0800

22 Mar, 2011

6 commits

147851d2d ceph: rename dentry_release -> d_release, fix comment ... Browse Code »

Just for consistency's sake. Fix obsolete comment too.

Signed-off-by: Sage Weil

Sage Weil
2011-03-22 03:24:26 +0800
49bcb9323 ceph: add request to the tail of unsafe write list ... Browse Code »

In sync_write_wait(), we assume that the newest request is at the
tail of unsafe write list. We should maintain the semantics here.

Signed-off-by: Henry C Chang
Signed-off-by: Sage Weil

Henry C Chang
2011-03-22 03:24:25 +0800
78a255654 ceph: remove request from unsafe list if it is canceled/timed out ... Browse Code »

This fixes the list corruption warning like this:

------------[ cut here ]------------
WARNING: at lib/list_debug.c:30 __list_add+0x68/0x81()
Hardware name: X8DTU
list_add corruption. prev->next should be next (ffff880618931250), but was (null). (prev=ffff880c188b9130).
Modules linked in: nfsd lockd nfs_acl auth_rpcgss exportfs ceph libceph libcrc32c sunrpc ipv6 fuse igb i2c_i801 ioatdma i2c_core iTCO_wdt iTCO_vendor_support joydev dca serio_raw usb_storage [last unloaded: scsi_wait_scan]
Pid: 10977, comm: smbd Tainted: G W 2.6.32.23-170.Elaster.xendom0.fc12.x86_64 #1
Call Trace:
[] warn_slowpath_common+0x7c/0x94
[] warn_slowpath_fmt+0x41/0x43
[] __list_add+0x68/0x81
[] ceph_aio_write+0x614/0x8a2 [ceph]
[] do_sync_write+0xe8/0x125
[] ? autoremove_wake_function+0x0/0x39
[] ? selinux_file_permission+0x5c/0xb3
[] ? security_file_permission+0x16/0x18
[] vfs_write+0xae/0x10b
[] sys_pwrite64+0x5a/0x76
[] system_call_fastpath+0x16/0x1b
---[ end trace 08573eb9f07ff6f4 ]---

Signed-off-by: Henry C Chang
Signed-off-by: Sage Weil

Henry C Chang
2011-03-22 03:24:24 +0800
80456f867 ceph: move readahead default to fs/ceph from libceph ... Browse Code »

Signed-off-by: Sage Weil

Sage Weil
2011-03-22 03:24:23 +0800
ad1fee96c ceph: add ino32 mount option ... Browse Code »

The ino32 mount option forces the ceph fs to report 32 bit
ino values. This is useful for 64 bit kernels with 32 bit userspace.

Signed-off-by: Yehuda Sadeh

Yehuda Sadeh
2011-03-22 03:24:22 +0800
21f3b5f1b ceph: remove debugfs debug cruft ... Browse Code »

Whoops!

Signed-off-by: Sage Weil

Sage Weil
2011-03-22 03:24:20 +0800

16 Mar, 2011

1 commit

09adc80c6 ceph: preserve I_COMPLETE across rename ... Browse Code »

d_move puts the renamed dentry at the end of d_subdirs, screwing with our
cached dentry directory offsets. We were just clearing I_COMPLETE to avoid
any possibility of trouble. However, assigning the renamed dentry an
offset at the end of the directory (to match it's new d_subdirs position)
is sufficient to maintain correct behavior and hold onto I_COMPLETE.

This is especially important for workloads like rsync, which renames files
into place. Before, we would lose I_COMPLETE and do MDS lookups for each
file. With this patch we only talk to the MDS on create and rename.

Signed-off-by: Sage Weil

Sage Weil
2011-03-16 00:14:03 +0800

10 Mar, 2011

1 commit

0eb980e31 ceph: fix d_revalidate oopsen on NFS exports ... Browse Code »

can't blindly check nd->flags in ->d_revalidate()

Signed-off-by: Al Viro

Al Viro
2011-03-10 16:44:05 +0800

05 Mar, 2011

1 commit

455cec0ab ceph: no .snap inside of snapped namespace ... Browse Code »

Otherwise you can do things like

# mkdir .snap/foo
# cd .snap/foo/.snap
# ls

Signed-off-by: Sage Weil

Sage Weil
2011-03-05 04:25:09 +0800