Eric Lee / smarc-fsl-linux-kernel

11 Jan, 2012

1 commit

56e925b67 libceph: remove useless return value for osd_client __send_request() ... Browse Code »

Signed-off-by: Sage Weil

Sage Weil
2012-01-11 00:57:03 +0800

12 Nov, 2011

1 commit

224736d91 libceph: Allocate larger oid buffer in request msgs ... Browse Code »

ceph_osd_request struct allocates a 40-byte buffer for object names.
RBD image names can be up to 96 chars long (100 with the .rbd suffix),
which results in the object name for the image being truncated, and a
subsequent map failure.

Increase the oid buffer in request messages, in order to avoid the
truncation.

Signed-off-by: Stratos Psomadakis
Signed-off-by: Sage Weil

Stratos Psomadakis
2011-11-12 01:50:19 +0800

26 Oct, 2011

2 commits

38d6453ca libceph: force resend of osd requests if we skip an osdmap ... Browse Code »

If we skip over one or more map epochs, we need to resend all osd requests
because it is possible they remapped to other servers and then back.

Signed-off-by: Sage Weil

Sage Weil
2011-10-26 07:10:17 +0800
b61c27636 libceph: don't complain on msgpool alloc failures ... Browse Code »

The pool allocation failures are masked by the pool; there is no need to
spam the console about them. (That's the whole point of having the pool
in the first place.)

Mark msg allocations whose failure is safely handled as such.

Signed-off-by: Sage Weil

Sage Weil
2011-10-26 07:10:15 +0800

17 Sep, 2011

1 commit

935b639a0 libceph: fix linger request requeuing ... Browse Code »

The r_req_lru_item list node moves between several lists, and that cycle
is not directly related (and does not begin) with __register_request().
Initialize it in the request constructor, not __register_request(). This
fixes later badness (below) when OSDs restart underneath an rbd mount.

Crashes we've seen due to this include:

[ 213.974288] kernel BUG at net/ceph/messenger.c:2193!

and

[ 144.035274] BUG: unable to handle kernel NULL pointer dereference at 0000000000000048
[ 144.035278] IP: [] con_work+0x1463/0x2ce0 [libceph]

Signed-off-by: Sage Weil

Sage Weil
2011-09-17 02:13:17 +0800

01 Sep, 2011

1 commit

aca420bc5 libceph: fix leak of osd structs during shutdown ... Browse Code »

We want to remove all OSDs, not just those on the idle LRU.

Signed-off-by: Sage Weil

Sage Weil
2011-09-01 06:22:46 +0800

27 Jul, 2011

1 commit

4cf9d5446 libceph: don't time out osd requests that haven't been received ... Browse Code »

Keep track of when an outgoing message is ACKed (i.e., the server fully
received it and, presumably, queued it for processing). Time out OSD
requests only if it's been too long since they've been received.

This prevents timeouts and connection thrashing when the OSDs are simply
busy and are throttling the requests they read off the network.

Reviewed-by: Yehuda Sadeh
Signed-off-by: Sage Weil

Sage Weil
2011-07-27 02:27:24 +0800

14 Jun, 2011

1 commit

9bb0ce2b0 libceph: fix page calculation for non-page-aligned io ... Browse Code »

Set the page count correctly for non-page-aligned IO. We were already
doing this correctly for alignment, but not the page count. Fixes
DIRECT_IO writes from unaligned pages.

Signed-off-by: Sage Weil

Sage Weil
2011-06-14 07:26:17 +0800

08 Jun, 2011

1 commit

258454723 ceph: fix sync vs canceled write ... Browse Code »

If we cancel a write, trigger the safe completions to prevent a sync from
blocking indefinitely in ceph_osdc_sync().

Signed-off-by: Sage Weil

Sage Weil
2011-06-08 12:34:13 +0800

25 May, 2011

1 commit

cd634fb6e libceph: subscribe to osdmap when cluster is full ... Browse Code »

When the cluster is marked full, subscribe to subsequent map updates to
ensure we find out promptly when it is no longer full. This will prevent
us from spewing ENOSPC for (much) longer than necessary.

Signed-off-by: Sage Weil

Sage Weil
2011-05-25 02:52:11 +0800

20 May, 2011

2 commits

9d6fcb081 ceph: check return value for start_request in writepages ... Browse Code »

Since we pass the nofail arg, we should never get an error; BUG if we do.
(And fix the function to not return an error if __map_request fails.)

Signed-off-by: Sage Weil

Sage Weil
2011-05-20 02:25:05 +0800
2dab036b8 libceph: use snprintf for formatting object name ... Browse Code »

Signed-off-by: Sage Weil

Sage Weil
2011-05-20 02:25:02 +0800

04 May, 2011

1 commit

4ad12621e libceph: fix ceph_osdc_alloc_request error checks ... Browse Code »

ceph_osdc_alloc_request returns NULL on failure.

Signed-off-by: Sage Weil

Sage Weil
2011-05-04 00:28:13 +0800

15 Apr, 2011

1 commit

e6d283183 Merge branch 'for-linus' of git://git.kernel.org/pub/scm/linux/kernel/git/sage/ceph-client ... Browse Code »

* 'for-linus' of git://git.kernel.org/pub/scm/linux/kernel/git/sage/ceph-client:
libceph: fix linger request requeueing

Linus Torvalds
2011-04-15 10:02:55 +0800

08 Apr, 2011

1 commit

42933bac1 Merge branch 'for-linus2' of git://git.profusion.mobi/users/lucas/linux-2.6 ... Browse Code »

* 'for-linus2' of git://git.profusion.mobi/users/lucas/linux-2.6:
Fix common misspellings

Linus Torvalds
2011-04-08 02:14:49 +0800

07 Apr, 2011

1 commit

77f38e0ee libceph: fix linger request requeueing ... Browse Code »

Fix the request transition from linger -> normal request. The key is to
preserve r_osd and requeue on the same OSD. Reregister as a normal request,
add the request to the proper queues, then unregister the linger. Fix the
unregister helper to avoid clearing r_osd (and also simplify the parallel
check in __unregister_request()).

Reported-by: Henry Chang
Signed-off-by: Sage Weil

Sage Weil
2011-04-07 00:09:16 +0800

31 Mar, 2011

1 commit

25985edce Fix common misspellings ... Browse Code »

Fixes generated by 'codespell' and manually reviewed.

Signed-off-by: Lucas De Marchi

Lucas De Marchi
2011-03-31 22:26:23 +0800

30 Mar, 2011

1 commit

fbdb91904 libceph: fix null dereference when unregistering linger requests ... Browse Code »

We should only clear r_osd if we are neither registered as a linger or a
regular request. We may unregister as a linger while still registered as
a regular request (e.g., in reset_osd). Incorrectly clearing r_osd there
leads to a null pointer dereference in __send_request.

Also simplify the parallel check in __unregister_request() where we just
removed r_osd_item and know it's empty.

Signed-off-by: Sage Weil

Sage Weil
2011-03-30 03:11:06 +0800

29 Mar, 2011

1 commit

234af26ff ceph: unlock on error in ceph_osdc_start_request() ... Browse Code »

There was a missing unlock on the error path if __map_request() failed.

Signed-off-by: Dan Carpenter
Signed-off-by: Sage Weil

Dan Carpenter
2011-03-29 23:59:54 +0800

27 Mar, 2011

1 commit

6b0ae4097 ceph: fix possible NULL pointer dereference ... Browse Code »

This patch fixes 'event_work' dereference before it is checked for NULL.

Signed-off-by: Mariusz Kozlowski
Signed-off-by: Sage Weil

Mariusz Kozlowski
2011-03-27 04:41:20 +0800

23 Mar, 2011

1 commit

a40c4f10e libceph: add lingering request and watch/notify event framework ... Browse Code »

Lingering requests are requests that are sent to the OSD normally but
tracked also after we get a successful request. This keeps the OSD
connection open and resends the original request if the object moves to
another OSD. The OSD can then send notification messages back to us
if another client initiates a notify.

This framework will be used by RBD so that the client gets notification
when a snapshot is created by another node or tool.

Signed-off-by: Yehuda Sadeh
Signed-off-by: Sage Weil

Yehuda Sadeh
2011-03-23 02:33:55 +0800

22 Mar, 2011

1 commit

6f6c70067 libceph: fix osd request queuing on osdmap updates ... Browse Code »

If we send a request to osd A, and the request's pg remaps to osd B and
then back to A in quick succession, we need to resend the request to A. The
old code was only calling kick_requests after processing all incremental
maps in a message, so it was very possible to not resend a request that
needed to be resent. This would make the osd eventually time out (at least
with the current default of osd timeouts enabled).

The correct approach is to scan requests on every map incremental. This
patch refactors the kick code in a few ways:
- all requests are either on req_lru (in flight), req_unsent (ready to
send), or req_notarget (currently map to no up osd)
- mapping always done by map_request (previous map_osds)
- if the mapping changes, we requeue. requests are resent only after all
map incrementals are processed.
- some osd reset code is moved out of kick_requests into a separate
function
- the "kick this osd" functionality is moved to kick_osd_requests, as it
is unrelated to scanning for request->pg->osd mapping changes

Signed-off-by: Sage Weil

Sage Weil
2011-03-22 03:24:19 +0800

10 Nov, 2010

2 commits

c5c6b19d4 ceph: explicitly specify page alignment in network messages ... Browse Code »

The alignment used for reading data into or out of pages used to be taken
from the data_off field in the message header. This only worked as long
as the page alignment matched the object offset, breaking direct io to
non-page aligned offsets.

Instead, explicitly specify the page alignment next to the page vector
in the ceph_msg struct, and use that instead of the message header (which
probably shouldn't be trusted). The alloc_msg callback is responsible for
filling in this field properly when it sets up the page vector.

Signed-off-by: Sage Weil

Sage Weil
2010-11-10 04:43:17 +0800
b7495fc2f ceph: make page alignment explicit in osd interface ... Browse Code »

We used to infer alignment of IOs within a page based on the file offset,
which assumed they matched. This broke with direct IO that was not aligned
to pages (e.g., 512-byte aligned IO). We were also trusting the alignment
specified in the OSD reply, which could have been adjusted by the server.

Explicitly specify the page alignment when setting up OSD IO requests.

Signed-off-by: Sage Weil

Sage Weil
2010-11-10 04:43:12 +0800

21 Oct, 2010

1 commit

3d14c5d2b ceph: factor out libceph from Ceph file system ... Browse Code »

This factors out protocol and low-level storage parts of ceph into a
separate libceph module living in net/ceph and include/linux/ceph. This
is mostly a matter of moving files around. However, a few key pieces
of the interface change as well:

- ceph_client becomes ceph_fs_client and ceph_client, where the latter
captures the mon and osd clients, and the fs_client gets the mds client
and file system specific pieces.
- Mount option parsing and debugfs setup is correspondingly broken into
two pieces.
- The mon client gets a generic handler callback for otherwise unknown
messages (mds map, in this case).
- The basic supported/required feature bits can be expanded (and are by
ceph_fs_client).

No functional change, aside from some subtle error handling cases that got
cleaned up in the refactoring process.

Signed-off-by: Sage Weil

Yehuda Sadeh
2010-10-21 06:37:28 +0800