Eric Lee / smarc-fsl-linux-kernel

01 Nov, 2011

1 commit

bc3b2d7fb net: Add export.h for EXPORT_SYMBOL/THIS_MODULE to non-modules ... Browse Code »

These files are non modular, but need to export symbols using
the macros now living in export.h -- call out the include so
that things won't break when we remove the implicit presence
of module.h from everywhere.

Signed-off-by: Paul Gortmaker

Paul Gortmaker
2011-11-01 07:30:30 +0800

26 Oct, 2011

3 commits

ee3b56f26 ceph: use kernel DNS resolver ... Browse Code »

Change ceph_parse_ips to take either names given as
IP addresses or standard hostnames (e.g. localhost).
The DNS lookup is done using the dns_resolver facility
similar to its use in AFS, NFS, and CIFS.

This patch defines CONFIG_CEPH_LIB_USE_DNS_RESOLVER
that controls if this feature is on or off.

Signed-off-by: Noah Watkins
Signed-off-by: Sage Weil

Noah Watkins
2011-10-26 07:10:16 +0800
f0ed1b7ce libceph: warn on msg allocation failures ... Browse Code »

Any non-masked msg allocation failure should generate a warning and stack
trace to the console. All of these need to eventually be replaced by
safe preallocation or msgpools.

Signed-off-by: Sage Weil

Sage Weil
2011-10-26 07:10:16 +0800
b61c27636 libceph: don't complain on msgpool alloc failures ... Browse Code »

The pool allocation failures are masked by the pool; there is no need to
spam the console about them. (That's the whole point of having the pool
in the first place.)

Mark msg allocations whose failure is safely handled as such.

Signed-off-by: Sage Weil

Sage Weil
2011-10-26 07:10:15 +0800

17 Sep, 2011

1 commit

c0d5f9db1 libceph: initialize ack_stamp to avoid unnecessary connection reset ... Browse Code »

Commit 4cf9d544631c recorded when an outgoing ceph message was ACKed,
in order to avoid unnecessary connection resets when an OSD is busy.

However, ack_stamp is uninitialized, so there is a window between
when the message is sent and when it is ACKed in which handle_timeout()
interprets the unitialized value as an expired timeout, and resets
the connection unnecessarily.

Close the window by initializing ack_stamp.

Signed-off-by: Jim Schutt
Signed-off-by: Sage Weil

Jim Schutt
2011-09-17 00:16:22 +0800

27 Jul, 2011

1 commit

4cf9d5446 libceph: don't time out osd requests that haven't been received ... Browse Code »

Keep track of when an outgoing message is ACKed (i.e., the server fully
received it and, presumably, queued it for processing). Time out OSD
requests only if it's been too long since they've been received.

This prevents timeouts and connection thrashing when the OSDs are simply
busy and are throttling the requests they read off the network.

Reviewed-by: Yehuda Sadeh
Signed-off-by: Sage Weil

Sage Weil
2011-07-27 02:27:24 +0800

20 May, 2011

5 commits

a2a79609c libceph: add missing breaks in addr_set_port ... Browse Code »

Signed-off-by: Sage Weil

Sage Weil
2011-05-20 02:25:05 +0800
041778822 libceph: fix TAG_WAIT case ... Browse Code »

If we get a WAIT as a client something went wrong; error out. And don't
fall through to an unrelated case.

Signed-off-by: Sage Weil

Sage Weil
2011-05-20 02:25:04 +0800
12a2f643b libceph: use snprintf for unknown addrs ... Browse Code »

Signed-off-by: Sage Weil

Sage Weil
2011-05-20 02:25:03 +0800
e8f54ce16 libceph: fix uninitialized value when no get_authorizer method is set ... Browse Code »

If there is no get_authorizer method we set the out_kvec to a bogus
pointer. The length is also zero in that case, so it doesn't much matter,
but it's better not to add the empty item in the first place.

Signed-off-by: Sage Weil

Sage Weil
2011-05-20 02:25:02 +0800
0da5d7036 libceph: handle connection reopen race with callbacks ... Browse Code »

If a connection is closed and/or reopened (ceph_con_close, ceph_con_open)
it can race with a callback. con_work does various state checks for
closed or reopened sockets at the beginning, but drops con->mutex before
making callbacks. We need to check for state bit changes after retaking
the lock to ensure we restart con_work and execute those CLOSED/OPENING
tests or else we may end up operating under stale assumptions.

In Jim's case, this was causing 'bad tag' errors.

There are four cases where we re-take the con->mutex inside con_work: catch
them all and return EAGAIN from try_{read,write} so that we can restart
con_work.

Reported-by: Jim Schutt
Tested-by: Jim Schutt
Signed-off-by: Sage Weil

Sage Weil
2011-05-20 02:21:05 +0800

04 May, 2011

1 commit

ca20892db libceph: fix ceph_msg_new error path ... Browse Code »

If memory allocation failed, calling ceph_msg_put() will cause GPF
since some of ceph_msg variables are not initialized first.

Fix Bug #970.

Signed-off-by: Henry C Chang
Signed-off-by: Sage Weil

Henry C Chang
2011-05-04 00:28:11 +0800

05 Mar, 2011

3 commits

e00de341f libceph: fix msgr standby handling ... Browse Code »

The standby logic used to be pretty dependent on the work requeueing
behavior that changed when we switched to WQ_NON_REENTRANT. It was also
very fragile.

Restructure things so that:
- We clear WRITE_PENDING when we set STANDBY. This ensures we will
requeue work when we wake up later.
- con_work backs off if STANDBY is set. There is nothing to do if we are
in standby.
- clear_standby() helper is called by both con_send() and con_keepalive(),
the two actions that can wake us up again. Move the connect_seq++
logic here.

Signed-off-by: Sage Weil

Sage Weil
2011-03-05 04:25:05 +0800
e76661d0a libceph: fix msgr keepalive flag ... Browse Code »

There was some broken keepalive code using a dead variable. Shift to using
the proper bit flag.

Signed-off-by: Sage Weil

Sage Weil
2011-03-05 04:24:31 +0800
60bf8bf88 libceph: fix msgr backoff ... Browse Code »

With commit f363e45f we replaced a bunch of hacky workqueue mutual
exclusion logic with the WQ_NON_REENTRANT flag. One pieces of fallout is
that the exponential backoff breaks in certain cases:

* con_work attempts to connect.
* we get an immediate failure, and the socket state change handler queues
immediate work.
* con_work calls con_fault, we decide to back off, but can't queue delayed
work.

In this case, we add a BACKOFF bit to make con_work reschedule delayed work
next time it runs (which should be immediately).

Signed-off-by: Sage Weil

Sage Weil
2011-03-05 04:24:28 +0800

04 Mar, 2011

1 commit

692d20f57 libceph: retry after authorization failure ... Browse Code »

If we mark the connection CLOSED we will give up trying to reconnect to
this server instance. That is appropriate for things like a protocol
version mismatch that won't change until the server is restarted, at which
point we'll get a new addr and reconnect. An authorization failure like
this is probably due to the server not properly rotating it's secret keys,
however, and should be treated as transient so that the normal backoff and
retry behavior kicks in.

Signed-off-by: Sage Weil

Sage Weil
2011-03-04 05:47:40 +0800

26 Jan, 2011

2 commits

42961d233 libceph: fix socket write error handling ... Browse Code »

Pass errors from writing to the socket up the stack. If we get -EAGAIN,
return 0 from the helper to simplify the callers' checks.

Signed-off-by: Sage Weil

Sage Weil
2011-01-26 00:19:34 +0800
98bdb0aa0 libceph: fix socket read error handling ... Browse Code »

If we get EAGAIN when trying to read from the socket, it is not an error.
Return 0 from the helper in this case to simplify the error handling cases
in the caller (indirectly, try_read).

Fix try_read to pass any error to it's caller (con_work) instead of almost
always returning 0. This let's us respond to things like socket
disconnects.

Signed-off-by: Sage Weil

Sage Weil
2011-01-26 00:17:48 +0800

13 Jan, 2011

1 commit

f363e45fd net/ceph: make ceph_msgr_wq non-reentrant ... Browse Code »

ceph messenger code does a rather complex dancing around multithread
workqueue to make sure the same work item isn't executed concurrently
on different CPUs. This restriction can be provided by workqueue with
WQ_NON_REENTRANT.

Make ceph_msgr_wq non-reentrant workqueue with the default concurrency
level and remove the QUEUED/BUSY logic.

* This removes backoff handling in con_work() but it couldn't reliably
block execution of con_work() to begin with - queue_con() can be
called after the work started but before BUSY is set. It seems that
it was an optimization for a rather cold path and can be safely
removed.

* The number of concurrent work items is bound by the number of
connections and connetions are independent from each other. With
the default concurrency level, different connections will be
executed independently.

Signed-off-by: Tejun Heo
Cc: Sage Weil
Cc: ceph-devel@vger.kernel.org
Signed-off-by: Sage Weil

Tejun Heo
2011-01-13 07:15:14 +0800

14 Dec, 2010

1 commit

d96c9043d ceph: fix msgr_init error path ... Browse Code »

create_workqueue() returns NULL on failure.

Signed-off-by: Sage Weil

Sage Weil
2010-12-14 12:30:28 +0800

10 Nov, 2010

1 commit

c5c6b19d4 ceph: explicitly specify page alignment in network messages ... Browse Code »

The alignment used for reading data into or out of pages used to be taken
from the data_off field in the message header. This only worked as long
as the page alignment matched the object offset, breaking direct io to
non-page aligned offsets.

Instead, explicitly specify the page alignment next to the page vector
in the ceph_msg struct, and use that instead of the message header (which
probably shouldn't be trusted). The alloc_msg callback is responsible for
filling in this field properly when it sets up the page vector.

Signed-off-by: Sage Weil

Sage Weil
2010-11-10 04:43:17 +0800

02 Nov, 2010

1 commit

df9f86faf ceph: fix small seq message skipping ... Browse Code »

If the client gets out of sync with the server message sequence number, we
normally skip low seq messages (ones we already received). The skip code
was also incrementing the expected seq, such that all subsequent messages
also appeared old and got skipped, and an eventual timeout on the osd
connection. This resulted in some lagging requests and console messages
like

[233480.882885] ceph: skipping osd22 10.138.138.13:6804 seq 2016, expected 2017
[233480.882919] ceph: skipping osd22 10.138.138.13:6804 seq 2017, expected 2018
[233480.882963] ceph: skipping osd22 10.138.138.13:6804 seq 2018, expected 2019
[233480.883488] ceph: skipping osd22 10.138.138.13:6804 seq 2019, expected 2020
[233485.219558] ceph: skipping osd22 10.138.138.13:6804 seq 2020, expected 2021
[233485.906595] ceph: skipping osd22 10.138.138.13:6804 seq 2021, expected 2022
[233490.379536] ceph: skipping osd22 10.138.138.13:6804 seq 2022, expected 2023
[233495.523260] ceph: skipping osd22 10.138.138.13:6804 seq 2023, expected 2024
[233495.923194] ceph: skipping osd22 10.138.138.13:6804 seq 2024, expected 2025
[233500.534614] ceph: tid 6023602 timed out on osd22, will reset osd

Reported-by: Theodore Ts'o
Signed-off-by: Sage Weil

Sage Weil
2010-11-02 06:49:23 +0800

21 Oct, 2010

1 commit

3d14c5d2b ceph: factor out libceph from Ceph file system ... Browse Code »

This factors out protocol and low-level storage parts of ceph into a
separate libceph module living in net/ceph and include/linux/ceph. This
is mostly a matter of moving files around. However, a few key pieces
of the interface change as well:

- ceph_client becomes ceph_fs_client and ceph_client, where the latter
captures the mon and osd clients, and the fs_client gets the mds client
and file system specific pieces.
- Mount option parsing and debugfs setup is correspondingly broken into
two pieces.
- The mon client gets a generic handler callback for otherwise unknown
messages (mds map, in this case).
- The basic supported/required feature bits can be expanded (and are by
ceph_fs_client).

No functional change, aside from some subtle error handling cases that got
cleaned up in the refactoring process.

Signed-off-by: Sage Weil

Yehuda Sadeh
2010-10-21 06:37:28 +0800