Eric Lee / smarc-fsl-linux-kernel

24 Aug, 2020

1 commit

df561f668 treewide: Use fallthrough pseudo-keyword ... Browse Code »

Replace the existing /* fall through */ comments and its variants with
the new pseudo-keyword macro fallthrough[1]. Also, remove unnecessary
fall-through markings when it is the case.

[1] https://www.kernel.org/doc/html/v5.7/process/deprecated.html?highlight=fallthrough#implicit-switch-case-fall-through

Signed-off-by: Gustavo A. R. Silva

Gustavo A. R. Silva
2020-08-24 06:36:59 +0800

29 Jul, 2020

1 commit

76246c921 NFSv4: Use sequence counter with associated spinlock ... Browse Code »

A sequence counter write side critical section must be protected by some
form of locking to serialize writers. A plain seqcount_t does not
contain the information of which lock must be held when entering a write
side critical section.

Use the new seqcount_spinlock_t data type, which allows to associate a
spinlock with the sequence counter. This enables lockdep to verify that
the spinlock used for writer serialization is held when the write side
critical section is entered.

If lockdep is disabled this lock association is compiled out and has
neither storage size nor runtime overhead.

Signed-off-by: Ahmed S. Darwish
Signed-off-by: Peter Zijlstra (Intel)
Link: https://lkml.kernel.org/r/20200720155530.1173732-22-a.darwish@linutronix.de

Ahmed S. Darwish
2020-07-29 22:14:28 +0800

12 May, 2020

1 commit

29fe83997 nfs: fix NULL deference in nfs4_get_valid_delegation ... Browse Code »

We add the new state to the nfsi->open_states list, making it
potentially visible to other threads, before we've finished initializing
it.

That wasn't a problem when all the readers were also taking the i_lock
(as we do here), but since we switched to RCU, there's now a possibility
that a reader could see the partially initialized state.

Symptoms observed were a crash when another thread called
nfs4_get_valid_delegation() on a NULL inode, resulting in an oops like:

BUG: unable to handle page fault for address: ffffffffffffffb0 ...
RIP: 0010:nfs4_get_valid_delegation+0x6/0x30 [nfsv4] ...
Call Trace:
nfs4_open_prepare+0x80/0x1c0 [nfsv4]
__rpc_execute+0x75/0x390 [sunrpc]
? finish_task_switch+0x75/0x260
rpc_async_schedule+0x29/0x40 [sunrpc]
process_one_work+0x1ad/0x370
worker_thread+0x30/0x390
? create_worker+0x1a0/0x1a0
kthread+0x10c/0x130
? kthread_park+0x80/0x80
ret_from_fork+0x22/0x30

Fixes: 9ae075fdd190 "NFSv4: Convert open state lookup to use RCU"
Reviewed-by: Seiichi Ikarashi
Tested-by: Daisuke Matsuda
Tested-by: Masayoshi Mizuma
Signed-off-by: J. Bruce Fields
Cc: stable@vger.kernel.org # v4.20+
Signed-off-by: Trond Myklebust

J. Bruce Fields
2020-05-12 02:05:58 +0800

16 Mar, 2020

1 commit

b5fdf8418 NFSv4: Add support for CB_RECALL_ANY for flexfiles layouts ... Browse Code »

When we receive a CB_RECALL_ANY that asks us to return flexfiles
layouts, we iterate through all the layouts and look at whether or
not there are active open file descriptors that might need them
for I/O. If there are no such descriptors, we return the layouts.

Signed-off-by: Trond Myklebust

Trond Myklebust
2020-03-16 20:34:30 +0800

05 Feb, 2020

1 commit

7dc2993a9 NFSv4.0: nfs4_do_fsinfo() should not do implicit lease renewals ... Browse Code »

Currently, each time nfs4_do_fsinfo() is called it will do an implicit
NFS4 lease renewal, which is not compliant with the NFS4 specification.
This can result in a lease being expired by an NFS server.

Commit 83ca7f5ab31f ("NFS: Avoid PUTROOTFH when managing leases")
introduced implicit client lease renewal in nfs4_do_fsinfo(),
which can result in the NFSv4.0 lease to expire on a server side,
and servers returning NFS4ERR_EXPIRED or NFS4ERR_STALE_CLIENTID.

This can easily be reproduced by frequently unmounting a sub-mount,
then stat'ing it to get it mounted again, which will delay or even
completely prevent client from sending RENEW operations if no other
NFS operations are issued. Eventually nfs server will expire client's
lease and return an error on file access or next RENEW.

This can also happen when a sub-mount is automatically unmounted
due to inactivity (after nfs_mountpoint_expiry_timeout), then it is
mounted again via stat(). This can result in a short window during
which client's lease will expire on a server but not on a client.
This specific case was observed on production systems.

This patch removes the implicit lease renewal from nfs4_do_fsinfo().

Fixes: 83ca7f5ab31f ("NFS: Avoid PUTROOTFH when managing leases")
Signed-off-by: Robert Milkowski
Signed-off-by: Anna Schumaker

Robert Milkowski
2020-02-05 01:27:55 +0800

04 Feb, 2020

1 commit

b7b7dac68 NFSv4: Try to return the delegation immediately when marked for return on close ... Browse Code »

Add a routine to return the delegation immediately upon close of the
file if it was marked for return-on-close.

Signed-off-by: Trond Myklebust
Signed-off-by: Anna Schumaker

Trond Myklebust
2020-02-04 05:35:07 +0800

15 Jan, 2020

1 commit

8b98a5324 NFS4: Remove unneeded semicolon ... Browse Code »

Fixes coccicheck warning:

fs/nfs/nfs4state.c:1138:2-3: Unneeded semicolon
fs/nfs/nfs4proc.c:6862:2-3: Unneeded semicolon
fs/nfs/nfs4proc.c:8629:2-3: Unneeded semicolon

Reported-by: Hulk Robot
Signed-off-by: zhengbin
Signed-off-by: Anna Schumaker

zhengbin
2020-01-15 23:54:31 +0800

18 Nov, 2019

2 commits

21f86d2d6 NFS4: Trace lock reclaims ... Browse Code »

One of the most frustrating messages our sustaining team sees is
the "Lock reclaim failed!" message. Add some observability in the
client's lock reclaim logic so we can capture better data the
first time a problem occurs.

Signed-off-by: Chuck Lever
Signed-off-by: Trond Myklebust

Chuck Lever
2019-11-18 18:04:32 +0800
511ba52e4 NFS4: Trace state recovery operation ... Browse Code »

Add a trace point in the main state manager loop to observe state
recovery operation. Help track down state recovery bugs.

Signed-off-by: Chuck Lever
Signed-off-by: Trond Myklebust

Chuck Lever
2019-11-18 17:58:39 +0800

06 Nov, 2019

1 commit

807ce06c2 Merge branch 'linux-ssc-for-5.5' Browse Code »

Trond Myklebust
2019-11-06 21:55:23 +0800

04 Nov, 2019

1 commit

42c304c34 NFS: nfs_inode_find_state_and_recover() fix stateid matching ... Browse Code »

In nfs_inode_find_state_and_recover() we want to mark for recovery
only those stateids that match or are older than the supplied
stateid parameter.

Signed-off-by: Trond Myklebust

Trond Myklebust
2019-11-04 10:28:46 +0800

10 Oct, 2019

2 commits

0e65a32c8 NFS: handle source server reboot ... Browse Code »

When the source server reboots after a server-to-server copy was
issued, we need to retry the copy from COPY_NOTIFY. We need to
detect that the source server rebooted and there is a copy waiting
on a destination server and wake it up.

Signed-off-by: Olga Kornievskaia

Olga Kornievskaia
2019-10-10 00:06:19 +0800
0b9018b9c NFS: skip recovery of copy open on dest server ... Browse Code »

Mark the open created for the source file on the destination
server. Then if this open is going thru a recovery, then fail
the recovery as we don't need to be recoving a "fake" open.
We need to fail the ongoing READs and vfs_copy_file_range().

Signed-off-by: Olga Kornievskaia

Olga Kornievskaia
2019-10-10 00:05:56 +0800

21 Sep, 2019

1 commit

0e0cb35b4 NFSv4: Handle NFS4ERR_OLD_STATEID in CLOSE/OPEN_DOWNGRADE ... Browse Code »

If a CLOSE or OPEN_DOWNGRADE operation receives a NFS4ERR_OLD_STATEID
then bump the seqid before resending. Ensure we only bump the seqid
by 1.

Signed-off-by: Trond Myklebust
Signed-off-by: Anna Schumaker

Trond Myklebust
2019-09-21 03:56:19 +0800

22 Aug, 2019

1 commit

1e672e364 NFSv4: Fix a memory leak bug ... Browse Code »

In nfs4_try_migration(), if nfs4_begin_drain_session() fails, the
previously allocated 'page' and 'locations' are not deallocated, leading to
memory leaks. To fix this issue, go to the 'out' label to free 'page' and
'locations' before returning the error.

Signed-off-by: Wenwen Wang
Signed-off-by: Anna Schumaker

Wenwen Wang
2019-08-22 04:39:29 +0800

08 Aug, 2019

1 commit

67e7b52d4 NFSv4: Ensure state recovery handles ETIMEDOUT correctly ... Browse Code »

Ensure that the state recovery code handles ETIMEDOUT correctly,
and also that we set RPC_TASK_TIMEOUT when recovering open state.

Signed-off-by: Trond Myklebust

Trond Myklebust
2019-08-08 00:55:11 +0800

05 Aug, 2019

3 commits

c77e22834 NFSv4: Fix a potential sleep while atomic in nfs4_do_reclaim() ... Browse Code »

John Hubbard reports seeing the following stack trace:

nfs4_do_reclaim
rcu_read_lock /* we are now in_atomic() and must not sleep */
nfs4_purge_state_owners
nfs4_free_state_owner
nfs4_destroy_seqid_counter
rpc_destroy_wait_queue
cancel_delayed_work_sync
__cancel_work_timer
__flush_work
start_flush_work
might_sleep:
(kernel/workqueue.c:2975: BUG)

The solution is to separate out the freeing of the state owners
from nfs4_purge_state_owners(), and perform that outside the atomic
context.

Reported-by: John Hubbard
Fixes: 0aaaf5c424c7f ("NFS: Cache state owners after files are closed")
Signed-off-by: Trond Myklebust

Trond Myklebust
2019-08-05 10:35:40 +0800
c34fae003 NFSv4: When recovering state fails with EAGAIN, retry the same recovery ... Browse Code »

If the server returns with EAGAIN when we're trying to recover from
a server reboot, we currently delay for 1 second, but then mark the
stateid as needing recovery after the grace period has expired.

Instead, we should just retry the same recovery process immediately
after the 1 second delay. Break out of the loop after 10 retries.

Fixes: 35a61606a612 ("NFS: Reduce indentation of the switch statement...")
Signed-off-by: Trond Myklebust

Trond Myklebust
2019-08-05 10:35:40 +0800
86dbd08b3 NFSv4: Print an error in the syslog when state is marked as irrecoverable ... Browse Code »

When error recovery fails due to a fatal error on the server, ensure
we log it in the syslog.

Signed-off-by: Trond Myklebust

Trond Myklebust
2019-08-05 10:35:40 +0800

19 Jul, 2019

1 commit

d9aba2b40 NFSv4: Don't use the zero stateid with layoutget ... Browse Code »

The NFSv4.1 protocol explicitly forbids us from using the zero stateid
together with layoutget, so when we see that nfs4_select_rw_stateid()
is unable to return a valid delegation, lock or open stateid, then
we should initiate recovery and retry.

Signed-off-by: Trond Myklebust

Trond Myklebust
2019-07-19 02:43:52 +0800

13 Jul, 2019

2 commits

5b596830d nfs4.0: Refetch lease_time after clientid update ... Browse Code »

RFC 7530 requires us to refetch the lease time attribute once a new
clientID is established. This is already implemented for the
nfs4.1(+) clients by nfs41_init_clientid, which calls
nfs41_finish_session_reset, which calls nfs4_setup_state_renewal.

To make nfs4_setup_state_renewal available for nfs4.0, move it
further to the top of the source file to include it regardles of
CONFIG_NFS_V4_1 and to save a forward declaration.

Call nfs4_setup_state_renewal from nfs4_init_clientid.

Signed-off-by: Donald Buczek
Signed-off-by: Trond Myklebust

Donald Buczek
2019-07-13 23:48:41 +0800
ea51efaa9 nfs4: Rename nfs41_setup_state_renewal ... Browse Code »

The function nfs41_setup_state_renewal is useful to the nfs 4.0 client
as well, so rename the function to nfs4_setup_state_renewal.

Signed-off-by: Donald Buczek
Signed-off-by: Trond Myklebust

Donald Buczek
2019-07-13 23:48:41 +0800

10 May, 2019

2 commits

8ca017c8c NFSv4: don't mark all open state for recovery when handling recallable state revoked flag ... Browse Code »

Only delegations and layouts can be recalled, so it shouldn't be
necessary to recover all opens when handling the status bit
SEQ4_STATUS_RECALLABLE_STATE_REVOKED. We'll still wind up calling
nfs41_open_expired() when a TEST_STATEID returns NFS4ERR_DELEG_REVOKED.

Signed-off-by: Scott Mayhew
Reviewed-by: Trond Myklebust
Signed-off-by: Anna Schumaker

Scott Mayhew
2019-05-10 04:26:57 +0800
f02f3755d NFS4: Fix v4.0 client state corruption when mount ... Browse Code »

stat command with soft mount never return after server is stopped.

When alloc a new client, the state of the client will be set to
NFS4CLNT_LEASE_EXPIRED.

When the server is stopped, the state manager will work, and accord
the state to recover. But the state is NFS4CLNT_LEASE_EXPIRED, it
will drain the slot table and lead other task to wait queue, until
the client recovered. Then the stat command is hung.

When discover server trunking, the client will renew the lease,
but check the client state, it lead the client state corruption.

So, we need to call state manager to recover it when detect server
ip trunking.

Signed-off-by: ZhangXiaoxu
Cc: stable@vger.kernel.org
Signed-off-by: Anna Schumaker

ZhangXiaoxu
2019-05-10 04:26:05 +0800

21 Feb, 2019

1 commit

302fad7bd NFS: Fix up documentation warnings ... Browse Code »

Fix up some compiler warnings about function parameters, etc not being
correctly described or formatted.

Signed-off-by: Trond Myklebust

Trond Myklebust
2019-02-21 04:14:21 +0800

03 Jan, 2019

1 commit

9aeaf8cfc NFSv4.2 fix async copy reboot recovery ... Browse Code »

Original commit (e4648aa4f98a "NFS recover from destination server
reboot for copies") used memcmp() and then it was changed to use
nfs4_stateid_match_other() but that function returns opposite of
memcmp. As the result, recovery can't find the copy leading
to copy hanging.

Fixes: 80f42368868e ("NFSv4: Split out NFS v4.2 copy completion functions")
Fixes: cb7a8384dc02 ("NFS: Split out the body of nfs4_reclaim_open_state")
Signed-of-by: Olga Kornievskaia
Signed-off-by: Anna Schumaker

Olga Kornievskaia
2019-01-03 01:05:19 +0800

20 Dec, 2018

4 commits

a52458b48 NFS/NFSD/SUNRPC: replace generic creds with 'struct cred'. ... Browse Code »

SUNRPC has two sorts of credentials, both of which appear as
"struct rpc_cred".
There are "generic credentials" which are supplied by clients
such as NFS and passed in 'struct rpc_message' to indicate
which user should be used to authorize the request, and there
are low-level credentials such as AUTH_NULL, AUTH_UNIX, AUTH_GSS
which describe the credential to be sent over the wires.

This patch replaces all the generic credentials by 'struct cred'
pointers - the credential structure used throughout Linux.

For machine credentials, there is a special 'struct cred *' pointer
which is statically allocated and recognized where needed as
having a special meaning. A look-up of a low-level cred will
map this to a machine credential.

Signed-off-by: NeilBrown
Acked-by: J. Bruce Fields
Signed-off-by: Anna Schumaker

NeilBrown
2018-12-20 02:52:46 +0800
5e16923b4 NFS/SUNRPC: don't lookup machine credential until rpcauth_bindcred(). ... Browse Code »

When NFS creates a machine credential, it is a "generic" credential,
not tied to any auth protocol, and is really just a container for
the princpal name.
This doesn't get linked to a genuine credential until rpcauth_bindcred()
is called.
The lookup always succeeds, so various places that test if the machine
credential is NULL, are pointless.

As a step towards getting rid of generic credentials, this patch gets
rid of generic machine credentials. The nfs_client and rpc_client
just hold a pointer to a constant principal name.
When a machine credential is wanted, a special static 'struct rpc_cred'
pointer is used. rpcauth_bindcred() recognizes this, finds the
principal from the client, and binds the correct credential.

Signed-off-by: NeilBrown
Signed-off-by: Anna Schumaker

NeilBrown
2018-12-20 02:52:45 +0800
f15e1e8bc NFSv4: don't require lock for get_renew_cred or get_machine_cred ... Browse Code »

This lock is no longer necessary.

If nfs4_get_renew_cred() needs to hunt through the open-state
creds for a user cred, it still takes the lock to stablize
the rbtree, but otherwise there are no races.

Note that this completely removes the lock from nfs4_renew_state().
It appears that the original need for the locking here was removed
long ago, and there is no longer anything to protect.

Signed-off-by: NeilBrown
Signed-off-by: Anna Schumaker

NeilBrown
2018-12-20 02:52:45 +0800
a534ecb01 NFSv4: add cl_root_cred for use when machine cred is not available. ... Browse Code »

NFSv4 state management tries a root credential when no machine
credential is available, as can happen with kerberos.
It does this by replacing the cl_machine_cred with a root credential.
This means that any user of the machine credential needs to take
a lock while getting a reference to the machine credential, which is
a little cumbersome.

So introduce an explicit cl_root_cred, and never free either
credential until client shutdown. This means that no locking
is needed to reference these credentials. Future patches
will make use of this.

This is only a temporary addition. both cl_machine_cred and
cl_root_cred will disappear later in the series.

Signed-off-by: NeilBrown
Signed-off-by: Anna Schumaker

NeilBrown
2018-12-20 02:52:45 +0800

20 Nov, 2018

1 commit

aeabb3c96 NFSv4: Fix a NFSv4 state manager deadlock ... Browse Code »

Fix a deadlock whereby the NFSv4 state manager can get stuck in the
delegation return code, waiting for a layout return to complete in
another thread. If the server reboots before that other thread
completes, then we need to be able to start a second state
manager thread in order to perform recovery.

Signed-off-by: Trond Myklebust

Trond Myklebust
2018-11-20 09:11:45 +0800

13 Nov, 2018

2 commits

a1aa09be2 NFSv4: Ensure that the state manager exits the loop on SIGKILL ... Browse Code »

Signed-off-by: Trond Myklebust

Trond Myklebust
2018-11-13 05:39:13 +0800
21a446cf1 NFSv4: Don't exit the state manager without clearing NFS4CLNT_MANAGER_RUNNING ... Browse Code »

If we exit the NFSv4 state manager due to a umount, then we can end up
leaving the NFS4CLNT_MANAGER_RUNNING flag set. If another mount causes
the nfs4_client to be rereferenced before it is destroyed, then we end
up never being able to recover state.

Fixes: 47c2199b6eb5 ("NFSv4.1: Ensure state manager thread dies on last ...")
Signed-off-by: Trond Myklebust
Cc: stable@vger.kernel.org # v4.15+

Trond Myklebust
2018-11-13 05:39:13 +0800

01 Oct, 2018

7 commits

80f423688 NFSv4: Split out NFS v4.2 copy completion functions ... Browse Code »

The convention in the rest of the code is to have a separate function
for anything that might be ifdef-ed out.

Signed-off-by: Anna Schumaker
Signed-off-by: Trond Myklebust

Anna Schumaker
2018-10-01 03:35:17 +0800
000d3f956 NFS: Reduce indentation of nfs4_recovery_handle_error() ... Browse Code »

This is to match kernel coding style for switch statements.

Signed-off-by: Anna Schumaker
Signed-off-by: Trond Myklebust

Anna Schumaker
2018-10-01 03:35:17 +0800
35a61606a NFS: Reduce indentation of the switch statement in nfs4_reclaim_open_state() ... Browse Code »

Most places in the kernel tend to line up cases with the switch to
reduce indentation, so move this over to match that style.
Additionally, I handle the (status >= 0) case in the switch so that we
only "goto restart" from a single place after error handling.

Signed-off-by: Anna Schumaker
Signed-off-by: Trond Myklebust

Anna Schumaker
2018-10-01 03:35:17 +0800
cb7a8384d NFS: Split out the body of nfs4_reclaim_open_state() ... Browse Code »

Moving all of this into a new function removes the need for cramped
indentation, making the code overall easier to look at. I also take
this chance to switch copy recovery over to using
nfs4_stateid_match_other()

Signed-off-by: Anna Schumaker
Signed-off-by: Trond Myklebust

Anna Schumaker
2018-10-01 03:35:17 +0800
ace9fad43 NFSv4: Convert struct nfs4_state to use refcount_t ... Browse Code »

Signed-off-by: Trond Myklebust

Trond Myklebust
2018-10-01 03:35:17 +0800
9ae075fdd NFSv4: Convert open state lookup to use RCU ... Browse Code »

Further reduce contention on the inode->i_lock.

Signed-off-by: Trond Myklebust

Trond Myklebust
2018-10-01 03:35:17 +0800
0de43976f NFS: Convert lookups of the open context to RCU ... Browse Code »

Reduce contention on the inode->i_lock by ensuring that we use RCU
when looking up the NFS open context.

Signed-off-by: Trond Myklebust

Trond Myklebust
2018-10-01 03:35:17 +0800