Eric Lee / smarc-fsl-linux-kernel

12 Nov, 2020

2 commits

11decaf81 NFS: Remove unnecessary inode lock in nfs_fsync_dir() ... Browse Code »

nfs_inc_stats() is already thread-safe, and there are no other reasons
to hold the inode lock here.

Signed-off-by: Trond Myklebust
Signed-off-by: Anna Schumaker

Trond Myklebust
2020-11-12 23:41:26 +0800
83f2c45e6 NFS: Remove unnecessary inode locking in nfs_llseek_dir() ... Browse Code »

Remove the contentious inode lock, and instead provide thread safety
using the file->f_lock spinlock.

Signed-off-by: Trond Myklebust
Signed-off-by: Anna Schumaker

Trond Myklebust
2020-11-12 23:41:26 +0800

17 Sep, 2020

1 commit

d33030e2e nfs: Fix security label length not being reset ... Browse Code »

nfs_readdir_page_filler() iterates over entries in a directory, reusing
the same security label buffer, but does not reset the buffer's length.
This causes decode_attr_security_label() to return -ERANGE if an entry's
security label is longer than the previous one's. This error, in
nfs4_decode_dirent(), only gets passed up as -EAGAIN, which causes another
failed attempt to copy into the buffer. The second error is ignored and
the remaining entries do not show up in ls, specifically the getdents64()
syscall.

Reproduce by creating multiple files in NFS and giving one of the later
files a longer security label. ls will not see that file nor any that are
added afterwards, though they will exist on the backend.

In nfs_readdir_page_filler(), reset security label buffer length before
every reuse

Signed-off-by: Jeffrey Mitchell
Fixes: b4487b935452 ("nfs: Fix getxattr kernel panic and memory overflow")
Signed-off-by: Trond Myklebust

Jeffrey Mitchell
2020-09-17 00:25:14 +0800

24 Aug, 2020

1 commit

df561f668 treewide: Use fallthrough pseudo-keyword ... Browse Code »

Replace the existing /* fall through */ comments and its variants with
the new pseudo-keyword macro fallthrough[1]. Also, remove unnecessary
fall-through markings when it is the case.

[1] https://www.kernel.org/doc/html/v5.7/process/deprecated.html?highlight=fallthrough#implicit-switch-case-fall-through

Signed-off-by: Gustavo A. R. Silva

Gustavo A. R. Silva
2020-08-24 06:36:59 +0800

14 Jul, 2020

2 commits

72832a245 NFSv4.2: query the extended attribute access bits ... Browse Code »

RFC 8276 defines separate ACCESS bits for extended attribute checking.
Query them in nfs_do_access and opendata.

Signed-off-by: Frank van der Linden
Signed-off-by: Trond Myklebust

Frank van der Linden
2020-07-14 05:52:45 +0800
d2ae4f8b2 nfs: define nfs_access_get_cached function ... Browse Code »

The only consumer of nfs_access_get_cached_rcu and nfs_access_cached
calls these static functions in order to first try RCU access, and
then locked access.

Combine them in to a single function, and call that. Make this function
available to the rest of the NFS code.

Signed-off-by: Frank van der Linden
Signed-off-by: Trond Myklebust

Frank van der Linden
2020-07-14 05:52:45 +0800

08 Apr, 2020

1 commit

04de788e6 Merge tag 'nfs-for-5.7-1' of git://git.linux-nfs.org/projects/trondmy/linux-nfs ... Browse Code »

Pull NFS client updates from Trond Myklebust:
"Highlights include:

Stable fixes:
- Fix a page leak in nfs_destroy_unlinked_subrequests()

- Fix use-after-free issues in nfs_pageio_add_request()

- Fix new mount code constant_table array definitions

- finish_automount() requires us to hold 2 refs to the mount record

Features:
- Improve the accuracy of telldir/seekdir by using 64-bit cookies
when possible.

- Allow one RDMA active connection and several zombie connections to
prevent blocking if the remote server is unresponsive.

- Limit the size of the NFS access cache by default

- Reduce the number of references to credentials that are taken by
NFS

- pNFS files and flexfiles drivers now support per-layout segment
COMMIT lists.

- Enable partial-file layout segments in the pNFS/flexfiles driver.

- Add support for CB_RECALL_ANY to the pNFS flexfiles layout type

- pNFS/flexfiles Report NFS4ERR_DELAY and NFS4ERR_GRACE errors from
the DS using the layouterror mechanism.

Bugfixes and cleanups:
- SUNRPC: Fix krb5p regressions

- Don't specify NFS version in "UDP not supported" error

- nfsroot: set tcp as the default transport protocol

- pnfs: Return valid stateids in nfs_layout_find_inode_by_stateid()

- alloc_nfs_open_context() must use the file cred when available

- Fix locking when dereferencing the delegation cred

- Fix memory leaks in O_DIRECT when nfs_get_lock_context() fails

- Various clean ups of the NFS O_DIRECT commit code

- Clean up RDMA connect/disconnect

- Replace zero-length arrays with C99-style flexible arrays"

* tag 'nfs-for-5.7-1' of git://git.linux-nfs.org/projects/trondmy/linux-nfs: (86 commits)
NFS: Clean up process of marking inode stale.
SUNRPC: Don't start a timer on an already queued rpc task
NFS/pnfs: Reference the layout cred in pnfs_prepare_layoutreturn()
NFS/pnfs: Fix dereference of layout cred in pnfs_layoutcommit_inode()
NFS: Beware when dereferencing the delegation cred
NFS: Add a module parameter to set nfs_mountpoint_expiry_timeout
NFS: finish_automount() requires us to hold 2 refs to the mount record
NFS: Fix a few constant_table array definitions
NFS: Try to join page groups before an O_DIRECT retransmission
NFS: Refactor nfs_lock_and_join_requests()
NFS: Reverse the submission order of requests in __nfs_pageio_add_request()
NFS: Clean up nfs_lock_and_join_requests()
NFS: Remove the redundant function nfs_pgio_has_mirroring()
NFS: Fix memory leaks in nfs_pageio_stop_mirroring()
NFS: Fix a request reference leak in nfs_direct_write_clear_reqs()
NFS: Fix use-after-free issues in nfs_pageio_add_request()
NFS: Fix races nfs_page_group_destroy() vs nfs_destroy_unlinked_subrequests()
NFS: Fix a page leak in nfs_destroy_unlinked_subrequests()
NFS: Remove unused FLUSH_SYNC support in nfs_initiate_pgio()
pNFS/flexfiles: Specify the layout segment range in LAYOUTGET
...

Linus Torvalds
2020-04-08 04:51:39 +0800

07 Apr, 2020

1 commit

93ce4af77 NFS: Clean up process of marking inode stale. ... Browse Code »

Instead of the various open coded calls to set the NFS_INO_STALE bit
and call nfs_zap_caches(), consolidate them into a single function
nfs_set_inode_stale().

Signed-off-by: Trond Myklebust

Trond Myklebust
2020-04-07 01:56:33 +0800

24 Mar, 2020

1 commit

baf5fe761 Merge branch 'for-mingo' of git://git.kernel.org/pub/scm/linux/kernel/git/paulmc… ... Browse Code »

…k/linux-rcu into core/rcu

Pull RCU changes from Paul E. McKenney:

- Make kfree_rcu() use kfree_bulk() for added performance
- RCU updates
- Callback-overload handling updates
- Tasks-RCU KCSAN and sparse updates
- Locking torture test and RCU torture test updates
- Documentation updates
- Miscellaneous fixes

Signed-off-by: Ingo Molnar <mingo@kernel.org>

Ingo Molnar
2020-03-24 17:10:09 +0800

16 Mar, 2020

4 commits

5601cda82 nfs: Replace zero-length array with flexible-array member ... Browse Code »

The current codebase makes use of the zero-length array language
extension to the C90 standard, but the preferred mechanism to declare
variable-length types such as these ones is a flexible array member[1][2],
introduced in C99:

struct foo {
int stuff;
struct boo array[];
};

By making use of the mechanism above, we will get a compiler warning
in case the flexible array does not occur last in the structure, which
will help us prevent some kind of undefined behavior bugs from being
inadvertently introduced[3] to the codebase from now on.

Also, notice that, dynamic memory allocations won't be affected by
this change:

"Flexible array members have incomplete type, and so the sizeof operator
may not be applied. As a quirk of the original implementation of
zero-length arrays, sizeof evaluates to zero."[1]

This issue was found with the help of Coccinelle.

[1] https://gcc.gnu.org/onlinedocs/gcc/Zero-Length.html
[2] https://github.com/KSPP/linux/issues/21
[3] commit 76497732932f ("cxgb3/l2t: Fix undefined behaviour")

Signed-off-by: Gustavo A. R. Silva
Signed-off-by: Trond Myklebust

Gustavo A. R. Silva
2020-03-16 22:16:26 +0800
eb095c140 NFS:remove redundant call to nfs_do_access ... Browse Code »

In function nfs_permission:
1. the rcu_read_lock and rcu_read_unlock around nfs_do_access
is unnecessary because the rcu critical data structure is already
protected in subsidiary function nfs_access_get_cached_rcu. No other
data structure needs rcu_read_lock in nfs_do_access.

2. call nfs_do_access once is enough, because:
2-1. when mask has MAY_NOT_BLOCK bit
The second call to nfs_do_access will not happen.

2-2. when mask has no MAY_NOT_BLOCK bit
The second call to nfs_do_access will happen if res == -ECHILD, which
means the first nfs_do_access goes out after statement if (!may_block).
The second call to nfs_do_access will go through this procedure once
again except continue the work after if (!may_block).
But above work can be performed by only one call to nfs_do_access
without mangling the mask flag.

Tested in x86_64
Signed-off-by: Zhouyi Zhou
Signed-off-by: Trond Myklebust

Zhouyi Zhou
2020-03-16 22:11:59 +0800
a8b373eef NFS: Limit the size of the access cache by default ... Browse Code »

Currently, we have no real limit on the access cache size (we set it
to ULONG_MAX). That can lead to credentials getting pinned for a
very long time on lots of files if you have a system with a lot of
memory.

Signed-off-by: Trond Myklebust

Trond Myklebust
2020-03-16 20:34:29 +0800
59e356a96 NFS: Use the 64-bit server readdir cookies when possible ... Browse Code »

When we're running as a 64-bit architecture and are not running in
32-bit compatibility mode, it is better to use the 64-bit readdir
cookies that supplied by the server. Doing so improves the accuracy
of telldir()/seekdir(), particularly when the directory is changing,
for instance, when doing 'rm -rf'.

We still fall back to using the 32-bit offsets on 32-bit architectures
and when in compatibility mode.

Signed-off-by: Trond Myklebust

Trond Myklebust
2020-03-16 20:34:28 +0800

21 Feb, 2020

1 commit

9f01eb5d4 nfs: Fix nfs_access_get_cached_rcu() sparse error ... Browse Code »

This patch fixes the following sparse error:
fs/nfs/dir.c:2353:14: error: incompatible types in comparison expression (different address spaces):
fs/nfs/dir.c:2353:14: struct list_head [noderef] *
fs/nfs/dir.c:2353:14: struct list_head *

Signed-off-by: Madhuparna Bhowmik
Signed-off-by: Paul E. McKenney

Madhuparna Bhowmik
2020-02-21 07:58:21 +0800

13 Feb, 2020

1 commit

efeda80da NFSv4: Fix revalidation of dentries with delegations ... Browse Code »

If a dentry was not initially looked up while we were holding a
delegation, then we do still need to revalidate that it still holds
the same name. If there are multiple hard links to the same file,
then all the hard links need validation.

Reported-by: Benjamin Coddington
Signed-off-by: Trond Myklebust
Reviewed-by: Benjamin Coddington
Tested-by: Benjamin Coddington
[Anna: Put nfs_unset_verifier_delegated() under CONFIG_NFS_V4]
Signed-off-by: Anna Schumaker

Trond Myklebust
2020-02-13 02:55:25 +0800

10 Feb, 2020

1 commit

a1147b828 NFS: Fix up directory verifier races ... Browse Code »

In order to avoid having our dentry revalidation race with an update
of the directory on the server, we need to store the verifier before
the RPC calls to LOOKUP and READDIR.

Signed-off-by: Trond Myklebust
Reviewed-by: Benjamin Coddington
Tested-by: Benjamin Coddington
Signed-off-by: Anna Schumaker

Trond Myklebust
2020-02-10 23:38:48 +0800

04 Feb, 2020

6 commits

227823d20 nfs: optimise readdir cache page invalidation ... Browse Code »

When the directory is large and it's being modified by one client
while another client is doing the 'ls -l' on the same directory then
the cache page invalidation from nfs_force_use_readdirplus causes
the reading client to keep restarting READDIRPLUS from cookie 0
which causes the 'ls -l' to take a very long time to complete,
possibly never completing.

Currently when nfs_force_use_readdirplus is called to switch from
READDIR to READDIRPLUS, it invalidates all the cached pages of the
directory. This cache page invalidation causes the next nfs_readdir
to re-read the directory content from cookie 0.

This patch is to optimise the cache invalidation in
nfs_force_use_readdirplus by only truncating the cached pages from
last page index accessed to the end the file. It also marks the
inode to delay invalidating all the cached page of the directory
until the next initial nfs_readdir of the next 'ls' instance.

Signed-off-by: Dai Ngo
Reviewed-by: Trond Myklebust
[Anna - Fix conflicts with Trond's readdir patches]
[Anna - Remove redundant call to nfs_zap_mapping()]
[Anna - Replace d_inode(file_dentry(desc->file)) with file_inode(desc->file)]
Signed-off-by: Anna Schumaker

Dai Ngo
2020-02-04 23:50:44 +0800
93a6ab7b6 NFS: Switch readdir to using iterate_shared() ... Browse Code »

Now that the page cache locking is repaired, we should be able to
switch to using iterate_shared() for improved concurrency when
doing readdir().

Signed-off-by: Trond Myklebust
Reviewed-by: Benjamin Coddington
Signed-off-by: Anna Schumaker

Trond Myklebust
2020-02-04 05:37:51 +0800
3803d6721 NFS: Use kmemdup_nul() in nfs_readdir_make_qstr() ... Browse Code »

The directory strings stored in the readdir cache may be used with
printk(), so it is better to ensure they are nul-terminated.

Signed-off-by: Trond Myklebust
Reviewed-by: Benjamin Coddington
Signed-off-by: Anna Schumaker

Trond Myklebust
2020-02-04 05:37:45 +0800
114de3822 NFS: Directory page cache pages need to be locked when read ... Browse Code »

When a NFS directory page cache page is removed from the page cache,
its contents are freed through a call to nfs_readdir_clear_array().
To prevent the removal of the page cache entry until after we've
finished reading it, we must take the page lock.

Fixes: 11de3b11e08c ("NFS: Fix a memory leak in nfs_readdir")
Cc: stable@vger.kernel.org # v2.6.37+
Signed-off-by: Trond Myklebust
Reviewed-by: Benjamin Coddington
Signed-off-by: Anna Schumaker

Trond Myklebust
2020-02-04 05:37:17 +0800
4b310319c NFS: Fix memory leaks and corruption in readdir ... Browse Code »

nfs_readdir_xdr_to_array() must not exit without having initialised
the array, so that the page cache deletion routines can safely
call nfs_readdir_clear_array().
Furthermore, we should ensure that if we exit nfs_readdir_filler()
with an error, we free up any page contents to prevent a leak
if we try to fill the page again.

Fixes: 11de3b11e08c ("NFS: Fix a memory leak in nfs_readdir")
Cc: stable@vger.kernel.org # v2.6.37+
Signed-off-by: Trond Myklebust
Reviewed-by: Benjamin Coddington
Signed-off-by: Anna Schumaker

Trond Myklebust
2020-02-04 05:35:17 +0800
9a206de2e NFS: nfs_access_get_cached_rcu() should use cred_fscmp() ... Browse Code »

We do not need to have the rcu lookup method fail in the case where
the fsuid/fsgid and supplemental groups match.

Signed-off-by: Trond Myklebust
Signed-off-by: Anna Schumaker

Trond Myklebust
2020-02-04 05:35:07 +0800

25 Jan, 2020

1 commit

f7b37b8b1 NFS: Add softreval behaviour to nfs_lookup_revalidate() ... Browse Code »

If the server is unavaliable, we want to allow the revalidating
lookup to time out, and to default to validating the cached dentry
if the 'softreval' mount option is set.

Signed-off-by: Trond Myklebust
Signed-off-by: Anna Schumaker

Trond Myklebust
2020-01-25 05:51:13 +0800

15 Jan, 2020

2 commits

5c965db86 NFS: Trust cached access if we've already revalidated the inode once ... Browse Code »

If we've already revalidated the inode once then don't distrust the
access cache unless the NFS_INO_INVALID_ACCESS flag is actually set.

Signed-off-by: Trond Myklebust
Signed-off-by: Anna Schumaker

Trond Myklebust
2020-01-15 23:54:33 +0800
e8194b7dd NFS: Improve tracing of permission calls ... Browse Code »

On exit from nfs_do_access(), record the mask representing the requested
permissions, as well as the server-supplied set of access rights for
this user.

Signed-off-by: Trond Myklebust
Signed-off-by: Anna Schumaker

Trond Myklebust
2020-01-15 23:54:33 +0800

27 Sep, 2019

1 commit

972a2bf7d Merge tag 'nfs-for-5.4-1' of git://git.linux-nfs.org/projects/anna/linux-nfs ... Browse Code »

Pull NFS client updates from Anna Schumaker:
"Stable bugfixes:
- Dequeue the request from the receive queue while we're re-encoding
# v4.20+
- Fix buffer handling of GSS MIC without slack # 5.1

Features:
- Increase xprtrdma maximum transport header and slot table sizes
- Add support for nfs4_call_sync() calls using a custom
rpc_task_struct
- Optimize the default readahead size
- Enable pNFS filelayout LAYOUTGET on OPEN

Other bugfixes and cleanups:
- Fix possible null-pointer dereferences and memory leaks
- Various NFS over RDMA cleanups
- Various NFS over RDMA comment updates
- Don't receive TCP data into a reset request buffer
- Don't try to parse incomplete RPC messages
- Fix congestion window race with disconnect
- Clean up pNFS return-on-close error handling
- Fixes for NFS4ERR_OLD_STATEID handling"

* tag 'nfs-for-5.4-1' of git://git.linux-nfs.org/projects/anna/linux-nfs: (53 commits)
pNFS/filelayout: enable LAYOUTGET on OPEN
NFS: Optimise the default readahead size
NFSv4: Handle NFS4ERR_OLD_STATEID in LOCKU
NFSv4: Handle NFS4ERR_OLD_STATEID in CLOSE/OPEN_DOWNGRADE
NFSv4: Fix OPEN_DOWNGRADE error handling
pNFS: Handle NFS4ERR_OLD_STATEID on layoutreturn by bumping the state seqid
NFSv4: Add a helper to increment stateid seqids
NFSv4: Handle RPC level errors in LAYOUTRETURN
NFSv4: Handle NFS4ERR_DELAY correctly in return-on-close
NFSv4: Clean up pNFS return-on-close error handling
pNFS: Ensure we do clear the return-on-close layout stateid on fatal errors
NFS: remove unused check for negative dentry
NFSv3: use nfs_add_or_obtain() to create and reference inodes
NFS: Refactor nfs_instantiate() for dentry referencing callers
SUNRPC: Fix congestion window race with disconnect
SUNRPC: Don't try to parse incomplete RPC messages
SUNRPC: Rename xdr_buf_read_netobj to xdr_buf_read_mic
SUNRPC: Fix buffer handling of GSS MIC without slack
SUNRPC: RPC level errors should always set task->tk_rpc_status
SUNRPC: Don't receive TCP data into a request buffer that has been reset
...

Linus Torvalds
2019-09-27 03:20:14 +0800

21 Sep, 2019

2 commits

581057c83 NFS: remove unused check for negative dentry ... Browse Code »

This check has been hanging out since we used to have parallel paths to add
dentry in nfs_create(), but that hasn't been the case for some years.

Signed-off-by: Benjamin Coddington
Signed-off-by: Anna Schumaker

Benjamin Coddington
2019-09-21 03:15:24 +0800
406cd9153 NFS: Refactor nfs_instantiate() for dentry referencing callers ... Browse Code »

Since commit b0c6108ecf64 ("nfs_instantiate(): prevent multiple aliases for
directory inode"), nfs_instantiate() may succeed without actually
instantiating the dentry that was passed in. That can be problematic for
some callers in NFSv3, so this patch breaks things up so we can get the
actual dentry obtained.

Signed-off-by: Benjamin Coddington
Signed-off-by: Anna Schumaker

Benjamin Coddington
2019-09-21 03:15:24 +0800

19 Aug, 2019

1 commit

9821421a2 NFSv4: Fix return value in nfs_finish_open() ... Browse Code »

If the file turns out to be of the wrong type after opening, we want
to revalidate the path and retry, so return EOPENSTALE rather than
ESTALE.

Signed-off-by: Trond Myklebust

Trond Myklebust
2019-08-19 20:56:04 +0800

13 Jul, 2019

1 commit

db531db95 Revert "NFS: readdirplus optimization by cache mechanism" (memleak) ... Browse Code »

This reverts commit be4c2d4723a4a637f0d1b4f7c66447141a4b3564.

That commit caused a severe memory leak in nfs_readdir_make_qstr().

When listing a directory with more than 100 files (this is how many
struct nfs_cache_array_entry elements fit in one 4kB page), all
allocated file name strings past those 100 leak.

The root of the leakage is that those string pointers are managed in
pages which are never linked into the page cache.

fs/nfs/dir.c puts pages into the page cache by calling
read_cache_page(); the callback function nfs_readdir_filler() will
then fill the given page struct which was passed to it, which is
already linked in the page cache (by do_read_cache_page() calling
add_to_page_cache_lru()).

Commit be4c2d4723a4 added another (local) array of allocated pages, to
be filled with more data, instead of discarding excess items received
from the NFS server. Those additional pages can be used by the next
nfs_readdir_filler() call (from within the same nfs_readdir() call).

The leak happens when some of those additional pages are never used
(copied to the page cache using copy_highpage()). The pages will be
freed by nfs_readdir_free_pages(), but their contents will not. The
commit did not invoke nfs_readdir_clear_array() (and doing so would
have been dangerous, because it did not track which of those pages
were already copied to the page cache, risking double free bugs).

How to reproduce the leak:

- Use a kernel with CONFIG_SLUB_DEBUG_ON.

- Create a directory on a NFS mount with more than 100 files with
names long enough to use the "kmalloc-32" slab (so we can easily
look up the allocation counts):

for i in `seq 110`; do touch ${i}_0123456789abcdef; done

- Drop all caches:

echo 3 >/proc/sys/vm/drop_caches

- Check the allocation counter:

grep nfs_readdir /sys/kernel/slab/kmalloc-32/alloc_calls
30564391 nfs_readdir_add_to_array+0x73/0xd0 age=534558/4791307/6540952 pid=370-1048386 cpus=0-47 nodes=0-1

- Request a directory listing and check the allocation counters again:

ls
[...]
grep nfs_readdir /sys/kernel/slab/kmalloc-32/alloc_calls
30564511 nfs_readdir_add_to_array+0x73/0xd0 age=207/4792999/6542663 pid=370-1048386 cpus=0-47 nodes=0-1

There are now 120 new allocations.

- Drop all caches and check the counters again:

echo 3 >/proc/sys/vm/drop_caches
grep nfs_readdir /sys/kernel/slab/kmalloc-32/alloc_calls
30564401 nfs_readdir_add_to_array+0x73/0xd0 age=735/4793524/6543176 pid=370-1048386 cpus=0-47 nodes=0-1

110 allocations are gone, but 10 have leaked and will never be freed.

Unhelpfully, those allocations are explicitly excluded from KMEMLEAK,
that's why my initial attempts with KMEMLEAK were not successful:

/*
* Avoid a kmemleak false positive. The pointer to the name is stored
* in a page cache page which kmemleak does not scan.
*/
kmemleak_not_leak(string->name);

It would be possible to solve this bug without reverting the whole
commit:

- keep track of which pages were not used, and call
nfs_readdir_clear_array() on them, or
- manually link those pages into the page cache

But for now I have decided to just revert the commit, because the real
fix would require complex considerations, risking more dangerous
(crash) bugs, which may seem unsuitable for the stable branches.

Signed-off-by: Max Kellermann
Cc: stable@vger.kernel.org # v5.1+
Signed-off-by: Trond Myklebust

Max Kellermann
2019-07-13 04:01:37 +0800

07 Jul, 2019

1 commit

1c341b777 NFS: Add deferred cache invalidation for close-to-open consistency violations ... Browse Code »

If the client detects that close-to-open cache consistency has been
violated, and that the file or directory has been changed on the
server, then do a cache invalidation when we're done working with
the file.
The reason we don't do an immediate cache invalidation is that we
want to avoid performance problems due to false positives. Also,
note that we cannot guarantee cache consistency in this situation
even if we do invalidate the cache.

Signed-off-by: Trond Myklebust

Trond Myklebust
2019-07-07 02:54:49 +0800

21 May, 2019

1 commit

457c89965 treewide: Add SPDX license identifier for missed files ... Browse Code »

Add SPDX license identifiers to all files which:

- Have no license information of any form

- Have EXPORT_.*_SYMBOL_GPL inside which was used in the
initial scan/conversion to ignore the file

These files fall under the project license, GPL v2 only. The resulting SPDX
license identifier is:

GPL-2.0-only

Signed-off-by: Thomas Gleixner
Signed-off-by: Greg Kroah-Hartman

Thomas Gleixner
2019-05-21 16:50:45 +0800

10 May, 2019

1 commit

a46126ccc nfs: pass the correct prototype to read_cache_page ... Browse Code »

Fix the callbacks NFS passes to read_cache_page to actually have the
proper type expected. Casting around function pointers can easily
hide typing bugs, and defeats control flow protection.

Signed-off-by: Christoph Hellwig
Signed-off-by: Anna Schumaker

Christoph Hellwig
2019-05-10 04:26:57 +0800

21 Feb, 2019

4 commits

bf211ca1a NFS: Fix typo in comments of nfs_readdir_alloc_pages() ... Browse Code »

This fixes the typo in comments of nfs_readdir_alloc_pages().
Because nfs_readdir_large_page and nfs_readdir_free_pagearray had been
renamed.

Signed-off-by: Liguang Zhang
Signed-off-by: Trond Myklebust

zhangliguang
2019-02-21 06:33:55 +0800
42f72cf36 NFS: Remove redundant semicolon ... Browse Code »

This removes redundant semicolon for ending code.

Fixes: c7944ebb9ce9 ("NFSv4: Fix lookup revalidate of regular files")
Signed-off-by: Liguang Zhang
Signed-off-by: Trond Myklebust

zhangliguang
2019-02-21 06:33:55 +0800
be4c2d472 NFS: readdirplus optimization by cache mechanism ... Browse Code »

When listing very large directories via NFS, clients may take a long
time to complete. There are about three factors involved:

First of all, ls and practically every other method of listing a
directory including python os.listdir and find rely on libc readdir().
However readdir() only reads 32K of directory entries at a time, which
means that if you have a lot of files in the same directory, it is going
to take an insanely long time to read all the directory entries.

Secondly, libc readdir() reads 32K of directory entries at a time, in
kernel space 32K buffer split into 8 pages. One NFS readdirplus rpc will
be called for one page, which introduces many readdirplus rpc calls.

Lastly, one NFS readdirplus rpc asks for 32K data (filled by nfs_dentry)
to fill one page (filled by dentry), we found that nearly one third of
data was wasted.

To solve above problems, pagecache mechanism was introduced. One NFS
readdirplus rpc will ask for a large data (more than 32k), the data can
fill more than one page, the cached pages can be used for next readdir
call. This can reduce many readdirplus rpc calls and improve readdirplus
performance.

TESTING:
When listing very large directories(include 300 thousand files) via NFS

time ls -l /nfs_mount | wc -l

without the patch:
300001
real 1m53.524s
user 0m2.314s
sys 0m2.599s

with the patch:
300001
real 0m23.487s
user 0m2.305s
sys 0m2.558s

Improved performance: 79.6%
readdirplus rpc calls decrease: 85%

Signed-off-by: Liguang Zhang
Signed-off-by: Trond Myklebust

luanshi
2019-02-21 06:33:55 +0800
302fad7bd NFS: Fix up documentation warnings ... Browse Code »

Fix up some compiler warnings about function parameters, etc not being
correctly described or formatted.

Signed-off-by: Trond Myklebust

Trond Myklebust
2019-02-21 04:14:21 +0800

20 Dec, 2018

2 commits

684f39b4c NFS: struct nfs_open_dir_context: convert rpc_cred pointer to cred. ... Browse Code »

Use the common 'struct cred' to pass credentials for readdir.

Signed-off-by: NeilBrown
Signed-off-by: Anna Schumaker

NeilBrown
2018-12-20 02:52:46 +0800
b68572e07 NFS: change access cache to use 'struct cred'. ... Browse Code »

Rather than keying the access cache with 'struct rpc_cred',
use 'struct cred'. Then use cred_fscmp() to compare
credentials rather than comparing the raw pointer.

A benefit of this approach is that in the common case we avoid the
rpc_lookup_cred_nonblock() call which can be slow when the cred cache is large.
This also keeps many fewer items pinned in the rpc cred cache, so the
cred cache is less likely to get large.

Signed-off-by: NeilBrown
Signed-off-by: Anna Schumaker

NeilBrown
2018-12-20 02:52:45 +0800

01 Oct, 2018

1 commit

c7944ebb9 NFSv4: Fix lookup revalidate of regular files ... Browse Code »

If we're revalidating an existing dentry in order to open a file, we need
to ensure that we check the directory has not changed before we optimise
away the lookup.

Signed-off-by: Trond Myklebust

Trond Myklebust
2018-10-01 03:35:18 +0800