Eric Lee / smarc-fsl-linux-kernel

29 Jul, 2016

1 commit

6784725ab Merge branch 'work.misc' of git://git.kernel.org/pub/scm/linux/kernel/git/viro/vfs ... Browse Code »

Pull vfs updates from Al Viro:
"Assorted cleanups and fixes.

Probably the most interesting part long-term is ->d_init() - that will
have a bunch of followups in (at least) ceph and lustre, but we'll
need to sort the barrier-related rules before it can get used for
really non-trivial stuff.

Another fun thing is the merge of ->d_iput() callers (dentry_iput()
and dentry_unlink_inode()) and a bunch of ->d_compare() ones (all
except the one in __d_lookup_lru())"

* 'work.misc' of git://git.kernel.org/pub/scm/linux/kernel/git/viro/vfs: (26 commits)
fs/dcache.c: avoid soft-lockup in dput()
vfs: new d_init method
vfs: Update lookup_dcache() comment
bdev: get rid of ->bd_inodes
Remove last traces of ->sync_page
new helper: d_same_name()
dentry_cmp(): use lockless_dereference() instead of smp_read_barrier_depends()
vfs: clean up documentation
vfs: document ->d_real()
vfs: merge .d_select_inode() into .d_real()
unify dentry_iput() and dentry_unlink_inode()
binfmt_misc: ->s_root is not going anywhere
drop redundant ->owner initializations
ufs: get rid of redundant checks
orangefs: constify inode_operations
missed comment updates from ->direct_IO() prototype change
file_inode(f)->i_mapping is f->f_mapping
trim fsnotify hooks a bit
9p: new helper - v9fs_parent_fid()
debugfs: ->d_parent is never NULL or negative
...

Linus Torvalds
2016-07-29 03:59:05 +0800

27 Jul, 2016

2 commits

0e06f5c0d Merge branch 'akpm' (patches from Andrew) ... Browse Code »

Merge updates from Andrew Morton:

- a few misc bits

- ocfs2

- most(?) of MM

* emailed patches from Andrew Morton : (125 commits)
thp: fix comments of __pmd_trans_huge_lock()
cgroup: remove unnecessary 0 check from css_from_id()
cgroup: fix idr leak for the first cgroup root
mm: memcontrol: fix documentation for compound parameter
mm: memcontrol: remove BUG_ON in uncharge_list
mm: fix build warnings in
mm, thp: convert from optimistic swapin collapsing to conservative
mm, thp: fix comment inconsistency for swapin readahead functions
thp: update Documentation/{vm/transhuge,filesystems/proc}.txt
shmem: split huge pages beyond i_size under memory pressure
thp: introduce CONFIG_TRANSPARENT_HUGE_PAGECACHE
khugepaged: add support of collapse for tmpfs/shmem pages
shmem: make shmem_inode_info::lock irq-safe
khugepaged: move up_read(mmap_sem) out of khugepaged_alloc_page()
thp: extract khugepaged from mm/huge_memory.c
shmem, thp: respect MADV_{NO,}HUGEPAGE for file mappings
shmem: add huge pages support
shmem: get_unmapped_area align huge page
shmem: prepare huge= mount option and sysfs knob
mm, rmap: account shmem thp pages
...

Linus Torvalds
2016-07-27 10:55:54 +0800
8a5c743e3 mm, memcg: use consistent gfp flags during readahead ... Browse Code »

Vladimir has noticed that we might declare memcg oom even during
readahead because read_pages only uses GFP_KERNEL (with mapping_gfp
restriction) while __do_page_cache_readahead uses
page_cache_alloc_readahead which adds __GFP_NORETRY to prevent from
OOMs. This gfp mask discrepancy is really unfortunate and easily
fixable. Drop page_cache_alloc_readahead() which only has one user and
outsource the gfp_mask logic into readahead_gfp_mask and propagate this
mask from __do_page_cache_readahead down to read_pages.

This alone would have only very limited impact as most filesystems are
implementing ->readpages and the common implementation mpage_readpages
does GFP_KERNEL (with mapping_gfp restriction) again. We can tell it to
use readahead_gfp_mask instead as this function is called only during
readahead as well. The same applies to read_cache_pages.

ext4 has its own ext4_mpage_readpages but the path which has pages !=
NULL can use the same gfp mask. Btrfs, cifs, f2fs and orangefs are
doing a very similar pattern to mpage_readpages so the same can be
applied to them as well.

[akpm@linux-foundation.org: coding-style fixes]
[mhocko@suse.com: restrict gfp mask in mpage_alloc]
Link: http://lkml.kernel.org/r/20160610074223.GC32285@dhcp22.suse.cz
Link: http://lkml.kernel.org/r/1465301556-26431-1-git-send-email-mhocko@kernel.org
Signed-off-by: Michal Hocko
Cc: Vladimir Davydov
Cc: Chris Mason
Cc: Steve French
Cc: Theodore Ts'o
Cc: Jan Kara
Cc: Mike Marshall
Cc: Jaegeuk Kim
Cc: Changman Lee
Cc: Chao Yu
Signed-off-by: Andrew Morton
Signed-off-by: Linus Torvalds

Michal Hocko
2016-07-27 07:19:19 +0800

06 Jul, 2016

5 commits

78fee0b68 orangefs: fix namespace handling ... Browse Code »

In orangefs_inode_getxattr(), an fsuid is written to dmesg. The kuid is
converted to a userspace uid via from_kuid(current_user_ns(), [...]), but
since dmesg is global, init_user_ns should be used here instead.

In copy_attributes_from_inode(), op_alloc() and fill_default_sys_attrs(),
upcall structures are populated with uids/gids that have been mapped into
the caller's namespace. However, those upcall structures are read by
another process (the userspace filesystem driver), and that process might
be running in another namespace. This effectively lets any user spoof its
uid and gid as seen by the userspace filesystem driver.

To fix the second issue, I just construct the opcall structures with
init_user_ns uids/gids and require the filesystem server to run in the
init namespace. Since orangefs is full of global state anyway (as the error
message in DUMP_DEVICE_ERROR explains, there can only be one userspace
orangefs filesystem driver at once), that shouldn't be a problem.

[
Why does orangefs even exist in the kernel if everything does upcalls into
userspace? What does orangefs do that couldn't be done with the FUSE
interface? If there is no good answer to those questions, I'd prefer to see
orangefs kicked out of the kernel. Can that be done for something that
shipped in a release?

According to commit f7ab093f74bf ("Orangefs: kernel client part 1"), they
even already have a FUSE daemon, and the only rational reason (apart from
"but most of our users report preferring to use our kernel module instead")
given for not wanting to use FUSE is one "in-the-works" feature that could
probably be integated into FUSE instead.
]

This patch has been compile-tested.

Signed-off-by: Jann Horn
Signed-off-by: Mike Marshall

Jann Horn
2016-07-06 03:47:43 +0800
3903f1500 Orangefs: allow O_DIRECT in open ... Browse Code »

Signed-off-by: Mike Marshall

Mike Marshall
2016-07-06 03:47:35 +0800
d373a712c orangefs: Remove useless xattr prefix arguments ... Browse Code »

Mike,

On Fri, Jun 3, 2016 at 9:44 PM, Mike Marshall wrote:
> We use the return value in this one line you changed, our userspace code gets
> ill when we send it (-ENOMEM +1) as a key length...

ah, my mistake. Here's a fixed version.

Thanks,
Andreas

Signed-off-by: Andreas Gruenbacher
Signed-off-by: Mike Marshall

Andreas Gruenbacher
2016-07-06 03:47:27 +0800
2ce8272a1 orangefs: Remove redundant "trusted." xattr handler ... Browse Code »

Orangefs has a catch-all xattr handler that effectively does what the
trusted handler does already.

Signed-off-by: Andreas Gruenbacher
Signed-off-by: Mike Marshall

Andreas Gruenbacher
2016-07-06 03:47:22 +0800
972a7344f orangefs: Remove useless defines ... Browse Code »

The ORANGEFS_XATTR_INDEX_ defines are unused; the ORANGEFS_XATTR_NAME_
defines only obfuscate the code.

Signed-off-by: Andreas Gruenbacher
Signed-off-by: Mike Marshall

Andreas Gruenbacher
2016-07-06 03:47:16 +0800

30 May, 2016

2 commits

6f3fc1070 orangefs: constify inode_operations ... Browse Code »

Signed-off-by: Al Viro

Al Viro
2016-05-30 07:07:00 +0800
96b0cffba orangefs: don't open-code %pd2 ... Browse Code »

Signed-off-by: Al Viro

Al Viro
2016-05-30 04:22:07 +0800

28 May, 2016

1 commit

593012268 switch xattr_handler->set() to passing dentry and inode separately ... Browse Code »

preparation for similar switch in ->setxattr() (see the next commit for
rationale).

Signed-off-by: Al Viro

Al Viro
2016-05-28 03:39:43 +0800

03 May, 2016

2 commits

5ecfcb265 orangefs: don't open-code inode_lock/inode_unlock ... Browse Code »

Signed-off-by: Al Viro

Al Viro
2016-05-03 07:47:23 +0800
84695ffee Merge getxattr prototype change into work.lookups ... Browse Code »

The rest of work.xattr stuff isn't needed for this branch

Al Viro
2016-05-03 07:45:47 +0800

11 Apr, 2016

1 commit

b296821a7 xattr_handler: pass dentry and inode as separate arguments of ->get() ... Browse Code »

... and do not assume they are already attached to each other

Signed-off-by: Al Viro

Al Viro
2016-04-11 08:48:24 +0800

10 Apr, 2016

1 commit

675921264 Merge tag 'for-linus-4.6-ofs1' of git://git.kernel.org/pub/scm/linux/kernel/git/hubcap/linux ... Browse Code »

Pull orangefs fixes from Mike Marshall:
"Orangefs cleanups and a strncpy vulnerability fix.

Cleanups:
- remove an unused variable from orangefs_readdir.
- clean up printk wrapper used for ofs "gossip" debugging.
- clean up truncate ctime and mtime setting in inode.c
- remove a useless null check found by coccinelle.
- optimize some memcpy/memset boilerplate code.
- remove some useless sanity checks from xattr.c

Fix:
- fix a potential strncpy vulnerability"

* tag 'for-linus-4.6-ofs1' of git://git.kernel.org/pub/scm/linux/kernel/git/hubcap/linux:
orangefs: remove unused variable
orangefs: Add KERN_ to gossip_ macros
orangefs: strncpy -> strscpy
orangefs: clean up truncate ctime and mtime setting
Orangefs: fix ifnullfree.cocci warnings
Orangefs: optimize boilerplate code.
Orangefs: xattr.c cleanup

Linus Torvalds
2016-04-10 01:33:58 +0800

09 Apr, 2016

7 commits

e56f49814 orangefs: remove unused variable ... Browse Code »

Signed-off-by: Martin Brandenburg
Signed-off-by: Mike Marshall

Martin Brandenburg
2016-04-09 03:50:44 +0800
1917a6932 orangefs: Add KERN_<LEVEL> to gossip_<level> macros ... Browse Code »

Emit the logging messages at the appropriate levels.

Miscellanea:

o Change format to fmt
o Use the more common ##__VA_ARGS__

Signed-off-by: Joe Perches
Signed-off-by: Mike Marshall

Joe Perches
2016-04-09 02:10:45 +0800
2eacea74c orangefs: strncpy -> strscpy ... Browse Code »

It would have been possible for a rogue client-core to send in a symlink
target which is not NUL terminated. This returns EIO if the client-core
gives us corrupt data.

Leave debugfs and superblock code as is for now.

Other dcache.c and namei.c strncpy instances are safe because
ORANGEFS_NAME_MAX = NAME_MAX + 1; there is always enough space for a
name plus a NUL byte.

Signed-off-by: Martin Brandenburg
Signed-off-by: Mike Marshall

Martin Brandenburg
2016-04-09 02:10:34 +0800
f83140c14 orangefs: clean up truncate ctime and mtime setting ... Browse Code »

The ctime and mtime are always updated on a successful ftruncate and
only updated on a successful truncate where the size changed.

We handle the ``if the size changed'' bit.

This matches FUSE's behavior.

Signed-off-by: Martin Brandenburg
Signed-off-by: Mike Marshall

Martin Brandenburg
2016-04-09 02:10:31 +0800
2fa37fd71 Orangefs: fix ifnullfree.cocci warnings ... Browse Code »

fs/orangefs/orangefs-debugfs.c:130:2-26: WARNING: NULL check before freeing functions like kfree, debugfs_remove, debugfs_remove_recursive or usb_free_urb is not needed. Maybe consider reorganizing relevant code to avoid passing NULL values.

NULL check before some freeing functions is not needed.

Based on checkpatch warning
"kfree(NULL) is safe this check is probably not required"
and kfreeaddr.cocci by Julia Lawall.

Generated by: scripts/coccinelle/free/ifnullfree.cocci

Signed-off-by: Fengguang Wu
Signed-off-by: Mike Marshall

kbuild test robot
2016-04-09 02:08:38 +0800
a9bb3ba81 Orangefs: optimize boilerplate code. ... Browse Code »

Suggested by David Binderman
The former can potentially be a performance win over the latter.

memcpy(d, s, len);
memset(d+len, c, size-len);

memset(d, c, size);
memcpy(d, s, len);

Signed-off-by: Mike Marshall

Mike Marshall
2016-04-09 02:08:27 +0800
2d09a2ca6 Orangefs: xattr.c cleanup ... Browse Code »

1. It is nonsense to test for negative size_t, suggested by
David Binderman

2. By the time Orangefs gets called, the vfs has ensured that
name != NULL, and that buffer and size are sane.

Signed-off-by: Mike Marshall

Mike Marshall
2016-04-09 02:08:27 +0800

05 Apr, 2016

2 commits

4a2d057e4 Merge branch 'PAGE_CACHE_SIZE-removal' ... Browse Code »

Merge PAGE_CACHE_SIZE removal patches from Kirill Shutemov:
"PAGE_CACHE_{SIZE,SHIFT,MASK,ALIGN} macros were introduced *long* time
ago with promise that one day it will be possible to implement page
cache with bigger chunks than PAGE_SIZE.

This promise never materialized. And unlikely will.

Let's stop pretending that pages in page cache are special. They are
not.

The first patch with most changes has been done with coccinelle. The
second is manual fixups on top.

The third patch removes macros definition"

[ I was planning to apply this just before rc2, but then I spaced out,
so here it is right _after_ rc2 instead.

As Kirill suggested as a possibility, I could have decided to only
merge the first two patches, and leave the old interfaces for
compatibility, but I'd rather get it all done and any out-of-tree
modules and patches can trivially do the converstion while still also
working with older kernels, so there is little reason to try to
maintain the redundant legacy model. - Linus ]

* PAGE_CACHE_SIZE-removal:
mm: drop PAGE_CACHE_* and page_cache_{get,release} definition
mm, fs: remove remaining PAGE_CACHE_* and page_cache_{get,release} usage
mm, fs: get rid of PAGE_CACHE_* and page_cache_{get,release} macros

Linus Torvalds
2016-04-05 01:50:24 +0800
09cbfeaf1 mm, fs: get rid of PAGE_CACHE_* and page_cache_{get,release} macros ... Browse Code »

PAGE_CACHE_{SIZE,SHIFT,MASK,ALIGN} macros were introduced *long* time
ago with promise that one day it will be possible to implement page
cache with bigger chunks than PAGE_SIZE.

This promise never materialized. And unlikely will.

We have many places where PAGE_CACHE_SIZE assumed to be equal to
PAGE_SIZE. And it's constant source of confusion on whether
PAGE_CACHE_* or PAGE_* constant should be used in a particular case,
especially on the border between fs and mm.

Global switching to PAGE_CACHE_SIZE != PAGE_SIZE would cause to much
breakage to be doable.

Let's stop pretending that pages in page cache are special. They are
not.

The changes are pretty straight-forward:

- << (PAGE_CACHE_SHIFT - PAGE_SHIFT) -> ;

- >> (PAGE_CACHE_SHIFT - PAGE_SHIFT) -> ;

- PAGE_CACHE_{SIZE,SHIFT,MASK,ALIGN} -> PAGE_{SIZE,SHIFT,MASK,ALIGN};

- page_cache_get() -> get_page();

- page_cache_release() -> put_page();

This patch contains automated changes generated with coccinelle using
script below. For some reason, coccinelle doesn't patch header files.
I've called spatch for them manually.

The only adjustment after coccinelle is revert of changes to
PAGE_CAHCE_ALIGN definition: we are going to drop it later.

There are few places in the code where coccinelle didn't reach. I'll
fix them manually in a separate patch. Comments and documentation also
will be addressed with the separate patch.

virtual patch

@@
expression E;
@@
- E << (PAGE_CACHE_SHIFT - PAGE_SHIFT)
+ E

@@
expression E;
@@
- E >> (PAGE_CACHE_SHIFT - PAGE_SHIFT)
+ E

@@
@@
- PAGE_CACHE_SHIFT
+ PAGE_SHIFT

@@
@@
- PAGE_CACHE_SIZE
+ PAGE_SIZE

@@
@@
- PAGE_CACHE_MASK
+ PAGE_MASK

@@
expression E;
@@
- PAGE_CACHE_ALIGN(E)
+ PAGE_ALIGN(E)

@@
expression E;
@@
- page_cache_get(E)
+ get_page(E)

@@
expression E;
@@
- page_cache_release(E)
+ put_page(E)

Signed-off-by: Kirill A. Shutemov
Acked-by: Michal Hocko
Signed-off-by: Linus Torvalds

Kirill A. Shutemov
2016-04-05 01:41:08 +0800

01 Apr, 2016

2 commits

878dfd321 orangefs: minimum userspace version is 2.9.3 ... Browse Code »

Version 2.9.4 isn't even released yet.

Signed-off-by: Martin Brandenburg

Martin Brandenburg
2016-04-01 00:06:00 +0800
641bb3246 orangefs: don't put readdir slot twice ... Browse Code »

This was quite an oversight. After a readdir, the module could not be
unloaded, the number of slots is wrong, and memory near the slot bitmap
is possibly corrupt. Oops.

Signed-off-by: Martin Brandenburg

Martin Brandenburg
2016-04-01 00:06:00 +0800

26 Mar, 2016

8 commits

45996492e orangefs: fix orangefs_superblock locking ... Browse Code »

* switch orangefs_remount() to taking ORANGEFS_SB(sb) instead of sb
* remove from the list _before_ orangefs_unmount() - request_mutex
in the latter will make sure that nothing observed in the loop in
ORANGEFS_DEV_REMOUNT_ALL handling will get freed until the end
of loop
* on removal, keep the forward pointer and zero the back one. That
way we can drop and regain the spinlock in the loop body (again,
ORANGEFS_DEV_REMOUNT_ALL one) and still be able to get to the
rest of the list.

Signed-off-by: Al Viro
Signed-off-by: Mike Marshall

Al Viro
2016-03-26 19:22:00 +0800
6d4c1a30b orangefs: fix do_readv_writev() handling of error halfway through ... Browse Code »

Error should only be returned if nothing had been read/written.
Otherwise we need to report a short read/write instead.

Signed-off-by: Al Viro
Signed-off-by: Mike Marshall

Al Viro
2016-03-26 10:30:54 +0800
524b1d309 orangefs: have ->kill_sb() evict the VFS side of things first ... Browse Code »

Signed-off-by: Al Viro
Signed-off-by: Mike Marshall

Al Viro
2016-03-26 10:30:54 +0800
177f8fc49 orangefs: sanitize ->llseek() ... Browse Code »

a) open files can't have NULL inodes
b) it's SEEK_END, not ORANGEFS_SEEK_END; no need to get cute.
c) make_bad_inode() on lseek()?

Signed-off-by: Al Viro
Signed-off-by: Mike Marshall

Al Viro
2016-03-26 10:30:54 +0800
7df240d77 orangefs-bufmap.h: trim unused junk ... Browse Code »

Signed-off-by: Al Viro
Signed-off-by: Mike Marshall

Al Viro
2016-03-26 10:30:54 +0800
b8a99a8f9 orangefs: saner calling conventions for getting a slot ... Browse Code »

just have it return the slot number or -E... - the caller checks
the sign anyway

Signed-off-by: Al Viro
Signed-off-by: Mike Marshall

Al Viro
2016-03-26 10:30:54 +0800
bf6bf606e orangefs_copy_{to,from}_bufmap(): don't pass bufmap pointer ... Browse Code »

it's always __orangefs_bufmap

Signed-off-by: Al Viro
Signed-off-by: Mike Marshall

Al Viro
2016-03-26 10:30:54 +0800
9f5e2f7f1 orangefs: get rid of readdir_handle_s ... Browse Code »

no point, really - we couldn't keep those across the calls of
getdents(); it would be too easy to DoS, having all slots exhausted.

Signed-off-by: Al Viro
Signed-off-by: Mike Marshall

Al Viro
2016-03-26 10:30:54 +0800

24 Mar, 2016

6 commits

fecd86aac ornagefs: ensure that truncate has an up to date inode size ... Browse Code »

Signed-off-by: Martin Brandenburg
Signed-off-by: Mike Marshall

Martin Brandenburg
2016-03-24 05:36:16 +0800
e8da254c4 orangefs: move code which sets i_link to orangefs_inode_getattr ... Browse Code »

Everything else setting inode->i_ values is in there.

Signed-off-by: Martin Brandenburg
Signed-off-by: Mike Marshall

Martin Brandenburg
2016-03-24 05:36:16 +0800
05d31c5cb orangefs: remove needless wrapper around GFP_KERNEL ... Browse Code »

Signed-off-by: Martin Brandenburg
Signed-off-by: Mike Marshall

Martin Brandenburg
2016-03-24 05:36:15 +0800
93d53a488 orangefs: remove wrapper around mutex_lock(&inode->i_mutex) ... Browse Code »

Signed-off-by: Martin Brandenburg
Signed-off-by: Mike Marshall

Martin Brandenburg
2016-03-24 05:36:15 +0800
266626339 orangefs: refactor inode type or link_target change detection ... Browse Code »

Signed-off-by: Martin Brandenburg
Signed-off-by: Mike Marshall

Martin Brandenburg
2016-03-24 05:36:15 +0800
5859d77e5 orangefs: use new getattr for revalidate and remove old getattr ... Browse Code »

Signed-off-by: Martin Brandenburg
Signed-off-by: Mike Marshall

Martin Brandenburg
2016-03-24 05:36:15 +0800