Eric Lee / smarc-fsl-linux-kernel

08 Dec, 2010

7 commits

a2ae4cc9a inotify: stop kernel memory leak on file creation failure ... Browse Code »

If inotify_init is unable to allocate a new file for the new inotify
group we leak the new group. This patch drops the reference on the
group on file allocation failure.

Reported-by: Vegard Nossum
cc: stable@kernel.org
Signed-off-by: Eric Paris

Eric Paris
2010-12-08 05:14:22 +0800
09e5f14e5 fanotify: on group destroy allow all waiters to bypass permission check ... Browse Code »

When fanotify_release() is called, there may still be processes waiting for
access permission. Currently only processes for which an event has already been
queued into the groups access list will be woken up. Processes for which no
event has been queued will continue to sleep and thus cause a deadlock when
fsnotify_put_group() is called.
Furthermore there is a race allowing further processes to be waiting on the
access wait queue after wake_up (if they arrive before clear_marks_by_group()
is called).
This patch corrects this by setting a flag to inform processes that the group
is about to be destroyed and thus not to wait for access permission.

[additional changelog from eparis]
Lets think about the 4 relevant code paths from the PoV of the
'operator' 'listener' 'responder' and 'closer'. Where operator is the
process doing an action (like open/read) which could require permission.
Listener is the task (or in this case thread) slated with reading from
the fanotify file descriptor. The 'responder' is the thread responsible
for responding to access requests. 'Closer' is the thread attempting to
close the fanotify file descriptor.

The 'operator' is going to end up in:
fanotify_handle_event()
get_response_from_access()
(THIS BLOCKS WAITING ON USERSPACE)

The 'listener' interesting code path
fanotify_read()
copy_event_to_user()
prepare_for_access_response()
(THIS CREATES AN fanotify_response_event)

The 'responder' code path:
fanotify_write()
process_access_response()
(REMOVE A fanotify_response_event, SET RESPONSE, WAKE UP 'operator')

The 'closer':
fanotify_release()
(SUPPOSED TO CLEAN UP THE REST OF THIS MESS)

What we have today is that in the closer we remove all of the
fanotify_response_events and set a bit so no more response events are
ever created in prepare_for_access_response().

The bug is that we never wake all of the operators up and tell them to
move along. You fix that in fanotify_get_response_from_access(). You
also fix other operators which haven't gotten there yet. So I agree
that's a good fix.
[/additional changelog from eparis]

[remove additional changes to minimize patch size]
[move initialization so it was inside CONFIG_FANOTIFY_PERMISSION]

Signed-off-by: Lino Sanfilippo
Signed-off-by: Eric Paris

Lino Sanfilippo
2010-12-08 05:14:22 +0800
1734dee4e fanotify: Dont allow a mask of 0 if setting or removing a mark ... Browse Code »

In mark_remove_from_mask() we destroy marks that have their event mask cleared.
Thus we should not allow the creation of those marks in the first place.
With this patch we check if the mask given from user is 0 in case of FAN_MARK_ADD.
If so we return an error. Same for FAN_MARK_REMOVE since this does not have any
effect.

Signed-off-by: Lino Sanfilippo
Signed-off-by: Eric Paris

Lino Sanfilippo
2010-12-08 05:14:21 +0800
fa218ab98 fanotify: correct broken ref counting in case adding a mark failed ... Browse Code »

If adding a mount or inode mark failed fanotify_free_mark() is called explicitly.
But at this time the mark has already been put into the destroy list of the
fsnotify_mark kernel thread. If the thread is too slow it will try to decrease
the reference of a mark, that has already been freed by fanotify_free_mark().
(If its fast enough it will only decrease the marks ref counter from 2 to 1 - note
that the counter has been increased to 2 in add_mark() - which has practically no
effect.)

This patch fixes the ref counting by not calling free_mark() explicitly, but
decreasing the ref counter and rely on the fsnotify_mark thread to cleanup in
case adding the mark has failed.

Signed-off-by: Lino Sanfilippo
Signed-off-by: Eric Paris

Lino Sanfilippo
2010-12-08 05:14:21 +0800
b1085ba80 fanotify: if set by user unset FMODE_NONOTIFY before fsnotify_perm() is called ... Browse Code »

Unsetting FMODE_NONOTIFY in fsnotify_open() is too late, since fsnotify_perm()
is called before. If FMODE_NONOTIFY is set fsnotify_perm() will skip permission
checks, so a user can still disable permission checks by setting this flag
in an open() call.
This patch corrects this by unsetting the flag before fsnotify_perm is called.

Signed-off-by: Lino Sanfilippo
Signed-off-by: Eric Paris

Lino Sanfilippo
2010-12-08 05:14:21 +0800
88d60c327 fanotify: remove packed from access response message ... Browse Code »

Since fanotify has decided to be careful about alignment and packing
rather than rely on __attribute__((packed)) for multiarch support.
Since this attribute isn't doing anything on fanotify_response we just
drop it. This does not break API/ABI.

Suggested-by: Tvrtko Ursulin
Signed-off-by: Eric Paris

Eric Paris
2010-12-08 05:14:20 +0800
ecf6f5e7d fanotify: deny permissions when no event was sent ... Browse Code »

If no event was sent to userspace we cannot expect userspace to respond to
permissions requests. Today such requests just hang forever. This patch will
deny any permissions event which was unable to be sent to userspace.

Reported-by: Tvrtko Ursulin
Signed-off-by: Eric Paris

Eric Paris
2010-12-08 05:14:17 +0800

30 Nov, 2010

14 commits

e8a7e48bb Linux 2.6.37-rc4 Browse Code »

Linus Torvalds
2010-11-30 12:42:04 +0800
32e157242 Merge branch 'merge' of git://git.kernel.org/pub/scm/linux/kernel/git/benh/powerpc ... Browse Code »

* 'merge' of git://git.kernel.org/pub/scm/linux/kernel/git/benh/powerpc:
powerpc: Use call_rcu_sched() for pagetables

Linus Torvalds
2010-11-30 12:41:39 +0800
f2e785ed5 powerpc: Use call_rcu_sched() for pagetables ... Browse Code »

PowerPC relies on IRQ-disable to guard against RCU quiecent states,
use the appropriate RCU call version.

Signed-off-by: Peter Zijlstra
Signed-off-by: Benjamin Herrenschmidt

Peter Zijlstra
2010-11-30 07:42:20 +0800
bcb38ceb2 Revert "debug_locks: set oops_in_progress if we will log messages." ... Browse Code »

This reverts commit e0fdace10e75dac67d906213b780ff1b1a4cc360.

On-list discussion seems to suggest that the robustness fixes for printk
make this unnecessary and DaveM has also agreed in person at Kernel Summit
and on list.

The main problem with this code is once we hit a lockdep splat we always
keep oops_in_progress set, the console layer uses oops_in_progress with KMS
to decide when it should be showing the oops and not showing X, so it causes
problems around suspend/resume time when a userspace resume can cause a console
switch away from X, only if oops_in_progress is set (which is what we want
if an oops actually is in progress, but not because we had a lockdep splat
2 days prior).

Cc: David S Miller
Cc: Ingo Molnar
Signed-off-by: Dave Airlie
Signed-off-by: Linus Torvalds

Dave Airlie
2010-11-30 07:18:28 +0800
8f1b1a509 Merge branch 'for-linus' of git://git.kernel.org/pub/scm/linux/kernel/git/jmorri… ... Browse Code »

…s/security-testing-2.6

* 'for-linus' of git://git.kernel.org/pub/scm/linux/kernel/git/jmorris/security-testing-2.6:
tpm: Autodetect itpm devices

Linus Torvalds
2010-11-30 06:38:06 +0800
a01af8e4a Merge git://git.kernel.org/pub/scm/linux/kernel/git/davem/net-2.6 ... Browse Code »

* git://git.kernel.org/pub/scm/linux/kernel/git/davem/net-2.6: (27 commits)
af_unix: limit recursion level
pch_gbe driver: The wrong of initializer entry
pch_gbe dreiver: chang author
ucc_geth: fix ucc halt problem in half duplex mode
inet: Fix __inet_inherit_port() to correctly increment bsockets and num_owners
ehea: Add some info messages and fix an issue
hso: fix disable_net
NET: wan/x25_asy, move lapb_unregister to x25_asy_close_tty
cxgb4vf: fix setting unicast/multicast addresses ...
net, ppp: Report correct error code if unit allocation failed
DECnet: don't leak uninitialized stack byte
au1000_eth: fix invalid address accessing the MAC enable register
dccp: fix error in updating the GAR
tcp: restrict net.ipv4.tcp_adv_win_scale (#20312)
netns: Don't leak others' openreq-s in proc
Net: ceph: Makefile: Remove unnessary code
vhost/net: fix rcu check usage
econet: fix CVE-2010-3848
econet: fix CVE-2010-3850
econet: disallow NULL remote addr for sendmsg(), fixes CVE-2010-3849
...

Linus Torvalds
2010-11-30 06:36:33 +0800
a9735c81a Merge branch 'omap-fixes-for-linus' of git://git.kernel.org/pub/scm/linux/kernel… ... Browse Code »

…/git/tmlind/linux-omap-2.6

* 'omap-fixes-for-linus' of git://git.kernel.org/pub/scm/linux/kernel/git/tmlind/linux-omap-2.6:
OMAP2+: PM/serial: hold console semaphore while OMAP UARTs are disabled
OMAP: UART: don't resume UARTs that are not enabled.

Linus Torvalds
2010-11-30 06:36:07 +0800
3f0d3d016 tpm: Autodetect itpm devices ... Browse Code »

Some Lenovos have TPMs that require a quirk to function correctly. This can
be autodetected by checking whether the device has a _HID of INTC0102. This
is an invalid PNPid, and as such is discarded by the pnp layer - however
it's still present in the ACPI code, so we can pull it out that way. This
means that the quirk won't be automatically applied on non-ACPI systems,
but without ACPI we don't have any way to identify the chip anyway so I
don't think that's a great concern.

Signed-off-by: Matthew Garrett
Acked-by: Rajiv Andrade
Tested-by: Jiri Kosina
Tested-by: Andy Isaacson
Signed-off-by: James Morris

Matthew Garrett
2010-11-30 06:18:01 +0800
aa3fc5254 Merge git://git.kernel.org/pub/scm/linux/kernel/git/mason/btrfs-unstable ... Browse Code »

* git://git.kernel.org/pub/scm/linux/kernel/git/mason/btrfs-unstable: (24 commits)
Btrfs: don't use migrate page without CONFIG_MIGRATION
Btrfs: deal with DIO bios that span more than one ordered extent
Btrfs: setup blank root and fs_info for mount time
Btrfs: fix fiemap
Btrfs - fix race between btrfs_get_sb() and umount
Btrfs: update inode ctime when using links
Btrfs: make sure new inode size is ok in fallocate
Btrfs: fix typo in fallocate to make it honor actual size
Btrfs: avoid NULL pointer deref in try_release_extent_buffer
Btrfs: make btrfs_add_nondir take parent inode as an argument
Btrfs: hold i_mutex when calling btrfs_log_dentry_safe
Btrfs: use dget_parent where we can UPDATED
Btrfs: fix more ESTALE problems with NFS
Btrfs: handle NFS lookups properly
btrfs: make 1-bit signed fileds unsigned
btrfs: Show device attr correctly for symlinks
btrfs: Set file size correctly in file clone
btrfs: Check if dest_offset is block-size aligned before cloning file
Btrfs: handle the space_cache option properly
btrfs: Fix early enospc because 'unused' calculated with wrong sign.
...

Linus Torvalds
2010-11-30 06:11:08 +0800
555bdaefd Merge branch 'for-linus' of git://git.kernel.org/pub/scm/linux/kernel/git/bp/bp ... Browse Code »

* 'for-linus' of git://git.kernel.org/pub/scm/linux/kernel/git/bp/bp:
EDAC: Fix typos in Documentation/edac.txt
EDAC, MCE: Fix edac_init_mce_inject error handling
EDAC: Remove deprecated kbuild goal definitions

Linus Torvalds
2010-11-30 06:10:44 +0800
1bfe4eefe Merge git://git.kernel.org/pub/scm/linux/kernel/git/steve/gfs2-2.6-fixes ... Browse Code »

* git://git.kernel.org/pub/scm/linux/kernel/git/steve/gfs2-2.6-fixes:
GFS2: Userland expects quota limit/warn/usage in 512b blocks

Linus Torvalds
2010-11-30 06:10:22 +0800
25888e303 af_unix: limit recursion level ... Browse Code »

Its easy to eat all kernel memory and trigger NMI watchdog, using an
exploit program that queues unix sockets on top of others.

lkml ref : http://lkml.org/lkml/2010/11/25/8

This mechanism is used in applications, one choice we have is to have a
recursion limit.

Other limits might be needed as well (if we queue other types of files),
since the passfd mechanism is currently limited by socket receive queue
sizes only.

Add a recursion_level to unix socket, allowing up to 4 levels.

Each time we send an unix socket through sendfd mechanism, we copy its
recursion level (plus one) to receiver. This recursion level is cleared
when socket receive queue is emptied.

Reported-by: Марк Коренберг
Signed-off-by: Eric Dumazet
Signed-off-by: David S. Miller

Eric Dumazet
2010-11-30 01:45:15 +0800
50a420533 pch_gbe driver: The wrong of initializer entry ... Browse Code »

The wrong of initializer entry was modified.

Signed-off-by: Toshiharu Okada
Reported-by: Dr. David Alan Gilbert
Signed-off-by: David S. Miller

Toshiharu Okada
2010-11-30 00:51:34 +0800
a1dcfcb7f pch_gbe dreiver: chang author ... Browse Code »

This driver's AUTHOR was changed to "Toshiharu Okada" from "Masayuki Ohtake".
I update the Kconfig, renamed "Topcliff" to "EG20T".

Signed-off-by: Toshiharu Okada
Signed-off-by: David S. Miller

Toshiharu Okada
2010-11-30 00:51:33 +0800

29 Nov, 2010

19 commits

5a92bc88c Btrfs: don't use migrate page without CONFIG_MIGRATION ... Browse Code »

Fixes compile error

Signed-off-by: Chris Mason

Chris Mason
2010-11-29 22:49:11 +0800
d830418e4 ucc_geth: fix ucc halt problem in half duplex mode ... Browse Code »

In commit 58933c64(ucc_geth: Fix the wrong the Rx/Tx FIFO size),
the UCC_GETH_UTFTT_INIT is set to 512 based on the recommendation
of the QE Reference Manual. But that will sometimes cause tx halt
while working in half duplex mode.

According to errata draft QE_GENERAL-A003(High Tx Virtual FIFO
threshold size can cause UCC to halt), setting UTFTT less than
[(UTFS x (M - 8)/M) - 128] will prevent this from happening
(M is the minimum buffer size).

The patch changes UTFTT back to 256.

Signed-off-by: Li Yang
Cc: Jean-Denis Boyer
Cc: Andreas Schmitz
Cc: Anton Vorontsov
Signed-off-by: David S. Miller

Yang Li
2010-11-29 10:36:57 +0800
b4ff3c90e inet: Fix __inet_inherit_port() to correctly increment bsockets and num_owners ... Browse Code »

inet sockets corresponding to passive connections are added to the bind hash
using ___inet_inherit_port(). These sockets are later removed from the bind
hash using __inet_put_port(). These two functions are not exactly symmetrical.
__inet_put_port() decrements hashinfo->bsockets and tb->num_owners, whereas
___inet_inherit_port() does not increment them. This results in both of these
going to -ve values.

This patch fixes this by calling inet_bind_hash() from ___inet_inherit_port(),
which does the right thing.

'bsockets' and 'num_owners' were introduced by commit a9d8f9110d7e953c
(inet: Allowing more than 64k connections and heavily optimize bind(0))

Signed-off-by: Nagendra Singh Tomar
Acked-by: Eric Dumazet
Acked-by: Evgeniy Polyakov
Signed-off-by: David S. Miller

Nagendra Tomar
2010-11-29 10:18:44 +0800
5c7e57f7c ehea: Add some info messages and fix an issue ... Browse Code »

This patch adds some debug information about ehea not being able to
allocate enough spaces. Also it correctly updates the amount of available
skb.

Signed-off-by: Breno Leitao
Signed-off-by: David S. Miller

Breno Leitao
2010-11-29 10:15:22 +0800
163cf09c2 Btrfs: deal with DIO bios that span more than one ordered extent ... Browse Code »

The new DIO bio splitting code has problems when the bio
spans more than one ordered extent. This will happen as the
generic DIO code merges our get_blocks calls together into
a bigger single bio.

This fixes things by walking forward in the ordered extent
code finding all the overlapping ordered extents and completing them
all at once.

Signed-off-by: Chris Mason

Chris Mason
2010-11-29 08:56:33 +0800
720836465 Un-inline get_pipe_info() helper function ... Browse Code »

This avoids some include-file hell, and the function isn't really
important enough to be inlined anyway.

Reported-by: Ingo Molnar
Signed-off-by: Linus Torvalds

Linus Torvalds
2010-11-29 08:27:19 +0800
c66fb3479 Export 'get_pipe_info()' to other users ... Browse Code »

And in particular, use it in 'pipe_fcntl()'.

The other pipe functions do not need to use the 'careful' version, since
they are only ever called for things that are already known to be pipes.

The normal read/write/ioctl functions are called through the file
operations structures, so if a file isn't a pipe, they'd never get
called. But pipe_fcntl() is special, and called directly from the
generic fcntl code, and needs to use the same careful function that the
splice code is using.

Cc: Jens Axboe
Cc: Andrew Morton
Cc: Al Viro
Cc: Dave Jones
Signed-off-by: Linus Torvalds

Linus Torvalds
2010-11-29 06:09:57 +0800
71993e62a Rename 'pipe_info()' to 'get_pipe_info()' ... Browse Code »

.. and change it to take the 'file' pointer instead of an inode, since
that's what all users want anyway.

The renaming is preparatory to exporting it to other users. The old
'pipe_info()' name was too generic and is already used elsewhere, so
before making the function public we need to use a more specific name.

Cc: Jens Axboe
Cc: Andrew Morton
Cc: Al Viro
Cc: Dave Jones
Signed-off-by: Linus Torvalds

Linus Torvalds
2010-11-29 05:56:09 +0800
a9e40a249 Merge branch 'perf-fixes-for-linus' of git://git.kernel.org/pub/scm/linux/kernel… ... Browse Code »

…/git/tip/linux-2.6-tip

* 'perf-fixes-for-linus' of git://git.kernel.org/pub/scm/linux/kernel/git/tip/linux-2.6-tip:
perf: Fix the software context switch counter
perf, x86: Fixup Kconfig deps
x86, perf, nmi: Disable perf if counters are not accessible
perf: Fix inherit vs. context rotation bug

Linus Torvalds
2010-11-29 04:25:02 +0800
75f5d2c9b Merge branch 'fwnet' of git://git.kernel.org/pub/scm/linux/kernel/git/ieee1394/linux1394-2.6 ... Browse Code »

* 'fwnet' of git://git.kernel.org/pub/scm/linux/kernel/git/ieee1394/linux1394-2.6:
firewire: net: throttle TX queue before running out of tlabels
firewire: net: replace lists by counters
firewire: net: fix memory leaks
firewire: net: count stats.tx_packets and stats.tx_bytes

Linus Torvalds
2010-11-29 04:24:20 +0800
8e65c0ece hso: fix disable_net ... Browse Code »

The HSO driver incorrectly creates a serial device instead of a net
device when disable_net is set. It shouldn't create anything for the
network interface.

Signed-off-by: Filip Aben
Reported-by: Piotr Isajew
Reported-by: Johan Hovold
Signed-off-by: David S. Miller

Filip Aben
2010-11-29 03:46:44 +0800
03fe5f3ef NET: wan/x25_asy, move lapb_unregister to x25_asy_close_tty ... Browse Code »

We register lapb when tty is created, but unregister it only when the
device is UP. So move the lapb_unregister to x25_asy_close_tty after
the device is down.

The old behaviour causes ldisc switching to fail each second attempt,
because we noted for us that the device is unused, so we use it the
second time, but labp layer still have it registered, so it fails
obviously.

Signed-off-by: Jiri Slaby
Reported-by: Sergey Lapin
Cc: Andrew Hendry
Tested-by: Sergey Lapin
Tested-by: Mikhail Ulyanov
Signed-off-by: David S. Miller

Jiri Slaby
2010-11-29 03:43:47 +0800
42eb59d3a cxgb4vf: fix setting unicast/multicast addresses ... ... Browse Code »

We were truncating the number of unicast and multicast MAC addresses
supported. Additionally, we were incorrectly computing the MAC Address
hash (a "1 << N" where we needed a "1ULL << N").

Signed-off-by: Casey Leedom
Signed-off-by: David S. Miller

Casey Leedom
2010-11-29 03:40:58 +0800
bcc70bb3a net, ppp: Report correct error code if unit allocation failed ... Browse Code »

Allocating unit from ird might return several error codes
not only -EAGAIN, so it should not be changed and returned
precisely. Same time unit release procedure should be invoked
only if device is unregistering.

Signed-off-by: Cyrill Gorcunov
CC: Paul Mackerras
Signed-off-by: David S. Miller

Cyrill Gorcunov
2010-11-29 03:33:49 +0800
3c6f27bf3 DECnet: don't leak uninitialized stack byte ... Browse Code »

A single uninitialized padding byte is leaked to userspace.

Signed-off-by: Dan Rosenberg
CC: stable
Signed-off-by: David S. Miller

Dan Rosenberg
2010-11-29 03:32:30 +0800
462ca99c2 au1000_eth: fix invalid address accessing the MAC enable register ... Browse Code »

"aup->enable" holds already the address pointing to the MAC enable
register. The bug was introduced by commit d0e7cb:

"au1000-eth: remove volatiles, switch to I/O accessors".

CC: Florian Fainelli
Signed-off-by: Wolfgang Grandegger
Acked-by: Florian Fainelli
Signed-off-by: David S. Miller

Wolfgang Grandegger
2010-11-29 03:31:22 +0800
0ac788702 dccp: fix error in updating the GAR ... Browse Code »

This fixes a bug in updating the Greatest Acknowledgment number Received (GAR):
the current implementation does not track the greatest received value -
lower values in the range AWL..AWH (RFC 4340, 7.5.1) erase higher ones.

Signed-off-by: Gerrit Renker
Signed-off-by: David S. Miller

Gerrit Renker
2010-11-29 03:29:27 +0800
a301e1703 Merge branch 'vhost-net' of git://git.kernel.org/pub/scm/linux/kernel/git/mst/vhost Browse Code »

David S. Miller
2010-11-29 03:27:44 +0800
0147fc058 tcp: restrict net.ipv4.tcp_adv_win_scale (#20312) ... Browse Code »

tcp_win_from_space() does the following:

if (sysctl_tcp_adv_win_scale > (-sysctl_tcp_adv_win_scale);
else
return space - (space >> sysctl_tcp_adv_win_scale);

"space" is int.

As per C99 6.5.7 (3) shifting int for 32 or more bits is
undefined behaviour.

Indeed, if sysctl_tcp_adv_win_scale is exactly 32,
space >> 32 equals space and function returns 0.

Which means we busyloop in tcp_fixup_rcvbuf().

Restrict net.ipv4.tcp_adv_win_scale to [-31, 31].

Fix https://bugzilla.kernel.org/show_bug.cgi?id=20312

Steps to reproduce:

echo 32 >/proc/sys/net/ipv4/tcp_adv_win_scale
wget www.kernel.org
[softlockup]

Signed-off-by: Alexey Dobriyan
Signed-off-by: David S. Miller

Alexey Dobriyan
2010-11-29 02:39:45 +0800