Eric Lee / smarc-fsl-linux-kernel

31 Jul, 2015

12 commits

23d8335d7 tipc: remove implicit message delivery in node_unlock() ... Browse Code »

After the most recent changes, all access calls to a link which
may entail addition of messages to the link's input queue are
postpended by an explicit call to tipc_sk_rcv(), using a reference
to the correct queue.

This means that the potentially hazardous implicit delivery, using
tipc_node_unlock() in combination with a binary flag and a cached
queue pointer, now has become redundant.

This commit removes this implicit delivery mechanism both for regular
data messages and for binding table update messages.

Tested-by: Ying Xue
Signed-off-by: Jon Maloy
Signed-off-by: David S. Miller

Jon Paul Maloy
2015-07-31 08:25:14 +0800
598411d70 tipc: make resetting of links non-atomic ... Browse Code »

In order to facilitate future improvements to the locking structure, we
want to make resetting and establishing of links non-atomic. I.e., the
functions tipc_node_link_up() and tipc_node_link_down() should be called
from outside the node lock context, and grab/release the node lock
themselves. This requires that we can freeze the link state from the
moment it is set to RESETTING or PEER_RESET in one lock context until
it is set to RESET or ESTABLISHING in a later context. The recently
introduced link FSM makes this possible, so we are now ready to introduce
the above change.

This commit implements this.

Tested-by: Ying Xue
Signed-off-by: Jon Maloy
Signed-off-by: David S. Miller

Jon Paul Maloy
2015-07-31 08:25:14 +0800
cf148816a tipc: move received discovery data evaluation inside node.c ... Browse Code »

The node lock is currently grabbed and and released in the function
tipc_disc_rcv() in the file discover.c. As a preparation for the next
commits, we need to move this node lock handling, along with the code
area it is covering, to node.c.

This commit introduces this change.

Tested-by: Ying Xue
Signed-off-by: Jon Maloy
Signed-off-by: David S. Miller

Jon Paul Maloy
2015-07-31 08:25:14 +0800
662921cd0 tipc: merge link->exec_mode and link->state into one FSM ... Browse Code »

Until now, we have been handling link failover and synchronization
by using an additional link state variable, "exec_mode". This variable
is not independent of the link FSM state, something causing a risk of
inconsistencies, apart from the fact that it clutters the code.

The conditions are now in place to define a new link FSM that covers
all existing use cases, including failover and synchronization, and
eliminate the "exec_mode" field altogether. The FSM must also support
non-atomic resetting of links, which will be introduced later.

The new link FSM is shown below, with 7 states and 8 events.
Only events leading to state change are shown as edges.

+------------------------------------+
|RESET_EVT |
| |
| +--------------+
| +-----------------| SYNCHING |-----------------+
| |FAILURE_EVT +--------------+ PEER_RESET_EVT|
| | A | |
| | | | |
| | | | |
| | |SYNCH_ |SYNCH_ |
| | |BEGIN_EVT |END_EVT |
| | | | |
| V | V V
| +-------------+ +--------------+ +------------+
| | RESETTING || PEER_RESET |
| +-------------+ FAILURE_ +--------------+ PEER_ +------------+
| | EVT | A RESET_EVT |
| | | | |
| | | | |
| | +--------------+ | |
| RESET_EVT| |RESET_EVT |ESTABLISH_EVT |
| | | | |
| | | | |
| V V | |
| +-------------+ +--------------+ RESET_EVT|
+--->| RESET |--------->| ESTABLISHING |
Signed-off-by: Jon Maloy
Signed-off-by: David S. Miller

Jon Paul Maloy
2015-07-31 08:25:14 +0800
5045f7b90 tipc: move protocol message sending away from link FSM ... Browse Code »

The implementation of the link FSM currently takes decisions about and
sends out link protocol messages. This is unnecessary, since such
actions are not the result of any link state change, and are even
decided based on non-FSM state information ("silent_intv_cnt").

We now move the sending of unicast link protocol messages to the
function tipc_link_timeout(), and the initial broadcast synchronization
message to tipc_node_link_up(). The latter is done because a link
instance should not need to know whether it is the first or second
link to a destination. Such information is now restricted to and
handled by the link aggregation layer in node.c

Tested-by: Ying Xue
Signed-off-by: Jon Maloy
Signed-off-by: David S. Miller

Jon Paul Maloy
2015-07-31 08:25:14 +0800
6e498158a tipc: move link synch and failover to link aggregation level ... Browse Code »

Link failover and synchronization have until now been handled by the
links themselves, forcing them to have knowledge about and to access
parallel links in order to make the two algorithms work correctly.

In this commit, we move the control part of this functionality to the
link aggregation level in node.c, which is the right location for this.
As a result, the two algorithms become easier to follow, and the link
implementation becomes simpler.

Tested-by: Ying Xue
Signed-off-by: Jon Maloy
Signed-off-by: David S. Miller

Jon Paul Maloy
2015-07-31 08:25:14 +0800
66996b6c4 tipc: extend node FSM ... Browse Code »

In the next commit, we will move link synch/failover orchestration to
the link aggregation level. In order to do this, we first need to extend
the node FSM with two more states, NODE_SYNCHING and NODE_FAILINGOVER,
plus four new events to enter and leave those states.

This commit introduces this change, without yet making use of it.
The node FSM now looks as follows:

+-----------------------------------------+
| PEER_DOWN_EVT|
| |
+------------------------+----------------+ |
|SELF_DOWN_EVT | | |
| | | |
| +-----------+ +-----------+ |
| |NODE_ | |NODE_ | |
| +----------|FAILINGOVER|| SELF_UP_ ||SELF_LEAVING|
+------------+ SELF_ +-----------+ +-----------+ PEER_ +------------+
| DOWN_EVT A A DOWN_EVT |
| | | |
| | | |
| SELF_UP_EVT| |PEER_UP_EVT |
| | | |
| | | |
|PEER_DOWN_EVT +--------------+ SELF_DOWN_EVT|
+------------------->| SELF_DOWN_ |
Signed-off-by: Jon Maloy
Signed-off-by: David S. Miller

Jon Paul Maloy
2015-07-31 08:25:13 +0800
655fb243b tipc: reverse call order for link_reset()->node_link_down() ... Browse Code »

In many cases the call order when a link is reset goes as follows:
tipc_node_xx()->tipc_link_reset()->tipc_node_link_down()

This is not the right order if we want the node to be in control,
so in this commit we change the order to:
tipc_node_xx()->tipc_node_link_down()->tipc_link_reset()

The fact that tipc_link_reset() now is called from only one
location with a well-defined state will also facilitate later
simplifications of tipc_link_reset() and the link FSM.

Tested-by: Ying Xue
Signed-off-by: Jon Maloy
Signed-off-by: David S. Miller

Jon Paul Maloy
2015-07-31 08:25:13 +0800
6144a996a tipc: move all link_reset() calls to link aggregation level ... Browse Code »

In line with our effort to let the node level have full control over
its links, we want to move all link reset calls from link.c to node.c.
Some of the calls can be moved by simply moving the calling function,
when this is the right thing to do. For the remaining calls we use
the now established technique of returning a TIPC_LINK_DOWN_EVT
flag from tipc_link_rcv(), whereafter we perform the reset call when
the call returns.

This change serves as a preparation for the coming commits.

Tested-by: Ying Xue
Signed-off-by: Jon Maloy
Signed-off-by: David S. Miller

Jon Paul Maloy
2015-07-31 08:25:13 +0800
cbeb83ca6 tipc: eliminate function tipc_link_activate() ... Browse Code »

The function tipc_link_activate() is redundant, since it mostly performs
settings that have already been done in a preceding tipc_link_reset().

There are three exceptions to this:
- The actual state change to TIPC_LINK_WORKING. This should anyway be done
in the FSM, and not in a separate function.
- Registration of the link with the bearer. This should be done by the
node, since we don't want the link to have any knowledge about its
specific bearer.
- Call to tipc_node_link_up() for user access registration. With the new
role distribution between link aggregation and link level this becomes
the wrong call order; tipc_node_link_up() should instead be called
directly as a result of a TIPC_LINK_UP event, hence by the node itself.

This commit implements those changes.

Tested-by: Ying Xue
Signed-off-by: Jon Maloy
Signed-off-by: David S. Miller

Jon Paul Maloy
2015-07-31 08:25:13 +0800
29a3060aa Merge branch 'for-upstream' of git://git.kernel.org/pub/scm/linux/kernel/git/blu… ... Browse Code »

…etooth/bluetooth-next

Johan Hedberg says:

====================
pull request: bluetooth-next 2015-07-30

Here's a set of Bluetooth & 802.15.4 patches intended for the 4.3 kernel.

- Cleanups & fixes to mac802154
- Refactoring of Intel Bluetooth HCI driver
- Various coding style fixes to Bluetooth HCI drivers
- Support for Intel Lightning Peak Bluetooth devices
- Generic class code in interface descriptor in btusb to match more HW
- Refactoring of Bluetooth HS code together with a new config option
- Support for BCM4330B1 Broadcom UART controller

Let me know if there are any issues pulling. Thanks.
====================

Signed-off-by: David S. Miller <davem@davemloft.net>

David S. Miller
2015-07-31 07:16:43 +0800
8013d1d7e net/ipv6: add sysctl option accept_ra_min_hop_limit ... Browse Code »

Commit 6fd99094de2b ("ipv6: Don't reduce hop limit for an interface")
disabled accept hop limit from RA if it is smaller than the current hop
limit for security stuff. But this behavior kind of break the RFC definition.

RFC 4861, 6.3.4. Processing Received Router Advertisements
A Router Advertisement field (e.g., Cur Hop Limit, Reachable Time,
and Retrans Timer) may contain a value denoting that it is
unspecified. In such cases, the parameter should be ignored and the
host should continue using whatever value it is already using.

If the received Cur Hop Limit value is non-zero, the host SHOULD set
its CurHopLimit variable to the received value.

So add sysctl option accept_ra_min_hop_limit to let user choose the minimum
hop limit value they can accept from RA. And set default to 1 to meet RFC
standards.

Signed-off-by: Hangbin Liu
Acked-by: YOSHIFUJI Hideaki
Signed-off-by: David S. Miller

Hangbin Liu
2015-07-31 06:56:40 +0800

30 Jul, 2015

15 commits

5857d1dba Bluetooth: 6lowpan: Fix possible race ... Browse Code »

This patch fix a possible race after calling register_netdev. After
calling netdev_register it could be possible that netdev_ops callbacks
use the uninitialized private data of lowpan_dev. By moving the
initialization of this data before netdev_register we can be sure that
initialized private data is be used after netdev_register.

Signed-off-by: Alexander Aring
Signed-off-by: Marcel Holtmann

Alexander Aring
2015-07-30 20:11:36 +0800
c22ff7b4e mac802154: Fix memory corruption with global deferred transmit state. ... Browse Code »

When transmitting a packet via a mac802154 driver that can sleep in
its transmit function, mac802154 defers the call to the driver's
transmit function to a per-device workqueue.

However, mac802154 uses a single global work_struct for this, which
means that if you have more than one registered mac802154 interface
in the system, and you transmit on more than one of them at the same
time, you'll very easily cause memory corruption.

This patch moves the deferred transmit processing state from global
variables to struct ieee802154_local, and this seems to fix the memory
corruption issue.

Signed-off-by: Lennert Buytenhek
Acked-by: Alexander Aring
Signed-off-by: Marcel Holtmann

Lennert Buytenhek
2015-07-30 20:08:55 +0800
54c9ee399 Bluetooth: cmtp: Do not use list_for_each_safe when not needed ... Browse Code »

There is no need to use the safe version of list_for_each here.

Signed-off-by: Christophe JAILLET
Signed-off-by: Marcel Holtmann

Christophe JAILLET
2015-07-30 19:50:35 +0800
df9b89c7e Bluetooth: Move create/accept phy link completed callback to amp.c ... Browse Code »

To avoid amp module hooks from hci_event.c

Signed-off-by: Arron Wang
Signed-off-by: Marcel Holtmann

Arron Wang
2015-07-30 19:37:22 +0800
b3d391400 Bluetooth: Move amp assoc read/write completed callback to amp.c ... Browse Code »

To avoid amp module hooks from hci_event.c

Signed-off-by: Arron Wang
Signed-off-by: Marcel Holtmann

Arron Wang
2015-07-30 19:37:22 +0800
839278823 Bluetooth: Move get info completed callback to a2mp.c ... Browse Code »

To avoid a2mp module hooks from hci_event.c and send
getinfo response operation only required by a2mp module,
we can move this callback to a2mp.c

Signed-off-by: Arron Wang
Signed-off-by: Marcel Holtmann

Arron Wang
2015-07-30 19:37:22 +0800
a77a6a14e Bluetooth: Move high speed specific event under BT_HS option ... Browse Code »

Signed-off-by: Arron Wang
Signed-off-by: Marcel Holtmann

Arron Wang
2015-07-30 19:31:59 +0800
244bc3775 Bluetooth: Add BT_HS config option ... Browse Code »

Move A2MP Module under BT_HS config option and allow
the user have flexible option to choose the feature only
they need

a2mp_discover_amp() & a2mp_channel_create() are a2mp module
entry point for master and slave, and this is dynamic
invoked depends on the userspace or remote request, then
we defined their implementation depends on BT_HS config

Signed-off-by: Arron Wang
Signed-off-by: Marcel Holtmann

Arron Wang
2015-07-30 19:31:59 +0800
dcc38c033 openvswitch: Re-add CONFIG_OPENVSWITCH_VXLAN ... Browse Code »

This readds the config option CONFIG_OPENVSWITCH_VXLAN to avoid a
hard dependency of OVS on VXLAN. It moves the VXLAN config compat
code to vport-vxlan.c and allows compliation as a module.

Fixes: 614732eaa12d ("openvswitch: Use regular VXLAN net_device device")
Fixes: 2661371ace96 ("openvswitch: fix compilation when vxlan is a module")
Cc: Pravin B Shelar
Cc: Nicolas Dichtel
Signed-off-by: Thomas Graf
Acked-by: Pravin B Shelar
Signed-off-by: David S. Miller

Thomas Graf
2015-07-30 14:03:10 +0800
1ce4b2f44 net: pktgen: Remove unused 'allocated_skbs' field ... Browse Code »

Field pktgen_dev.allocated_skbs had been written to, but never read
from. The number of allocated skbs can be deduced anyway, from the total
number of sent packets and the 'clone_skb' param.

Signed-off-by: Bogdan Hamciuc
Signed-off-by: David S. Miller

Bogdan Hamciuc
2015-07-30 14:00:39 +0800
879c7220e net: pktgen: Observe needed_headroom of the device ... Browse Code »

Allocate enough space so as not to force the outgoing net device to do
skb_realloc_headroom().

Signed-off-by: Bogdan Hamciuc
Signed-off-by: David S. Miller

Bogdan Hamciuc
2015-07-30 14:00:39 +0800
92a99bf3b lwtunnel: Make lwtun_encaps[] static ... Browse Code »

Any external user should use the registration API instead of
accessing this directly.

Cc: Roopa Prabhu
Signed-off-by: Thomas Graf
Acked-by: Roopa Prabhu
Signed-off-by: David S. Miller

Thomas Graf
2015-07-30 13:59:39 +0800
877d1f629 net: Set sk_txhash from a random number ... Browse Code »

This patch creates sk_set_txhash and eliminates protocol specific
inet_set_txhash and ip6_set_txhash. sk_set_txhash simply sets a
random number instead of performing flow dissection. sk_set_txash
is also allowed to be called multiple times for the same socket,
we'll need this when redoing the hash for negative routing advice.

Signed-off-by: Tom Herbert
Signed-off-by: David S. Miller

Tom Herbert
2015-07-30 13:44:04 +0800
5a4c35522 tipc: fix bug in broadcast synch message create function ... Browse Code »

In commit d999297c3dbbe7fdd832f7fa4ec84301e170b3e6
("tipc: reduce locking scope during packet reception") we introduced
a new function tipc_build_bcast_sync_msg(), which carries initial
synchronization data between two nodes at first contact and at
re-contact. In this function, we missed to add synchronization data,
with the effect that the broadcast link endpoints will fail to
synchronize correctly at re-contact between a running and a restarted
node. All other cases work as intended.

With this commit, we fix this bug.

Signed-off-by: Jon Maloy
Signed-off-by: David S. Miller

Jon Maloy
2015-07-30 07:48:16 +0800
73d0fcf2f packet: remove handling of tx_ring from prb_shutdown_retire_blk_timer() ... Browse Code »

Follow e8e85cc5eb57 ("packet: remove handling of tx_ring") and remove
the tx_ring parameter from prb_shutdown_retire_blk_timer() as it is only
called with tx_ring = 0.

Signed-off-by: Tobias Klauser
Signed-off-by: David S. Miller

Tobias Klauser
2015-07-30 06:11:12 +0800

27 Jul, 2015

13 commits

e11f40b93 lwtunnel: use kfree_skb() instead of vanilla kfree() ... Browse Code »

kfree_skb() is correct here.

Fixes: ffce41962ef6 ('lwtunnel: support dst output redirect function')
Signed-off-by: Dan Carpenter
Signed-off-by: David S. Miller

Dan Carpenter
2015-07-27 16:24:59 +0800
99d7662a0 tcp: tso: allow deferring under reordering state ... Browse Code »

While doing experiments with reordering resilience, we found
linux senders were not able to send at full speed under reordering,
because every incoming SACK was releasing one MSS.

This patch removes the limitation, as we did for CWR state
in commit a0ea700e409 ("tcp: tso: allow CA_CWR state in
tcp_tso_should_defer()")

Neal Cardwell had a concern about limited transmit so
Yuchung conducted experiments on GFE and found nothing
worth adding an extra check on fast path :

if (icsk->icsk_ca_state == TCP_CA_Disorder &&
tcp_sk(sk)->reordering == sysctl_tcp_reordering)
goto send_now;

Signed-off-by: Eric Dumazet
Signed-off-by: Yuchung Cheng
Cc: Neal Cardwell
Acked-by: Neal Cardwell
Signed-off-by: David S. Miller

Eric Dumazet
2015-07-27 16:23:20 +0800
8d6c31bf5 ipv6: Avoid rt6_probe() taking writer lock in the fast path ... Browse Code »

The patch checks neigh->nud_state before acquiring the writer lock.
Note that rt6_probe() is only used in CONFIG_IPV6_ROUTER_PREF.

40 udpflood processes and a /64 gateway route are used.
The gateway has NUD_PERMANENT. Each of them is run for 30s.
At the end, the total number of finished sendto():

Before: 55M
After: 95M

Signed-off-by: Martin KaFai Lau
Cc: Hannes Frederic Sowa
CC: Julian Anastasov
CC: YOSHIFUJI Hideaki
Signed-off-by: David S. Miller

Martin KaFai Lau
2015-07-27 16:08:25 +0800
990edb428 ipv6: Re-arrange code in rt6_probe() ... Browse Code »

It is a prep work for the next patch to remove write_lock
from rt6_probe().

1. Reduce the number of if(neigh) check. From 4 to 1.
2. Bring the write_(un)lock() closer to the operations that the
lock is protecting.

Hopefully, the above make rt6_probe() more readable.

Signed-off-by: Martin KaFai Lau
Cc: Hannes Frederic Sowa
Cc: Julian Anastasov
Cc: YOSHIFUJI Hideaki
Signed-off-by: David S. Miller

Martin KaFai Lau
2015-07-27 16:08:25 +0800
5a6228a0b lwtunnel: change prototype of lwtunnel_state_get() ... Browse Code »

It saves some lines and simplify a bit the code when the state is returning
by this function. It's also useful to handle a NULL entry.

To avoid too long lines, I've also renamed lwtunnel_state_get() and
lwtunnel_state_put() to lwtstate_get() and lwtstate_put().

CC: Thomas Graf
CC: Roopa Prabhu
Signed-off-by: Nicolas Dichtel
Acked-by: Thomas Graf
Acked-by: Roopa Prabhu
Signed-off-by: David S. Miller

Nicolas Dichtel
2015-07-27 16:02:49 +0800
d94365950 ipv6: copy lwtstate in ip6_rt_copy_init() ... Browse Code »

We need to copy this field (ip6_rt_cache_alloc() and ip6_rt_pcpu_alloc()
use ip6_rt_copy_init() to build a dst).

CC: Thomas Graf
CC: Roopa Prabhu
Fixes: 19e42e451506 ("ipv6: support for fib route lwtunnel encap attributes")
Signed-off-by: Nicolas Dichtel
Acked-by: Thomas Graf
Acked-by: Roopa Prabhu
Signed-off-by: David S. Miller

Nicolas Dichtel
2015-07-27 16:02:49 +0800
6673a9f4e ipv6: use lwtunnel_output6() only if flag redirect is set ... Browse Code »

This function make sense only when LWTUNNEL_STATE_OUTPUT_REDIRECT is set.
The check is already done in IPv4.

CC: Thomas Graf
CC: Roopa Prabhu
Fixes: 74a0f2fe8ed5 ("ipv6: rt6_info output redirect to tunnel output")
Signed-off-by: Nicolas Dichtel
Acked-by: Thomas Graf
Acked-by: Roopa Prabhu
Signed-off-by: David S. Miller

Nicolas Dichtel
2015-07-27 16:00:51 +0800
b469139e8 dev: Spelling fix in comments ... Browse Code »

Fix the following typo
- unchainged -> unchanged

Signed-off-by: Subash Abhinov Kasiviswanathan
Signed-off-by: David S. Miller

subashab@codeaurora.org
2015-07-27 15:54:57 +0800
949f1e39a bridge: mdb: notify on router port add and del ... Browse Code »

Send notifications on router port add and del/expire, re-use the already
existing MDBA_ROUTER and send NEWMDB/DELMDB netlink notifications
respectively.

Signed-off-by: Satish Ashok
Signed-off-by: Nikolay Aleksandrov
Signed-off-by: David S. Miller

Satish Ashok
2015-07-27 12:19:58 +0800
597798e43 openvswitch: Retrieve tunnel metadata when receiving from vport-netdev ... Browse Code »

Retrieve the tunnel metadata for packets received by a net_device and
provide it to ovs_vport_receive() for flow key extraction.

[This hunk was in the GRE patch in the initial series and missed the
cut for the initial submission for merging.]

Fixes: 614732eaa12d ("openvswitch: Use regular VXLAN net_device device")
Signed-off-by: Thomas Graf
Acked-by: Pravin B Shelar
Signed-off-by: David S. Miller

Thomas Graf
2015-07-27 12:19:11 +0800
2661371ac openvswitch: fix compilation when vxlan is a module ... Browse Code »

With CONFIG_VXLAN=m and CONFIG_OPENVSWITCH=y, there was the following
compilation error:
LD init/built-in.o
net/built-in.o: In function `vxlan_tnl_create':
.../net/openvswitch/vport-netdev.c:322: undefined reference to `vxlan_dev_create'
make: *** [vmlinux] Error 1

CC: Thomas Graf
Fixes: 614732eaa12d ("openvswitch: Use regular VXLAN net_device device")
Signed-off-by: Nicolas Dichtel
Acked-by: Thomas Graf
Signed-off-by: David S. Miller

Nicolas Dichtel
2015-07-27 11:56:55 +0800
88f643203 ipv4: be more aggressive when probing alternative gateways ... Browse Code »

Currently, we do not notice if new alternative gateways
are added. We can do it by checking for present neigh
entry. Also, gateways that are currently probed (NUD_INCOMPLETE)
can be skipped from round-robin probing.

Suggested-by: Florian Westphal
Signed-off-by: Julian Anastasov
Signed-off-by: David S. Miller

Julian Anastasov
2015-07-27 11:56:27 +0800
48fb6b554 ipv6: fix crash over flow-based vxlan device ... Browse Code »

Similar check was added in ip_rcv but not in ipv6_rcv.

BUG: unable to handle kernel NULL pointer dereference at (null)
IP: [] ipv6_rcv+0xfa/0x500
Call Trace:
[] ? ip_rcv+0x296/0x400
[] ? packet_rcv+0x52/0x410
[] __netif_receive_skb_core+0x63f/0x9a0
[] ? br_handle_frame_finish+0x580/0x580 [bridge]
[] ? update_rq_clock.part.81+0x1c/0x40
[] __netif_receive_skb+0x18/0x60
[] process_backlog+0x9f/0x150

Fixes: ee122c79d422 (vxlan: Flow based tunneling)
Signed-off-by: Wei-Chun Chao
Acked-by: Thomas Graf
Signed-off-by: David S. Miller

Wei-Chun Chao
2015-07-27 11:54:56 +0800