18 May, 2010

26 commits


17 May, 2010

1 commit


16 May, 2010

13 commits

  • Now we have a set of nested attributes:

    IFLA_VFINFO_LIST (NESTED)
    IFLA_VF_INFO (NESTED)
    IFLA_VF_MAC
    IFLA_VF_VLAN
    IFLA_VF_TX_RATE

    This allows a single set to operate on multiple attributes if desired.
    Among other things, it means a dump can be replayed to set state.

    The current interface has yet to be released, so this seems like
    something to consider for 2.6.34.

    Signed-off-by: Chris Wright
    Signed-off-by: David S. Miller

    Chris Wright
     
  • Synchronize access to the drivers configuration interface.
    Also do not allow configuration changes during online/offline
    transition.

    Signed-off-by: Frank Blaschka
    Signed-off-by: David S. Miller

    Frank Blaschka
     
  • z/OS may activate Optimized Latency Mode (OLM) for a connection
    through an OSA Express3 adapter, which reduces the number of
    allowed concurrent connections, if adapter is used in shared mode.
    Create a meaningful message, if activation of an OSA-connection fails
    due to an active OLM-connection on the shared OSA-adapter.

    Signed-off-by: Ursula Braun
    Signed-off-by: Frank Blaschka
    Signed-off-by: David S. Miller

    Ursula Braun
     
  • OSA supports HW TX checksumming in layer 3 mode. Enable this
    feature and remove software fallback used for TSO. Cleanup
    checksum bits to indicate OSA can do checksumming only for
    IPv4 TCP and UDP.

    Signed-off-by: Frank Blaschka
    Signed-off-by: David S. Miller

    Frank Blaschka
     
  • transport may be free before ICMP proto unreachable timer expire, so
    we should delete active ICMP proto unreachable timer when transport
    is going away.

    Signed-off-by: Wei Yongjun
    Acked-by: Vlad Yasevich
    Signed-off-by: David S. Miller

    Wei Yongjun
     
  • vlan/macvlan start_xmit() can inform caller of congestion with
    NET_XMIT_CN return value. This doesnt mean packet was dropped.
    Increment normal stat counters instead of tx_dropped.

    Signed-off-by: Eric Dumazet
    Acked-by: Patrick McHardy
    Signed-off-by: David S. Miller

    Eric Dumazet
     
  • TCP-MD5 sessions have intermittent failures, when route cache is
    invalidated. ip_queue_xmit() has to find a new route, calls
    sk_setup_caps(sk, &rt->u.dst), destroying the

    sk->sk_route_caps &= ~NETIF_F_GSO_MASK

    that MD5 desperately try to make all over its way (from
    tcp_transmit_skb() for example)

    So we send few bad packets, and everything is fine when
    tcp_transmit_skb() is called again for this socket.

    Since ip_queue_xmit() is at a lower level than TCP-MD5, I chose to use a
    socket field, sk_route_nocaps, containing bits to mask on sk_route_caps.

    Reported-by: Bhaskar Dutta
    Signed-off-by: Eric Dumazet
    Signed-off-by: David S. Miller

    Eric Dumazet
     
  • TCP MD5 support uses percpu data for temporary storage. It currently
    disables preemption so that same storage cannot be reclaimed by another
    thread on same cpu.

    We also have to make sure a softirq handler wont try to use also same
    context. Various bug reports demonstrated corruptions.

    Fix is to disable preemption and BH.

    Reported-by: Bhaskar Dutta
    Signed-off-by: Eric Dumazet
    Signed-off-by: David S. Miller

    Eric Dumazet
     
  • With RPS inclusion, skb timestamping is not consistent in RX path.

    If netif_receive_skb() is used, its deferred after RPS dispatch.

    If netif_rx() is used, its done before RPS dispatch.

    This can give strange tcpdump timestamps results.

    I think timestamping should be done as soon as possible in the receive
    path, to get meaningful values (ie timestamps taken at the time packet
    was delivered by NIC driver to our stack), even if NAPI already can
    defer timestamping a bit (RPS can help to reduce the gap)

    Tom Herbert prefer to sample timestamps after RPS dispatch. In case
    sampling is expensive (HPET/acpi_pm on x86), this makes sense.

    Let admins switch from one mode to another, using a new
    sysctl, /proc/sys/net/core/netdev_tstamp_prequeue

    Its default value (1), means timestamps are taken as soon as possible,
    before backlog queueing, giving accurate timestamps.

    Setting a 0 value permits to sample timestamps when processing backlog,
    after RPS dispatch, to lower the load of the pre-RPS cpu.

    Signed-off-by: Eric Dumazet
    Signed-off-by: David S. Miller

    Eric Dumazet
     
  • I mistakenly had the error path to use num_pols to decide how
    many policies we need to drop (cruft from earlier patch set
    version which did not handle socket policies right).

    This is wrong since normally we do not keep explicit references
    (instead we hold reference to the cache entry which holds references
    to policies). drop_pols is set to num_pols if we are holding the
    references, so use that. Otherwise we eventually BUG_ON inside
    xfrm_policy_destroy due to premature policy deletion.

    Signed-off-by: Timo Teras
    Signed-off-by: David S. Miller

    Timo Teras
     
  • Now there's null check here and also again in the hook. Looking at bridge bits
    which are simmilar, port structure is rcu_dereferenced right away in
    handle_bridge and passed to hook. Looks nicer.

    Signed-off-by: Jiri Pirko
    Acked-by: Patrick McHardy
    Signed-off-by: David S. Miller

    Jiri Pirko
     
  • This replace the PCI DMA state API (include/linux/pci-dma.h) with the
    DMA equivalents since the PCI DMA state API will be obsolete.

    No functional change.

    For further information about the background:

    http://marc.info/?l=linux-netdev&m=127037540020276&w=2

    Signed-off-by: FUJITA Tomonori
    Acked-by: Stephen Hemminger
    Signed-off-by: David S. Miller

    FUJITA Tomonori
     
  • (Dropped the infiniband part, because Tetsuo modified the related code,
    I will send a separate patch for it once this is accepted.)

    This patch introduces /proc/sys/net/ipv4/ip_local_reserved_ports which
    allows users to reserve ports for third-party applications.

    The reserved ports will not be used by automatic port assignments
    (e.g. when calling connect() or bind() with port number 0). Explicit
    port allocation behavior is unchanged.

    Signed-off-by: Octavian Purdila
    Signed-off-by: WANG Cong
    Cc: Neil Horman
    Cc: Eric Dumazet
    Cc: Eric W. Biederman
    Signed-off-by: David S. Miller

    Amerigo Wang