Eric Lee / smarc-fsl-linux-kernel

14 Jul, 2018

9 commits

d23b27c02 samples/bpf: xdp_redirect_cpu handle parsing of double VLAN tagged packets ... Browse Code »

People noticed that the code match on IEEE 802.1ad (ETH_P_8021AD) ethertype,
and this implies Q-in-Q or double tagged VLANs. Thus, we better parse
the next VLAN header too. It is even marked as a TODO.

This is relevant for real world use-cases, as XDP cpumap redirect can be
used when the NIC RSS hashing is broken. E.g. the ixgbe driver HW cannot
handle double tagged VLAN packets, and places everything into a single
RX queue. Using cpumap redirect, users can redistribute traffic across
CPUs to solve this, which is faster than the network stacks RPS solution.

It is left as an exerise how to distribute the packets across CPUs. It
would be convenient to use the RX hash, but that is not _yet_ exposed
to XDP programs. For now, users can code their own hash, as I've demonstrated
in the Suricata code (where Q-in-Q is handled correctly).

Reported-by: Florian Maury
Reported-by: Marek Majkowski
Signed-off-by: Jesper Dangaard Brouer
Signed-off-by: Daniel Borkmann

Jesper Dangaard Brouer
2018-07-14 06:52:54 +0800
ee15f7cdf Merge branch 'bpf-xdp-driver-and-hw' ... Browse Code »

Jakub Kicinski says:

====================
This set is adding support for loading driver and offload XDP
at the same time. This enables advanced use cases where some
of the work is offloaded to the NIC and some is done by the host.
Separate netlink attributes are added for each mode of operation.
Driver callbacks for offload are cleaned up a little, including
removal of .prog_attached flag.
====================

Acked-by: Alexei Starovoitov
Signed-off-by: Daniel Borkmann

Daniel Borkmann
2018-07-14 03:54:57 +0800
5f4284015 nfp: add support for simultaneous driver and hw XDP ... Browse Code »

Split handling of offloaded and driver programs completely. Since
offloaded programs always come with XDP_FLAGS_HW_MODE set in reality
there could be no sharing, anyway, programs would only be installed
in driver or in hardware. Splitting the handling allows us to install
programs in HW and in driver at the same time.

Signed-off-by: Jakub Kicinski
Reviewed-by: Quentin Monnet
Signed-off-by: Daniel Borkmann

Jakub Kicinski
2018-07-14 02:26:35 +0800
99dadb6e3 selftests/bpf: add test for multiple programs ... Browse Code »

Add tests for having an XDP program attached in the driver and
another one attached in HW simultaneously.

Signed-off-by: Jakub Kicinski
Reviewed-by: Quentin Monnet
Signed-off-by: Daniel Borkmann

Jakub Kicinski
2018-07-14 02:26:35 +0800
799e173d7 netdevsim: add support for simultaneous driver and hw XDP ... Browse Code »

Allow netdevsim to accept driver and offload attachment of XDP
BPF programs at the same time.

Signed-off-by: Jakub Kicinski
Reviewed-by: Quentin Monnet
Signed-off-by: Daniel Borkmann

Jakub Kicinski
2018-07-14 02:26:35 +0800
a25717d2b xdp: support simultaneous driver and hw XDP attachment ... Browse Code »

Split the query of HW-attached program from the software one.
Introduce new .ndo_bpf command to query HW-attached program.
This will allow drivers to install different programs in HW
and SW at the same time. Netlink can now also carry multiple
programs on dump (in which case mode will be set to
XDP_ATTACHED_MULTI and user has to check per-attachment point
attributes, IFLA_XDP_PROG_ID will not be present). We reuse
IFLA_XDP_PROG_ID skb space for second mode, so rtnl_xdp_size()
doesn't need to be updated.

Note that the installation side is still not there, since all
drivers currently reject installing more than one program at
the time.

Signed-off-by: Jakub Kicinski
Reviewed-by: Quentin Monnet
Signed-off-by: Daniel Borkmann

Jakub Kicinski
2018-07-14 02:26:35 +0800
05296620f xdp: factor out common program/flags handling from drivers ... Browse Code »

Basic operations drivers perform during xdp setup and query can
be moved to helpers in the core. Encapsulate program and flags
into a structure and add helpers. Note that the structure is
intended as the "main" program information source in the driver.
Most drivers will additionally place the program pointer in their
fast path or ring structures.

The helpers don't have a huge impact now, but they will
decrease the code duplication when programs can be installed
in HW and driver at the same time. Encapsulating the basic
operations in helpers will hopefully also reduce the number
of changes to drivers which adopt them.

Helpers could really be static inline, but they depend on
definition of struct netdev_bpf which means they'd have
to be placed in netdevice.h, an already 4500 line header.

Signed-off-by: Jakub Kicinski
Reviewed-by: Quentin Monnet
Signed-off-by: Daniel Borkmann

Jakub Kicinski
2018-07-14 02:26:35 +0800
6b8675897 xdp: don't make drivers report attachment mode ... Browse Code »

prog_attached of struct netdev_bpf should have been superseded
by simply setting prog_id long time ago, but we kept it around
to allow offloading drivers to communicate attachment mode (drv
vs hw). Subsequently drivers were also allowed to report back
attachment flags (prog_flags), and since nowadays only programs
attached will XDP_FLAGS_HW_MODE can get offloaded, we can tell
the attachment mode from the flags driver reports. Remove
prog_attached member.

Signed-off-by: Jakub Kicinski
Reviewed-by: Quentin Monnet
Signed-off-by: Daniel Borkmann

Jakub Kicinski
2018-07-14 02:26:35 +0800
4f91da26c xdp: add per mode attributes for attached programs ... Browse Code »

In preparation for support of simultaneous driver and hardware XDP
support add per-mode attributes. The catch-all IFLA_XDP_PROG_ID
will still be reported, but user space can now also access the
program ID in a new IFLA_XDP__PROG_ID attribute.

Signed-off-by: Jakub Kicinski
Reviewed-by: Quentin Monnet
Signed-off-by: Daniel Borkmann

Jakub Kicinski
2018-07-14 02:26:35 +0800

13 Jul, 2018

24 commits

9c48b1d11 Merge branch 'bpf-arm-jit-improvements' ... Browse Code »

Russell King says:

====================
Four further jit compiler improves for 32-bit ARM.
====================

Signed-off-by: Daniel Borkmann

Daniel Borkmann
2018-07-13 21:26:42 +0800
b18bea2a4 ARM: net: bpf: improve 64-bit ALU implementation ... Browse Code »

Improbe the 64-bit ALU implementation from:

movw r8, #65532
movt r8, #65535
movw r9, #65535
movt r9, #65535
ldr r7, [fp, #-44]
adds r7, r7, r8
str r7, [fp, #-44]
ldr r7, [fp, #-40]
adc r7, r7, r9
str r7, [fp, #-40]

to:

movw r8, #65532
movt r8, #65535
movw r9, #65535
movt r9, #65535
ldrd r6, [fp, #-44]
adds r6, r6, r8
adc r7, r7, r9
strd r6, [fp, #-44]

Signed-off-by: Russell King
Signed-off-by: Daniel Borkmann

Russell King
2018-07-13 21:26:42 +0800
c5eae6925 ARM: net: bpf: improve 64-bit store implementation ... Browse Code »

Improve the 64-bit store implementation from:

ldr r6, [fp, #-8]
str r8, [r6]
ldr r6, [fp, #-8]
mov r7, #4
add r7, r6, r7
str r9, [r7]

to:

ldr r6, [fp, #-8]
str r8, [r6]
str r9, [r6, #4]

We leave the store as two separate STR instructions rather than using
STRD as the store may not be aligned, and STR can handle misalignment.

Signed-off-by: Russell King
Signed-off-by: Daniel Borkmann

Russell King
2018-07-13 21:26:42 +0800
077513b89 ARM: net: bpf: improve 64-bit sign-extended immediate load ... Browse Code »

Improve the 64-bit sign-extended immediate from:

mov r6, #1
str r6, [fp, #-52] ; 0xffffffcc
mov r6, #0
str r6, [fp, #-48] ; 0xffffffd0

to:

mov r6, #1
mov r7, #0
strd r6, [fp, #-52] ; 0xffffffcc

Signed-off-by: Russell King
Signed-off-by: Daniel Borkmann

Russell King
2018-07-13 21:26:41 +0800
f9ff5018c ARM: net: bpf: improve 64-bit load immediate implementation ... Browse Code »

Rather than writing each 32-bit half of the 64-bit immediate value
separately when the register is on the stack:

movw r6, #45056 ; 0xb000
movt r6, #60979 ; 0xee33
str r6, [fp, #-44] ; 0xffffffd4
mov r6, #0
str r6, [fp, #-40] ; 0xffffffd8

arrange to use the double-word store when available instead:

movw r6, #45056 ; 0xb000
movt r6, #60979 ; 0xee33
mov r7, #0
strd r6, [fp, #-44] ; 0xffffffd4

Signed-off-by: Russell King
Signed-off-by: Daniel Borkmann

Russell King
2018-07-13 21:26:41 +0800
6fd066604 Merge branch 'bpf-arm-jit-improvements' ... Browse Code »

Russell King says:

====================
This series improves the ARM BPF JIT compiler by:

- enumerating the stack layout rather than using constants that happen
to be multiples of four
- rejig the BPF "register" accesses to use negative numbers instead of
positive, which could be confused with register numbers in the bpf2a32
array.
- since we maintain the ARM FP register as a pointer to the top of our
scratch space (or, with frame pointers enabled, a valid ARM frame
pointer register), we can access our scratch space using FP, which is
constant across all BPF programs, including tail-called programs.
- use immediate forms of ARM instructions where possible, rather than
first loading the immediate into an ARM register.
- use load-with-shift instruction rather than seperate shift instruction
followed by load
- avoid reloading index and array in the tail-call code
- use double-word load/store instructions where available

Version 2:

- Fix ARMv5 test pointed out by Olof
- Fix build error found by 0-day (adding an additional patch)
====================

Signed-off-by: Daniel Borkmann

Daniel Borkmann
2018-07-13 02:45:24 +0800
8c9602d38 ARM: net: bpf: use double-word load/stores where available ... Browse Code »

Use double-word load and stores where support for this instruction is
supported by the CPU architecture.

Signed-off-by: Russell King
Signed-off-by: Daniel Borkmann

Russell King
2018-07-13 02:45:23 +0800
bef8968df ARM: net: bpf: always use odd/even register pair ... Browse Code »

Always use an odd/even register pair for our 64-bit registers, so that
we're able to use the double-word load/store instructions in the future.

Signed-off-by: Russell King
Signed-off-by: Daniel Borkmann

Russell King
2018-07-13 02:45:23 +0800
b50452299 ARM: net: bpf: avoid reloading 'array' ... Browse Code »

Rearranging the order of the initial tail call code a little allows is
to avoid reloading the 'array' pointer.

Signed-off-by: Russell King
Signed-off-by: Daniel Borkmann

Russell King
2018-07-13 02:45:23 +0800
aaffd2f5c ARM: net: bpf: avoid reloading 'index' ... Browse Code »

Avoid reloading 'index' after we have validated it - it remains in
tmp2[1] up to the point that we begin the code to index the pointer
array, so with a little rearrangement of the registers, we can use
the already loaded value.

Signed-off-by: Russell King
Signed-off-by: Daniel Borkmann

Russell King
2018-07-13 02:45:23 +0800
2b6958ef1 ARM: net: bpf: use ldr instructions with shifted rm register ... Browse Code »

Rather than pre-shifting the rm register for the ldr in the tail call,
shift it in the load instruction. This eliminates one unnecessary
instruction.

Signed-off-by: Russell King
Signed-off-by: Daniel Borkmann

Russell King
2018-07-13 02:45:23 +0800
828e2b90e ARM: net: bpf: use immediate forms of instructions where possible ... Browse Code »

Rather than moving constants to a register and then using them in a
subsequent instruction, use them directly in the desired instruction
cutting out the "middle" register. This removes two instructions from
the tail call code path.

Signed-off-by: Russell King
Signed-off-by: Daniel Borkmann

Russell King
2018-07-13 02:45:23 +0800
1ca3b17b7 ARM: net: bpf: imm12 constant conversion ... Browse Code »

Provide a version of the imm8m() function that the compiler can optimise
when used with a constant expression.

Signed-off-by: Russell King
Signed-off-by: Daniel Borkmann

Russell King
2018-07-13 02:45:23 +0800
96cced4e7 ARM: net: bpf: access eBPF scratch space using ARM FP register ... Browse Code »

Access the eBPF scratch space using the frame pointer rather than our
stack pointer, as the offsets from the ARM frame pointer are constant
across all eBPF programs.

Since we no longer reference the scratch space registers from the stack
pointer, this simplifies emit_push_r64() as it no longer needs to know
how many words are pushed onto the stack.

Signed-off-by: Russell King
Signed-off-by: Daniel Borkmann

Russell King
2018-07-13 02:45:22 +0800
a6eccac50 ARM: net: bpf: 64-bit accessor functions for BPF registers ... Browse Code »

Provide a couple of 64-bit register accessors, and use them where
appropriate

Signed-off-by: Russell King
Signed-off-by: Daniel Borkmann

Russell King
2018-07-13 02:45:22 +0800
7a9870256 ARM: net: bpf: provide accessor functions for BPF registers ... Browse Code »

Many of the code paths need to have knowledge about whether a register
is stacked or in a CPU register. Move this decision making to a pair
of helper functions instead of having it scattered throughout the
code.

Signed-off-by: Russell King
Signed-off-by: Daniel Borkmann

Russell King
2018-07-13 02:45:22 +0800
47b9c3bf4 ARM: net: bpf: remove is_on_stack() and sstk/dstk ... Browse Code »

The decision about whether a BPF register is on the stack or in a CPU
register is detected at the top BPF insn processing level, and then
percolated throughout the remainder of the code. Since we now use
negative register values to represent stacked registers, we can detect
where a BPF register is stored without restoring to carrying this
additional metadata through all code paths.

Signed-off-by: Russell King
Signed-off-by: Daniel Borkmann

Russell King
2018-07-13 02:45:22 +0800
1c35ba122 ARM: net: bpf: use negative numbers for stacked registers ... Browse Code »

Use negative numbers for eBPF registers that live on the stack.

Signed-off-by: Russell King
Signed-off-by: Daniel Borkmann

Russell King
2018-07-13 02:45:22 +0800
a8ef95a03 ARM: net: bpf: provide load/store ops with negative immediates ... Browse Code »

Provide a set of load/store opcode generators that work with negative
immediates as well as positive ones.

Signed-off-by: Russell King
Signed-off-by: Daniel Borkmann

Russell King
2018-07-13 02:45:22 +0800
d449ceb11 ARM: net: bpf: enumerate the JIT scratch stack layout ... Browse Code »

Enumerate the contents of the JIT scratch stack layout used for storing
some of the JITs 64-bit registers, tail call counter and AX register.

Signed-off-by: Russell King
Signed-off-by: Daniel Borkmann

Russell King
2018-07-13 02:45:22 +0800
b103cbe0d Merge branch 'bpf-helper-man-install' ... Browse Code »

Quentin Monnet says:

====================
The three patches in this series are related to the documentation for eBPF
helpers. The first patch brings minor formatting edits to the documentation
in include/uapi/linux/bpf.h, and the second one updates the related header
file under tools/.

The third patch adds a Makefile under tools/bpf for generating the
documentation (man pages) about eBPF helpers. The targets defined in this
file can also be called from the bpftool directory (please refer to
relevant commit logs for details).
====================

Signed-off-by: Daniel Borkmann

Daniel Borkmann
2018-07-13 00:55:54 +0800
86f7d85ce tools: bpf: build and install man page for eBPF helpers from bpftool/ ... Browse Code »

Provide a new Makefile.helpers in tools/bpf, in order to build and
install the man page for eBPF helpers. This Makefile is also included in
the one used to build bpftool documentation, so that it can be called
either on its own (cd tools/bpf && make -f Makefile.helpers) or from
bpftool directory (cd tools/bpf/bpftool && make doc, or
cd tools/bpf/bpftool/Documentation && make helpers).

Makefile.helpers is not added directly to bpftool to avoid changing its
Makefile too much (helpers are not 100% directly related with bpftool).
But the possibility to build the page from bpftool directory makes us
able to package the helpers man page with bpftool, and to install it
along with bpftool documentation, so that the doc for helpers becomes
easily available to developers through the "man" program.

Cc: linux-man@vger.kernel.org
Suggested-by: Daniel Borkmann
Signed-off-by: Quentin Monnet
Reviewed-by: Jakub Kicinski
Signed-off-by: Daniel Borkmann

Quentin Monnet
2018-07-13 00:55:53 +0800
9b8ca3795 tools: bpf: synchronise BPF UAPI header with tools ... Browse Code »

Update with latest changes from include/uapi/linux/bpf.h header.

Signed-off-by: Quentin Monnet
Reviewed-by: Jakub Kicinski
Signed-off-by: Daniel Borkmann

Quentin Monnet
2018-07-13 00:55:53 +0800
2bae79d2d bpf: fix documentation for eBPF helpers ... Browse Code »

Minor formatting edits for eBPF helpers documentation, including blank
lines removal, fix of item list for return values in bpf_fib_lookup(),
and missing prefix on bpf_skb_load_bytes_relative().

Signed-off-by: Quentin Monnet
Reviewed-by: Jakub Kicinski
Signed-off-by: Daniel Borkmann

Quentin Monnet
2018-07-13 00:55:53 +0800

12 Jul, 2018

7 commits

671dffa7d Merge branch 'bpf-bpftool-improved-prog-load' ... Browse Code »

Jakub Kicinski says:

====================
This series starts with two minor clean ups to test_offload.py
selftest script.

The next 11 patches extend the abilities of bpftool prog load
beyond the simple cgroup use cases. Three new parameters are
added:

- type - allows specifying program type, independent of how
code sections are named;
- map - allows reusing existing maps, instead of creating a new
map on every program load;
- dev - offload/binding to a device.

A number of changes to libbpf is required to accomplish the task.
The section - program type logic mapping is exposed. We should
probably aim to use the libbpf program section naming everywhere.
For reuse of maps we need to allow users to set FD for bpf map
object in libbpf.

Examples

Load program my_xdp.o and pin it as /sys/fs/bpf/my_xdp, for xdp
program type:

$ bpftool prog load my_xdp.o /sys/fs/bpf/my_xdp \
type xdp

As above but for offload:

$ bpftool prog load my_xdp.o /sys/fs/bpf/my_xdp \
type xdp \
dev netdevsim0

Load program my_maps.o, but for the first map reuse map id 17,
and for the map called "other_map" reuse pinned map /sys/fs/bpf/map0:

$ bpftool prog load my_maps.o /sys/fs/bpf/prog \
map idx 0 id 17 \
map name other_map pinned /sys/fs/bpf/map0

v3:
- fix return codes in patch 5;
- rename libbpf_prog_type_by_string() -> libbpf_prog_type_by_name();
- fold file path into xattr in patch 8;
- add patch 10;
- use dup3() in patch 12;
- depend on fd value in patch 12;
- close old fd in patch 12.
v2:
- add compat for reallocarray().
====================

Signed-off-by: Daniel Borkmann

Daniel Borkmann
2018-07-12 04:13:35 +0800
3ff5a4dc5 tools: bpftool: allow reuse of maps with bpftool prog load ... Browse Code »

Add map parameter to prog load which will allow reuse of existing
maps instead of creating new ones.

We need feature detection and compat code for reallocarray, since
it's not available in many libc versions.

Signed-off-by: Jakub Kicinski
Reviewed-by: Quentin Monnet
Acked-by: Alexei Starovoitov
Signed-off-by: Daniel Borkmann

Jakub Kicinski
2018-07-12 04:13:34 +0800
26736eb9a tools: libbpf: allow map reuse ... Browse Code »

More advanced applications may want to only replace programs without
destroying associated maps. Allow libbpf users to achieve that.
Instead of always creating all of the maps at load time, expose to
users an API to reconstruct the map object from already existing
map.

The map parameters are read from the kernel and replace the parameters
of the ELF map. libbpf does not restrict the map replacement, i.e.
the reused map does not have to be compatible with the ELF map
definition. We relay on the verifier for checking the compatibility
between maps and programs. The ELF map definition is completely
overwritten by the information read from the kernel, to make sure
libbpf's view of map object corresponds to the actual map.

Signed-off-by: Jakub Kicinski
Reviewed-by: Quentin Monnet
Acked-by: Andrey Ignatov
Signed-off-by: Daniel Borkmann

Jakub Kicinski
2018-07-12 04:13:34 +0800
531b014e7 tools: bpf: make use of reallocarray ... Browse Code »

reallocarray() is a safer variant of realloc which checks for
multiplication overflow in case of array allocation. Since it's
not available in Glibc < 2.26 import kernel's overflow.h and
add a static inline implementation when needed. Use feature
detection to probe for existence of reallocarray.

Signed-off-by: Jakub Kicinski
Reviewed-by: Quentin Monnet
Reviewed-by: Jiong Wang
Signed-off-by: Daniel Borkmann

Jakub Kicinski
2018-07-12 04:13:34 +0800
8d13406c0 tools: libbpf: move library error code into a separate file ... Browse Code »

libbpf_strerror() depends on XSI-compliant (POSIX) version of
strerror_r(), which prevents us from using GNU-extensions in
libbpf.c, like reallocarray() or dup3(). Move error printing
code into a separate file to allow it to continue using POSIX
strerror_r().

No functional changes.

Signed-off-by: Jakub Kicinski
Signed-off-by: Daniel Borkmann

Jakub Kicinski
2018-07-12 04:13:34 +0800
c8406848b tools: bpftool: reimplement bpf_prog_load() for prog load ... Browse Code »

bpf_prog_load() is a very useful helper but it doesn't give us full
flexibility of modifying the BPF objects before loading. Open code
bpf_prog_load() in bpftool so we can add extra logic in following
commits.

Signed-off-by: Jakub Kicinski
Reviewed-by: Quentin Monnet
Acked-by: Alexei Starovoitov
Signed-off-by: Daniel Borkmann

Jakub Kicinski
2018-07-12 04:13:34 +0800
07f2d4eac tools: libbpf: add extended attributes version of bpf_object__open() ... Browse Code »

Similarly to bpf_prog_load() users of bpf_object__open() may need
to specify the expected program type. Program type is needed at
open to avoid the kernel version check for program types which don't
require it.

Signed-off-by: Jakub Kicinski
Reviewed-by: Quentin Monnet
Acked-by: Andrey Ignatov
Signed-off-by: Daniel Borkmann

Jakub Kicinski
2018-07-12 04:13:34 +0800