Blame view

Documentation/networking/strparser.rst 8.5 KB
060d9d3e1   Mauro Carvalho Chehab   docs: networking:...
1
2
3
  .. SPDX-License-Identifier: GPL-2.0
  
  =========================
bbb03029a   Tom Herbert   strparser: Genera...
4
  Stream Parser (strparser)
060d9d3e1   Mauro Carvalho Chehab   docs: networking:...
5
  =========================
bbb03029a   Tom Herbert   strparser: Genera...
6
7
8
  
  Introduction
  ============
adcce4d5d   Tom Herbert   strparser: Docume...
9
10
  
  The stream parser (strparser) is a utility that parses messages of an
bbb03029a   Tom Herbert   strparser: Genera...
11
  application layer protocol running over a data stream. The stream
adcce4d5d   Tom Herbert   strparser: Docume...
12
13
14
15
  parser works in conjunction with an upper layer in the kernel to provide
  kernel support for application layer messages. For instance, Kernel
  Connection Multiplexor (KCM) uses the Stream Parser to parse messages
  using a BPF program.
bbb03029a   Tom Herbert   strparser: Genera...
16
17
18
19
20
21
22
23
24
25
26
  The strparser works in one of two modes: receive callback or general
  mode.
  
  In receive callback mode, the strparser is called from the data_ready
  callback of a TCP socket. Messages are parsed and delivered as they are
  received on the socket.
  
  In general mode, a sequence of skbs are fed to strparser from an
  outside source. Message are parsed and delivered as the sequence is
  processed. This modes allows strparser to be applied to arbitrary
  streams of data.
adcce4d5d   Tom Herbert   strparser: Docume...
27
  Interface
bbb03029a   Tom Herbert   strparser: Genera...
28
  =========
adcce4d5d   Tom Herbert   strparser: Docume...
29
30
  
  The API includes a context structure, a set of callbacks, utility
bbb03029a   Tom Herbert   strparser: Genera...
31
32
33
34
  functions, and a data_ready function for receive callback mode. The
  callbacks include a parse_msg function that is called to perform
  parsing (e.g.  BPF parsing in case of KCM), and a rcv_msg function
  that is called when a full message has been completed.
adcce4d5d   Tom Herbert   strparser: Docume...
35

bbb03029a   Tom Herbert   strparser: Genera...
36
37
  Functions
  =========
adcce4d5d   Tom Herbert   strparser: Docume...
38

060d9d3e1   Mauro Carvalho Chehab   docs: networking:...
39
40
41
42
       ::
  
  	strp_init(struct strparser *strp, struct sock *sk,
  		const struct strp_callbacks *cb)
adcce4d5d   Tom Herbert   strparser: Docume...
43

bbb03029a   Tom Herbert   strparser: Genera...
44
45
46
47
48
       Called to initialize a stream parser. strp is a struct of type
       strparser that is allocated by the upper layer. sk is the TCP
       socket associated with the stream parser for use with receive
       callback mode; in general mode this is set to NULL. Callbacks
       are called by the stream parser (the callbacks are listed below).
060d9d3e1   Mauro Carvalho Chehab   docs: networking:...
49
50
51
       ::
  
  	void strp_pause(struct strparser *strp)
bbb03029a   Tom Herbert   strparser: Genera...
52
53
54
  
       Temporarily pause a stream parser. Message parsing is suspended
       and no new messages are delivered to the upper layer.
060d9d3e1   Mauro Carvalho Chehab   docs: networking:...
55
56
57
       ::
  
  	void strp_unpause(struct strparser *strp)
bbb03029a   Tom Herbert   strparser: Genera...
58
59
  
       Unpause a paused stream parser.
060d9d3e1   Mauro Carvalho Chehab   docs: networking:...
60
61
62
       ::
  
  	void strp_stop(struct strparser *strp);
bbb03029a   Tom Herbert   strparser: Genera...
63
64
65
66
67
  
       strp_stop is called to completely stop stream parser operations.
       This is called internally when the stream parser encounters an
       error, and it is called from the upper layer to stop parsing
       operations.
060d9d3e1   Mauro Carvalho Chehab   docs: networking:...
68
69
70
       ::
  
  	void strp_done(struct strparser *strp);
bbb03029a   Tom Herbert   strparser: Genera...
71
72
73
74
  
       strp_done is called to release any resources held by the stream
       parser instance. This must be called after the stream processor
       has been stopped.
060d9d3e1   Mauro Carvalho Chehab   docs: networking:...
75
76
77
78
79
       ::
  
  	int strp_process(struct strparser *strp, struct sk_buff *orig_skb,
  			 unsigned int orig_offset, size_t orig_len,
  			 size_t max_msg_size, long timeo)
bbb03029a   Tom Herbert   strparser: Genera...
80
81
82
83
84
85
  
      strp_process is called in general mode for a stream parser to
      parse an sk_buff. The number of bytes processed or a negative
      error number is returned. Note that strp_process does not
      consume the sk_buff. max_msg_size is maximum size the stream
      parser will parse. timeo is timeout for completing a message.
060d9d3e1   Mauro Carvalho Chehab   docs: networking:...
86
87
88
      ::
  
  	void strp_data_ready(struct strparser *strp);
bbb03029a   Tom Herbert   strparser: Genera...
89
90
91
92
93
94
  
      The upper layer calls strp_tcp_data_ready when data is ready on
      the lower socket for strparser to process. This should be called
      from a data_ready callback that is set on the socket. Note that
      maximum messages size is the limit of the receive socket
      buffer and message timeout is the receive timeout for the socket.
060d9d3e1   Mauro Carvalho Chehab   docs: networking:...
95
96
97
      ::
  
  	void strp_check_rcv(struct strparser *strp);
bbb03029a   Tom Herbert   strparser: Genera...
98
99
100
101
  
      strp_check_rcv is called to check for new messages on the socket.
      This is normally called at initialization of a stream parser
      instance or after strp_unpause.
adcce4d5d   Tom Herbert   strparser: Docume...
102
103
  
  Callbacks
bbb03029a   Tom Herbert   strparser: Genera...
104
  =========
adcce4d5d   Tom Herbert   strparser: Docume...
105

bbb03029a   Tom Herbert   strparser: Genera...
106
  There are six callbacks:
adcce4d5d   Tom Herbert   strparser: Docume...
107

060d9d3e1   Mauro Carvalho Chehab   docs: networking:...
108
109
110
      ::
  
  	int (*parse_msg)(struct strparser *strp, struct sk_buff *skb);
adcce4d5d   Tom Herbert   strparser: Docume...
111
112
113
114
  
      parse_msg is called to determine the length of the next message
      in the stream. The upper layer must implement this function. It
      should parse the sk_buff as containing the headers for the
bbb03029a   Tom Herbert   strparser: Genera...
115
      next application layer message in the stream.
adcce4d5d   Tom Herbert   strparser: Docume...
116

bbb03029a   Tom Herbert   strparser: Genera...
117
      The skb->cb in the input skb is a struct strp_msg. Only
adcce4d5d   Tom Herbert   strparser: Docume...
118
119
120
121
      the offset field is relevant in parse_msg and gives the offset
      where the message starts in the skb.
  
      The return values of this function are:
060d9d3e1   Mauro Carvalho Chehab   docs: networking:...
122
123
124
125
126
127
128
129
130
131
      =========    ===========================================================
      >0           indicates length of successfully parsed message
      0            indicates more data must be received to parse the message
      -ESTRPIPE    current message should not be processed by the
  		 kernel, return control of the socket to userspace which
  		 can proceed to read the messages itself
      other < 0    Error in parsing, give control back to userspace
  		 assuming that synchronization is lost and the stream
  		 is unrecoverable (application expected to close TCP socket)
      =========    ===========================================================
adcce4d5d   Tom Herbert   strparser: Docume...
132
133
  
      In the case that an error is returned (return value is less than
bbb03029a   Tom Herbert   strparser: Genera...
134
135
136
137
138
      zero) and the parser is in receive callback mode, then it will set
      the error on TCP socket and wake it up. If parse_msg returned
      -ESTRPIPE and the stream parser had previously read some bytes for
      the current message, then the error set on the attached socket is
      ENODATA since the stream is unrecoverable in that case.
060d9d3e1   Mauro Carvalho Chehab   docs: networking:...
139
140
141
      ::
  
  	void (*lock)(struct strparser *strp)
bbb03029a   Tom Herbert   strparser: Genera...
142
143
144
145
146
147
  
      The lock callback is called to lock the strp structure when
      the strparser is performing an asynchronous operation (such as
      processing a timeout). In receive callback mode the default
      function is to lock_sock for the associated socket. In general
      mode the callback must be set appropriately.
060d9d3e1   Mauro Carvalho Chehab   docs: networking:...
148
149
150
      ::
  
  	void (*unlock)(struct strparser *strp)
bbb03029a   Tom Herbert   strparser: Genera...
151
152
153
154
155
  
      The unlock callback is called to release the lock obtained
      by the lock callback. In receive callback mode the default
      function is release_sock for the associated socket. In general
      mode the callback must be set appropriately.
adcce4d5d   Tom Herbert   strparser: Docume...
156

060d9d3e1   Mauro Carvalho Chehab   docs: networking:...
157
158
159
      ::
  
  	void (*rcv_msg)(struct strparser *strp, struct sk_buff *skb);
adcce4d5d   Tom Herbert   strparser: Docume...
160
161
162
163
  
      rcv_msg is called when a full message has been received and
      is queued. The callee must consume the sk_buff; it can
      call strp_pause to prevent any further messages from being
bbb03029a   Tom Herbert   strparser: Genera...
164
      received in rcv_msg (see strp_pause above). This callback
adcce4d5d   Tom Herbert   strparser: Docume...
165
      must be set.
bbb03029a   Tom Herbert   strparser: Genera...
166
      The skb->cb in the input skb is a struct strp_msg. This
adcce4d5d   Tom Herbert   strparser: Docume...
167
168
169
170
      struct contains two fields: offset and full_len. Offset is
      where the message starts in the skb, and full_len is the
      the length of the message. skb->len - offset may be greater
      then full_len since strparser does not trim the skb.
060d9d3e1   Mauro Carvalho Chehab   docs: networking:...
171
172
173
      ::
  
  	int (*read_sock_done)(struct strparser *strp, int err);
adcce4d5d   Tom Herbert   strparser: Docume...
174
175
  
       read_sock_done is called when the stream parser is done reading
bbb03029a   Tom Herbert   strparser: Genera...
176
177
178
179
       the TCP socket in receive callback mode. The stream parser may
       read multiple messages in a loop and this function allows cleanup
       to occur when exiting the loop. If the callback is not set (NULL
       in strp_init) a default function is used.
adcce4d5d   Tom Herbert   strparser: Docume...
180

060d9d3e1   Mauro Carvalho Chehab   docs: networking:...
181
182
183
       ::
  
  	void (*abort_parser)(struct strparser *strp, int err);
adcce4d5d   Tom Herbert   strparser: Docume...
184
185
  
       This function is called when stream parser encounters an error
bbb03029a   Tom Herbert   strparser: Genera...
186
187
188
189
       in parsing. The default function stops the stream parser and
       sets the error in the socket if the parser is in receive callback
       mode. The default function can be changed by setting the callback
       to non-NULL in strp_init.
adcce4d5d   Tom Herbert   strparser: Docume...
190

bbb03029a   Tom Herbert   strparser: Genera...
191
192
  Statistics
  ==========
adcce4d5d   Tom Herbert   strparser: Docume...
193

bbb03029a   Tom Herbert   strparser: Genera...
194
195
196
197
198
  Various counters are kept for each stream parser instance. These are in
  the strp_stats structure. strp_aggr_stats is a convenience structure for
  accumulating statistics for multiple stream parser instances.
  save_strp_stats and aggregate_strp_stats are helper functions to save
  and aggregate statistics.
adcce4d5d   Tom Herbert   strparser: Docume...
199

bbb03029a   Tom Herbert   strparser: Genera...
200
201
  Message assembly limits
  =======================
adcce4d5d   Tom Herbert   strparser: Docume...
202

bbb03029a   Tom Herbert   strparser: Genera...
203
204
  The stream parser provide mechanisms to limit the resources consumed by
  message assembly.
adcce4d5d   Tom Herbert   strparser: Docume...
205

bbb03029a   Tom Herbert   strparser: Genera...
206
207
208
209
210
211
  A timer is set when assembly starts for a new message. In receive
  callback mode the message timeout is taken from rcvtime for the
  associated TCP socket. In general mode, the timeout is passed as an
  argument in strp_process. If the timer fires before assembly completes
  the stream parser is aborted and the ETIMEDOUT error is set on the TCP
  socket if in receive callback mode.
adcce4d5d   Tom Herbert   strparser: Docume...
212

bbb03029a   Tom Herbert   strparser: Genera...
213
214
215
216
217
218
  In receive callback mode, message length is limited to the receive
  buffer size of the associated TCP socket. If the length returned by
  parse_msg is greater than the socket buffer size then the stream parser
  is aborted with EMSGSIZE error set on the TCP socket. Note that this
  makes the maximum size of receive skbuffs for a socket with a stream
  parser to be 2*sk_rcvbuf of the TCP socket.
adcce4d5d   Tom Herbert   strparser: Docume...
219

bbb03029a   Tom Herbert   strparser: Genera...
220
221
  In general mode the message length limit is passed in as an argument
  to strp_process.
adcce4d5d   Tom Herbert   strparser: Docume...
222

bbb03029a   Tom Herbert   strparser: Genera...
223
224
  Author
  ======
adcce4d5d   Tom Herbert   strparser: Docume...
225

bbb03029a   Tom Herbert   strparser: Genera...
226
  Tom Herbert (tom@quantonium.net)