Doug / smarc-fsl-linux-kernel | Embedian Git Server

23 Jun, 2006

1 commit

fadd8fbd1 [PATCH] support for panic at OOM ... Browse Code »

This patch adds panic_on_oom sysctl under sys.vm.

When sysctl vm.panic_on_oom = 1, the kernel panics intead of killing rogue
processes. And if vm.panic_on_oom is 0 the kernel will do oom_kill() in
the same way as it does today. Of course, the default value is 0 and only
root can modifies it.

In general, oom_killer works well and kill rogue processes. So the whole
system can survive. But there are environments where panic is preferable
rather than kill some processes.

Signed-off-by: KAMEZAWA Hiroyuki
Signed-off-by: Andrew Morton
Signed-off-by: Linus Torvalds

KAMEZAWA Hiroyuki
2006-06-23 22:42:47 +0800

18 Jun, 2006

3 commits

35089bb20 [TCP]: Add tcp_slow_start_after_idle sysctl. ... Browse Code »

A lot of people have asked for a way to disable tcp_cwnd_restart(),
and it seems reasonable to add a sysctl to do that.

Signed-off-by: David S. Miller

David S. Miller
2006-06-18 12:30:53 +0800
39a27a35c [NETFILTER]: conntrack: add sysctl to disable checksumming ... Browse Code »

Signed-off-by: Patrick McHardy
Signed-off-by: David S. Miller

Patrick McHardy
2006-06-18 12:28:57 +0800
959378258 [I/OAT]: Add a sysctl for tuning the I/OAT offloaded I/O threshold ... Browse Code »

Any socket recv of less than this ammount will not be offloaded

Signed-off-by: Chris Leech
Signed-off-by: David S. Miller

Chris Leech
2006-06-18 12:25:54 +0800

21 Mar, 2006

10 commits

15d99e02b [TCP]: sysctl to allow TCP window > 32767 sans wscale ... Browse Code »

Back in the dark ages, we had to be conservative and only allow 15-bit
window fields if the window scale option was not negotiated. Some
ancient stacks used a signed 16-bit quantity for the window field of
the TCP header and would get confused.

Those days are long gone, so we can use the full 16-bits by default
now.

There is a sysctl added so that we can still interact with such old
stacks

Signed-off-by: Rick Jones
Signed-off-by: David S. Miller

Rick Jones
2006-03-21 14:40:29 +0800
abd596a4b [IPV4] ARP: Alloc acceptance of unsolicited ARP via netdevice sysctl. ... Browse Code »

Signed-off-by: Neil Horman
Signed-off-by: David S. Miller

Neil Horman
2006-03-21 14:39:47 +0800
e55d912f5 [DCCP] feat: Introduce sysctls for the default features ... Browse Code »

[root@qemu ~]# for a in /proc/sys/net/dccp/default/* ; do echo $a ; cat $a ; done
/proc/sys/net/dccp/default/ack_ratio
2
/proc/sys/net/dccp/default/rx_ccid
3
/proc/sys/net/dccp/default/send_ackvec
1
/proc/sys/net/dccp/default/send_ndp
1
/proc/sys/net/dccp/default/seq_window
100
/proc/sys/net/dccp/default/tx_ccid
3
[root@qemu ~]#

So if wanting to test ccid3 as the tx CCID one can just do:

[root@qemu ~]# echo 3 > /proc/sys/net/dccp/default/tx_ccid
[root@qemu ~]# echo 2 > /proc/sys/net/dccp/default/rx_ccid
[root@qemu ~]# cat /proc/sys/net/dccp/default/[tr]x_ccid
2
3
[root@qemu ~]#

Of course we also need the setsockopt for each app to tell its preferences, but
for testing or defining something other than CCID2 as the default for apps that
don't explicitely set their preference the sysctl interface is handy.

Signed-off-by: Arnaldo Carvalho de Melo
Signed-off-by: David S. Miller

Arnaldo Carvalho de Melo
2006-03-21 11:25:02 +0800
f8cd54884 [IPSEC]: Sync series - core changes ... Browse Code »

This patch provides the core functionality needed for sync events
for ipsec. Derived work of Krisztian KOVACS

Signed-off-by: Jamal Hadi Salim
Signed-off-by: David S. Miller

Jamal Hadi Salim
2006-03-21 11:15:11 +0800
5d424d5a6 [TCP]: MTU probing ... Browse Code »

Implementation of packetization layer path mtu discovery for TCP, based on
the internet-draft currently found at
.

Signed-off-by: John Heffner
Signed-off-by: David S. Miller

John Heffner
2006-03-21 09:53:41 +0800
09c884d4c [IPV6]: ROUTE: Add accept_ra_rt_info_max_plen sysctl. ... Browse Code »

Signed-off-by: YOSHIFUJI Hideaki
Signed-off-by: David S. Miller

YOSHIFUJI Hideaki
2006-03-21 09:07:03 +0800
52e163563 [IPV6]: ROUTE: Add router_probe_interval sysctl. ... Browse Code »

Signed-off-by: YOSHIFUJI Hideaki
Signed-off-by: David S. Miller

YOSHIFUJI Hideaki
2006-03-21 09:05:47 +0800
930d6ff2e [IPV6]: ROUTE: Add accept_ra_rtr_pref sysctl. ... Browse Code »

Signed-off-by: YOSHIFUJI Hideaki
Signed-off-by: David S. Miller

YOSHIFUJI Hideaki
2006-03-21 09:05:30 +0800
c4fd30eb1 [IPV6]: ADDRCONF: Add accept_ra_pinfo sysctl. ... Browse Code »

This controls whether we accept Prefix Information in RAs.

Signed-off-by: YOSHIFUJI Hideaki
Signed-off-by: David S. Miller

YOSHIFUJI Hideaki
2006-03-21 08:55:26 +0800
65f5c7c11 [IPV6]: ROUTE: Add accept_ra_defrtr sysctl. ... Browse Code »

This controls whether we accept default router information
in RAs.

Signed-off-by: YOSHIFUJI Hideaki
Signed-off-by: David S. Miller

YOSHIFUJI Hideaki
2006-03-21 08:55:08 +0800

01 Mar, 2006

1 commit

d2b176ed8 [IA64] sysctl option to silence unaligned trap warnings ... Browse Code »

Allow sysadmin to disable all warnings about userland apps
making unaligned accesses by using:
# echo 1 > /proc/sys/kernel/ignore-unaligned-usertrap
Rather than having to use prctl on a process by process basis.

Default behaivour leaves the warnings enabled.

Signed-off-by: Jes Sorensen
Signed-off-by: Tony Luck

Jes Sorensen
2006-03-01 01:42:23 +0800

21 Feb, 2006

1 commit

c255d844d [PATCH] suspend-to-ram: allow video options to be set at runtime ... Browse Code »

Currently, acpi video options can only be set on kernel command line. That's
little inflexible; I'd like userland s2ram application that just works, and
modifying kernel command line according to whitelist is not fun. It is better
to just allow s2ram application to set video options just before suspend
(according to the whitelist).

This implements sysctl to allow setting suspend video options without reboot.

(akpm: Documentation updates for this new sysctl are pending..)

Signed-off-by: Pavel Machek
Cc: "Brown, Len"
Cc: "Antonino A. Daplas"
Signed-off-by: Andrew Morton
Signed-off-by: Linus Torvalds

Pavel Machek
2006-02-21 12:00:10 +0800

02 Feb, 2006

1 commit

2a11ff06d [PATCH] zone_reclaim: configurable off node allocation period. ... Browse Code »

Currently the zone_reclaim code has a fixed window of 30 seconds of off node
allocations should a local zone have no unused pagecache pages left. Reclaim
will be attempted again after this timeout period to avoid repeated useless
scans for memory. This is also useful to established sufficiently large off
node allocation chunks to relieve the local node.

It may be beneficial to adjust that time period for some special situations.
For example if memory use was exceeding node capacity one may want to give up
for longer periods of time. If memory spikes intermittendly then one may want
to shorten the time period to reduce the number of off node allocations.

This patch allows just that....

Signed-off-by: Christoph Lameter
Signed-off-by: Andrew Morton
Signed-off-by: Linus Torvalds

Christoph Lameter
2006-02-02 00:53:16 +0800

19 Jan, 2006

1 commit

1743660b9 [PATCH] Zone reclaim: proc override ... Browse Code »

proc support for zone reclaim

This patch creates a proc entry /proc/sys/vm/zone_reclaim_mode that may be
used to override the automatic determination of the zone reclaim made on
bootup.

Signed-off-by: Christoph Lameter
Signed-off-by: Andrew Morton
Signed-off-by: Linus Torvalds

Christoph Lameter
2006-01-19 11:20:17 +0800

09 Jan, 2006

2 commits

8ad4b1fb8 [PATCH] Make high and batch sizes of per_cpu_pagelists configurable ... Browse Code »

As recently there has been lot of traffic on the right values for batch and
high water marks for per_cpu_pagelists. This patch makes these two
variables configurable through /proc interface.

A new tunable /proc/sys/vm/percpu_pagelist_fraction is added. This entry
controls the fraction of pages at most in each zone that are allocated for
each per cpu page list. The min value for this is 8. It means that we
don't allow more than 1/8th of pages in each zone to be allocated in any
single per_cpu_pagelist.

The batch value of each per cpu pagelist is also updated as a result. It
is set to pcp->high/4. The upper limit of batch is (PAGE_SHIFT * 8)

Signed-off-by: Rohit Seth
Signed-off-by: Andrew Morton
Signed-off-by: Linus Torvalds

Rohit Seth
2006-01-09 12:12:40 +0800
9d0243bca [PATCH] drop-pagecache ... Browse Code »

Add /proc/sys/vm/drop_caches. When written to, this will cause the kernel to
discard as much pagecache and/or reclaimable slab objects as it can. THis
operation requires root permissions.

It won't drop dirty data, so the user should run `sync' first.

Caveats:

a) Holds inode_lock for exorbitant amounts of time.

b) Needs to be taught about NUMA nodes: propagate these all the way through
so the discarding can be controlled on a per-node basis.

This is a debugging feature: useful for getting consistent results between
filesystem benchmarks. We could possibly put it under a config option, but
it's less than 300 bytes.

Signed-off-by: Andrew Morton
Signed-off-by: Linus Torvalds

Andrew Morton
2006-01-09 12:12:40 +0800

05 Jan, 2006

2 commits

db9edfd7e Merge git://git.kernel.org/pub/scm/linux/kernel/git/gregkh/driver-2.6 ... Browse Code »

Trivial manual merge fixup for usb_find_interface clashes.

Linus Torvalds
2006-01-05 10:44:12 +0800
312c004d3 [PATCH] driver core: replace "hotplug" by "uevent" ... Browse Code »

Leave the overloaded "hotplug" word to susbsystems which are handling
real devices. The driver core does not "plug" anything, it just exports
the state to userspace and generates events.

Signed-off-by: Kay Sievers
Signed-off-by: Greg Kroah-Hartman

Kay Sievers
2006-01-05 08:18:08 +0800

04 Jan, 2006

1 commit

89cee8b1c [IPV4]: Safer reassembly ... Browse Code »

Another spin of Herbert Xu's "safer ip reassembly" patch
for 2.6.16.

(The original patch is here:
http://marc.theaimsgroup.com/?l=linux-netdev&m=112281936522415&w=2
and my only contribution is to have tested it.)

This patch (optionally) does additional checks before accepting IP
fragments, which can greatly reduce the possibility of reassembling
fragments which originated from different IP datagrams.

Signed-off-by: Herbert Xu
Signed-off-by: Arthur Kepner
Signed-off-by: David S. Miller

Herbert Xu
2006-01-04 05:10:31 +0800

06 Dec, 2005

1 commit

1f12bcc9d [DECNET]: add memory buffer settings ... Browse Code »

The patch (originally from Steve) simply adds memory buffer settings to
DECnet similar to those in TCP.

Signed-off-by: Patrick Caulfield
Signed-off-by: David S. Miller

Steven Whitehouse
2005-12-06 05:42:06 +0800

16 Nov, 2005

1 commit

d4ed803c5 [PATCH] Make sysctl.h (again) usable from userspace ... Browse Code »

Make sysctl.h (again) useable from userspace

Signed-off-by: Harald Welte
Signed-off-by: Andrew Morton
Signed-off-by: Linus Torvalds

Harald Welte
2005-11-16 00:59:18 +0800

12 Nov, 2005

1 commit

049b3ff5a [SCTP]: Include ulpevents in socket receive buffer accounting. ... Browse Code »

Also introduces a sysctl option to configure the receive buffer
accounting policy to be either at socket or association level.
Default is all the associations on the same socket share the
receive buffer.

Signed-off-by: Neil Horman
Signed-off-by: Sridhar Samudrala
Signed-off-by: David S. Miller

Neil Horman
2005-11-12 08:08:24 +0800

11 Nov, 2005

1 commit

9772efb97 [TCP]: Appropriate Byte Count support ... Browse Code »

This is an updated version of the RFC3465 ABC patch originally
for Linux 2.6.11-rc4 by Yee-Ting Li. ABC is a way of counting
bytes ack'd rather than packets when updating congestion control.

The orignal ABC described in the RFC applied to a Reno style
algorithm. For advanced congestion control there is little
change after leaving slow start.

Signed-off-by: Stephen Hemminger
Signed-off-by: David S. Miller

Stephen Hemminger
2005-11-11 09:09:53 +0800

10 Nov, 2005

1 commit

9fb9cbb10 [NETFILTER]: Add nf_conntrack subsystem. ... Browse Code »

The existing connection tracking subsystem in netfilter can only
handle ipv4. There were basically two choices present to add
connection tracking support for ipv6. We could either duplicate all
of the ipv4 connection tracking code into an ipv6 counterpart, or (the
choice taken by these patches) we could design a generic layer that
could handle both ipv4 and ipv6 and thus requiring only one sub-protocol
(TCP, UDP, etc.) connection tracking helper module to be written.

In fact nf_conntrack is capable of working with any layer 3
protocol.

The existing ipv4 specific conntrack code could also not deal
with the pecularities of doing connection tracking on ipv6,
which is also cured here. For example, these issues include:

1) ICMPv6 handling, which is used for neighbour discovery in
ipv6 thus some messages such as these should not participate
in connection tracking since effectively they are like ARP
messages

2) fragmentation must be handled differently in ipv6, because
the simplistic "defrag, connection track and NAT, refrag"
(which the existing ipv4 connection tracking does) approach simply
isn't feasible in ipv6

3) ipv6 extension header parsing must occur at the correct spots
before and after connection tracking decisions, and there were
no provisions for this in the existing connection tracking
design

4) ipv6 has no need for stateful NAT

The ipv4 specific conntrack layer is kept around, until all of
the ipv4 specific conntrack helpers are ported over to nf_conntrack
and it is feature complete. Once that occurs, the old conntrack
stuff will get placed into the feature-removal-schedule and we will
fully kill it off 6 months later.

Signed-off-by: Yasuyuki Kozakai
Signed-off-by: Harald Welte
Signed-off-by: Arnaldo Carvalho de Melo

Yasuyuki Kozakai
2005-11-10 08:38:16 +0800

09 Nov, 2005

1 commit

330d57fb9 [PATCH] Fix sysctl unregistration oops (CVE-2005-2709) ... Browse Code »

You could open the /proc/sys/net/ipv4/conf// file, then
wait for interface to go away, try to grab as much memory as possible in
hope to hit the (kfreed) ctl_table. Then fill it with pointers to your
function. Then do read from file you've opened and if you are lucky,
you'll get it called as ->proc_handler() in kernel mode.

So this is at least an Oops and possibly more. It does depend on an
interface going away though, so less of a security risk than it would
otherwise be.

Signed-off-by: Greg Kroah-Hartman
Signed-off-by: Linus Torvalds

Al Viro
2005-11-09 09:57:30 +0800

22 Sep, 2005

1 commit

590232a71 [LLC]: Add sysctl support for the LLC timeouts ... Browse Code »

Signed-off-by: Jochen Friedrich
Signed-off-by: Arnaldo Carvalho de Melo

Arnaldo Carvalho de Melo
2005-09-22 15:30:44 +0800

13 Sep, 2005

1 commit

e21ce8c7c [NETROM]: Implement G8PZT Circuit reset for NET/ROM ... Browse Code »

NET/ROM is lacking a connection reset like TCP's RST flag which at times
may result in a connecting having to slowly timing out instead of just being
reset. An earlier attempt to reset the connection by sending a
NR_CONNACK | NR_CHOKE_FLAG transport was inacceptable as it did result in
crashes of BPQ systems. An alternative approach of introducing a new
transport type 7 (NR_RESET) has be implemented several years ago in
Paula Jayne Dowie G8PZT's Xrouter.

Implement NR_RESET for Linux's NET/ROM but like any messing with the state
engine consider this experimental for now and thus control it by a sysctl
(net.netrom.reset) which for the time being defaults to off.

Signed-off-by: Ralf Baechle DL5RB
Signed-off-by: David S. Miller

Ralf Baechle
2005-09-13 05:27:37 +0800

08 Sep, 2005

1 commit

8c702e162 [PATCH] ipmi poweroff: fix chassis control ... Browse Code »

The IPMI power control function proc_write_chassctrl was badly written, it
directly used userspace pointers, it assumed that strings were NULL
terminated, and it used the evil sscanf function. This converts over to
using the sysctl interface for this data and changes the semantics to be a
little more logical.

Signed-off-by: Corey Minyard
Cc:
Signed-off-by: Andrew Morton
Signed-off-by: Linus Torvalds

Corey Minyard
2005-09-08 07:57:49 +0800

28 Jul, 2005

1 commit

951f22d5b [PATCH] s390: spin lock retry ... Browse Code »

Split spin lock and r/w lock implementation into a single try which is done
inline and an out of line function that repeatedly tries to get the lock
before doing the cpu_relax(). Add a system control to set the number of
retries before a cpu is yielded.

The reason for the spin lock retry is that the diagnose 0x44 that is used to
give up the virtual cpu is quite expensive. For spin locks that are held only
for a short period of time the costs of the diagnoses outweights the savings
for spin locks that are held for a longer timer. The default retry count is
1000.

Signed-off-by: Martin Schwidefsky
Signed-off-by: Andrew Morton
Signed-off-by: Linus Torvalds

Martin Schwidefsky
2005-07-28 07:26:04 +0800

14 Jul, 2005

1 commit

0399cb08c [PATCH] inotify: move sysctl ... Browse Code »

This moves the inotify sysctl knobs to "/proc/sys/fs/inotify" from
"/proc/sys/fs". Also some related cleanup.

Signed-off-by: Robert Love
Signed-off-by: Linus Torvalds

Robert Love
2005-07-14 02:09:31 +0800

13 Jul, 2005

1 commit

0eeca2830 [PATCH] inotify ... Browse Code »

inotify is intended to correct the deficiencies of dnotify, particularly
its inability to scale and its terrible user interface:

* dnotify requires the opening of one fd per each directory
that you intend to watch. This quickly results in too many
open files and pins removable media, preventing unmount.
* dnotify is directory-based. You only learn about changes to
directories. Sure, a change to a file in a directory affects
the directory, but you are then forced to keep a cache of
stat structures.
* dnotify's interface to user-space is awful. Signals?

inotify provides a more usable, simple, powerful solution to file change
notification:

* inotify's interface is a system call that returns a fd, not SIGIO.
You get a single fd, which is select()-able.
* inotify has an event that says "the filesystem that the item
you were watching is on was unmounted."
* inotify can watch directories or files.

Inotify is currently used by Beagle (a desktop search infrastructure),
Gamin (a FAM replacement), and other projects.

See Documentation/filesystems/inotify.txt.

Signed-off-by: Robert Love
Cc: John McCutchan
Cc: Christoph Hellwig
Signed-off-by: Andrew Morton
Signed-off-by: Linus Torvalds

Robert Love
2005-07-13 11:38:38 +0800

29 Jun, 2005

1 commit

2f85a4296 [SCTP] Make init & delayed sack timeouts configurable by user. ... Browse Code »

Signed-off-by: Vlad Yasevich
Signed-off-by: Sridhar Samudrala
Signed-off-by: David S. Miller

Vlad Yasevich
2005-06-29 04:24:23 +0800

24 Jun, 2005

3 commits

51b0bdedb [NET]: Separate two usages of netdev_max_backlog. ... Browse Code »

Separate out the two uses of netdev_max_backlog. One controls the
upper bound on packets processed per softirq, the new name for this is
netdev_budget; the other controls the limit on packets queued via
netif_rx.

Increase the max_backlog default to account for faster processors.

Signed-off-by: Stephen Hemminger
Signed-off-by: David S. Miller

Stephen Hemminger
2005-06-24 11:14:40 +0800
317a76f9a [TCP]: Add pluggable congestion control algorithm infrastructure. ... Browse Code »

Allow TCP to have multiple pluggable congestion control algorithms.
Algorithms are defined by a set of operations and can be built in
or modules. The legacy "new RENO" algorithm is used as a starting
point and fallback.

Signed-off-by: Stephen Hemminger
Signed-off-by: David S. Miller

Stephen Hemminger
2005-06-24 03:19:55 +0800
d6e711448 [PATCH] setuid core dump ... Browse Code »

Add a new `suid_dumpable' sysctl:

This value can be used to query and set the core dump mode for setuid
or otherwise protected/tainted binaries. The modes are

0 - (default) - traditional behaviour. Any process which has changed
privilege levels or is execute only will not be dumped

1 - (debug) - all processes dump core when possible. The core dump is
owned by the current user and no security is applied. This is intended
for system debugging situations only. Ptrace is unchecked.

2 - (suidsafe) - any binary which normally would not be dumped is dumped
readable by root only. This allows the end user to remove such a dump but
not access it directly. For security reasons core dumps in this mode will
not overwrite one another or other files. This mode is appropriate when
adminstrators are attempting to debug problems in a normal environment.

(akpm:

> > +EXPORT_SYMBOL(suid_dumpable);
>
> EXPORT_SYMBOL_GPL?

No problem to me.

> > if (current->euid == current->uid && current->egid == current->gid)
> > current->mm->dumpable = 1;
>
> Should this be SUID_DUMP_USER?

Actually the feedback I had from last time was that the SUID_ defines
should go because its clearer to follow the numbers. They can go
everywhere (and there are lots of places where dumpable is tested/used
as a bool in untouched code)

> Maybe this should be renamed to `dump_policy' or something. Doing that
> would help us catch any code which isn't using the #defines, too.

Fair comment. The patch was designed to be easy to maintain for Red Hat
rather than for merging. Changing that field would create a gigantic
diff because it is used all over the place.

)

Signed-off-by: Alan Cox
Signed-off-by: Andrew Morton
Signed-off-by: Linus Torvalds

Alan Cox
2005-06-24 00:45:26 +0800

14 Jun, 2005

1 commit

1c2fb7f93 [IPV4]: Sysctl configurable icmp error source address. ... Browse Code »

This patch alows you to change the source address of icmp error
messages. It applies cleanly to 2.6.11.11 and retains the default
behaviour.

In the old (default) behaviour icmp error messages are sent with the ip
of the exiting interface.

The new behaviour (when the sysctl variable is toggled on), it will send
the message with the ip of the interface that received the packet that
caused the icmp error. This is the behaviour network administrators will
expect from a router. It makes debugging complicated network layouts
much easier. Also, all 'vendor routers' I know of have the later
behaviour.

Signed-off-by: David S. Miller

J. Simonetti
2005-06-14 06:19:03 +0800