Doug / smarc-fsl-linux-kernel | Embedian Git Server

10 Jul, 2008

1 commit

9cabcdbd4 [GFS2] Replace rgrp "recent list" with mru list ... Browse Code »

This patch removes the "recent list" which is used during allocation
and replaces it with the (already existing) mru list used during
deletion. The "recent list" was not a true mru list leading to a number
of inefficiencies including a "next" function which made scanning the
list an order N^2 operation wrt to the number of list elements.

This should increase allocation performance with large numbers of rgrps.
Its also a useful preparation and cleanup before some further changes
which are planned in this area.

Signed-off-by: Steven Whitehouse

Steven Whitehouse
2008-07-10 22:54:12 +0800

07 Jul, 2008

2 commits

209806aba [GFS2] Allow local DF locks when holding a cached EX glock ... Browse Code »

We already allow local SH locks while we hold a cached EX glock, so here
we allow DF locks as well. This works only because we rely on the VFS's
invalidation for locally cached data, and because if we hold an EX lock,
then we know that no other node can be caching data relating to this
file.

It dramatically speeds up initial writes to O_DIRECT files since we fall
back to buffered I/O for this and would otherwise bounce between DF and
EX modes on each and every write call. The lessons to be learned from
that are to ensure that (for the time being anyway) O_DIRECT files are
preallocated and that they are written to using reasonably large I/O
sizes. Even so this change fixes that corner case nicely

Signed-off-by: Steven Whitehouse

Steven Whitehouse
2008-07-07 17:07:28 +0800
265d529ce [GFS2] Fix delayed demote race ... Browse Code »

There is a race in the delayed demote code where it does the wrong thing
if a demotion to UN has occurred for other reasons before the delay has
expired. This patch adds an assert to catch that condition as well as
fixing the root cause by adding an additional check for the UN state.

Signed-off-by: Steven Whitehouse
Cc: Bob Peterson

Steven Whitehouse
2008-07-07 17:02:36 +0800

03 Jul, 2008

1 commit

f58ba8891 [GFS2] don't call permission() ... Browse Code »

GFS2 calls permission() to verify permissions after locks on the files
have been taken.

For this it's sufficient to call gfs2_permission() instead. This
results in the following changes:

- IS_RDONLY() check is not performed
- IS_IMMUTABLE() check is not performed
- devcgroup_inode_permission() is not called
- security_inode_permission() is not called

IS_RDONLY() should be unnecessary anyway, as the per-mount read-only
flag should provide protection against read-only remounts during
operations. do_gfs2_set_flags() has been fixed to perform
mnt_want_write()/mnt_drop_write() to protect against remounting
read-only.

IS_IMMUTABLE has been added to gfs2_permission()

Repeating the security checks seems to be pointless, as they don't
normally change, and if they do, it's independent of the filesystem
state.

Signed-off-by: Miklos Szeredi
Signed-off-by: Steven Whitehouse

Miklos Szeredi
2008-07-03 17:22:01 +0800

27 Jun, 2008

12 commits

f17172e00 [GFS2] Fix module building ... Browse Code »

Two lines missed from the previous patch.

Signed-off-by: Steven Whitehouse

Steven Whitehouse
2008-06-27 16:40:57 +0800
9f1585cb0 [GFS2] Glock documentation ... Browse Code »

This patch adds a file describing the internals of GFS2's glock
abstraction.

Signed-off-by: Steven Whitehouse

Steven Whitehouse
2008-06-27 16:39:53 +0800
31fcba00f [GFS2] Remove all_list from lock_dlm ... Browse Code »

I discovered that we had a list onto which every lock_dlm
lock was being put. Its only function was to discover whether
we'd got any locks left after umount. Since there was already
a counter for that purpose as well, I removed the list. The
saving is sizeof(struct list_head) per glock - well worth
having.

Signed-off-by: Steven Whitehouse

Steven Whitehouse
2008-06-27 16:39:50 +0800
b2cad26cf [GFS2] Remove obsolete conversion deadlock avoidance code ... Browse Code »

This is only used by GFS1 so can be removed.

Signed-off-by: Steven Whitehouse

Steven Whitehouse
2008-06-27 16:39:47 +0800
1bdad6063 [GFS2] Remove remote lock dropping code ... Browse Code »

There are several reasons why this is undesirable:

1. It never happens during normal operation anyway
2. If it does happen it causes performance to be very, very poor
3. It isn't likely to solve the original problem (memory shortage
on remote DLM node) it was supposed to solve
4. It uses a bunch of arbitrary constants which are unlikely to be
correct for any particular situation and for which the tuning seems
to be a black art.
5. In an N node cluster, only 1/N of the dropped locked will actually
contribute to solving the problem on average.

So all in all we are better off without it. This also makes merging
the lock_dlm module into GFS2 a bit easier.

Signed-off-by: Steven Whitehouse

Steven Whitehouse
2008-06-27 16:39:44 +0800
9171f5a99 [GFS2] kernel panic mounting volume ... Browse Code »

This patch fixes Red Hat bugzilla bug 450156.

This started with a not-too-improbable mount failure because the
locking protocol was never set back to its proper "lock_dlm" after the
system was rebooted in the middle of a gfs2_fsck. That left a
(purposely) invalid locking protocol in the superblock, which caused an
error when the file system was mounted the next time.

When there's an error mounting, vfs calls DQUOT_OFF, which calls
vfs_quota_off which calls gfs2_sync_fs. Next, gfs2_sync_fs calls
gfs2_log_flush passing s_fs_info. But due to the error, s_fs_info
had been previously set to NULL, and so we have the kernel oops.

My solution in this patch is to test for the NULL value before passing
it. I tested this patch and it fixes the problem.

Signed-off-by: Bob Peterson
Signed-off-by: Steven Whitehouse

Bob Peterson
2008-06-27 16:39:41 +0800
01b7c7ae8 [GFS2] Revise readpage locking ... Browse Code »

The previous attempt to fix the locking in readpage failed due
to the use of a "try lock" which resulted in occasional high
cpu usage during testing (due to repeated tries) and also it
did not resolve all the ordering problems wrt the transaction
lock (although it did solve all the inode lock ordering problems).

This patch avoids the problem by unlocking the page and getting the
locks in the correct order. This means that we have to retest the
page to ensure that it hasn't changed when we relock the page.

This now passes the tests which were previously failing.

Signed-off-by: Steven Whitehouse

Steven Whitehouse
2008-06-27 16:39:37 +0800
802747372 [GFS2] Fix ordering of args for list_add ... Browse Code »

The patch to remove lock_nolock managed to get the arguments
of this list_add backwards. This fixes it.

Signed-off-by: Steven Whitehouse

Steven Whitehouse
2008-06-27 16:39:34 +0800
2d81afb87 [GFS2] trivial sparse lock annotations ... Browse Code »

Annotate the &sdp->sd_log_lock.

Signed-off-by: Harvey Harrison
Signed-off-by: Steven Whitehouse

Harvey Harrison
2008-06-27 16:39:31 +0800
048bca223 [GFS2] No lock_nolock ... Browse Code »

This patch merges the lock_nolock module into GFS2 itself. As well as removing
some of the overhead of the module, it also means that its now impossible to
build GFS2 without a lock module (which would be a pointless thing to do
anyway).

We also plan to merge lock_dlm into GFS2 in the future, but that is a more
tricky task, and will therefore be a separate patch.

Signed-off-by: Steven Whitehouse
Cc: David Teigland

Steven Whitehouse
2008-06-27 16:39:28 +0800
f3c9d38a2 [GFS2] Fix ordering bug in lock_dlm ... Browse Code »

This looks like a lot of change, but in fact its not. Mostly its
things moving from one file to another. The change is just that
instead of queuing lock completions and callbacks from the DLM
we now pass them directly to GFS2.

This gives us a net loss of two list heads per glock (a fair
saving in memory) plus a reduction in the latency of delivering
the messages to GFS2, plus we now have one thread fewer as well.
There was a bug where callbacks and completions could be delivered
in the wrong order due to this unnecessary queuing which is fixed
by this patch.

Signed-off-by: Steven Whitehouse
Cc: Bob Peterson

Steven Whitehouse
2008-06-27 16:39:25 +0800
6802e3400 [GFS2] Clean up the glock core ... Browse Code »

This patch implements a number of cleanups to the core of the
GFS2 glock code. As a result a lot of code is removed. It looks
like a really big change, but actually a large part of this patch
is either removing or moving existing code.

There are some new bits too though, such as the new run_queue()
function which is considerably streamlined. Highlights of this
patch include:

o Fixes a cluster coherency bug during SH -> EX lock conversions
o Removes the "glmutex" code in favour of a single bit lock
o Removes the ->go_xmote_bh() for inodes since it was duplicating
->go_lock()
o We now only use the ->lm_lock() function for both locks and
unlocks (i.e. unlock is a lock with target mode LM_ST_UNLOCKED)
o The fast path is considerably shortly, giving performance gains
especially with lock_nolock
o The glock_workqueue is now used for all the callbacks from the DLM
which allows us to simplify the lock_dlm module (see following patch)
o The way is now open to make further changes such as eliminating the two
threads (gfs2_glockd and gfs2_scand) in favour of a more efficient
scheme.

This patch has undergone extensive testing with various test suites
so it should be pretty stable by now.

Signed-off-by: Steven Whitehouse
Cc: Bob Peterson

Steven Whitehouse
2008-06-27 16:39:22 +0800

25 Jun, 2008

17 commits

543cf4cb3 Linux 2.6.26-rc8 Browse Code »

Linus Torvalds
2008-06-25 09:58:20 +0800
bd8c540fe Merge branch 'release' of git://git.kernel.org/pub/scm/linux/kernel/git/aegl/linux-2.6 ... Browse Code »

* 'release' of git://git.kernel.org/pub/scm/linux/kernel/git/aegl/linux-2.6:
[IA64] Eliminate NULL test after alloc_bootmem in iosapic_alloc_rte()
[IA64] Handle count==0 in sn2_ptc_proc_write()
[IA64] Fix boot failure on ia64/sn2

Linus Torvalds
2008-06-25 09:12:33 +0800
035cfc61a Merge git://git.kernel.org/pub/scm/linux/kernel/git/steve/gfs2-2.6-fixes ... Browse Code »

* git://git.kernel.org/pub/scm/linux/kernel/git/steve/gfs2-2.6-fixes:
[GFS2] fix gfs2 block allocation (cleaned up)
[GFS2] BUG: unable to handle kernel paging request at ffff81002690e000

Linus Torvalds
2008-06-25 09:09:47 +0800
919c0d14a Merge branch 'kvm-updates-2.6.26' of git://git.kernel.org/pub/scm/linux/kernel/git/avi/kvm ... Browse Code »

* 'kvm-updates-2.6.26' of git://git.kernel.org/pub/scm/linux/kernel/git/avi/kvm:
KVM: Remove now unused structs from kvm_para.h
x86: KVM guest: Use the paravirt clocksource structs and functions
KVM: Make kvm host use the paravirt clocksource structs
x86: Make xen use the paravirt clocksource structs and functions
x86: Add structs and functions for paravirt clocksource
KVM: VMX: Fix host msr corruption with preemption enabled
KVM: ioapic: fix lost interrupt when changing a device's irq
KVM: MMU: Fix oops on guest userspace access to guest pagetable
KVM: MMU: large page update_pte issue with non-PAE 32-bit guests (resend)
KVM: MMU: Fix rmap_write_protect() hugepage iteration bug
KVM: close timer injection race window in __vcpu_run
KVM: Fix race between timer migration and vcpu migration

Linus Torvalds
2008-06-25 09:09:06 +0800
de08341a0 Merge git://git.kernel.org/pub/scm/linux/kernel/git/wim/linux-2.6-watchdog ... Browse Code »

* git://git.kernel.org/pub/scm/linux/kernel/git/wim/linux-2.6-watchdog:
Revert "[WATCHDOG] hpwdt: Add CFLAGS to get driver working"

Linus Torvalds
2008-06-25 02:23:35 +0800
9bf8a943a Merge branch 'x86-fixes-for-linus' of git://git.kernel.org/pub/scm/linux/kernel/… ... Browse Code »

…git/tip/linux-2.6-tip

* 'x86-fixes-for-linus' of git://git.kernel.org/pub/scm/linux/kernel/git/tip/linux-2.6-tip:
xen: remove support for non-PAE 32-bit

Linus Torvalds
2008-06-25 02:21:47 +0800
3b968b7c1 Merge branch 'for_linus' of git://git.kernel.org/pub/scm/linux/kernel/git/jwessel/linux-2.6-kgdb ... Browse Code »

* 'for_linus' of git://git.kernel.org/pub/scm/linux/kernel/git/jwessel/linux-2.6-kgdb:
kgdb: sparse fix
kgdb: documentation update - remove kgdboe

Linus Torvalds
2008-06-25 02:20:59 +0800
ea7b44c8e enable bus mastering on i915 at resume time ... Browse Code »

On 9xx chips, bus mastering needs to be enabled at resume time for much of the
chip to function. With this patch, vblank interrupts will work as expected
on resume, along with other chip functions. Fixes kernel bugzilla #10844.

Signed-off-by: Jie Luo
Signed-off-by: Jesse Barnes
Signed-off-by: Linus Torvalds

Jie Luo
2008-06-25 02:17:25 +0800
6b1ed9086 KVM: Remove now unused structs from kvm_para.h ... Browse Code »

The kvm_* structs are obsoleted by the pvclock_* ones.
Now all users have been switched over and the old structs
can be dropped.

Signed-off-by: Gerd Hoffmann
Signed-off-by: Avi Kivity

Gerd Hoffmann
2008-06-25 02:02:33 +0800
f6e16d5ad x86: KVM guest: Use the paravirt clocksource structs and functions ... Browse Code »

This patch updates the kvm host code to use the pvclock structs
and functions, thereby making it compatible with Xen.

The patch also fixes an initialization bug: on SMP systems the
per-cpu has two different locations early at boot and after CPU
bringup. kvmclock must take that in account when registering the
physical address within the host.

Signed-off-by: Gerd Hoffmann
Signed-off-by: Avi Kivity

Gerd Hoffmann
2008-06-25 02:02:33 +0800
50d0a0f98 KVM: Make kvm host use the paravirt clocksource structs ... Browse Code »

This patch updates the kvm host code to use the pvclock structs.
It also makes the paravirt clock compatible with Xen.

Signed-off-by: Gerd Hoffmann
Signed-off-by: Avi Kivity

Gerd Hoffmann
2008-06-25 02:02:32 +0800
1c7b67f75 x86: Make xen use the paravirt clocksource structs and functions ... Browse Code »

This patch updates the xen guest to use the pvclock structs
and helper functions.

Signed-off-by: Gerd Hoffmann
Acked-by: Jeremy Fitzhardinge
Signed-off-by: Avi Kivity

Gerd Hoffmann
2008-06-25 02:02:32 +0800
7af192c95 x86: Add structs and functions for paravirt clocksource ... Browse Code »

This patch adds structs for the paravirt clocksource ABI
used by both xen and kvm (pvclock-abi.h).

It also adds some helper functions to read system time and
wall clock time from a paravirtual clocksource (pvclock.[ch]).
They are based on the xen code. They are enabled using
CONFIG_PARAVIRT_CLOCK.

Subsequent patches of this series will put the code in use.

Signed-off-by: Gerd Hoffmann
Acked-by: Jeremy Fitzhardinge
Signed-off-by: Avi Kivity

Gerd Hoffmann
2008-06-25 02:02:31 +0800
5af4e7a0b [GFS2] fix gfs2 block allocation (cleaned up) ... Browse Code »

This patch fixes bz 450641.

This patch changes the computation for zero_metapath_length(), which it
renames to metapath_branch_start(). When you are extending the metadata
tree, The indirect blocks that point to the new data block must either
diverge from the existing tree either at the inode, or at the first
indirect block. They can diverge at the first indirect block because the
inode has room for 483 pointers while the indirect blocks have room for
509 pointers, so when the tree is grown, there is some free space in the
first indirect block. What metapath_branch_start() now computes is the
height where the first indirect block for the new data block is located.
It can either be 1 (if the indirect block diverges from the inode) or 2
(if it diverges from the first indirect block).

Signed-off-by: Benjamin Marzinski
Signed-off-by: Steven Whitehouse

Benjamin Marzinski
2008-06-25 02:02:28 +0800
e2569b7e5 [IA64] Eliminate NULL test after alloc_bootmem in iosapic_alloc_rte() ... Browse Code »

As noted by Akinobu Mita alloc_bootmem and related functions never return
NULL and always return a zeroed region of memory. Thus a NULL test or
memset after calls to these functions is unnecessary.

Signed-off-by: Julia Lawall
Signed-off-by: Tony Luck

Julia Lawall
2008-06-25 01:28:55 +0800
8097110d1 [IA64] Handle count==0 in sn2_ptc_proc_write() ... Browse Code »

The fix applied in e0c6d97c65e0784aade7e97b9411f245a6c543e7
"security hole in sn2_ptc_proc_write" didn't take into account
the case where count==0 (which results in a buffer underrun
when adding the trailing '\0'). Thanks to Andi Kleen for
pointing this out.

Signed-off-by: Cliff Wickman
Signed-off-by: Tony Luck

Cliff Wickman
2008-06-25 01:20:06 +0800
2826f8c0f [IA64] Fix boot failure on ia64/sn2 ... Browse Code »

Call check_sal_cache_flush() after platform_setup() as
check_sal_cache_flush() now relies on being able to call platform
vector code.

Problem was introduced by: 3463a93def55c309f3c0d0a8aaf216be3be42d64
"Update check_sal_cache_flush to use platform_send_ipi()"

Signed-off-by: Jes Sorensen
Tested-by: Alex Chiang:
Signed-off-by: Tony Luck

Jes Sorensen
2008-06-25 01:16:27 +0800

24 Jun, 2008

7 commits

aabdc3b8c kgdb: sparse fix ... Browse Code »

- Fix warning reported by sparse
kernel/kgdb.c:1502:6: warning: symbol 'kgdb_console_write' was not declared.
Should it be static?

Signed-off-by: Jason Wessel

Jason Wessel
2008-06-24 23:52:55 +0800
a606b5e24 kgdb: documentation update - remove kgdboe ... Browse Code »

kgdboe is not presently included kgdb, and there should be no
references to it.

Also fix the tcp port terminal connection example.

Signed-off-by: Jason Wessel

Jason Wessel
2008-06-24 23:52:55 +0800
284991439 xen: remove support for non-PAE 32-bit ... Browse Code »

Non-PAE operation has been deprecated in Xen for a while, and is
rarely tested or used. xen-unstable has now officially dropped
non-PAE support. Since Xen/pvops' non-PAE support has also been
broken for a while, we may as well completely drop it altogether.

Signed-off-by: Jeremy Fitzhardinge
Signed-off-by: Ingo Molnar
Signed-off-by: Thomas Gleixner
Signed-off-by: Ingo Molnar

Jeremy Fitzhardinge
2008-06-24 23:00:55 +0800
17c15da00 [GFS2] BUG: unable to handle kernel paging request at ffff81002690e000 ... Browse Code »

This patch fixes bugzilla bug bz448866: gfs2: BUG: unable to
handle kernel paging request at ffff81002690e000.

Signed-off-by: Bob Peterson
Signed-off-by: Steven Whitehouse

Bob Peterson
2008-06-24 21:17:45 +0800
63842cccb Revert "[WATCHDOG] hpwdt: Add CFLAGS to get driver working" ... Browse Code »

After Linus fixed the inline assembly, the CFLAGS option is not
needed anymore.

Signed-off-by: Thomas Mingarelli
Signed-off-by: Wim Van Sebroeck

Wim Van Sebroeck
2008-06-24 21:09:26 +0800
a9b21b622 KVM: VMX: Fix host msr corruption with preemption enabled ... Browse Code »

Switching msrs can occur either synchronously as a result of calls to
the msr management functions (usually in response to the guest touching
virtualized msrs), or asynchronously when preempting a kvm thread that has
guest state loaded. If we're unlucky enough to have the two at the same
time, host msrs are corrupted and the machine goes kaput on the next syscall.

Most easily triggered by Windows Server 2008, as it does a lot of msr
switching during bootup.

Signed-off-by: Avi Kivity

Avi Kivity
2008-06-24 17:26:17 +0800
4fa6b9c5d KVM: ioapic: fix lost interrupt when changing a device's irq ... Browse Code »

The ioapic acknowledge path translates interrupt vectors to irqs. It
currently uses a first match algorithm, stopping when it finds the first
redirection table entry containing the vector. That fails however if the
guest changes the irq to a different line, leaving the old redirection table
entry in place (though masked). Result is interrupts not making it to the
guest.

Fix by always scanning the entire redirection table.

Signed-off-by: Avi Kivity

Avi Kivity
2008-06-24 17:23:55 +0800