16 Jun, 2009

1 commit


12 Jun, 2009

1 commit


06 May, 2009

1 commit

  • The only references in the kernel to the .text.sched section are in
    recordmcount.pl. Since the code it has is intended to be example code
    it should refer to real kernel sections. So change it to .sched.text
    instead.

    [ Impact: consistency in comments ]

    Signed-off-by: Tim Abbott
    LKML-Reference:
    Acked-by: Sam Ravnborg
    Signed-off-by: Steven Rostedt

    Tim Abbott
     

19 Jan, 2009

1 commit

  • Impact: fix failure of dynamic function tracer selftest

    In a course of development, a developer does several makes on their
    kernel. Sometimes, the make might do something abnormal. In the
    case of running the recordmcount.pl script on an object twice,
    the script will duplicate all the calls to mcount in the __mcount_loc
    section.

    On boot up, the dynamic function tracer is careful when it modifies
    code, and performs several consistency checks. One is to not modify
    the call site if it is not what it expects it to be. If a function
    call site is listed twice, the first entry will convert the site
    to a nop, and the second will fail because it expected to see a
    call to mcount, but instead it sees a nop. Thus, the function tracer
    is disabled.

    Eric Sesterhenn reported seeing:

    [ 1.055440] ftrace: converting mcount calls to 0f 1f 44 00 00
    [ 1.055568] ftrace: allocating 29418 entries in 116 pages
    [ 1.061000] ------------[ cut here ]------------
    [ 1.061000] WARNING: at kernel/trace/ftrace.c:441

    [...]

    [ 1.060000] ---[ end trace 4eaa2a86a8e2da23 ]---
    [ 1.060000] ftrace failed to modify [] check_corruption+0x3/0x2d
    [ 1.060000] actual: 0f:1f:44:00:00

    This warning shows that check_corruption+0x3 already had a nop in
    its place (0x0f1f440000). After compiling another kernel the problem
    went away.

    Later Eric Paris notice the same type of issue. Luckily, he saved
    the vmlinux file that caused it. In the file we found a bunch of
    duplicate mcount call site records, which lead us to the script.

    Perhaps this problem only happens to people named Eric.

    This patch changes the script to test if the __mcount_loc already
    exists in the object file, and if it does, it will print out
    an error message and kill the compile.

    Reported-by: Eric Sesterhenn
    Reported-by: Eric Paris
    Signed-off-by: Steven Rostedt
    Signed-off-by: Ingo Molnar

    Steven Rostedt
     

14 Jan, 2009

3 commits


12 Dec, 2008

1 commit

  • Impact: let the function-graph-tracer be aware of the irq entrypoints

    Add a new .irqentry.text section to store the irq entrypoints functions
    inside the same section. This way, the tracer will be able to signal
    an interrupts triggering on output by recognizing these entrypoints.

    Also, make this section recordable for dynamic tracing.

    Signed-off-by: Frederic Weisbecker
    Signed-off-by: Ingo Molnar

    Frederic Weisbecker
     

26 Nov, 2008

1 commit

  • Impact: widen the scope of recordmcount.pl

    Besides .text section, there are three .text sections that won't
    be freed after kernel booting. They are: .sched.text, .spinlock.text
    and .kprobes.text, which contain functions we can trace. But the last
    section ".kprobes.text" is particular, which has been marked as "notrace",
    we ignore it. Thus we add other two sections.

    Signed-off-by: Liming Wang
    Acked-by: Steven Rostedt
    Signed-off-by: Ingo Molnar

    Liming Wang
     

23 Nov, 2008

5 commits

  • Impact: extend scripts/recordmcount.pl to ARM

    Arm uses %progbits instead of @progbits and requires only 4 byte alignment.

    [ Thanks to Sam Ravnborg for mentioning that ARM uses %progbits ]

    Signed-off-by: Jim Radford
    Signed-off-by: Steven Rostedt
    Signed-off-by: Ingo Molnar

    Jim Radford
     
  • Impact: extend scripts/recordmcount.pl with default alignment for SH

    Set $alignment=2 for the sh architecture so that a ".align 2" directive
    will be emitted for all __mcount_loc sections. Fix a whitspace error
    while I'm here (converted spaces to tabs).

    Signed-off-by: Matt Fleming
    Signed-off-by: Steven Rostedt
    Signed-off-by: Ingo Molnar

    Matt Fleming
     
  • Impact: cleanup of recordmcount.pl

    Now that more architectures are being ported to the MCOUNT_RECORD
    method, there is no reason to have each declare their own arch
    specific variable if most of them share the same value. This patch
    creates a set of default values for the arch specific variables
    based off of i386.

    Signed-off-by: Steven Rostedt
    Signed-off-by: Ingo Molnar

    Steven Rostedt
     
  • Impact: Add PowerPC port to recordmcount.pl script

    This patch updates the recordmcount.pl script to process
    PowerPC.

    Signed-off-by: Steven Rostedt
    Signed-off-by: Ingo Molnar

    Steven Rostedt
     
  • First cut at dynamic ftrace support.

    [
    Steven Rostedt - only updated the recordmcount.pl file.
    There are updates for PowerPC that will conflict with this,
    and we need to base off of these changes.
    ]

    Signed-off-by: Matt Fleming
    Signed-off-by: Paul Mundt
    Signed-off-by: Steven Rostedt
    Signed-off-by: Ingo Molnar

    Matt Fleming
     

08 Nov, 2008

1 commit

  • Impact: add alignment option for recordmcount.pl script

    Align the __mcount_loc sections so that architectures with strict
    alignment requirements need not worry about performing unaligned
    accesses.

    This fixes an issue where I was seeing unaligned accesses, which are not
    supported on our architecture (the results of an unaligned access are
    undefined).

    Signed-off-by: Matt Fleming
    Signed-off-by: Steven Rostedt
    Signed-off-by: Ingo Molnar

    Matt Fleming
     

23 Oct, 2008

2 commits

  • The text section stays in memory without ever leaving. With the exception
    of modules, but modules know how to handle that case. With the dynamic
    ftrace tracer, we need to make sure that it does not try to modify code
    that no longer exists. The only safe section is .text.

    This patch changes the recordmcount script to only record the mcount calls
    in the .text sections.

    Signed-off-by: Steven Rostedt
    Signed-off-by: Ingo Molnar

    Steven Rostedt
     
  • The recordmcount script requires that the actual arch is passed in.
    This works well when ARCH=i386 or ARCH=x86_64 but does not handle the
    case of ARCH=x86.

    This patch adds a parameter to the function to pass in the number of
    bits of the architecture. So that it can determine if x86 should be
    run for x86_64 or i386 archs.

    Signed-off-by: Steven Rostedt
    Signed-off-by: Ingo Molnar

    Steven Rostedt
     

14 Oct, 2008

6 commits

  • The warning messages about old objcopy and local functions spam the
    user quite drastically. Remove the warning until we can find a nicer
    way of tell the user to upgrade their objcopy.

    Signed-off-by: Steven Rostedt
    Cc: Stephen Rothwell
    Signed-off-by: Ingo Molnar

    Steven Rostedt
     
  • The --globalize-symbols option came out in objcopy version 2.17.
    If the kernel is being compiled on a system with a lower version of
    objcopy, then we can not use the globalize / localize trick to
    link to symbols pointing to local functions.

    This patch tests the version of objcopy and will only use the trick
    if the version is greater than or equal to 2.17. Otherwise, if an
    object has only local functions within a section, it will give a
    nice warning and recommend the user to upgrade their objcopy.

    Leaving the symbols unrecorded is not that big of a deal, since the
    mcount record method changes the actual mcount code to be a simple
    "ret" without recording registers or anything.

    Reported-by: Stephen Rothwell
    Signed-off-by: Steven Rostedt
    Signed-off-by: Ingo Molnar

    Steven Rostedt
     
  • During tests and checks, I've discovered that there were failures to
    convert mcount callers into nops. Looking deeper into these failures,
    code that was attempted to be changed was not an mcount caller.
    The current code only updates if the code being changed is what it expects,
    but I still investigate any time there is a failure.

    What was happening is that a weak symbol was being used as a reference
    for other mcount callers. That weak symbol was also referenced elsewhere
    so the offsets were using the strong symbol and not the function symbol
    that it was referenced from.

    This patch changes the setting up of the mcount_loc section to search
    for a global function that is not weak. It will pick a local over a weak
    but if only a weak is found in a section, a warning is printed and the
    mcount location is not recorded (just to be safe).

    Signed-off-by: Steven Rostedt
    Signed-off-by: Ingo Molnar

    Steven Rostedt
     
  • I'm trying to keep all the arch changes in recordmcount.pl in one place.
    I moved your code into that area, by adding the flags to the commands
    that were passed in.

    Signed-off-by: Steven Rostedt
    Signed-off-by: Ingo Molnar

    Steven Rostedt
     
  • hack around:

    ld: Relocatable linking with relocations from format elf32-i386 (init/.tmp_gl_calibrate.o) to format elf64-x86-64 (init/.tmp_mx_calibrate.o) i CC arch/x86/mm/extable.o
    objcopy: 'init/.tmp_mx_calibrate.o': No such file
    rm: cannot remove `init/.tmp_mx_calibrate.o': No such file or directory
    ld: Relocatable linking with relocations from format elf32-i386 (arch/x86/mm/extable.o) to format elf64-x86-64 (arch/x86/mm/.tmp_mx_extable.o) is not supported
    mv: cannot stat `arch/x86/mm/.tmp_mx_extable.o': No such file or directory
    ld: Relocatable linking with relocations from format elf32-i386 (arch/x86/mm/fault.o) to format elf64-x86-64 (arch/x86/mm/.tmp_mx_fault.o) is not supported

    Signed-off-by: Ingo Molnar

    Ingo Molnar
     
  • This patch creates a section in the kernel called "__mcount_loc".
    This will hold a list of pointers to the mcount relocation for
    each call site of mcount.

    For example:

    objdump -dr init/main.o
    [...]
    Disassembly of section .text:

    0000000000000000 :
    0: 55 push %rbp
    [...]
    000000000000017b :
    17b: 55 push %rbp
    17c: 48 89 e5 mov %rsp,%rbp
    17f: 53 push %rbx
    180: 48 83 ec 08 sub $0x8,%rsp
    184: e8 00 00 00 00 callq 189
    185: R_X86_64_PC32 mcount+0xfffffffffffffffc
    [...]

    We will add a section to point to each function call.

    .section __mcount_loc,"a",@progbits
    [...]
    .quad .text + 0x185
    [...]

    The offset to of the mcount call site in init_post is an offset from
    the start of the section, and not the start of the function init_post.
    The mcount relocation is at the call site 0x185 from the start of the
    .text section.

    .text + 0x185 == init_post + 0xa

    We need a way to add this __mcount_loc section in a way that we do not
    lose the relocations after final link. The .text section here will
    be attached to all other .text sections after final link and the
    offsets will be meaningless. We need to keep track of where these
    .text sections are.

    To do this, we use the start of the first function in the section.
    do_one_initcall. We can make a tmp.s file with this function as a reference
    to the start of the .text section.

    .section __mcount_loc,"a",@progbits
    [...]
    .quad do_one_initcall + 0x185
    [...]

    Then we can compile the tmp.s into a tmp.o

    gcc -c tmp.s -o tmp.o

    And link it into back into main.o.

    ld -r main.o tmp.o -o tmp_main.o
    mv tmp_main.o main.o

    But we have a problem. What happens if the first function in a section
    is not exported, and is a static function. The linker will not let
    the tmp.o use it. This case exists in main.o as well.

    Disassembly of section .init.text:

    0000000000000000 :
    0: 55 push %rbp
    1: 48 89 e5 mov %rsp,%rbp
    4: e8 00 00 00 00 callq 9
    5: R_X86_64_PC32 mcount+0xfffffffffffffffc

    The first function in .init.text is a static function.

    00000000000000a8 t __setup_set_reset_devices
    000000000000105f t __setup_str_set_reset_devices
    0000000000000000 t set_reset_devices

    The lowercase 't' means that set_reset_devices is local and is not exported.
    If we simply try to link the tmp.o with the set_reset_devices we end
    up with two symbols: one local and one global.

    .section __mcount_loc,"a",@progbits
    .quad set_reset_devices + 0x10

    00000000000000a8 t __setup_set_reset_devices
    000000000000105f t __setup_str_set_reset_devices
    0000000000000000 t set_reset_devices
    U set_reset_devices

    We still have an undefined reference to set_reset_devices, and if we try
    to compile the kernel, we will end up with an undefined reference to
    set_reset_devices, or even worst, it could be exported someplace else,
    and then we will have a reference to the wrong location.

    To handle this case, we make an intermediate step using objcopy.
    We convert set_reset_devices into a global exported symbol before linking
    it with tmp.o and set it back afterwards.

    00000000000000a8 t __setup_set_reset_devices
    000000000000105f t __setup_str_set_reset_devices
    0000000000000000 T set_reset_devices

    00000000000000a8 t __setup_set_reset_devices
    000000000000105f t __setup_str_set_reset_devices
    0000000000000000 T set_reset_devices

    00000000000000a8 t __setup_set_reset_devices
    000000000000105f t __setup_str_set_reset_devices
    0000000000000000 t set_reset_devices

    Now we have a section in main.o called __mcount_loc that we can place
    somewhere in the kernel using vmlinux.ld.S and access it to convert
    all these locations that call mcount into nops before starting SMP
    and thus, eliminating the need to do this with kstop_machine.

    Note, A well documented perl script (scripts/recordmcount.pl) is used
    to do all this in one location.

    Signed-off-by: Steven Rostedt
    Signed-off-by: Ingo Molnar

    Steven Rostedt