Eric Lee / smarc-fsl-linux-kernel

17 Nov, 2011

1 commit

77271ce4b tracing: Add irq, preempt-count and need resched info to default trace output ... Browse Code »

People keep asking how to get the preempt count, irq, and need resched info
and we keep telling them to enable the latency format. Some developers think
that traces without this info is completely useless, and for a lot of tasks
it is useless.

The first option was to enable the latency trace as the default format, but
the header for the latency format is pretty useless for most tracers and
it also does the timestamp in straight microseconds from the time the trace
started. This is sometimes more difficult to read as the default trace is
seconds from the start of boot up.

Latency format:

# tracer: nop
#
# nop latency trace v1.1.5 on 3.2.0-rc1-test+
# --------------------------------------------------------------------
# latency: 0 us, #159771/64234230, CPU#1 | (M:preempt VP:0, KP:0, SP:0 HP:0 #P:4)
# -----------------
# | task: -0 (uid:0 nice:0 policy:0 rt_prio:0)
# -----------------
#
# _------=> CPU#
# / _-----=> irqs-off
# | / _----=> need-resched
# || / _---=> hardirq/softirq
# ||| / _--=> preempt-depth
# |||| / delay
# cmd pid ||||| time | caller
# \ / ||||| \ | /
migratio-6 0...2 41778231us+: rcu_note_context_switch irqs-off
# / _----=> need-resched
# | / _---=> hardirq/softirq
# || / _--=> preempt-depth
# ||| / delay
# TASK-PID CPU# |||| TIMESTAMP FUNCTION
# | | | |||| | |
-0 [000] d..2 49.309305: cpuidle_get_driver -0 [000] d..2 49.309307: mwait_idle -0 [000] d..2 49.309309: need_resched -0 [000] d..2 49.309310: test_ti_thread_flag -0 [000] d..2 49.309312: trace_power_start.constprop.13 -0 [000] d..2 49.309313: trace_cpu_idle -0 [000] d..2 49.309315: need_resched -0 [000] 49.309305: cpuidle_get_driver -0 [000] 49.309307: mwait_idle -0 [000] 49.309309: need_resched -0 [000] 49.309310: test_ti_thread_flag -0 [000] 49.309312: trace_power_start.constprop.13 -0 [000] 49.309313: trace_cpu_idle -0 [000] 49.309315: need_resched
Signed-off-by: Steven Rostedt

Steven Rostedt
2011-11-17 22:58:48 +0800

15 Jul, 2011

1 commit

4a9bd3f13 tracing: Have dynamic size event stack traces ... Browse Code »

Currently the stack trace per event in ftace is only 8 frames.
This can be quite limiting and sometimes useless. Especially when
the "ignore frames" is wrong and we also use up stack frames for
the event processing itself.

Change this to be dynamic by adding a percpu buffer that we can
write a large stack frame into and then copy into the ring buffer.

For interrupts and NMIs that come in while another event is being
process, will only get to use the 8 frame stack. That should be enough
as the task that it interrupted will have the full stack frame anyway.

Requested-by: Thomas Gleixner
Signed-off-by: Steven Rostedt

Steven Rostedt
2011-07-15 04:36:53 +0800

26 May, 2011

1 commit

2fc1b6f0d tracing: Add __print_symbolic_u64 to avoid warnings on 32bit machine ... Browse Code »

Filesystem, like Btrfs, has some "ULL" macros, and when these macros are passed
to tracepoints'__print_symbolic(), there will be 64->32 truncate WARNINGS during
compiling on 32bit box.

Signed-off-by: Liu Bo
Link: http://lkml.kernel.org/r/4DACE6E0.7000507@cn.fujitsu.com
Signed-off-by: Steven Rostedt

liubo
2011-05-26 10:13:44 +0800

05 Apr, 2011

1 commit

ee5e51f51 tracing: Avoid soft lockup in trace_pipe ... Browse Code »

running following commands:

# enable the binary option
echo 1 > ./options/bin
# disable context info option
echo 0 > ./options/context-info
# tracing only events
echo 1 > ./events/enable
cat trace_pipe

plus forcing system to generate many tracing events,
is causing lockup (in NON preemptive kernels) inside
tracing_read_pipe function.

The issue is also easily reproduced by running ltp stress test.
(ftrace_stress_test.sh)

The reasons are:
- bin/hex/raw output functions for events are set to
trace_nop_print function, which prints nothing and
returns TRACE_TYPE_HANDLED value
- LOST EVENT trace do not handle trace_seq overflow

These reasons force the while loop in tracing_read_pipe
function never to break.

The attached patch fixies handling of lost event trace, and
changes trace_nop_print to print minimal info, which is needed
for the correct tracing_read_pipe processing.

v2 changes:
- omit the cond_resched changes by trace_nop_print changes
- WARN changed to WARN_ONCE and added info to be able
to find out the culprit

v3 changes:
- make more accurate patch comment

Signed-off-by: Jiri Olsa
LKML-Reference:
Signed-off-by: Steven Rostedt

Jiri Olsa
2011-04-05 00:18:24 +0800

10 Mar, 2011

2 commits

10da37a64 tracing: Adjust conditional expression latency formatting. ... Browse Code »

Formatting change only to improve code readability. No code changes except to
introduce intermediate variables.

Signed-off-by: David Sharp
LKML-Reference:

[ Keep variable declarations and assignment separate ]

Signed-off-by: Steven Rostedt

David Sharp
2011-03-10 23:34:35 +0800
e6e1e2593 tracing: Remove lock_depth from event entry ... Browse Code »

The lock_depth field in the event headers was added as a temporary
data point for help in removing the BKL. Now that the BKL is pretty
much been removed, we can remove this field.

This in turn changes the header from 12 bytes to 8 bytes,
removing the 4 byte buffer that gcc would insert if the first field
in the data load was 8 bytes in size.

Signed-off-by: Steven Rostedt

Steven Rostedt
2011-03-10 23:31:48 +0800

23 Jul, 2010

1 commit

3a01736e7 Merge branch 'tip/perf/core' of git://git.kernel.org/pub/scm/linux/kernel/git/ro… ... Browse Code »

…stedt/linux-2.6-trace into perf/core

Ingo Molnar
2010-07-23 15:10:29 +0800

21 Jul, 2010

1 commit

bc289ae98 tracing: Reduce latency and remove percpu trace_seq ... Browse Code »

__print_flags() and __print_symbolic() use percpu trace_seq:

1) Its memory is allocated at compile time, it wastes memory if we don't use tracing.
2) It is percpu data and it wastes more memory for multi-cpus system.
3) It disables preemption when it executes its core routine
"trace_seq_printf(s, "%s: ", #call);" and introduces latency.

So we move this trace_seq to struct trace_iterator.

Signed-off-by: Lai Jiangshan
LKML-Reference:
Signed-off-by: Steven Rostedt

Lai Jiangshan
2010-07-21 10:05:34 +0800

20 Jul, 2010

1 commit

eb7beb5c0 tracing: Remove special traces ... Browse Code »

Special traces type was only used by sysprof. Lets remove it now
that sysprof ftrace plugin has been dropped.

Signed-off-by: Frederic Weisbecker
Acked-by: Soeren Sandmann
Cc: Peter Zijlstra
Cc: Ingo Molnar
Cc: Steven Rostedt
Cc: Li Zefan

Frederic Weisbecker
2010-07-20 20:31:07 +0800

28 May, 2010

1 commit

c5617b200 Merge branch 'perf-core-for-linus' of git://git.kernel.org/pub/scm/linux/kernel/… ... Browse Code »

…git/tip/linux-2.6-tip

* 'perf-core-for-linus' of git://git.kernel.org/pub/scm/linux/kernel/git/tip/linux-2.6-tip: (61 commits)
tracing: Add __used annotation to event variable
perf, trace: Fix !x86 build bug
perf report: Support multiple events on the TUI
perf annotate: Fix up usage of the build id cache
x86/mmiotrace: Remove redundant instruction prefix checks
perf annotate: Add TUI interface
perf tui: Remove annotate from popup menu after failure
perf report: Don't start the TUI if -D is used
perf: Fix getline undeclared
perf: Optimize perf_tp_event_match()
perf: Remove more code from the fastpath
perf: Optimize the !vmalloc backed buffer
perf: Optimize perf_output_copy()
perf: Fix wakeup storm for RO mmap()s
perf-record: Share per-cpu buffers
perf-record: Remove -M
perf: Ensure that IOC_OUTPUT isn't used to create multi-writer buffers
perf, trace: Optimize tracepoints by using per-tracepoint-per-cpu hlist to track events
perf, trace: Optimize tracepoints by removing IRQ-disable from perf/tracepoint interaction
perf tui: Allow disabling the TUI on a per command basis in ~/.perfconfig
...

Linus Torvalds
2010-05-28 06:23:47 +0800

21 May, 2010

1 commit

33cf23b0a Merge git://git.kernel.org/pub/scm/linux/kernel/git/jejb/scsi-misc-2.6 ... Browse Code »

* git://git.kernel.org/pub/scm/linux/kernel/git/jejb/scsi-misc-2.6: (182 commits)
[SCSI] aacraid: add an ifdef'd device delete case instead of taking the device offline
[SCSI] aacraid: prohibit access to array container space
[SCSI] aacraid: add support for handling ATA pass-through commands.
[SCSI] aacraid: expose physical devices for models with newer firmware
[SCSI] aacraid: respond automatically to volumes added by config tool
[SCSI] fcoe: fix fcoe module ref counting
[SCSI] libfcoe: FIP Keep-Alive messages for VPorts are sent with incorrect port_id and wwn
[SCSI] libfcoe: Fix incorrect MAC address clearing
[SCSI] fcoe: fix a circular locking issue with rtnl and sysfs mutex
[SCSI] libfc: Move the port_id into lport
[SCSI] fcoe: move link speed checking into its own routine
[SCSI] libfc: Remove extra pointer check
[SCSI] libfc: Remove unused fc_get_host_port_type
[SCSI] fcoe: fixes wrong error exit in fcoe_create
[SCSI] libfc: set seq_id for incoming sequence
[SCSI] qla2xxx: Updates to ISP82xx support.
[SCSI] qla2xxx: Optionally disable target reset.
[SCSI] qla2xxx: ensure flash operation and host reset via sg_reset are mutually exclusive
[SCSI] qla2xxx: Silence bogus warning by gcc for wrap and did.
[SCSI] qla2xxx: T10 DIF support added.
...

Linus Torvalds
2010-05-21 22:19:18 +0800

15 May, 2010

1 commit

a9a577638 tracing: Allow events to share their print functions ... Browse Code »

Multiple events may use the same method to print their data.
Instead of having all events have a pointer to their print funtions,
the trace_event structure now points to a trace_event_functions structure
that will hold the way to print ouf the event.

The event itself is now passed to the print function to let the print
function know what kind of event it should print.

This opens the door to consolidating the way several events print
their output.

text data bss dec hex filename
4913961 1088356 861512 6863829 68bbd5 vmlinux.orig
4900382 1048964 861512 6810858 67ecea vmlinux.init
4900446 1049028 861512 6810986 67ed6a vmlinux.preprint

This change slightly increases the size but is needed for the next change.

v3: Fix the branch tracer events to handle this change.

v2: Fix the new function graph tracer event calls to handle this change.

Acked-by: Mathieu Desnoyers
Acked-by: Masami Hiramatsu
Acked-by: Frederic Weisbecker
Signed-off-by: Steven Rostedt

Steven Rostedt
2010-05-15 02:20:32 +0800

06 May, 2010

1 commit

668eb65f0 tracing: Fix "integer as NULL pointer" warning. ... Browse Code »

kernel/trace/trace_output.c:256:24: warning: Using plain integer as NULL pointer

Signed-off-by: Thiago Farina
LKML-Reference:
Signed-off-by: Steven Rostedt

Thiago Farina
2010-05-06 00:01:26 +0800

01 May, 2010

2 commits

bf8162354 [SCSI] add scsi trace core functions and put trace points ... Browse Code »

Signed-off-by: Xiao Guangrong
Signed-off-by: Tomohiro Kusumi
Signed-off-by: Kei Tokunaga
Signed-off-by: James Bottomley

Kei Tokunaga
2010-05-01 01:51:10 +0800
5a2e39959 [SCSI] ftrace: add __print_hex() ... Browse Code »

__print_hex() prints values in an array in hex (w/o '0x') (space separated)
EX) 92 33 32 f3 ee 4d

Signed-off-by: Li Zefan
Signed-off-by: Tomohiro Kusumi
Signed-off-by: Kei Tokunaga
Acked-by: Steven Rostedt
Signed-off-by: James Bottomley

Kei Tokunaga
2010-05-01 01:50:22 +0800

10 Dec, 2009

2 commits

d184b31c0 tracing: Add full state to trace_seq ... Browse Code »

The trace_seq buffer might fill up, and right now one needs to check the
return value of each printf into the buffer to check for that.

Instead, have the buffer keep track of whether it is full or not, and
reject more input if it is full or would have overflowed with an input
that wasn't added.

Cc: Lai Jiangshan
Signed-off-by: Johannes Berg
Signed-off-by: Steven Rostedt

Johannes Berg
2009-12-10 03:05:49 +0800
a63ce5b30 tracing: Buffer the output of seq_file in case of filled buffer ... Browse Code »

If the seq_read fills the buffer it will call s_start again on the next
itertation with the same position. This causes a problem with the
function_graph tracer because it consumes the iteration in order to
determine leaf functions.

What happens is that the iterator stores the entry, and the function
graph plugin will look at the next entry. If that next entry is a return
of the same function and task, then the function is a leaf and the
function_graph plugin calls ring_buffer_read which moves the ring buffer
iterator forward (the trace iterator still points to the function start
entry).

The copying of the trace_seq to the seq_file buffer will fail if the
seq_file buffer is full. The seq_read will not show this entry.
The next read by userspace will cause seq_read to again call s_start
which will reuse the trace iterator entry (the function start entry).
But the function return entry was already consumed. The function graph
plugin will think that this entry is a nested function and not a leaf.

To solve this, the trace code now checks the return status of the
seq_printf (trace_print_seq). If the writing to the seq_file buffer
fails, we set a flag in the iterator (leftover) and we do not reset
the trace_seq buffer. On the next call to s_start, we check the leftover
flag, and if it is set, we just reuse the trace_seq buffer and do not
call into the plugin print functions.

Before this patch:

2) | fput() {
2) | __fput() {
2) 0.550 us | inotify_inode_queue_event();
2) | __fsnotify_parent() {
2) 0.540 us | inotify_dentry_parent_queue_event();

After the patch:

2) | fput() {
2) | __fput() {
2) 0.550 us | inotify_inode_queue_event();
2) 0.548 us | __fsnotify_parent();
2) 0.540 us | inotify_dentry_parent_queue_event();

[
Updated the patch to fix a missing return 0 from the trace_print_seq()
stub when CONFIG_TRACING is disabled.

Reported-by: Ingo Molnar
]

Reported-by: Jiri Olsa
Cc: Frederic Weisbecker
Signed-off-by: Steven Rostedt

Steven Rostedt
2009-12-10 02:55:26 +0800

24 Oct, 2009

1 commit

3e69533b5 tracing: Fix trace_seq_printf() return value ... Browse Code »

trace_seq_printf() return value is a little ambiguous. It
currently returns the length of the space available in the
buffer. printf usually returns the amount written. This is not
adequate here, because:

trace_seq_printf(s, "");

is perfectly legal, and returning 0 would indicate that it
failed.

We can always see the amount written by looking at the before
and after values of s->len. This is not quite the same use as
printf. We only care if the string was successfully written to
the buffer or not.

Make trace_seq_printf() return 0 if the trace oversizes the
buffer's free space, 1 otherwise.

Signed-off-by: Jiri Olsa
Signed-off-by: Steven Rostedt
Cc: Frederic Weisbecker
LKML-Reference:
Signed-off-by: Ingo Molnar

Jiri Olsa
2009-10-24 17:07:50 +0800

08 Oct, 2009

1 commit

829b876df tracing: fix transposed numbers of lock_depth and preempt_count ... Browse Code »

The lock_depth and preempt_count numbers in the latency format is
transposed.

Signed-off-by: Steven Rostedt

Steven Rostedt
2009-10-08 02:05:04 +0800

06 Oct, 2009

1 commit

b0f56f1a6 trace: Fix missing assignment in trace_ctxwake_* ... Browse Code »

The state char variable S should be reassigned, if S == 0.

We are missing the state of the task that is going to sleep for the
context switch events (in the raw mode).

Fortunately the problem arises with the sched_switch/wake_up
tracers, not the sched trace events.

The formers are legacy now. But still, that was buggy.

Signed-off-by: Hiroshi Shimamoto
Cc: Steven Rostedt
Acked-by: Frederic Weisbecker
LKML-Reference:
Signed-off-by: Ingo Molnar

Hiroshi Shimamoto
2009-10-06 20:28:24 +0800

12 Sep, 2009

2 commits

f81c972d2 tracing: consolidate code between trace_output.c and trace_function_graph.c ... Browse Code »

Both trace_output.c and trace_function_graph.c do basically the same
thing to handle the printing of the latency-format. This patch moves
the code into one function that both can use.

Signed-off-by: Steven Rostedt

Steven Rostedt
2009-09-12 02:24:13 +0800
637e7e864 tracing: add lock depth to entries ... Browse Code »

This patch adds the lock depth of the big kernel lock to the generic
entry header. This way we can see the depth of the lock and help
in removing the BKL.

Example:

# _------=> CPU#
# / _-----=> irqs-off
# | / _----=> need-resched
# || / _---=> hardirq/softirq
# ||| / _--=> preempt-depth
# |||| /_--=> lock-depth
# |||||/ delay
# cmd pid |||||| time | caller
# \ / |||||| \ | /
-0 2.N..3 5902255250us+: lock_acquire: read rcu_read_lock
-0 2.N..3 5902255253us+: lock_release: rcu_read_lock
-0 2dN..3 5902255257us+: lock_acquire: xtime_lock
-0 2dN..4 5902255259us : lock_acquire: clocksource_lock
-0 2dN..4 5902255261us+: lock_release: clocksource_lock

Signed-off-by: Steven Rostedt

Steven Rostedt
2009-09-12 01:55:35 +0800

11 Sep, 2009

1 commit

48659d311 tracing: move tgid out of generic entry and into userstack ... Browse Code »

The userstack trace required the recording of the tgid entry.
Unfortunately, it was added to the generic entry where it wasted
4 bytes of every entry and was only used by one entry.

This patch moves it out of the generic field and moves it into the
only user (userstack_entry).

Signed-off-by: Steven Rostedt

Steven Rostedt
2009-09-11 23:36:23 +0800

02 Jul, 2009

1 commit

e1af3aec3 tracing: Fix trace_print_seq() ... Browse Code »

We will lose something if trace_seq->buffer[0] is 0, because the copy length
is calculated by strlen() in seq_puts(), so using seq_write() instead of
seq_puts().

There have a example:
after reboot:

# echo kmemtrace > current_tracer
# echo 0 > options/kmem_minimalistic
# cat trace
# tracer: kmemtrace
#
#

Nothing is exported, because the first byte of trace_seq->buffer[ ]
is KMEMTRACE_USER_ALLOC.

( the value of KMEMTRACE_USER_ALLOC is zero, seeing
kmemtrace_print_alloc_user() in kernel/trace/kmemtrace.c)

Signed-off-by: Xiao Guangrong
Acked-by: Frederic Weisbecker
Acked-by: Pekka Enberg
Acked-by: Eduard - Gabriel Munteanu
Cc: Steven Rostedt
LKML-Reference:
Signed-off-by: Ingo Molnar

Xiao Guangrong
2009-07-02 14:51:13 +0800

10 Jun, 2009

2 commits

110bf2b76 tracing: add protection around module events unload ... Browse Code »

When reading the trace buffer, there is a race that when a module
is unloaded it removes events that is stilled referenced in the buffers.
This patch adds the protection around the unloading of the events
from modules and the reading of the trace buffers.

Signed-off-by: Steven Rostedt

Steven Rostedt
2009-06-10 05:29:07 +0800
725c624a5 tracing: add trace_seq_vprint interface ... Browse Code »

The code to update the print formats for events requires a vprintf
format in the trace_seq. This patch adds that interface.

Signed-off-by: Steven Rostedt

Steven Rostedt
2009-06-10 03:17:32 +0800

03 Jun, 2009

4 commits

563af16c3 tracing: add annotation to what type of stack trace is recorded ... Browse Code »

The current method of printing out a stack trace is to add a new line
and print out the trace:

yum-updatesd-3120 [002] 573.691303:
=> do_softirq
=> irq_exit
=> smp_apic_timer_interrupt
=> apic_timer_interrupt

This looks a bit awkward, and if we have both stack and user stack traces
running, it would be nice to have a title to tell them apart, although
it is easy to tell by the output.

This patch adds an annotation to the start of the stack traces:

init-1 [003] 929.304979:
=> user_path_at
=> vfs_fstatat
=> vfs_stat
=> sys_newstat
=> system_call_fastpath

cat-3459 [002] 1016.824040:
=>
=>
=>

Signed-off-by: Steven Rostedt

Steven Rostedt
2009-06-03 23:10:44 +0800
56d8bd3f0 tracing: fix multiple use of __print_flags and __print_symbolic ... Browse Code »

Here is an updated patch to include the extra call to
trace_seq_init() as requested. This is vs. the latest
-tip tree and fixes the use of multiple __print_flags
and __print_symbolic in a single tracer. Also tested
to ensure its working now:

mount.gfs2-2534 [000] 235.850587: gfs2_glock_queue: 8.7 glock 1:2 dequeue PR
mount.gfs2-2534 [000] 235.850591: gfs2_demote_rq: 8.7 glock 1:0 demote EX to NL flags:DI
mount.gfs2-2534 [000] 235.850591: gfs2_glock_queue: 8.7 glock 1:0 dequeue EX
glock_workqueue-2529 [000] 235.850666: gfs2_glock_state_change: 8.7 glock 1:0 state EX => NL tgt:NL dmt:NL flags:lDpI
glock_workqueue-2529 [000] 235.850672: gfs2_glock_put: 8.7 glock 1:0 state NL => IV flags:I

Signed-off-by: Steven Whitehouse
LKML-Reference:
Signed-off-by: Steven Rostedt

Steven Whitehouse
2009-06-03 22:29:48 +0800
048dc50c5 tracing/events: fix output format of user stack ... Browse Code »

According to "events/ftrace/user_stack/format", fix the output of
user stack.

before fix:

sh-1073 [000] 31.137561:

after fix:

sh-1072 [000] 37.039329:
=>
=>
=>

Signed-off-by: walimis
LKML-Reference:
Signed-off-by: Steven Rostedt

walimis
2009-06-03 22:25:30 +0800
f11b3f4e2 tracing/events: fix output format of kernel stack ... Browse Code »

According to "events/ftrace/kernel_stack/format", output format of
kernel stack should use "=>" instead of " sh:1073 [120]
sh-1072 [000] 26.957752:
sys_clone
=> syscall_call
sh-1075 [000] 39.792713: sched_switch: task sh:1075 [120] (R) ==> sh:1076 [120]
sh-1075 [000] 39.792722:
=> schedule
=> preempt_schedule
=> wake_up_new_task
=> do_fork
=> sys_clone
=> syscall_call

Signed-off-by: walimis
LKML-Reference:
Signed-off-by: Steven Rostedt

walimis
2009-06-03 22:25:15 +0800

02 Jun, 2009

1 commit

ec081ddc3 tracing: add exports to use __print_symbolic and __print_flags from a module ... Browse Code »

A patch to allow the use of __print_symbolic and __print_flags
from a module. This allows the current GFS2 tracing patch to
build.

Signed-off-by: Steven Whitehouse
LKML-Reference:
Signed-off-by: Steven Rostedt

Steven Whitehouse
2009-06-02 11:25:29 +0800

27 May, 2009

2 commits

0f4fc29dd tracing: add __print_symbolic to trace events ... Browse Code »

This patch adds __print_symbolic which is similar to __print_flags but
works for an enumeration type instead. That is, there is only a one to one
mapping between the values and the symbols. When a match is made, then
it is printed, otherwise the hex value is outputed.

[ Impact: add interface for showing symbol names in events ]

Signed-off-by: Steven Rostedt
Signed-off-by: Frederic Weisbecker

Steven Rostedt
2009-05-27 02:31:50 +0800
be74b73a5 tracing: add __print_flags for events ... Browse Code »

Developers have been asking for the ability in the ftrace event tracer
to display names of bits in a flags variable.

Instead of printing out c2, it would be easier to read FOO|BAR|GOO,
assuming that FOO is bit 1, BAR is bit 6 and GOO is bit 7.

Some examples where this would be useful are the state flags in a context
switch, kmalloc flags, and even permision flags in accessing files.

[
v2 changes include:

Frederic Weisbecker's idea of using a mask instead of bits,
thus we can output GFP_KERNEL instead of GPF_WAIT|GFP_IO|GFP_FS.

Li Zefan's idea of allowing the caller of __print_flags to add their
own delimiter (or no delimiter) where we can get for file permissions
rwx instead of r|w|x.
]

[
v3 changes:

Christoph Hellwig's idea of using an array instead of va_args.
]

[ Impact: better displaying of flags in trace output ]

Signed-off-by: Steven Rostedt
Signed-off-by: Frederic Weisbecker

Steven Rostedt
2009-05-27 02:25:22 +0800

26 May, 2009

1 commit

4f5359685 tracing: add trace_event_read_lock() ... Browse Code »

I found that there is nothing to protect event_hash in
ftrace_find_event(). Rcu protects the event hashlist
but not the event itself while we use it after its extraction
through ftrace_find_event().

This lack of a proper locking in this spot opens a race
window between any event dereferencing and module removal.

Eg:

--Task A--

print_trace_line(trace) {
event = find_ftrace_event(trace)

--Task B--

trace_module_remove_events(mod) {
list_trace_events_module(ev, mod) {
unregister_ftrace_event(ev->event) {
hlist_del(ev->event->node)
list_del(....)
}
}
}
|--> module removed, the event has been dropped

--Task A--

event->print(trace); // Dereferencing freed memory

If the event retrieved belongs to a module and this module
is concurrently removed, we may end up dereferencing a data
from a freed module.

RCU could solve this, but it would add latency to the kernel and
forbid tracers output callbacks to call any sleepable code.
So this fix converts 'trace_event_mutex' to a read/write semaphore,
and adds trace_event_read_lock() to protect ftrace_find_event().

[ Impact: fix possible freed memory dereference in ftrace ]

Signed-off-by: Lai Jiangshan
Acked-by: Steven Rostedt
LKML-Reference:
Signed-off-by: Frederic Weisbecker

Lai Jiangshan
2009-05-26 05:53:41 +0800

15 May, 2009

1 commit

1ec7c4849 tracing: stop stack trace on first empty entry ... Browse Code »

The stack tracer stores eight entries in the ring buffer when an event
traces the stack. The output outputs all eight entries regardless of
how many entries were recorded.

This patch breaks out of the loop when a null entry is discovered.

[ Impact: only print the stack that is recorded ]

Signed-off-by: Steven Rostedt

Steven Rostedt
2009-05-15 11:40:06 +0800

06 May, 2009

1 commit

48dd0fed9 tracing: trace_output.c, fix false positive compiler warning ... Browse Code »

This compiler warning:

CC kernel/trace/trace_output.o
kernel/trace/trace_output.c: In function ‘register_ftrace_event’:
kernel/trace/trace_output.c:544: warning: ‘list’ may be used uninitialized in this function

Is wrong as 'list' is always initialized - but GCC (4.3.2) does not
recognize this relationship properly.

Work around the warning by initializing the variable to NULL.

[ Impact: fix false positive compiler warning ]

Signed-off-by: Jaswinder Singh Rajput
Acked-by: Steven Rostedt
LKML-Reference:
Signed-off-by: Ingo Molnar

Jaswinder Singh Rajput
2009-05-06 20:19:16 +0800

25 Apr, 2009

1 commit

060fa5c83 tracing/events: reuse trace event ids after overflow ... Browse Code »

With modules being able to add trace events, and the max trace event
counter is 16 bits (65536) we can overflow the counter easily
with a simple while loop adding and removing modules that contain
trace events.

This patch links together the registered trace events and on overflow
searches for available trace event ids. It will still fail if
over 65536 events are registered, but considering that a typical
kernel only has 22000 functions, 65000 events should be sufficient.

Reported-by: Li Zefan
Signed-off-by: Steven Rostedt

Steven Rostedt
2009-04-25 11:06:00 +0800

24 Apr, 2009

1 commit

89ec0dee9 tracing: increase size of number of possible events ... Browse Code »

With the new event tracing registration, we must increase the number
of events that can be registered. Currently the type field is only
one byte, which leaves us only 256 possible events.

Since we do not save the CPU number in the tracer anymore (it is determined
by the per cpu ring buffer that is used) we have an extra byte to use.

This patch increases the size of type from 1 byte (256 events) to
2 bytes (65,536 events).

It also adds a WARN_ON_ONCE if we exceed that limit.

[ Impact: allow more than 255 events ]

Signed-off-by: Steven Rostedt

Steven Rostedt
2009-04-24 11:03:19 +0800

15 Apr, 2009

1 commit

17c873ec2 tracing/events: add export symbols for trace events in modules ... Browse Code »

Impact: let modules add trace events

The trace event code requires some functions to be exported to allow
modules to use TRACE_EVENT. This patch adds EXPORT_SYMBOL_GPL to the
necessary functions.

Signed-off-by: Steven Rostedt

Steven Rostedt
2009-04-15 00:58:01 +0800

07 Apr, 2009

1 commit

86665c75d Merge branch 'tracing/urgent' into tracing/ftrace Browse Code »

Ingo Molnar
2009-04-07 20:41:17 +0800