Loading...
1 2 3 4 5 6 7 8 9 10 11 12 13 14 15 16 17 18 19 20 21 22 23 24 25 26 27 28 29 30 31 32 33 34 35 36 37 38 39 40 41 42 43 44 45 46 47 48 49 50 51 52 53 54 55 56 57 58 59 60 61 62 63 64 65 66 67 68 69 70 71 72 73 74 75 76 77 78 79 80 81 82 83 84 85 86 87 88 89 90 91 92 93 94 95 96 97 98 99 100 101 102 103 104 | Using the Linux Kernel Markers Mathieu Desnoyers This document introduces Linux Kernel Markers and their use. It provides examples of how to insert markers in the kernel and connect probe functions to them and provides some examples of probe functions. * Purpose of markers A marker placed in code provides a hook to call a function (probe) that you can provide at runtime. A marker can be "on" (a probe is connected to it) or "off" (no probe is attached). When a marker is "off" it has no effect, except for adding a tiny time penalty (checking a condition for a branch) and space penalty (adding a few bytes for the function call at the end of the instrumented function and adds a data structure in a separate section). When a marker is "on", the function you provide is called each time the marker is executed, in the execution context of the caller. When the function provided ends its execution, it returns to the caller (continuing from the marker site). You can put markers at important locations in the code. Markers are lightweight hooks that can pass an arbitrary number of parameters, described in a printk-like format string, to the attached probe function. They can be used for tracing and performance accounting. * Usage In order to use the macro trace_mark, you should include linux/marker.h. #include <linux/marker.h> And, trace_mark(subsystem_event, "myint %d mystring %s", someint, somestring); Where : - subsystem_event is an identifier unique to your event - subsystem is the name of your subsystem. - event is the name of the event to mark. - "myint %d mystring %s" is the formatted string for the serializer. "myint" and "mystring" are repectively the field names associated with the first and second parameter. - someint is an integer. - somestring is a char pointer. Connecting a function (probe) to a marker is done by providing a probe (function to call) for the specific marker through marker_probe_register() and can be activated by calling marker_arm(). Marker deactivation can be done by calling marker_disarm() as many times as marker_arm() has been called. Removing a probe is done through marker_probe_unregister(); it will disarm the probe. marker_synchronize_unregister() must be called between probe unregistration and the first occurrence of - the end of module exit function, to make sure there is no caller left using the probe; - the free of any resource used by the probes, to make sure the probes wont be accessing invalid data. This, and the fact that preemption is disabled around the probe call, make sure that probe removal and module unload are safe. See the "Probe example" section below for a sample probe module. The marker mechanism supports inserting multiple instances of the same marker. Markers can be put in inline functions, inlined static functions, and unrolled loops as well as regular functions. The naming scheme "subsystem_event" is suggested here as a convention intended to limit collisions. Marker names are global to the kernel: they are considered as being the same whether they are in the core kernel image or in modules. Conflicting format strings for markers with the same name will cause the markers to be detected to have a different format string not to be armed and will output a printk warning which identifies the inconsistency: "Format mismatch for probe probe_name (format), marker (format)" Another way to use markers is to simply define the marker without generating any function call to actually call into the marker. This is useful in combination with tracepoint probes in a scheme like this : void probe_tracepoint_name(unsigned int arg1, struct task_struct *tsk); DEFINE_MARKER_TP(marker_eventname, tracepoint_name, probe_tracepoint_name, "arg1 %u pid %d"); notrace void probe_tracepoint_name(unsigned int arg1, struct task_struct *tsk) { struct marker *marker = &GET_MARKER(kernel_irq_entry); /* write data to trace buffers ... */ } * Probe / marker example See the example provided in samples/markers/src Compile them with your kernel. Run, as root : modprobe marker-example (insmod order is not important) modprobe probe-example cat /proc/marker-example (returns an expected error) rmmod marker-example probe-example dmesg |