From Fedora Project Wiki

< QA

m (User:Johannbg/QA/Testing/Tips/Sysrq moved to QA/Sysrq: take out of draft stage)
(Reverting previous edit)
Tag: Manual revert
 
(8 intermediate revisions by 6 users not shown)
Line 10: Line 10:


You need to say "yes" to 'Magic SysRq key (CONFIG_MAGIC_SYSRQ)' when
You need to say "yes" to 'Magic SysRq key (CONFIG_MAGIC_SYSRQ)' when
configuring the kernel. When running a kernel with SysRq compiled in,
configuring the kernel. Stock Fedora and RHEL kernels do have this functionality enabled at compile-time, but the distributions disable it at boot time, by default, using sysctl.conf.
/proc/sys/kernel/sysrq controls the functions allowed to be invoked via
 
the SysRq key. By default the file contains 1 which means that every
To re-enable it at boot time create config file in the <code>sysctl.d</code> directory (e.g. <code>/etc/sysctl.d/90-sysrq.conf</code>) with this line:
possible SysRq request is allowed (in older versions SysRq was disabled
 
by default, and you were required to specifically enable it at run-time
<pre>kernel.sysrq = 1</pre>
but this is not the case any more). Here is the list of possible values
 
in /proc/sys/kernel/sysrq:
When running a kernel with SysRq compiled in, /proc/sys/kernel/sysrq controls the functions allowed to be invoked via the SysRq key. Here is the list of possible values in /proc/sys/kernel/sysrq:
 
* 0 - disable sysrq completely
* 0 - disable sysrq completely
* 1 - enable all functions of sysrq
* 1 - enable all functions of sysrq
Line 40: Line 41:


<pre>sysctl -w kernel.sysrq=1</pre>
<pre>sysctl -w kernel.sysrq=1</pre>
{{admon/warning|Remember to turn it off! }}


{{admon/note|The value of /proc/sys/kernel/sysrq influences only the invocation via a keyboard. Invocation of any operation via /proc/sysrq-trigger is always allowed (by a user with admin privileges).}}
{{admon/note|The value of /proc/sys/kernel/sysrq influences only the invocation via a keyboard. Invocation of any operation via /proc/sysrq-trigger is always allowed (by a user with admin privileges).}}


== How do I use the magic SysRq key? ==
== How do I use the magic SysRq key? ==
Line 50: Line 48:
'''x86'''
'''x86'''


You press the key combo '<Alt><SysRq><command key>'.  
You press the key combo {{key press|Alt|SysRq|<command key>}}.  


'''SPARC'''
'''SPARC'''


You press '<Alt><Stop><command key>'
You press {{key press|Alt|Stop|<command key>}}


'''Serial Console''' (PC style standard serial ports only )
'''Serial Console''' (PC style standard serial ports only )
Line 64: Line 62:
'''PowerPC'''
'''PowerPC'''


Press '<Alt><Print Screen>(or <F13>)<command key>
Press {{key press|Alt|PrtSc}} (or {{key press|F13}}) + {{key press|<command key>}}


'''All Architectures'''
'''All Architectures'''
Line 77: Line 75:
== What are the 'command' keys? ==
== What are the 'command' keys? ==


*'b' - Will immediately reboot the system without syncing or unmounting your disks.
*<code>b</code> - Will immediately reboot the system without syncing or unmounting your disks.
*'c' - Will perform a kexec reboot in order to take a crashdump.
*<code>c</code> - Will perform a kexec reboot in order to take a crashdump.
*'d' - Shows all locks that are held.
*<code>d</code> - Shows all locks that are held.
*'e' - Send a SIGTERM to all processes, except for init.
*<code>e</code> - Send a SIGTERM to all processes, except for init.
*'f' - Will call oom_kill to kill a memory hog process.
*<code>f</code> - Will call oom_kill to kill a memory hog process.
*'g' - Used by kgdb on ppc and sh platforms.
*<code>g</code> - Used by kgdb on ppc and sh platforms.
*'h' - Will display help (any key that is not listed here will bring fourth help )
*<code>h</code> - Will display help (any key that is not listed here will bring forth help )
*'i' - Send a SIGKILL to all processes, except for init.
*<code>i</code> - Send a SIGKILL to all processes, except for init.
*'k' - Secure Access Key (SAK) Kills all programs on the current virtual
*<code>k</code> - Secure Access Key (SAK) Kills all programs on the current virtual terminal.
{{admon/note|See important comments below in SAK section.}}
{{admon/note|See important comments below in SAK section.}}
*'l' - Shows a stack backtrace for all active CPUs.
*<code>l</code> - Shows a stack backtrace for all active CPUs.
*'m' - Will dump current memory info to your console.
*<code>m</code> - Will dump current memory info to your console.
*'n' - Used to make RT tasks nice-able
*<code>n</code> - Used to make RT tasks nice-able
*'o' - Will shut your system off (if configured and supported).
*<code>o</code> - Will shut your system off (if configured and supported).
*'p' - Will dump the current registers and flags to your console.
*<code>p</code> - Will dump the current registers and flags to your console.
*'q' - Will dump a list of all running timers.
*<code>q</code> - Will dump a list of all running timers.
*'r' - Turns off keyboard raw mode and sets it to XLATE.
*<code>r</code> - Turns off keyboard raw mode and sets it to XLATE.
*'s' - Will attempt to sync all mounted filesystems.
*<code>s</code> - Will attempt to sync all mounted filesystems.
*'t' - Will dump a list of current tasks and their information to your console.
*<code>t</code> - Will dump a list of current tasks and their information to your console.
*'u' - Will attempt to remount all mounted filesystems read-only.
*<code>u</code> - Will attempt to remount all mounted filesystems read-only.
*'v' - Dumps Voyager SMP processor info to your console.
*<code>v</code> - Dumps Voyager SMP processor info to your console.
*'w' - Dumps tasks that are in uninterruptable (blocked) state.
*<code>w</code> - Dumps tasks that are in uninterruptable (blocked) state.
*'x' - Used by xmon interface on ppc/powerpc platforms.
*<code>x</code> - Used by xmon interface on ppc/powerpc platforms.
*'0'-'9' - Sets the console log level, controlling which kernel messages will be printed to your console. ('0', for example would make it so that only emergency messages like PANICs or OOPSes would make it to your console.)
*<code>0</code>-<code>9</code> - Sets the console log level, controlling which kernel messages will be printed to your console. ('0', for example would make it so that only emergency messages like PANICs or OOPSes would make it to your console.)


== Okay, so what can I use them for? ==
== Okay, so what can I use them for? ==


Un'r'aw is very handy when your X server or a svgalib program crashes.
Un<code>r</code>aw is very handy when your X server or a svgalib program crashes.


Sa'k' (Secure Access Key) is useful when you want to be sure there is no trojan program running at console which could grab your password when you would try to login. It will kill all programs on given console, thus letting you make sure that the login prompt you see is actually the one from init, not some trojan program.
Sa<code>k</code> (Secure Access Key) is useful when you want to be sure there is no trojan program running at console which could grab your password when you would try to login. It will kill all programs on given console, thus letting you make sure that the login prompt you see is actually the one from init, not some trojan program.
Others find it useful as (System Attention Key) which is useful when you want to exit a program that will not let you switch consoles. (For example, X or a svgalib program.)
Others find it useful as (System Attention Key) which is useful when you want to exit a program that will not let you switch consoles. (For example, X or a svgalib program.)


  {{admon/note|In its true form it is not a true SAK like the one in a c2 compliant system, and it should not be mistaken as such. }}
  {{admon/note|In its true form it is not a true SAK like the one in a c2 compliant system, and it should not be mistaken as such. }}


Re'b'oot is good when you're unable to shut down.
Re<code>b</code>oot is good when you're unable to shut down.


{{admon/note| It's general considered a good practice to 'u'mount first }}
{{admon/note| It's generally considered a good practice to <code>u</code>mount first }}


Crashdump can be used to manually trigger a 'c'rashdump when the system is hung.
Crashdump can be used to manually trigger a <code>c</code>rashdump when the system is hung.


{{admon/note| The kernel needs to have been built with CONFIG_KEXEC enabled! }}
{{admon/note| The kernel needs to have been built with CONFIG_KEXEC enabled! }}


Sync is great when your system is locked up, it allows you to 's'ync your disks and will certainly lessen the chance of data loss and fscking.
Sync is great when your system is locked up, it allows you to <code>s</code>ync your disks and will certainly lessen the chance of data loss and fscking.


{{admon/warning| The sync hasn't taken place until you see the "OK" and "Done" appear on the screen. (If the kernel is really in strife, you may not ever get the OK or Done message. }}
{{admon/warning| The sync hasn't taken place until you see the "OK" and "Done" appear on the screen. (If the kernel is really in strife, you may not ever get the OK or Done message. }}


'U'mount is basically useful in the same ways as 'S'ync.
<code>U</code>mount is basically useful in the same ways as <code>S</code>ync.


The loglevels '0'-'9' are useful when your console is being flooded with kernel messages you do not want to see. Selecting '0' will prevent all but the most urgent kernel messages from reaching your console.
The loglevels <code>0</code>-<code>9</code> are useful when your console is being flooded with kernel messages you do not want to see. Selecting '0' will prevent all but the most urgent kernel messages from reaching your console.


{{admon/note| They will still be logged if syslogd/klogd are alive }}
{{admon/note| They will still be logged if syslogd/klogd are alive }}


T'e'rm and k'i'll are useful if you have some sort of runaway process you are unable to kill any other way, especially if it's spawning other processes.
T<code>e</code>rm and k<code>i</code>ll are useful if you have some sort of runaway process you are unable to kill any other way, especially if it's spawning other processes.


{{admon/note| When experiencing bad kernel panic do <Alt>+<Sysrg>+e then <Alt>+<Sysrg>+u then <Alt>+<Sysrg>+i and finally <Alt>+<Sysrg>+b }}
{{admon/note| When experiencing bad kernel panic do <Alt>+<Sysrg>+e then <Alt>+<Sysrg>+u then <Alt>+<Sysrg>+i and finally <Alt>+<Sysrg>+b }}


== Troubleshoot ==
== Troubleshooting ==
 
===Hanging before initscripts get run===
 
If the machine is hanging before the initscripts get to run, boot with <code>sysrq_always_enabled=1</code>.


=== Sometimes SysRq seems to get 'stuck' after using it, what can I do? ===
=== Sometimes SysRq seems to get 'stuck' after using it, what can I do? ===


Tapping shift, alt, and control on both sides of the keyboard, and hitting an invalid sysrq sequence again
Tapping {{key press|Shift}}, {{key press|Alt}}, and {{key press|Ctrl}} on both sides of the keyboard, and hitting an invalid sysrq sequence again
will fix the problem. (i.e., something like alt-sysrq-z).  
will fix the problem. (i.e., something like {{key press|Alt|SysRq|Z}}).  


Switching to another virtual console (ALT+Fn) and then back again should also help.
Switching to another virtual console ({{key press|Alt|Fn}}) and then back again should also help.


=== I hit SysRq, but nothing seems to happen, what's wrong? ===
=== I hit SysRq, but nothing seems to happen, what's wrong? ===

Latest revision as of 22:37, 1 February 2023

Sysrq

What is the magic SysRq key?

It is a 'magical' key combo you can hit which the kernel will respond to regardless of whatever else it is doing, unless it is completely locked up.

How do I enable the magic SysRq key?

You need to say "yes" to 'Magic SysRq key (CONFIG_MAGIC_SYSRQ)' when configuring the kernel. Stock Fedora and RHEL kernels do have this functionality enabled at compile-time, but the distributions disable it at boot time, by default, using sysctl.conf.

To re-enable it at boot time create config file in the sysctl.d directory (e.g. /etc/sysctl.d/90-sysrq.conf) with this line:

kernel.sysrq = 1

When running a kernel with SysRq compiled in, /proc/sys/kernel/sysrq controls the functions allowed to be invoked via the SysRq key. Here is the list of possible values in /proc/sys/kernel/sysrq:

  • 0 - disable sysrq completely
  • 1 - enable all functions of sysrq
  • >1 - bitmask of allowed sysrq functions (see below for detailed function description):
    • 2 - enable control of console logging level
    • 4 - enable control of keyboard (SAK, unraw)
    • 8 - enable debugging dumps of processes etc.
    • 16 - enable sync command
    • 32 - enable remount read-only
    • 64 - enable signalling of processes (term, kill, oom-kill)
    • 128 - allow reboot/poweroff
    • 256 - allow nicing of all RT tasks

You can set the value in the file by the following command.

echo "number" >/proc/sys/kernel/sysrq 

So to enable it would be.

echo "1" > /proc/sys/kernel/sysrq

Or also can enable it by doing.

sysctl -w kernel.sysrq=1
The value of /proc/sys/kernel/sysrq influences only the invocation via a keyboard. Invocation of any operation via /proc/sysrq-trigger is always allowed (by a user with admin privileges).

How do I use the magic SysRq key?

x86

You press the key combo Alt+SysRq+<command key>.

SPARC

You press Alt+Stop+<command key>

Serial Console (PC style standard serial ports only )

You send a BREAK, then within 5 seconds a command key.

Sending BREAK twice is interpreted as a normal BREAK.

PowerPC

Press Alt+PrtSc (or F13) + <command key>

All Architectures

Write a character to /proc/sysrq-trigger:

echo t > /proc/sysrq-trigger 
Some keyboards may not have a key labeled 'SysRq'. The 'SysRq' key is also known as the 'Print Screen' key. Also some keyboards cannot handle so many keys being pressed at the same time, so you might have better luck with "press Alt", "press SysRq", "release SysRq", "press <command key>", release everything.

What are the 'command' keys?

  • b - Will immediately reboot the system without syncing or unmounting your disks.
  • c - Will perform a kexec reboot in order to take a crashdump.
  • d - Shows all locks that are held.
  • e - Send a SIGTERM to all processes, except for init.
  • f - Will call oom_kill to kill a memory hog process.
  • g - Used by kgdb on ppc and sh platforms.
  • h - Will display help (any key that is not listed here will bring forth help )
  • i - Send a SIGKILL to all processes, except for init.
  • k - Secure Access Key (SAK) Kills all programs on the current virtual terminal.
See important comments below in SAK section.
  • l - Shows a stack backtrace for all active CPUs.
  • m - Will dump current memory info to your console.
  • n - Used to make RT tasks nice-able
  • o - Will shut your system off (if configured and supported).
  • p - Will dump the current registers and flags to your console.
  • q - Will dump a list of all running timers.
  • r - Turns off keyboard raw mode and sets it to XLATE.
  • s - Will attempt to sync all mounted filesystems.
  • t - Will dump a list of current tasks and their information to your console.
  • u - Will attempt to remount all mounted filesystems read-only.
  • v - Dumps Voyager SMP processor info to your console.
  • w - Dumps tasks that are in uninterruptable (blocked) state.
  • x - Used by xmon interface on ppc/powerpc platforms.
  • 0-9 - Sets the console log level, controlling which kernel messages will be printed to your console. ('0', for example would make it so that only emergency messages like PANICs or OOPSes would make it to your console.)

Okay, so what can I use them for?

Unraw is very handy when your X server or a svgalib program crashes.

Sak (Secure Access Key) is useful when you want to be sure there is no trojan program running at console which could grab your password when you would try to login. It will kill all programs on given console, thus letting you make sure that the login prompt you see is actually the one from init, not some trojan program. Others find it useful as (System Attention Key) which is useful when you want to exit a program that will not let you switch consoles. (For example, X or a svgalib program.)

In its true form it is not a true SAK like the one in a c2 compliant system, and it should not be mistaken as such.

Reboot is good when you're unable to shut down.

It's generally considered a good practice to umount first

Crashdump can be used to manually trigger a crashdump when the system is hung.

The kernel needs to have been built with CONFIG_KEXEC enabled!

Sync is great when your system is locked up, it allows you to sync your disks and will certainly lessen the chance of data loss and fscking.

The sync hasn't taken place until you see the "OK" and "Done" appear on the screen. (If the kernel is really in strife, you may not ever get the OK or Done message.

Umount is basically useful in the same ways as Sync.

The loglevels 0-9 are useful when your console is being flooded with kernel messages you do not want to see. Selecting '0' will prevent all but the most urgent kernel messages from reaching your console.

They will still be logged if syslogd/klogd are alive

Term and kill are useful if you have some sort of runaway process you are unable to kill any other way, especially if it's spawning other processes.

When experiencing bad kernel panic do <Alt>+<Sysrg>+e then <Alt>+<Sysrg>+u then <Alt>+<Sysrg>+i and finally <Alt>+<Sysrg>+b

Troubleshooting

Hanging before initscripts get run

If the machine is hanging before the initscripts get to run, boot with sysrq_always_enabled=1.

Sometimes SysRq seems to get 'stuck' after using it, what can I do?

Tapping Shift, Alt, and Ctrl on both sides of the keyboard, and hitting an invalid sysrq sequence again will fix the problem. (i.e., something like Alt+SysRq+Z).

Switching to another virtual console (Alt+Fn) and then back again should also help.

I hit SysRq, but nothing seems to happen, what's wrong?

There are some keyboards that send different scancodes for SysRq than the pre-defined 0x54. So if SysRq doesn't work out of the box for a certain keyboard, run 'showkey -s' to find out the proper scancode sequence. Then use 'setkeycodes <sequence> 84' to define this sequence to the usual SysRq code (84 is decimal for 0x54). It's probably best to put this command in a boot script.

You exit 'showkey' by not typing anything for ten seconds.

I want to add SysRQ key events to a module, how does it work?

In order to register a basic function with the table, you must first include the header 'include/linux/sysrq.h', this will define everything else you need. Next, you must create a sysrq_key_op struct, and populate it with...

  • The key handler function you will use.
  • A help_msg string, that will print when SysRQ prints help
  • An action_msg string, that will print right before your handler is called. Your handler must conform to the prototype in 'sysrq.h'.

After the sysrq_key_op is created, you can call the kernel function register_sysrq_key(int key, struct sysrq_key_op *op_p); this will register the operation pointed to by 'op_p' at table key 'key', if that slot in the table is blank. At module unload time, you must call the function unregister_sysrq_key(int key, struct sysrq_key_op *op_p), which will remove the key op pointed to by 'op_p' from the key 'key', if and only if it is currently registered in that slot. This is in case the slot has been overwritten since you registered it.

The Magic SysRQ system works by registering key operations against a key op lookup table, which is defined in 'drivers/char/sysrq.c'. This key table has a number of operations registered into it at compile time, but is mutable, and 2 functions are exported for interface to it the register_sysrq_key and unregister_sysrq_key. Of course, never ever leave an invalid pointer in the table. I.e., when your module that called register_sysrq_key() exits, it must call unregister_sysrq_key() to clean up the sysrq key table entry that it used.

Null pointers in the table are always safe.

If for some reason you feel the need to call the handle_sysrq function from within a function called by handle_sysrq, you must be aware that you are in a lock (you are also in an interrupt handler, which means don't sleep!), so you must call __handle_sysrq_nolock instead.