Unit 6: Signals
Table of Contents
1 What are signals and how are they used
A signal is a software interrupt, a way to communicate information to a process about the state of other processes, the operating system, and hardware. A signal is an interrupt in the sense that it can change the flow of the program —when a signal is delivered to a process, the process will stop what its doing, either handle or ignore the signal, or in some cases terminate, depending on the signal.
Signals may also be delivered in an unpredictable way, out of sequence with the program due to the fact that signals may originate outside of the currently executing process. Another way to view signals is that it is a mechanism for handling asynchronous events. As opposed to synchronous events, which is when a standard program executes iterative, that is, one line of code following another. Asynchronous events occur when portions of the program execute out of order. Asynchronous events typically occur due to external events originating at the hardware or operating system; the signal, itself, is the way for the operating system to communicate these events to the processes so that the process can take appropriate action.
1.1 How we use signals
Signals are used for a wide variety of purposes in Unix programming, and we've already used them in smaller contexts. For example, when we are working in the shell and wish to "kill all cat programs" we type the command:
#> killall cat
The killall
command will send a signal to all processes named cat
that says "terminate." The actually signal being sent is SIGTERM
,
whose purposes is to communicate a termination request to a given
process, but the process does not actually have to terminate … more
on that later.
We've also used and looked at signals in the context of terminal
signaling which is how programs stop, start and terminate. When we
type Ctrl-c
that is the same as sending a SIGINT
signal, and
when we type Ctrl-z
that is the same as sending a SIGTSTP
signal,
and when we type fg
or bg
that is the same as sending a SIGCONT
signal.
Each of these signals describe an action that the process should take
in response. This action is outside the normal control flow of the
program, the events arrive asynchronously requiring the process to
interrupt its current operations to respond to the event. For
above signals, the response is clear — SIGTERM
terminate,
SIGSTOP
stop, SIGCONT
continue — but for other signals, the
programmer can choose the correct response, which may be to simply
ignore the signal all together.
2 The Wide World of Signals
Every signal has a name, it starts with SIG
and ends with a
description. We can view all the signals in section 7 of the man
pages, below are the standard Linux signals you're likely to interact
with:
Signal Value Action Comment ---------------------------------------------------------------------- SIGHUP 1 Term Hangup detected on controlling terminal or death of controlling process SIGINT 2 Term Interrupt from keyboard SIGQUIT 3 Core Quit from keyboard SIGILL 4 Core Illegal Instruction SIGABRT 6 Core Abort signal from abort(3) SIGFPE 8 Core Floating point exception SIGKILL 9 Term Kill signal SIGSEGV 11 Core Invalid memory reference SIGPIPE 13 Term Broken pipe: write to pipe with no readers SIGALRM 14 Term Timer signal from alarm(2) SIGTERM 15 Term Termination signal SIGUSR1 30,10,16 Term User-defined signal 1 SIGUSR2 31,12,17 Term User-defined signal 2 SIGCHLD 20,17,18 Ign Child stopped or terminated SIGCONT 19,18,25 Cont Continue if stopped SIGSTOP 17,19,23 Stop Stop process SIGTSTP 18,20,24 Stop Stop typed at tty SIGTTIN 21,21,26 Stop tty input for background process SIGTTOU 22,22,27 Stop tty output for background process
2.1 Signal Names and Values
Notice that each signal has a name, value, and default action. The
signal name should start to become a bit more familiar, the value of
a signal is actually the same as the signal itself. In fact, the
signal name is just a #defined
value, and we can see this by
looking at the sys/signal.h
header file:
#define SIGHUP 1 /* hangup */
#define SIGINT 2 /* interrupt */
#define SIGQUIT 3 /* quit */
#define SIGILL 4 /* illegal instruction (not reset when caught) */
#define SIGTRAP 5 /* trace trap (not reset when caught) */
#define SIGABRT 6 /* abort() */
#define SIGPOLL 7 /* pollable event ([XSR] generated, not supported) */
#define SIGFPE 8 /* floating point exception */
#define SIGKILL 9 /* kill (cannot be caught or ignored) */
//(...)
In code we use both the #defined
value and the number. In general,
it is easier to remember the name of the signal, but some signals are
often referred to by value, in particular, SIGKILL
, whose value 9
is affectionately used in the phrase: "Kill 9 that process."
2.2 Default Actions of Signals
Each signal has a default action. There are four described in the table:
Term
: The process will terminateCore
: The process will terminate and produce a core dump file that traces the process state at the time of termination.Ign
: The process will ignore the signalStop
: The process will stop, like with aCtrl-Z
Cont
: The process will continue from being stopped
As we willsee later, for some signals, we can change the default
actions. A few signals, which are control signals, cannot have their
default action changed, these include SIGKILL
and SIGABRT
, which
is why "kill 9" is the ultimate kill statement.
3 Signals from the Command Line
Terminology for delivering signals is to "kill" a process with the
kill
command. The kill
command is actually poorly named —
originally, it was only used to kill or terminate a process, but it is
currently used to send any kind of signal to a process. The difference
between kill
and killall
is that kill
only sends signals to
process identified by their pid, killall
sends the signal to all
process of a given name.
3.1 Preparing for the kill
A good exercise to explore the variety of signals and how to use them is to actually use them from the command line. To start, we can open two terminals, in one, we execute the loop program:
/*loop.c*/
int main(){ while(1); }
Which will just loop forever, and in other terminal, we will kill
this process with various signals to see how it responds. Let's
start with a signal we all love to hate, the signal that indicates a
Segmentation Fault occurs. You might not have realized this, but a
Segmentation Fault is a signal generated from the O.S. that
something bad has happened. Let's simulate that effect:
killall -SIGSEGV loop
And in the in the terminal where loop
is running, the result is
eerily familiar.
#>./loop Segmentation fault: 11
The 11 following the message is the signal number: 11 is the signal
number for SIGSGV
. Note that the default response to a SIGSEGV
is to terminate with a core dump.
We can explore some of the more esoteric signals and see similar results occur when the program terminates:
| Signal | Output | |------------+-----------------------------| | SIGKILL | Killed: 9 | | SIGQUIT | Quit: 3 | | SIGILL | Illegal instruction: 4 | | SIGABRT | Abort trap: 6 | | SIGFPE | Floating point exception: 8 | | SIGPIPE | (no output) | | SIGALAR | Alarm clock: 14 | | SIGUSR1 | User defined signal 1: 30 | | SIGUSR2 | User defined signal 2: 31 | |------------+-----------------------------|
3.2 Sending Terminal Signals with Kill
Let's restart the loop program and use kill
to simulate terminal
signaling. We've been discussing how the terminal control will
deliver signals to stop, continue, and terminate a process; there's
no mystery here. Those signals are signals that you can send yourself
with kill
.
Let's look at starting the loop program again, but this time
killall -SIGSTOP loop
And again, the result in the other terminal is quite familiar:
#>./loop [1]+ Stopped ./loop
If we were to run jobs
, we can see that loop
is stopped in the
background. This is the same as typing Ctrl-z
in the terminal.
#> jobs [1]+ Stopped ./loop
Before, we'd continue the loop program with a call to bg
or fg
,
but we can use kill
to do that too. From the other terminal:
killall -SIGCONT loop
And, after we run jobs, the loop
program is running in the
background:
#> jobs [1]+ Running ./loop &
Finally, let's terminate the loop program. The Ctrl-c
from the
terminal actually generates the SIGINT
signal, which stands for
"interrupt" because a Ctrl-c
initiates an interrupt of the
foreground process, which by default terminates the process.
killall -SIGINT loop
And the expected result:
#> jobs [1]+ Interrupt: 2 ./loop &
4 Handling and Generating Signals
Now that we have a decent understanding of signals and how they
communicate information to a process, let's move on to investigate how
we can write program that take some action based on a signal. This is
described as signal handling, a program that handles a signal,
either by ignoring it or taking some action when the signal is
delivered. We will also explore how signals can be sent from one
program to another, again, we'll use a kill
for that.
4.1 Hello world of Signal Handling
The primary system call for signal handling is signal()
, which given
a signal and function, will execute the function whenever the signal
is delivered. This function is called the signal handler because it
handles the signal. The signal()
function has a strange
declaration:
int signal(int signum, void (*handler)(int))
That is, signal takes two arguments: the first argument is the signal
number, such as SIGSTOP
or SIGINT
, and the second is a reference
to a handler function whose first argument is an int and returns
void. It's probably best to explore signal()
through an example, and
hello world program is where we always start.
#include <stdlib.h>
#include <stdio.h>
#include <signal.h> /*for signal() and raise()*/
void hello(int signum){
printf("Hello World!\n");
}
int main(){
//execute hello() when receiving signal SIGUSR1
signal(SIGUSR1, hello);
//send SIGUSR1 to the calling process
raise(SIGUSR1);
}
The above program first establishes a signal handler for the user
signal SIGUSR1
. The signal handling function hello()
does as
expected: prints "Hello World!" to stdout. The program then sends
itself the SIGUSR1
signal, which is accomplished via raise()
,
and the result of executing the program is the beautiful phrase:
#> ./hello_signal Hello World!
4.2 Asynchronous Execution
Some key points to take away from the hello program is that the
second argument to signal()
is a function pointer, a reference to
a function to call. This tells the operating system that whenever
this signal is sent to this process, run this function as the signal
handler.
Also, the execution of the signal handler is asynchronous, which means the current state of the program will be paused while the signal handler executes, and then execution will resume from the pause point, much like context switching.
Let's look at another example hello world program:
/* hello_loop.c*/
void hello(int signum){
printf("Hello World!\n");
}
int main(){
//Handle SIGINT with hello
signal(SIGINT, hello);
//loop forever!
while(1);
}
The above program will set a signal handler for SIGINT
the signal
that is generated when you type Ctrl-C
. The question is, when we
execute this program, what will happen when we type Ctrl-C
?
To start, let's consider the execution of the program. It will
register the signal handler and then will enter the infinite
loop. When we hit Ctrl-C
, we can all agree that the signal handler
hello()
should execute and "Hello World!" prints to the screen,
but the program was in an infinite loop. In order to print "Hello
World!" it must have been the case that it broke the loop to execute
the signal handler, right? So it should exit the loop as well as the
program. Let's see:
#> ./hello_loop ^CHello World! ^CHello World! ^CHello World! ^CHello World! ^CHello World! ^CHello World! ^CHello World! ^\Quit: 3
As the output indicates, every time we issued Ctrl-C
"Hello
World!" prints, but the program returns to the infinite loop. It is
only after issuing a SIGQUIT
signal with Ctrl-\
did the program
actually exit.
While the interpretation that the loop would exit is reasonable, it doesn't consider the primary reason for signal handling, that is, asynchronous event handling. That means the signal handler acts out of the standard flow of the control of the program; in fact, the whole program is saved within a context, and a new context is created just for the signal handler to execute in. If you think about it some more, you realize that this is pretty cool, and also a totally new way to view programming.
4.3 Inter Process Communication
Signals are also a key means for inter-process communication. One
process can send a signal to another indicating that an action
should be taken. To send a signal to a particular process, we use
the kill()
system call. The function declaration is below.
int kill(pid_t pid, int signum);
Much like the command line version, kill()
takes a process
identifier and a signal, in this case the signal value as an int
,
but the value is #defined
so you can use the name. Let's see it
in use.
/*ipc_signal.c*/
void hello(){
printf("Hello World!\n");
}
int main(){
pid_t cpid;
pid_t ppid;
//set handler for SIGUSR1 to hello()
signal(SIGUSR1, hello);
if ( (cpid = fork()) == 0){
/*CHILD*/
//get parent's pid
ppid = getppid();
//send SIGUSR1 signal to parrent
kill(ppid, SIGUSR1);
exit(0);
}else{
/*PARENT*/
//just wait for child to terminate
wait(NULL);
}
}
In this program, first a signal handler is established for
SIGUSR1
, the hello()
function. After the fork, the parent calls
wait()
and the child will communicate to the parent by "killing"
it with the SIGUSR1
signal. The result is that the handler is
invoked in the parent and "Hello World!" is printed to stdout from
the parent.
While this is a small example, signals are integral to inter
process communication. In previous lessons, we've discussed how to
communicate data between process with pipe()
, signals is the way
process communicate state changes and other asynchronous
events. Perhaps most relevant is state change in child
processes. The SIGCHLD
signal is the signal that gets delivered
to the parent when a child terminates. So far, we've been handling
this signal implicitly through wait()
, but you can choose instead
to handle SIGCHLD
and take different actions when a child
terminates. We'll look at that in more detail in a future lesson.
4.4 Ignoring Signals
So far, our handlers have been doing things — mostly, printing
"Hello World!" — but we might just want our handler to do
nothing, essentially, ignoring the signal. That is easy enough to
write in code, for example, here is a program that will ignore
SIGINT
by handling the signal and do nothing:
/*ingore_sigint.c*/
#include <signal.h>
#include <sys/signal.h>
void nothing(int signum){ /*DO NOTHING*/ }
int main(){
signal(SIGINT, nothing);
while(1);
}
And if we run this program, we see that, yes, it Ctrl-c
is
ineffective and we have to use Ctrl-\
to quit the program:
>./ignore_sigint ^C^C^C^C^C^C^C^C^C^C^\Quit: 3
But, it would seem like a pain to always have to write the silly
little ignore function that does nothing, and so, when there is a
need, there is a way. The signal.h
header defines a set of actions
that can be used in place of the handler:
SIG_IGN
: Ignore the signalSIG_DFL
: Replace the current signal handler with the default handler
With these keywords, we can rewrite the program simply as:
int main(){
// using SIG_IGN
signal(SIGINT, SIG_IGN);
while(1);
}
4.5 Changing and Reverting to the default handler
Setting a signal handler is not a singular event. You can always change the handler and you can also revert the handler back to default state. For example, consider the following program:
/*you_shot_me.c*/
void handler_3(int signum){
printf("Don't you dare shoot me one more time!\n");
//Revert to default handler, will exit on next SIGINT
signal(SIGINT, SIG_DFL);
}
void handler_2(int signum){
printf("Hey, you shot me again!\n");
//switch handler to handler_3
signal(SIGINT, handler_3);
}
void handler_1(int signum){
printf("You shot me!\n");
//switch handler to handler_2
signal(SIGINT, handler_2);
}
int main(){
//Handle SIGINT with handler_1
signal(SIGINT, handler_1);
//loop forever!
while(1);
}
The program first initiates handler_1()
as the signal handler for
SIGINT
. After the first Ctrl-c
, in the signal handler, the
handler is changed to handler_2()
, and after the second Ctrl-c
,
it is change again to handler_3()
from handler_2()
. Finally, in
handler_3()
the default signal handler is reestablished, which is
to terminate on SIGINT
, and that is what we see in the output:
#> ./you_shout_me ^CYou shot me! ^CHey, you shot me again! ^CDon't you dare shoot me one more time! ^C
4.6 Some signals are more equal than others
The last note on signal handling is that not all signals are created equal, and some signals are more equal than others. That means, that you cannot handle all signals because it could potentially place the system in an unrecoverable state.
The two signals that can never be ignored or handled are: SIGKILL
and SIGSTOP
. Let's look at an example:
/* ignore_stop.c */
int main(){
//ignore SIGSTOP ?
signal(SIGSTOP, SIG_IGN);
//infinite loop
while(1);
}
The above program tries to set the ignore signal handler for
SIGSTOP
, and then goes into an infinite loop. If we execute the
program, we find that oure efforts were fruitless:
#>./ignore_stop ^Z [1]+ Stopped ./ignore_stop
The program did stop. And we can see the same for a program that
ignores SIGKILL
.
int main(){
//ignore SIGSTOP ?
signal(SIGKILL, SIG_IGN);
//infinite loop
while(1);
}
#>./ignore_kill & [1] 13129 #>kill -SIGKILL 13129 [1]+ Killed: 9 ./ignore_kill
The reasons for this are clearer when you consider that all programs must have a way to stop and terminate. These processes cannot be interfered with otherwise operating system would loose control of execution traces.
4.7 Checking Errors of signal()
The signal()
function returns a pointer to the previous signal
handler, which means that here, again, is a system call that we
cannot error check in the typical way, by checking if the return
value is less than 0. This is because a pointer type is unsigned,
there is no such thing as negative pointers.
Instead, a special value is used SIG_ERR
which we can compare
the return value of signal()
. Here, again, is the program where
we try and ignore SIGKILL
, but this time with proper error
checking:
/*signal_errorcheck.c*/
int main(){
//ignore SIGSTOP ?
if( signal(SIGKILL, SIG_IGN) == SIG_ERR){
perror("signal");;
exit(1);
}
//infinite loop
while(1);
}
And the output from the perror()
is clear:
#>./signal_errorcheck signal: Invalid argument
The invalid argument is SIGKILL
which cannot be handled or
ignored. It can only KILL!
5 Alarm Signals and SIGALRM
In the last lesson, we explored signal handling and user generated
signals, particularly those from the terminal, the SIGKILL
, and
the user signals, or the SIGUSR1
and SIGUSR2
. In this lessons,
we'll explore O.S. generated signals. We'll start with SIGALRM
.
5.1 Setting an alarm
A SIGALRM
signal is delivered by the Operating System via a
request from the user occuring after some amount of time. To
request an alarm, use the alarm()
system call:
unsigned int alarm(unsigned int seconds);
After seconds
have passed since requesting the alarm()
, the
SIGALRM
signal is delivered. The default behavior of SIGALRM
is
to terminate, so we can catch and handle the signal, leading to a
nice hello world program:
#include <stdio.h>
#include <stdlib.h>
#include <unistd.h>
#include <signal.h>
#include <sys/signal.h>
void alarm_handler(int signum){
printf("Buzz Buzz Buzz\n");
}
int main(){
//set up alarm handler
signal(SIGALRM, alarm_handler);
//schedule alarm for 1 second
alarm(1);
//do not proceed until signal is handled
pause();
}
The program looks very much like our other signal handling
programs, except this time the signal is delivered via the
alarm()
call. Additionally, we are introducing a new system call,
pause()
.
The pause()
call will "pause" the program until a signal is
delivered and handled. Pausing is an effective way to avoid busy
waiting, e.g., while(1);
, because the process is suspended during
a pause and awoken following the return of the signal handler.
5.2 Recurring Alarms
Alarm can be set continually, but only one alarm is allowed per
process. Subsequent calls to alarm()
will reset the previous
alarm. Suppose, now, that we want to write a program that will
continually alarm every 1 second, we would need to reset the alarm
once the signal is delivered. The natural place to do that is in
the signal handler:
/* buzz_buzz.c*/
void alarm_handler(int signum){
printf("Buzz Buzz Buzz\n");
//set a new alarm for 1 second
alarm(1);
}
int main(){
//set up alarm handler
signal(SIGALRM, alarm_handler);
//schedule the first alarm
alarm(1);
//pause in a loop
while(1){
pause();
}
}
After the first SIGALRM
is delivered and "Buzz Buzz Buzz" is
printed, another alarm is schedule via alarm(1)
. The process will
resume after the pause()
, but since it is in a loop, it will return
to a suspended state. The result is an alarm clock buzzing every 1
second. Running with time
utility, after about 4 seconds, we saw
4 buzzers.
#> time ./buzz_buzz Buzz Buzz Buzz Buzz Buzz Buzz Buzz Buzz Buzz Buzz Buzz Buzz ^C real 0m4.473s user 0m0.001s sys 0m0.002s
One thing to note about this example is that while the signal
handler code runs asynchronously, it is part of the process
program. Calling alarm()
not in the main method is a perfectly
fine thing to do and necessary.
5.3 Resetting Alarms
Let's suppose we want to add a snooze function to our alarm. If the
user enters Ctrl-c
then we want to reset the alarm to 5 seconds
before buzzing again, like snooze. We can easily add a signal
handler to do that.
void sigint_handler(int signum){
printf("Snoozing!\n");
//schedule next alarm for 5 seconds
alarm(5);
}
void alarm_handler(int signum){
printf("Buzz Buzz Buzz\n");
//set a new alarm for 1 second
alarm(1);
}
int main(){
//set up alarm handler
signal(SIGALRM, alarm_handler);
//set up signint handler
signal(SIGINT, sigint_handler);
//schedule the first alarm
alarm(1);
//pause in a loop
while(1){
pause();
}
}
And we can see that the output matches our expectation using the
time utility. Note we need to use Ctrl-\
to quit the process.
#>time ./snooze_buzz Buzz Buzz Buzz Buzz Buzz Buzz ^CSnoozing! Buzz Buzz Buzz Buzz Buzz Buzz ^CSnoozing! Buzz Buzz Buzz Buzz Buzz Buzz ^\Quit: 3 real 0m15.469s user 0m0.001s sys 0m0.003s
There is some interesting dilemas here: What happened to the last
alarm? And, what happens if I type Ctrl-C
multiple times, how
long will it snooze? The anser is, only one alarm may be schedule
at one time. Calling alarm()
again will reset any previous
alarms, so the answer to the questions is that the previous alarm
is replaced and subsequent snoozes only resets the previous snooze
back to 5 seconds.
If that's the case, how might we unschedule a previously scheduled
alarm. The way to do that is by scheduling an alarm for 0 seconds,
alarm(0)
. For example, we can finish our alarm clock by adding an
"off" switch that listens for Ctrl-\
or SIGQUIT
, which will
unschedule the alarm and reset the signal handler for Ctrl-c
back
to the default.
void sigquit_handler(int signum){
printf("Alarm Off\n");
//turn off all pending alarms
alarm(0);
//reinstate default handler for SIGINT
// Ctrl-C will now terminate program
signal(SIGINT, SIG_DFL);
}
void sigint_handler(int signum){
printf("Snoozing!\n");
//schedule next alarm for 5 seconds
alarm(5);
}
void alarm_handler(int signum){
printf("Buzz Buzz Buzz\n");
//set a new alarm for 1 second
alarm(1);
}
int main(){
//set up alarm handler
signal(SIGALRM, alarm_handler);
//set up signint handler
signal(SIGINT, sigint_handler);
//set up signint handler
signal(SIGQUIT, sigquit_handler);
//schedule the first alarm
alarm(1);
//pause in a loop
while(1){
pause();
}
}
And we can track the output:
#>./offswitch_buzz Buzz Buzz Buzz Buzz Buzz Buzz Buzz Buzz Buzz ^CSnoozing! ^\Alarm Off ^C
While this is a simple example, it demonstrates a lot of power in signal handling and how even in asynchronous settings, a lot of power programming concepts can be used. It's a totally different way to communicate with programs.
6 sigaction()
and Reentrant Functions
Now, we're starting to get a bit better with signal handling, we need to turn our attention back to the pesky asynchronous nature of signal handling. In the previous part, we harnessed asynchronicity to program with just the signal handlers, but we now need to consider how signal handlers might affect the primary control flow, in particularly when a system call is executed.
6.1 sigaction()
To start, we need to learn a more advanced form of the signal()
system call, sigaction()
. Like signal()
, sigaction()
allows
the programmer to specify a signal handler for a given signal, but
it also enables the programmer to retrieve additional information
about the signaling process and set additional flags and etc.
The decleration of sigaction()
is as follows:
int sigaction(int signum, const struct sigaction *act,
struct sigaction *oldact);
The first argument is the signal to be handled, while the second and
third arguments are references to a struct sigaction
. It is in the
struct sigaction
that we set the handler function and additional
arguments. It has the following members:
struct sigaction {
void (*sa_handler)(int);
void (*sa_sigaction)(int, siginfo_t *, void *);
sigset_t sa_mask;
int sa_flags;
};
The first two fields, sa_handler
and sa_sigaction
are function
references to signal handlers; sa_handler
has the same type as the
handlers we've been using previously, and we can now write a simple
hello world program with sigaction()
.
void handler(int signum){
printf("Hello World!\n");
}
int main(){
//declare a struct sigaction
struct sigaction action;
//set the handler
action.sa_handler = handler;
//call sigaction with the action structure
sigaction(SIGALRM, &action, NULL);
//schedule an alarm
alarm(1);
//pause
pause();
}
We'll look at using the more advanced sa_sigaction
handler later,
but there are other important differences between signal()
and
sigaction()
that are worth exploring. In particular, using
sigaction()
allows us to explores reentrant functions.
6.2 Reentrant Functions
Recall that when a signal handler is invoked, this is done outside the control flow of the program. Normally, the asynchronous invocation of the signal handler is not problematic: the context of the program is saved; the signal handler runs; and, the original context of the program is restored.
However, what happens when the context of the program is within a
blocking function, like reading from a device (read()
) or waiting
for a process to terminate (wait()
). The arrival of the signal
and invocation of the signal handler will interrupt the process,
waking it from a blocking state to execute the signal handler, but
what happens when it returns to the program?
The answer to that question depends on the operation being
performed. Most functions are reentrant and can be restarted in
such cases, but others are explicitly not. For example pause()
is
explicitly not reentrant by design; once interrupted, it should not
return to a blocking state. But functions like read()
and
wait()
need to be told to restart.
6.3 Interrupting System call EINTR
Let's first consider a simple example. Here is a rude little program that will ask for a users name, but if they don't answer within 1 second, it starts barking at the user "What's taking so long?"
void handler(int signum){
printf("What's taking so long?\n");
alarm(1);
}
int main(){
char name[1024];
struct sigaction action;
action.sa_handler = handler;
sigaction(SIGALRM, &action, NULL);
alarm(1);
printf("What is your name?\n");
//scanf returns the number of items scanned
if( scanf("%s", name) != 1){
perror("scanf fail");
exit(1);
}
printf("Hello %s!\n", name);
}
Running it, you can see that, yes, if you were to enter your name quickly, the program plays nice:
#> ./scanf_fail What is your name? adam Hello adam!
But, if you're late at all, it should start the barking process, but that's actually not what happens:
#> ./scanf_fail What is your name? What's taking so long? scanf fail: Interrupted system call
Instead, we get an error in the scanf()
function, which is a
library function that must call read()
to read from standard
input. The read()
is interrupted, which results in the error
message "Interrupted system call" whose error number is
EINTR
. From the man page for read:
EINTR The call was interrupted by a signal before any data was read; see signal(7).
And we can follow up by reading in man 7 signal
:
Interruption of System Calls and Library Functions by Signal Handlers If a signal handler is invoked while a system call or library function call is blocked, then either: * the call is automatically restarted after the signal handler returns; or * the call fails with the error EINTR. Which of these two behaviors occurs depends on the interface and whether or not the signal handler was established using the SA_RESTART flag (see sigaction(2)).
To avoid this scenario we need to set an additional flag for
sigaction()
:
struct sigaction {
void (*sa_handler)(int);
void (*sa_sigaction)(int, siginfo_t *, void *);
sigset_t sa_mask;
int sa_flags; //<---
};
What we are going to do, is update our annoying program from before
to use a SA_RESTART
flag so that read()
will be restarted after
the signal handler returns:
6.4 SA_RESTART
If we take another look at the struct sigaction
, there is a field
for flags:
void handler(int signum){
printf("What's taking so long?\n");
alarm(1);
}
int main(){
char name[1024];
struct sigaction action;
action.sa_handler = handler;
action.sa_flags = SA_RESTART; //<-- restart
sigaction(SIGALRM, &action, NULL);
alarm(1);
printf("What is your name?\n");
//scanf returns the number of items scanned
if( scanf("%s", name) != 1){
perror("scanf fail");
exit(1);
}
printf("Hello %s!\n", name);
}
6.5 Not all System Calls are Reentrant
It might seem like we've solved all the problems with the
SA_RESTART
flag, but not all system calls are reentrant. You can
see a complete listing in man 7 signal
, but we'll focus on one
you might encounter in your programing. The sleep()
system call
is not reentrant.
We can see this with a simple example:
void handler(int signum){
printf("Alarm\n");
alarm(1);
}
int main(){
struct sigaction action;
action.sa_handler = handler;
action.sa_flags = SA_RESTART; //<-- restart
sigaction(SIGALRM, &action, NULL);
//alarm in 1 second
alarm(1);
//meanwhile sleep for 2 seconds
sleep(2);
//how long does the program run for?
}
A handler for SIGALRM
is established with the SA_RESTART
flag,
so all should be good. An alarm is scheduled for 1 second and then
the program should sleep for 2 seconds. The question is: How long
does the program take to run?
There are two possibilities. First, it could take 2 seconds because
SIGALRM
is handled the sleep()
is restarted with an remaining 1
second to sleep, totaling 2 seconds worth runtime. Alternatively,
the program will run for 1 second; once SIGALRM
is handled, the
sleep will not be restarted, and the program terminates with 1
second of runtime. Let's run it and find out.
#> time ./sleep_restart Alarm real 0m1.005s user 0m0.001s sys 0m0.002s
The program runs for only 1 second, and that is because sleep()
is not reentrant. It cannot be restarted after a signal
handler. This is just a singular example, but there are other
system calls that meet these conditions, some you might also use,
like send()
and recv()
for network socket programming, and
understanding the properties of reentrant system calls is important
to becoming an effective systems programmer.