A sleep queue is a structure that holds the list of threads asleep on a wait channel. Each thread that is not asleep on a wait channel carries a sleep queue structure around with it. When a thread blocks on a wait channel, it donates its sleep queue structure to that wait channel. Sleep queues associated with a wait channel are stored in a hash table.
The sleep queue hash table holds sleep queues for wait channels that have at least one blocked thread. Each entry in the hash table is called a sleepqueue chain. The chain contains a linked list of sleep queues and a spin mutex. The spin mutex protects the list of sleep queues as well as the contents of the sleep queue structures on the list. Only one sleep queue is associated with a given wait channel. If multiple threads block on a wait channel than the sleep queues associated with all but the first thread are stored on a list of free sleep queues in the master sleep queue. When a thread is removed from the sleep queue it is given one of the sleep queue structures from the master queue's free list if it is not the only thread asleep on the queue. The last thread is given the master sleep queue when it is resumed. Since threads may be removed from the sleep queue in a different order than they are added, a thread may depart from a sleep queue with a different sleep queue structure than the one it arrived with.
The sleepq_lock
function locks the spin mutex of the
sleep queue chain that maps to a specific wait channel. The sleepq_lookup
function looks in the hash table for the master
sleep queue associated with a given wait channel. If no master sleep queue is found, it
returns NULL
. The sleepq_release
function unlocks the spin mutex associated with a
given wait channel.
A thread is added to a sleep queue via the sleepq_add
.
This function accepts the wait channel, a pointer to the mutex that protects the wait
channel, a wait message description string, and a mask of flags. The sleep queue chain
should be locked via sleepq_lock
before this function is
called. If no mutex protects the wait channel (or it is protected by Giant), then the
mutex pointer argument should be NULL
. The flags argument
contains a type field that indicates the kind of sleep queue that the thread is being
added to and a flag to indicate if the sleep is interruptible (SLEEPQ_INTERRUPTIBLE
). Currently there are only two types of
sleep queues: traditional sleep queues managed via the msleep
and wakeup
functions (SLEEPQ_MSLEEP
) and condition variable sleep queues (SLEEPQ_CONDVAR
). The sleep queue type and lock pointer argument
are used solely for internal assertion checking. Code that calls sleepq_add
should explicitly unlock any interlock protecting the
wait channel after the associated sleepqueue chain has been locked via sleepq_lock
and before blocking on the sleep queue via one of the
waiting functions.
A timeout for a sleep is set by invoking sleepq_set_timeout
. The function accepts the wait channel and the
timeout time as a relative tick count as its arguments. If a sleep should be interrupted
by arriving signals, the sleepq_catch_signals
function
should be called as well. This function accepts the wait channel as its only parameter.
If there is already a signal pending for this thread, then sleepq_catch_signals
will return a signal number; otherwise, it
will return 0.
Once a thread has been added to a sleep queue, it blocks using one of the sleepq_wait
functions. There are four wait functions depending on
whether or not the caller wishes to use a timeout or have the sleep aborted by caught
signals or an interrupt from the userland thread scheduler. The sleepq_wait
function simply waits until the current thread is
explicitly resumed by one of the wakeup functions. The sleepq_timedwait
function waits until either the thread is
explicitly resumed or the timeout set by an earlier call to sleepq_set_timeout
expires. The sleepq_wait_sig
function waits until either the thread is
explicitly resumed or its sleep is aborted. The sleepq_timedwait_sig
function waits until either the thread is
explicitly resumed, the timeout set by an earlier call to sleepq_set_timeout
expires, or the thread's sleep is aborted. All
of the wait functions accept the wait channel as their first parameter. In addition, the
sleepq_timedwait_sig
function accepts a second boolean
parameter to indicate if the earlier call to sleepq_catch_signals
found a pending signal.
If the thread is explicitly resumed or is aborted by a signal, then a value of zero is
returned by the wait function to indicate a successful sleep. If the thread is resumed by
either a timeout or an interrupt from the userland thread scheduler then an appropriate
errno value is returned instead. Note that since sleepq_wait
can only return 0 it does not return anything and the
caller should assume a successful sleep. Also, if a thread's sleep times out and is
aborted simultaneously then sleepq_timedwait_sig
will
return an error indicating that a timeout occurred. If an error value of 0 is returned
and either sleepq_wait_sig
or sleepq_timedwait_sig
was used to block, then the function sleepq_calc_signal_retval
should be called to check for any
pending signals and calculate an appropriate return value if any are found. The signal
number returned by the earlier call to sleepq_catch_signals
should be passed as the sole argument to sleepq_calc_signal_retval
.
Threads asleep on a wait channel are explicitly resumed by the sleepq_broadcast
and sleepq_signal
functions. Both functions accept the wait channel from which to resume threads, a
priority to raise resumed threads to, and a flags argument to indicate which type of
sleep queue is being resumed. The priority argument is treated as a minimum priority. If
a thread being resumed already has a higher priority (numerically lower) than the
priority argument then its priority is not adjusted. The flags argument is used for
internal assertions to ensure that sleep queues are not being treated as the wrong type.
For example, the condition variable functions should not resume threads on a traditional
sleep queue. The sleepq_broadcast
function resumes all
threads that are blocked on the specified wait channel while sleepq_signal
only resumes the highest priority thread blocked on
the wait channel. The sleep queue chain should first be locked via the sleepq_lock
function before calling these functions.
A sleeping thread may have its sleep interrupted by calling the sleepq_abort
function. This function must be called with sched_lock
held and the thread must be queued on a sleep queue. A
thread may also be removed from a specific sleep queue via the sleepq_remove
function. This function accepts both a thread and a
wait channel as an argument and only awakens the thread if it is on the sleep queue for
the specified wait channel. If the thread is not on a sleep queue or it is on a sleep
queue for a different wait channel, then this function does nothing.
- Compare/contrast with sleep queues.
- Lookup/wait/release. - Describe TDF_TSNOBLOCK race.
- Priority propagation.
- Should we require mutexes to be owned for mtx_destroy() since we can not safely assert that they are unowned by anyone else otherwise?
- Use a critical section...
- Describe the races with contested mutexes
- Why it is safe to read mtx_lock of a contested mutex when holding the turnstile chain lock.
This, and other documents, can be downloaded from ftp://ftp.FreeBSD.org/pub/FreeBSD/doc/.
For questions about FreeBSD, read the documentation before contacting <questions@FreeBSD.org>.
For questions about this documentation, e-mail <doc@FreeBSD.org>.