Disabling allocations in `pre_exec` (and signal handlers)

# Proposal

## Problem statement



Allocating inside [`CommandExt::pre_exec`](https://doc.rust-lang.org/nightly/std/os/unix/process/trait.CommandExt.html#tymethod.pre_exec)[^1] is a footgun.

[^1]: `Command` can be materialized on `unix` in two ways -- [`Command::spawn`](https://doc.rust-lang.org/nightly/std/process/struct.Command.html#method.spawn) and [`CommandExt::exec`](https://doc.rust-lang.org/nightly/std/os/unix/process/trait.CommandExt.html#tymethod.exec). All `pre_exec` madness about async-signal-safety applies only to `spawn` variant.

[According to POSIX](https://pubs.opengroup.org/onlinepubs/9799919799/functions/V2_chap02.html#:~:text=The%20following%20table%20defines%20a%20set%20of%20functions%20and%20function-like%20macros%20that%20shall%20be%20async-signal-safe), `malloc` is not async-signal-safe, meaning that it's not guaranteed to work after fork in a multi-thread process. In practice, this sometimes [leads to deadlocks](https://github.com/rust-lang/rust/blob/e0e204f3e97ad5f79524b9c259dc38df606ed82c/library/std/src/sys/process/unix/unix.rs#L247-L276), although some libcs take liberty in making it fine to call `malloc`/`free` after fork.

[2024 revision of POSIX](https://pubs.opengroup.org/onlinepubs/9799919799/functions/fork.html) says that "the application shall ensure that the child process only executes async-signal-safe operations until such time as one of the `exec` functions is successful". This does not clearly indicate that executing them is UB. However, [the previous revision](https://pubs.opengroup.org/onlinepubs/9699919799/functions/fork.html) says that, under the same conditions, it is undefined behavior to invoke `fork` if async-signal-unsafe `pthread_atfork` handlers are registered. Since this is just invoking async-signal-unsafe functions with a layer of indirection, I believe this wording implies executing async-signal-unsafe operations after `fork` is UB in general. Musl authors seemingly [agree with this interpretation](https://lists.uclibc.org/pipermail/uclibc/2011-March/045130.html).

Yet it can be tricky to know if the function is allocating, or at least not immediately obvious, as allocations are hidden and every Rust function can allocate. This problem is exacerbated by lack of tools to detect allocations inside `pre_exec`: Miri [can't analyze `pre_exec` hooks](https://play.rust-lang.org/?version=stable&mode=debug&edition=2024&gist=a862ce9a6e8a3e470b792854b6070986), clippy does not lint against using these functions and Valgrind remains silent. We can't make `pre_exec` safe, but we can at least help people recognise bugs in their code.

## Motivating examples or use cases



During efforts to make panicking work normally after fork it was [discovered that a simple `catch_unwind` solution does not reliably work across systems because of allocations](https://github.com/rust-lang/rust/pull/80263#issuecomment-774228420) and it was [suggested to make global allocator abort after fork](https://github.com/rust-lang/rust/pull/80263#issuecomment-774272370). Though `std` moved forward with the other solution to "don't unwind past `fork()` in child", it still could be helpful to know where these allocations were.

The `pre_exec` allocations footgun is mostly hit by external code, though. @purplesyringa recently found many projects that unknowingly use allocating [`Error::other` or `Error::new`](https://github.com/rust-lang/rust/pull/148971) in `pre_exec` hooks. It is not clear how developers could find this bug.

1. Even though `chdir(2)` is async-signal-safe, Rust's [`std::env::set_current_dir`](https://doc.rust-lang.org/nightly/std/env/fn.set_current_dir.html) is not, because it has to allocate to append NUL-byte to path. This applies to all filesystem-related functions. (Note that [`Command::current_dir`](https://doc.rust-lang.org/nightly/std/process/struct.Command.html#method.current_dir) could not be used in https://github.com/alacritty/alacritty/issues/8751 because the intention was to swallow errors.)
2. Allocating functions can be deeply nested in `pre_exec`, like filesystem accesses inside `Cgroup::add_task` in https://github.com/openanolis/cryptpilot/issues/54, or could be abstracted away, like in `logger` in https://github.com/alacritty/alacritty/pull/8756, so simple clippy lints could not help detect allocations.
3. Project maintainers showed interest in such functionality: https://github.com/reubeno/brush/pull/777#issuecomment-3563251506.

## Solution sketch



The idea is to make allocations inside `spawn` variant `pre_exec` a library UB and integrate checks into [`ub-checks` efforts](https://github.com/rust-lang/rust/issues/120848). Although POSIX mentions only multi-threaded processes, std could possibly have created helper threads, and lack of helper threads should probably be considered an implementation detail by treating all programs as multi-threaded in the `pre_exec` case.

While `malloc(3)` is async-signal-unsafe, certain custom allocators may be guaranteed to work correctly after `fork` (e.g. bump allocators). Therefore the check must be opt-in for the allocator.

Since async-signal-safe contexts also appear in signal handlers, it would be prudent to make the API useful for that purpose as well. We don't have to expose it right away, but the developed solution should be forward-compatible.

All in all, the plan is to expose a function to query whether we are in an "async signal" context (like [`std::thread::panicking`](https://doc.rust-lang.org/nightly/std/thread/fn.panicking.html)) and a "guard" (like [`DropGuard`](https://doc.rust-lang.org/nightly/std/mem/struct.DropGuard.html)).

```rust
// Names are up to bikeshed
mod thread {
    #[thread_local]
    static ASYNC_SIGNAL_NESTING: Cell<usize> = const { Cell::new(0) };

    pub fn is_async_signal_context() -> bool {
        ASYNC_SIGNAL_NESTING.get() != 0
    }

    pub struct AsyncSignalGuard;

    impl AsyncSignalGuard {
        fn new() -> Self {
            ASYNC_SIGNAL_NESTING.set(ASYNC_SIGNAL_NESTING.get() + 1);
            Self
        }
    }

    impl Drop for AsyncSignalGuard {
        fn drop(&mut self) {
            ASYNC_SIGNAL_NESTING.set(ASYNC_SIGNAL_NESTING.get() - 1);
        }
    }
}
```

Allocators would use this API to make sure they were not called inside an async signal context:

```rust
debug_assert!(
    !std::thread::is_async_signal_context(),
    "This allocator does not support allocating in async-signal-safe context."
);
```

The allocator, or a custom wrapper, could alternatively choose to use `assert!` to run such checks even in release mode. The `System` allocator would have to use a different mechanism than `debug_assert!` if `std` is pre-built.

Users of raw `fork`, like `Command::spawn`, and signal handlers may enter the async signal context with `AsyncSignalGuard`:

```rust
let pid = unsafe { cvt(libc::fork())? };
if pid == 0 {
    std::panic::always_abort();
    let _guard = std::thread::AsyncSignalGuard::new();
    // remaining code here
}
```

These functions should be stubbed on `#[cfg(not(unix))]` so that allocators don't have to always use `#[cfg(unix)]` (e.g. because we may want to introduce such checks on other platforms).

Note that `panic!` inside `pre_exec` will not show traceback now ([playground](https://play.rust-lang.org/?version=stable&mode=debug&edition=2024&gist=fed6652f80128f6e04b62e14c1d5d1c9)), and using this approach with allocator will produce a near-useless message "you allocate in async-signal-unsafe context" without telling where exactly that allocation happened. Though I believe one can add a breakpoint to `panic` and view traceback in a debugger, it has a less nicer experience than desired.

To sum up, this proposal consists of two parts:

- A new public API, and
- Updating `spawn` to enter an async signal context and the `System` allocator to panic when invoked in an async signal context.

## Alternatives



### Custom allocator published on `crates.io`

It is possible to write a custom allocator that would wrap `System` and register `pthread_atfork` handler to set a flag to disable allocations (this only detects `fork` and not `_Fork`; though another fork detection mechanism may not have this issue). However:

- Users would need to actively opt in this allocator, and third-party tools lack good discoverability compared to first-party tools like Miri. To use this debug mechanism, you'd have to be aware of the possibility of allocations, but the main reason this problem is wide-spread is lack of this knowledge.
- It does not open opportunity to mark other, non-allocating `std` functions as async-signal-unsafe, limiting the applicability.
- There is a many-to-many correspondence between async signal contexts (`fork`, manual syscalls, manually set up signal handlers) and async-signal-unsafe functions (allocators, libc wrappers, unwinders). For the checks to work, both have to use the same registry to enter async signal contexts and assert safety, so there would have to be exactly one such crate in the ecosystem, with no possibility of switching without losing functionality. That crate might as well be `std`.

### Replace "async signal context" with "no-alloc context"

Focusing on "no-alloc context" will make it impossible to track non-allocating async-signal-unsafe functions, e.g. functions using global mutexes or non-reentrant functions. However, "no-alloc context" is a platform-independent idea and might be useful outside of `cfg(unix)`. It is not clear which approach is better.

### Just wait for effects

*Suggested in https://github.com/bootc-dev/containers-image-proxy-rs/pull/109#issuecomment-3562849233*

Effects in Rust are not coming soon, whereas the proposed solution is simple enough for an incremental improvement.

### Clippy lint against certain functions

*Suggested in https://github.com/jamesmcm/vopono/pull/340#issuecomment-3542161352*

Clippy can't see through functions, and a lot of APIs unknowingly allocate. The lint would have a lot of false negatives and therefore be near-useless. A more complicated analysis would just be an ad-hoc implementation of effects.

## Links and related work



- Aborting on panic after `fork`: https://github.com/rust-lang/rust/pull/80263, https://github.com/rust-lang/rust/pull/81858, https://github.com/rust-lang/rust/issues/84438
- Similar proposal idea: https://github.com/rust-lang/rust/pull/80263#issuecomment-774272370, https://github.com/rust-lang/rust/pull/81858#pullrequestreview-617053754
- Unsafe uses of `Error::new`/`Error::other`: https://github.com/rust-lang/rust/pull/148971

## What happens now?

This issue contains an API change proposal (or ACP) and is part of the libs-api team [feature lifecycle]. Once this issue is filed, the libs-api team will review open proposals as capability becomes available. Current response times do not have a clear estimate, but may be up to several months.

[feature lifecycle]: https://std-dev-guide.rust-lang.org/development/feature-lifecycle.html

## Possible responses

The libs team may respond in various different ways. First, the team will consider the *problem* (this doesn't require any concrete solution or alternatives to have been proposed):

- We think this problem seems worth solving, and the standard library might be the right place to solve it.
- We think that this probably doesn't belong in the standard library.

Second, if there's a concrete solution:

- We think this specific solution looks roughly right, approved, you or someone else should implement this. (Further review will still happen on the subsequent implementation PR.)
- We're not sure this is the right solution, and the alternatives or other materials don't give us enough information to be sure about that. Here are some questions we have that aren't answered, or rough ideas about alternatives we'd want to see discussed.

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Disabling allocations in `pre_exec` (and signal handlers) #701

Proposal

Problem statement

Motivating examples or use cases

Solution sketch

Alternatives

Custom allocator published on `crates.io`

Replace "async signal context" with "no-alloc context"

Just wait for effects

Clippy lint against certain functions

Links and related work

What happens now?

Possible responses

Metadata

Assignees

Labels

Type

Fields

Projects

Milestone

Relationships

Development

Disabling allocations in pre_exec (and signal handlers) #701

Description

Proposal

Problem statement

Motivating examples or use cases

Solution sketch

Alternatives

Custom allocator published on crates.io

Replace "async signal context" with "no-alloc context"

Just wait for effects

Clippy lint against certain functions

Links and related work

What happens now?

Possible responses

Footnotes

Metadata

Metadata

Assignees

Labels

Type

Fields

Projects

Milestone

Relationships

Development

Issue actions

Disabling allocations in `pre_exec` (and signal handlers) #701

Custom allocator published on `crates.io`