Back to Bear

Wrapper-based command interception

docs/requirements/interception-wrapper-mechanism.md

4.1.56.3 KB
Original Source

Intent

When the user runs bear -- make on a system where LD_PRELOAD is not available (macOS with SIP, Windows) or not desired, Bear intercepts compiler invocations by placing wrapper executables on PATH ahead of the real compilers. The build system invokes the wrapper instead of the real compiler; the wrapper reports the execution to Bear and then forwards the call to the real compiler. The build completes normally and the user gets a compilation database without modifying the build system.

Acceptance criteria

  • Compiler invocations are intercepted and appear in the compilation database
  • The build process completes normally -- interception does not alter build output or exit codes
  • Bare compiler names in environment variables (e.g. CC=gcc) are resolved via PATH before creating wrappers
  • The wrapper reports the real compiler's absolute path, not the wrapper path
  • The .bear/ directory is created in the current working directory
  • The .bear/ directory is cleaned up when Bear exits
  • On Windows, executable name lookup is case-insensitive and extension- insensitive (cl, cl.exe, and CL.EXE all match)
  • Reporting failures do not affect the build
  • Signal forwarding works -- if the user sends SIGINT to the build, the wrapper forwards it to the real compiler and returns its exit code

Non-functional constraints

  • Must not alter build output or exit codes
  • Must handle concurrent builds (parallel make) -- each wrapper opens its own TCP connection for reporting
  • Platform: works on all supported platforms (Linux, macOS, FreeBSD, Windows)
  • The wrapper binary path must be discoverable at runtime, not baked in at build time (issue #668)
  • The .bear/ directory name is deterministic so that paths written during ./configure survive into the make phase

Known limitations

Only intercepts known compilers: Unlike preload mode, which intercepts all exec calls regardless of the executable, wrapper mode only intercepts compilers that Bear knows about. If the build uses a compiler that is not in a recognized environment variable or not on PATH at Bear startup time, it will not be intercepted.

PATH ordering conflicts (issue #445): The .bear/ directory must be first in PATH. If another tool (e.g. ccache's masquerade directory) is also first in PATH, the ordering can cause conflicts. See interception-wrapper-recursion for the specific ccache recursion problem and its solution.

Wrapper directory lifetime (issue #654): If the user runs bear -- ./configure and bear -- make as separate commands, the .bear/ directory is cleaned up after ./configure exits. The Makefile may have recorded paths into .bear/ (e.g. as the compiler path), causing "No such file or directory" errors during make. The workaround is to combine both steps under a single Bear invocation: bear -- sh -c './configure && make'. Using separate Bear invocations will always fail because the directory is removed when the first exits.

Wrapper path must be discoverable (issue #668): The bear-wrapper binary path was previously baked in at build time via a compile-time constant. This broke prebuilt packages (conda-forge, scoop, system packages) where the install path differs from the build path. The wrapper path must be resolved relative to the bear binary at runtime.

Cross-compilers may not be discovered (issue #561): Bear discovers compilers from environment variables and common names. Cross-compilers with unusual names (e.g. arm-none-eabi-gcc) are only intercepted if they appear in CC, CXX, or similar variables.

Testing

Given a project with a single C source file:

When the user configures wrapper mode and runs bear -- cc -c test.c, then compile_commands.json is created with one entry for test.c, and the compiler path in the entry is an absolute path to the real compiler (not .bear/cc), and the build exit code is preserved.

Given CC=gcc (a bare name) in the environment:

When the user runs bear -- make, then Bear resolves gcc via PATH to an absolute path (e.g. /usr/bin/gcc), creates .bear/gcc as a wrapper, and the compilation database contains /usr/bin/gcc as the compiler.

Given a build that uses an absolute compiler path (CC=/usr/bin/gcc):

When the user runs bear -- make, then Bear creates .bear/gcc pointing to /usr/bin/gcc, and the wrapper intercepts the invocation correctly.

Given a parallel build with multiple source files:

When the user runs bear -- make -j4 with wrapper mode, then all compilations are intercepted, and the compilation database contains one entry per source file.

Given a build that fails partway through:

When the user runs bear -- make and one compilation fails, then Bear's exit code matches the build's exit code, and the compilation database still contains entries for all attempted compilations.

Given a Windows build with CC=cl (no .exe extension):

When Bear looks up cl in the wrapper configuration, then it matches case-insensitively and without requiring the .exe extension, and the wrapper is created correctly.

Given a build where the compiler is not in any environment variable:

When the build script directly invokes /opt/custom/bin/mycc -c test.c without setting CC, then the invocation is not intercepted in wrapper mode (this is a known limitation -- preload mode would catch it).

Given a successful build:

When bear -- make completes, then the .bear/ directory is removed automatically, and no wrapper artifacts remain in the working directory.

Notes

  • Related requirement: interception-preload-mechanism (alternative interception mode using LD_PRELOAD).
  • Related requirement: interception-wrapper-recursion (ccache recursion prevention in wrapper mode).
  • A wrapper executable stands in for the real compiler on PATH; the build system invokes it in place of the compiler it expects.
  • GitHub issues #686, #681 cover Windows/MSYS2 wrapper mode improvements.
  • GitHub issue #609 describes user confusion about when to use wrapper vs. preload mode. The default selection (wrapper on macOS/Windows, preload on Linux) handles the common case; the configuration file covers the rest.

Rationale