Reliability

Reliability expectations and invariants for Loong.

The public reader-facing summary for this material lives in ../site/reference/security-and-reliability.mdx. This file remains the repository-native reliability and invariant reference.

Route By Audience

If you are trying to...	Start here
read the public summary first	`../site/reference/security-and-reliability.mdx`
inspect repository-native invariants and verification expectations	this file
understand the broader repository docs layering	`README.md`

Read This File When

you need the source-level invariant and verification contract
you are checking whether a change weakens an existing reliability guarantee
you need the repository-native commands and guardrails behind the public summary

Reliability Areas

Area	What this file covers
build invariants	CI-parity commands that must stay green
runtime guardrails	execution behavior that should not silently degrade
architecture guardrails	machine-checkable boundary and complexity rules
kernel and channel invariants	fail-closed and persistence behavior that should remain stable

Build Invariants

These must hold at every commit on every branch:

cargo fmt --all -- --check passes
cargo clippy --workspace --all-targets --all-features -- -D warnings passes
cargo test --workspace passes
cargo test --workspace --all-features passes (test count evolves with the codebase; CI is the source of truth)

Enforced by: CI (.github/workflows/ci.yml, surfaced through the aggregate build check). The optional scripts/pre-commit hook mirrors these cargo gates locally.

Feedback Lanes

Quick local loop — use task verify:quick for a faster edit/test cycle, or task test:packages PACKAGES='-p loong-app -p loong-spec' when you want a targeted package subset. This lane keeps curated daemon smoke coverage but skips the heaviest integration suite and does not replace CI parity. When you are iterating inside daemon integration tests, task test:daemon:domains gives you the optional sharded domain binaries and task test:daemon:heavy still runs the canonical full integration binary.
Changed-package loop — use task verify:changed to scope local Rust tests to the packages touched in the current working tree plus reverse dependents. When that closure includes loong, the loop still runs the curated daemon integration smoke subset by default and now escalates daemon test-file edits into the matching optional domain shards by reading the shard entry files as the source of truth, with a fallback to the full integration binary for broad harness changes.
CI parity — the build check runs workspace fmt/clippy, default workspace tests on Ubuntu and Windows, the all-features workspace test lane on Ubuntu, and explicit feature-delta compile checks for loong-spec test-hooks plus the browser-without-web.fetch app feature set.
Extended gate — task verify:full adds smoke and benchmark validation on top of the canonical local verification bar.

Runtime Stability Guardrails

Wasm trap behavior is platform-aware by default — on macOS, signals_based_traps is disabled to avoid trap-handler abort instability under parallel bridge tests.
Runtime override is explicit — set LOONG_WASM_SIGNALS_BASED_TRAPS=true|false to force trap behavior for diagnostics/experiments.
Daemon stress helper is scriptable — run ./scripts/stress_daemon_tests.sh 10 default,2,1 for manual repeated daemon test validation across thread modes.
Trap-mode matrix is available when needed — set LOONG_STRESS_WASM_TRAPS_MODES=auto,false,true to sweep daemon tests across trap behavior modes during targeted investigation.

Architecture Stability Guardrails

Complexity budgets are machine-checkable — run ./scripts/check_architecture_boundaries.sh directly, or task check:architecture when the optional task CLI wrapper is installed, to inspect module line/function budgets for architecture hotspots (spec_runtime, spec_execution, provider/mod, memory/mod, acp/manager, acp/acpx, channel/registry, config/channels, chat, channel/mod, conversation/turn_coordinator, tools/mod, daemon/lib, daemon/onboard_cli). The checker classifies each hotspot by foundation, structural_size, and operational_density pressure so reviews can distinguish large-surface drift from runtime-density risk.
Memory operation literals are boundary-guarded — memory core operation strings (append_turn, window, clear_session) must remain centralized in crates/app/src/memory/* and never spread into callsites.
spec stays detached from app — the architecture guardrails treat any direct loong-app dependency in crates/spec/Cargo.toml as a boundary regression, and ./scripts/check_dep_graph.sh must stay green.
Strict enforcement is the blocking gate — use LOONG_ARCH_STRICT=true ./scripts/check_architecture_boundaries.sh directly, or task check:architecture:strict when the optional task CLI wrapper is installed, to make architecture budget violations fail non-zero. This check is part of task verify, task verify:full, and CI.

Kernel Invariants

Token authorization is fail-closed — if the policy engine cannot determine authorization (e.g., mutex poisoned), the operation is denied.
Audit events are never silently dropped — kernel sinks fail closed on write errors instead of silently downgrading. Production app bootstraps default to FanoutAuditSink backed by ~/.loong/audit/events.jsonl, LoongKernel::new() defaults to InMemoryAuditSink, and spec/test/demo helpers may intentionally use explicit in-memory audit seams for side-effect-free reporting. NoopAuditSink remains reserved for callers that explicitly opt into new_without_audit(...) or wire a noop sink themselves.
Pack registration is idempotent-safe — duplicate pack IDs return DuplicatePack error, never silently overwrite.
Generation-based revocation is monotonic — the revocation threshold only increases, never decreases.
TaskState transitions are irreversible from terminal states — Completed and Faulted states cannot transition.

MVP Channel Invariants

Kernel context is bootstrapped at startup — the base CLI loop and shipped service-channel runtimes create KernelContext before processing messages.
Memory persistence failures are surfaced — persist_turn errors propagate to the caller, never silently swallowed.
Provider errors have two modes — Propagate (return error) or InlineMessage (synthetic reply). Behavior is explicit per operator or channel surface.

Test Expectations

Kernel crate: property tests (proptest) for capability boundary invariants
All crates: deterministic tests (no time-dependent flakes)
Multi-threaded tests use #[tokio::test(flavor = "multi_thread", worker_threads = 2)]

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Reliability

Route By Audience

Read This File When

Reliability Areas

Build Invariants

Feedback Lanes

Runtime Stability Guardrails

Architecture Stability Guardrails

Kernel Invariants

MVP Channel Invariants

Test Expectations

FilesExpand file tree

RELIABILITY.md

Latest commit

History

RELIABILITY.md

File metadata and controls

Reliability

Route By Audience

Read This File When

Reliability Areas

Build Invariants

Feedback Lanes

Runtime Stability Guardrails

Architecture Stability Guardrails

Kernel Invariants

MVP Channel Invariants

Test Expectations