IntentGuard

IntentGuard lets you test code intent with natural language assertions.

Use it when a property is real and test-worthy, but awkward to encode with ordinary assertions: architecture rules, security practices, documentation contracts, error-handling conventions, or other cross-cutting code qualities. IntentGuard checks referenced code with a local model and raises AssertionError when the judgement fails.

IntentGuard complements traditional tests. Keep unit tests for exact outputs, edge cases, state changes, and safety-critical behavior. Use IntentGuard where a custom AST walk, linter rule, or review checklist would be noisy or expensive.

Important

The current default model is IntentGuard-1-qwen2.5-coder-1.5b, with 92.5% accuracy and 92.3% precision in the validation suite.

Installation

pip install intentguard

Quick Start

With pytest

import intentguard as ig

def test_code_properties():
    ig.assert_code(
        "Classes in {module} should follow the Single Responsibility Principle",
        {"module": my_module}
    )

    ig.assert_code(
        "All database queries in {module} should be parameterized to prevent SQL injection",
        {"module": db_module}
    )

With unittest

import unittest
import intentguard as ig

class TestCodeQuality(unittest.TestCase):
    def test_error_handling(self):
        ig.assert_code(
            "All API endpoints in {module} should have proper input validation",
            {"module": api_module}
        )

Good Fits

IntentGuard works best for high-level properties that are easy to describe and hard to check directly:

"All public methods in {module} should have docstrings with Parameters and Returns sections."
"All API endpoints in {module} should validate input before using it."
"All methods in {module} should log errors before re-raising them."

Avoid using it for exact numeric results, runtime behavior that must be executed, or anything that needs perfect determinism. Model judgement is useful signal, not proof.

How It Works

assert_code() receives a natural language assertion and code references.
Code references are converted into source snippets.
IntentGuard builds a structured prompt and checks the cache.
On cache miss, the local model evaluates the assertion num_evaluations times.
A strict majority decides the result. Ties fail.
The result is cached for repeat runs.

Near-Deterministic Results

IntentGuard is designed for repeatable judgements, not guaranteed determinism. It uses low-temperature sampling, repeated evaluation, strict majority voting, and caching to make results stable in normal test runs. Fresh model evaluations can still vary, especially after changing the assertion, code, model, temperature, or evaluation count.

Configure repeatability:

import intentguard as ig

ig.set_default_options(
    ig.IntentGuardOptions(
        num_evaluations=7,  # More evaluations make majority vote more stable
        temperature=0.1,    # Lower temperature reduces sampling variance
    )
)

Use module-level ig.assert_code(...) for ordinary tests. Use ig.IntentGuard(options) when one test class, suite, or subsystem needs isolated options.

Model

IntentGuard uses a custom 1.5B parameter model, fine-tuned from Qwen2.5-Coder-1.5B for code property verification. It runs locally through llamafile, so code is not sent to a hosted API by default.

Performance

Model	Accuracy	Precision	Recall
(current model) IntentGuard-1-qwen2.5-coder-1.5b	92.5%	92.3%	89.4%
(previous model) IntentGuard-1-llama3.2-1b	92.4%	91.0%	91.0%
(reference model) gpt-4o-mini	89.3%	85.3%	90.2%

Validation Methodology

The validation suite is intentionally strict:

Each test example gets 15 total evaluations (5 trials x 3 evaluations per trial)
A voting mechanism is applied within each group (jury size = 3)
A test passes only if all 5 trials succeed with majority agreement (2 out of 3 or better)

For more details, see the validation documentation.

Compatibility

IntentGuard requires Python 3.10+. OS and architecture support come from llamafile:

Linux 2.6.18+
macOS 23.1.0+ (GPU support on ARM64)
Windows 10+ (AMD64)
FreeBSD 13+
NetBSD 9.2+ (AMD64)
OpenBSD 7+ (AMD64)

Local Development Environment Setup

Prerequisites: Python 3.10+, uv.
Clone: git clone <repository_url> && cd intentguard
Install dev dependencies: make install or uv sync --dev --group validation --group dataset
Run tests & checks: make test

Useful commands:

make install: Installs development dependencies.
make install-prod: Installs production dependencies only.
make check: Runs linting checks (ruff check).
make format-check: Checks code formatting (ruff format --check).
make mypy: Runs static type checking (mypy).
make unittest: Runs unit tests.
make test: Runs all checks and tests.
make clean: Removes the virtual environment.
make help: Lists available make commands.

License

MIT License

Name		Name	Last commit message	Last commit date
Latest commit History 61 Commits
.agents/skills		.agents/skills
.github		.github
ai_research		ai_research
design		design
intentguard		intentguard
tests		tests
validation		validation
.gitignore		.gitignore
AGENTS.md		AGENTS.md
LICENSE		LICENSE
Makefile		Makefile
README.md		README.md
ROADMAP.md		ROADMAP.md
pyproject.toml		pyproject.toml
uv.lock		uv.lock

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Uh oh!

Repository files navigation

IntentGuard

Installation

Quick Start

With pytest

With unittest

Good Fits

How It Works

Near-Deterministic Results

Model

Performance

Validation Methodology

Compatibility

Local Development Environment Setup

License

About

Uh oh!

Releases 13

Sponsor this project

Uh oh!

Uh oh!

Contributors

Uh oh!

Languages

Uh oh!

Folders and files

Latest commit

History

Repository files navigation

IntentGuard

Installation

Quick Start

With pytest

With unittest

Good Fits

How It Works

Near-Deterministic Results

Model

Performance

Validation Methodology

Compatibility

Local Development Environment Setup

License

About

Topics

Resources

License

Uh oh!

Stars

Watchers

Forks

Releases 13

Sponsor this project

Uh oh!

Uh oh!

Contributors

Uh oh!

Languages