Kernel-Lane (Lane 4)

Kernel-Lane is the fourth isolated lane in the lattice. Its role is hardware-focused: compile, profile, benchmark, and optimize CUDA kernels for your RTX 5060 stack.

It exists so GPU/performance work can move fast without destabilizing governance, verification, or orchestration lanes.

How Lane 4 Fits the Other 3

Lane	Primary Role	Output
Archivist	Governance and cross-lane arbitration	Decisions, routing, escalation
Library	Verification and attestation	Proof, hardening, validation
SwarmMind	Multi-agent behavior and execution strategies	Behavioral traces, coordination patterns
Kernel-Lane (this repo)	CUDA kernel performance engineering	Pinned release artifacts + perf evidence

Integration Contract

Kernel-Lane does not replace the first three lanes. It feeds them:

Archivist gets release decisions and promotion metadata.
Library gets benchmark/profile evidence for verification.
SwarmMind gets stable, pinned kernel artifacts to consume in runtime experiments.

Mission

Isolate GPU optimization from core governance/runtime lanes.
Produce measurable performance improvements with reproducible evidence.
Promote only immutable, pinned releases that other lanes can safely consume.

Hard Boundaries

In scope:

CUDA kernel source and compile flags
Nsight Systems/Compute profiling
Benchmark automation and regression checks
Release packaging and manifests

Out of scope:

Editing global Kilo/OpenCode routing config
Cross-lane governance policy edits
Direct runtime orchestration logic in other repos

Upstream Source and Intake

Primary intake source: S:\snac-v2\kimi-shared
Intake pattern: copy/snapshot into this lane, then optimize here
Do not tune directly in shared runtime lanes

Release-Only Consumption Rule

Other lanes must consume only promoted release artifacts listed in:

releases/index.json
releases/<version>/manifest.json

Never consume:

build/ outputs
unversioned temporary artifacts

Promotion Gate (Required)

A release is valid only if all five exist:

Built artifact
Benchmark report JSON
Nsight Systems report
Nsight Compute report
Release manifest

Promotion command:

.\scripts\promote-release.ps1 `
  -Version v0.1.0 `
  -ArtifactPath .\build\Release\kernels-v0.1.0.zip `
  -BenchmarkReportPath .\benchmarks\reports\baseline.json `
  -NsysReportPath .\profiles\nsys\baseline.nsys-rep `
  -NcuReportPath .\profiles\ncu\baseline.ncu-rep `
  -Notes "Initial optimized baseline"

Typical Workflow

Intake snapshot -> Build -> Benchmark -> Profile -> Compare vs baseline -> Promote release -> Hand off manifest

Hand-off should include:

version
manifest path
key speedup/regression metrics
known tradeoffs

Quick Start

Set-Location S:\kernel-lane
.\scripts\env-check.ps1
.\scripts\build-kernels.ps1 -Configuration Release
.\scripts\run-benchmarks.ps1 -Name baseline
.\scripts\run-profiles.ps1 -ExecutablePath .\build\Release\your-binary.exe -Args "--size 1048576" -Name baseline

Why This Matters

Kernel work is high-impact and high-risk. By isolating it as Lane 4, you get:

faster iteration on CUDA tuning
safer integration into the main lattice
deterministic convergence through evidence-backed releases

Head-less profiling

Run the combined Nsight Systems + Nsight Compute headless capture:

.\scripts\run-headless-profiling.ps1 -DurationSec 60

Artifacts are written to profiles/headless/. Use .\scripts\check-profiling-regression.ps1 to compare against the baseline.

Name		Name	Last commit message	Last commit date
Latest commit History 867 Commits
.compact-audit		.compact-audit
.cursor/hooks/state		.cursor/hooks/state
.github/workflows		.github/workflows
.global		.global
.kilo		.kilo
.tmp		.tmp
.trust		.trust
.vscode		.vscode
artifacts-from-ubuntu/20260502-164732		artifacts-from-ubuntu/20260502-164732
artifacts		artifacts
baselines		baselines
benchmarks		benchmarks
config		config
context-buffer		context-buffer
data		data
deploy/ubuntu		deploy/ubuntu
docs		docs
evidence		evidence
governance		governance
hooks		hooks
integration		integration
kernel/evidence/productivity-reports		kernel/evidence/productivity-reports
kernels		kernels
lanes		lanes
logs		logs
papers		papers
plans		plans
profiles		profiles
releases		releases
reports		reports
schemas		schemas
scripts		scripts
src		src
tmp		tmp
.env.example		.env.example
.gitguardian.yml		.gitguardian.yml
.gitignore		.gitignore
.lane-watch-timestamp		.lane-watch-timestamp
.session-mode		.session-mode
ACHIEVEMENT.md		ACHIEVEMENT.md
AGENTS.md		AGENTS.md
BOOTSTRAP.md		BOOTSTRAP.md
COVENANT.md		COVENANT.md
CUDA_KERNEL_TASKQUEUE.md		CUDA_KERNEL_TASKQUEUE.md
CUDA_PROJECT_SUMMARY.md		CUDA_PROJECT_SUMMARY.md
FINAL_SUMMARY.md		FINAL_SUMMARY.md
GOVERNANCE.md		GOVERNANCE.md
JOURNAL-2026-05-17.md		JOURNAL-2026-05-17.md
LANES_ARCHITECTURE.md		LANES_ARCHITECTURE.md
LANE_API_DOCUMENTATION.md		LANE_API_DOCUMENTATION.md
LICENSE		LICENSE
PHASE1_COMPLETION_DASHBOARD.md		PHASE1_COMPLETION_DASHBOARD.md
PHASE1_REMEDIATION_PLAYBOOK.md		PHASE1_REMEDIATION_PLAYBOOK.md
PHASE2_GENERATION_SUMMARY.md		PHASE2_GENERATION_SUMMARY.md
PROJECT_OVERVIEW.md		PROJECT_OVERVIEW.md
PROJECT_SUMMARY.md		PROJECT_SUMMARY.md
README.md		README.md
SYSTEM_CONSTRAINTS.md		SYSTEM_CONSTRAINTS.md
SYSTEM_MAP.md		SYSTEM_MAP.md
TASK_COMPLETE.md		TASK_COMPLETE.md
baseline_key_metrics.csv		baseline_key_metrics.csv
baseline_report.csv		baseline_report.csv
build_kernel.cmd		build_kernel.cmd
compile.cmd		compile.cmd
compile.ps1		compile.ps1
compile_script.cmd		compile_script.cmd
compile_triple.cmd		compile_triple.cmd
context.md		context.md
day1kernel		day1kernel
do_build.cmd		do_build.cmd
ncu_async4.csv		ncu_async4.csv
ncu_async8.csv		ncu_async8.csv
ncu_async8_2048.csv		ncu_async8_2048.csv
ncu_async8_2048_details.csv		ncu_async8_2048_details.csv
ncu_async8_4096_details.csv		ncu_async8_4096_details.csv
ncu_async8_raw.csv		ncu_async8_raw.csv
ncu_async8_triple.csv		ncu_async8_triple.csv
ncu_async_comparison.csv		ncu_async_comparison.csv
ncu_metrics_list.txt		ncu_metrics_list.txt
ncu_triple_2048.csv		ncu_triple_2048.csv
ncu_triple_2048_details.csv		ncu_triple_2048_details.csv
ncu_triple_4096_details.csv		ncu_triple_4096_details.csv
optimized_key_metrics.csv		optimized_key_metrics.csv
optimized_report.csv		optimized_report.csv
package.json		package.json
review_report.md		review_report.md
run_compile.bat		run_compile.bat
run_nfm_tests.sh		run_nfm_tests.sh
start-lattice-autopilot.bat		start-lattice-autopilot.bat
temp_build.bat		temp_build.bat
temp_build2.bat		temp_build2.bat
temp_build3.bat		temp_build3.bat
tmp_message.json		tmp_message.json
tmp_summary.json		tmp_summary.json
todo_list.json		todo_list.json

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Repository files navigation

Kernel-Lane (Lane 4)

How Lane 4 Fits the Other 3

Integration Contract

Mission

Hard Boundaries

Upstream Source and Intake

Release-Only Consumption Rule

Promotion Gate (Required)

Typical Workflow

Quick Start

Why This Matters

Head-less profiling

About

Uh oh!

Releases

Packages

Uh oh!

Contributors

Uh oh!

Languages

Folders and files

Latest commit

History

Repository files navigation

Kernel-Lane (Lane 4)

How Lane 4 Fits the Other 3

Integration Contract

Mission

Hard Boundaries

Upstream Source and Intake

Release-Only Consumption Rule

Promotion Gate (Required)

Typical Workflow

Quick Start

Why This Matters

Head-less profiling

About

Resources

License

Uh oh!

Stars

Watchers

Forks

Releases

Packages 0

Uh oh!

Contributors

Uh oh!

Languages

Packages