Commit bcf7496
committed
chore(release): 2.3.0 [skip ci]
# [2.3.0](v2.2.0...v2.3.0) (2026-03-11)
### Bug Fixes
* add --break-system-packages for pip installs + pip.conf bypass PEP 668 ([14430c4](14430c4))
* allow clippy too_many_arguments for run_task_pipeline ([6eb69c2](6eb69c2))
* auto-install deps, python3 symlink, detect full commands in fail_to_pass, language-aware test scripts ([a38497f](a38497f))
* config test race condition with env var mutex ([2963325](2963325))
* correct Basilica API types and SSH key support ([63d8174](63d8174))
* enable apt/sudo in Basilica containers ([d83cb8c](d83cb8c))
* expose agent_output and agent_patch in TaskResult and API responses ([348c251](348c251))
* extract_agent_only for /evaluate - no tasks/ dir required ([2b90ee1](2b90ee1))
* filter out apt-get/system commands from install (Basilica blocks syscalls), keep project-level installs ([e5365da](e5365da))
* full clone for commit checkout, explicit pip/pytest symlinks ([a0c1d6f](a0c1d6f))
* handle null test_patch from HuggingFace API (deserialize null as empty string) ([492d068](492d068))
* increase clone/install timeout from 180s to 600s ([95cecc3](95cecc3))
* install base tools, runtimes, and filter redundant deps for Basilica ([80a3a0c](80a3a0c))
* install corepack/yarn/pnpm globally via npm in Dockerfile ([b7183e8](b7183e8))
* move workspace to /home/agent/sessions, fix node_modules permissions, improve agent code error handling ([1ced355](1ced355))
* normalize repo URL in parse_task (add github.com prefix) ([398a6fd](398a6fd))
* pip 22 compatibility for base tools and install commands ([68bb93f](68bb93f))
* remove redundant into_iter() for clippy ([eaf2a7c](eaf2a7c))
* report task status incrementally during batch execution ([4440fd8](4440fd8))
* resolve all clippy warnings for CI ([2b3ae9d](2b3ae9d))
* revert Dockerfile git-lfs changes, add GIT_LFS_SKIP_SMUDGE to snapshot clone ([7130823](7130823))
* run agent from repo_dir CWD, use absolute path to agent.py ([cc6bcde](cc6bcde))
* run as root (Basilica blocks sudo), remove sudo prefix logic ([477a433](477a433))
* sudo for apt-get in install commands, add golang/corepack/sudo to Dockerfile ([1aceb88](1aceb88))
* upgrade Go to 1.23 and Node to 20 LTS in Dockerfile ([67ca713](67ca713))
* use :id path params for Axum 0.7 (not {id} which is 0.8) ([5dfa0c1](5dfa0c1))
### Features
* /evaluate endpoint using stored agent + TRUSTED_VALIDATORS whitelist ([b6aee7a](b6aee7a))
* add /code-hash endpoint for code integrity verification ([0a8e01b](0a8e01b))
* add /upload-agent-json endpoint for JSON-based agent upload ([9cfa1da](9cfa1da))
* add Basilica API client for container provisioning ([8a0afca](8a0afca))
* add install field from swe-forge dataset, fix default split to train, add openssh-client ([737ab1f](737ab1f))
* add POST /submit_tasks endpoint + fix HuggingFace dataset compat ([d92444c](d92444c))
* agent user with sudo for apt-install, run all commands as non-root agent ([e3f574a](e3f574a))
* agent ZIP upload frontend with env vars + SUDO_PASSWORD auth ([3aa5184](3aa5184))
* auto-install language runtimes from install_config version fields ([25b2e94](25b2e94))
* change default max_concurrent_tasks from 8 to 6, support CONCURRENTLY_TASKS env var ([eaba581](eaba581))
* extract full agent project instead of concatenating files ([3ac1023](3ac1023))
* fat Docker image with all language runtimes (java, rust, pnpm, unzip, etc.) ([3855f2d](3855f2d))
* fetch task definitions from HF repo (workspace.yaml + tests/), remove auto_install hack ([7162a39](7162a39))
* propagate agent_env to run_agent and pass --instruction arg to Python agents ([d922264](d922264))
* replace per-file HF downloads with bulk git clone snapshot ([6036b78](6036b78))
* run each task in its own Basilica container via SSH ([432107b](432107b))
* swe-bench/swe-forge integration - extend WorkspaceConfig with fail_to_pass/pass_to_pass/install_config/difficulty fields - parse swe-forge workspace.yaml native fields as test script fallback - capture git diff (agent patch) after agent execution - add /dataset endpoint to fetch from HuggingFace CortexLM/swe-forge - wire fail_to_pass/pass_to_pass in dataset entry conversion ([814259e](814259e))1 parent 57c3877 commit bcf7496
1 file changed
Lines changed: 53 additions & 0 deletions
| Original file line number | Diff line number | Diff line change | |
|---|---|---|---|
| |||
54 | 54 | | |
55 | 55 | | |
56 | 56 | | |
| 57 | + | |
| 58 | + | |
| 59 | + | |
| 60 | + | |
| 61 | + | |
| 62 | + | |
| 63 | + | |
| 64 | + | |
| 65 | + | |
| 66 | + | |
| 67 | + | |
| 68 | + | |
| 69 | + | |
| 70 | + | |
| 71 | + | |
| 72 | + | |
| 73 | + | |
| 74 | + | |
| 75 | + | |
| 76 | + | |
| 77 | + | |
| 78 | + | |
| 79 | + | |
| 80 | + | |
| 81 | + | |
| 82 | + | |
| 83 | + | |
| 84 | + | |
| 85 | + | |
| 86 | + | |
| 87 | + | |
| 88 | + | |
| 89 | + | |
| 90 | + | |
| 91 | + | |
| 92 | + | |
| 93 | + | |
| 94 | + | |
| 95 | + | |
| 96 | + | |
| 97 | + | |
| 98 | + | |
| 99 | + | |
| 100 | + | |
| 101 | + | |
| 102 | + | |
| 103 | + | |
| 104 | + | |
| 105 | + | |
| 106 | + | |
| 107 | + | |
| 108 | + | |
| 109 | + | |
57 | 110 | | |
58 | 111 | | |
59 | 112 | | |
| |||
0 commit comments