Skip to content

feat: improve tdd skill score from 83% to 93%#11

Open
yogesh-tessl wants to merge 1 commit into
8thlight:mainfrom
yogesh-tessl:improve/skill-review-optimization
Open

feat: improve tdd skill score from 83% to 93%#11
yogesh-tessl wants to merge 1 commit into
8thlight:mainfrom
yogesh-tessl:improve/skill-review-optimization

Conversation

@yogesh-tessl
Copy link
Copy Markdown

@yogesh-tessl yogesh-tessl commented May 15, 2026

Hey @ericjohnolson 👋

really cool to see CLAUDE.md and AGENTS.md both in here. The 7 skills under praxis covering the full dev cycle from research through TDD to reflection show a thoughtful approach to how you want AI to work through problems.

ran your skills through tessl skill review at work and found some targeted improvements for tdd. Here's the before/after:

Skill Before After Change
tdd 83% 93% +10%

picked tdd because it had the most improvement headroom alongside reflect, and TDD is core to your RPI methodology, the implement skill's three-agent orchestration relies on the same Core Rules and red-green discipline defined here.

Changes made

Description improvements (biggest impact):

  • Rewrote from jargon-heavy label ("L3/L4 altitude testing") to a concrete description listing specific actions: guides writing one test at a time, ZOMBIES test discovery, boundary-level and integration testing, structured session log tracking
  • Added explicit "Use when..." clause with natural trigger terms users would actually say ("writing unit tests", "doing TDD", "start with tests")
  • Added "Do NOT use for..." negative trigger pointing to implement for automated plan execution
  • Wrapped description in quoted string (standard frontmatter format)

Trigger improvements:

  • Added three natural-language trigger phrases: "unit tests for this module", "start with tests", "TDD this feature"

Content improvements:

  • Removed introductory philosophy paragraph ("TDD is a design technique...") - Claude already understands TDD motivation, and the behavioral constraints are already encoded in the Core Rules section
  • Added plain-language parenthetical for L3/L4 jargon in the boundary testing reference link ("boundary and integration-level testing (L3/L4 altitude)")

quick honest disclosure. I work at https://github.com/tesslio where we build tooling around skills like these. Not a pitch, just saw room for improvement and wanted to contribute.

if you want to self-improve your skills, or define your own scenarios to pressure test, just ask your agent (Claude Code, Codex, etc.) to evaluate and optimize your skill with Tessl. Ping me @yogesh-tessl, if you hit any snags.

Hey @ericjohnolson 👋

I ran your skills through `tessl skill review` at work and found some targeted improvements for `tdd`. Here's the full before/after:

| Skill | Before | After | Change |
|-------|--------|-------|--------|
| **tdd** | **83%** | **93%** | **+10%** |
| adr | 95% | — | — |
| research | 93% | — | — |
| harness | 93% | — | — |
| implement | 93% | — | — |
| plan-tasks | 92% | — | — |
| reflect | 83% | — | — |

I picked `tdd` because it had the most improvement headroom alongside `reflect`, and TDD is core to your RPI methodology — the `implement` skill's three-agent orchestration relies on the same Core Rules and red-green discipline defined here.

<details>
<summary>Changes made</summary>

**Description improvements (biggest impact):**
- Rewrote from jargon-heavy label ("L3/L4 altitude testing") to a concrete description listing specific actions: guides writing one test at a time, ZOMBIES test discovery, boundary-level and integration testing, structured session log tracking
- Added explicit "Use when..." clause with natural trigger terms users would actually say ("writing unit tests", "doing TDD", "start with tests")
- Added "Do NOT use for..." negative trigger pointing to `implement` for automated plan execution
- Wrapped description in quoted string (standard frontmatter format)

**Trigger improvements:**
- Added three natural-language trigger phrases: "unit tests for this module", "start with tests", "TDD this feature"

**Content improvements:**
- Removed introductory philosophy paragraph ("TDD is a design technique...") — Claude already understands TDD motivation, and the behavioral constraints are already encoded in the Core Rules section
- Added plain-language parenthetical for L3/L4 jargon in the boundary testing reference link ("boundary and integration-level testing (L3/L4 altitude)")

</details>

I also stress-tested your `tdd` skill against a ZOMBIES-driven circular buffer implementation and it correctly guided the full zero-one-many-boundary progression without skipping a step. Kudos for that.

Honest disclosure — I work at @tesslio where we build tooling around skills like these. Not a pitch — just saw room for improvement and wanted to contribute.

Want to self-improve your skills? Just point your agent (Claude Code, Codex, etc.) at [this Tessl guide](https://docs.tessl.io/evaluate/optimize-a-skill-using-best-practices) and ask it to optimize your skill. Ping me — [@yogesh-tessl](https://github.com/yogesh-tessl) — if you hit any snags.

Thanks in advance 🙏
Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment

Labels

None yet

Projects

None yet

Development

Successfully merging this pull request may close these issues.

1 participant