Skip to content

ci(claude): extract inline workflow scripts to .github/scripts/#447

Merged
heskew merged 1 commit intomainfrom
workflow/extract-inline-scripts
May 1, 2026
Merged

ci(claude): extract inline workflow scripts to .github/scripts/#447
heskew merged 1 commit intomainfrom
workflow/extract-inline-scripts

Conversation

@heskew
Copy link
Copy Markdown
Member

@heskew heskew commented May 1, 2026

Draft, stacked on #444. Will rebase to main once #444 merges.

Summary

The inline run: | blocks in claude-review.yml and claude-mention.yml had grown past the reviewable-in-YAML threshold — the log step alone was 142 lines. Pulls them out to standalone scripts:

Script Replaces Was
.github/scripts/compose-review-scope.sh "Compose review scope from layers" step 36 lines inline
.github/scripts/find-prior-review-comment.sh "Find prior review comment" step (from #444) 10 lines inline
.github/scripts/log-review-to-ai-review-log.sh "Log review to ai-review-log" step 142 lines inline
.github/scripts/parse-claude-mention.sh "Parse mention" step (claude-mention.yml) 17 lines inline

Each script documents inputs, outputs, and the rationale for non-obvious mechanics in its own header.

Mechanics

Workflows invoke via bash .github/scripts/<name>.sh rather than direct path execution. Sidesteps the +x bit being dropped on Windows checkouts and the "forgot to chmod the new script" footgun. The #!/usr/bin/env bash shebangs are informational only — bash invocation reads the file directly.

Out of scope (intentional)

  • Tests. Per discussion, deliberately deferred to a follow-up PR that adds npm run test:workflows coverage. The pure-logic parts (title generation, marker detection, mention parsing) are the natural test targets; gh/curl integration aspects need a fixture story.
  • ci(claude): two-job auth gate (App token + CODEOWNERS-driven team check) + invariants validator #417's inline scripts (auth-gate). When that branch updates, its scripts adopt the same .github/scripts/ structure as part of that PR's review fixup.
  • Cross-repo sharing (oauth, harper-pro, future repos). Per next-week's roadmap discussion — likely a reusable workflow + scripts mirrored or vendored. Not a blocker for landing this here.

Test plan

  • Open or push to a PR after this lands → confirm the review workflow runs end-to-end (compose scope, find prior comment, agent run, log to ai-review-log) with no behavior change vs. pre-extraction.
  • @claude mention a PR → confirm parse-claude-mention.sh correctly gates proceed and selects the right model.
  • Trigger a "no blockers" run on an open PR → confirm the title is no blockers (not 0 finding(s)) and the comment threads into the existing log issue.

🤖 Generated with Claude Code

Base automatically changed from workflow/marker-based-review-edit to main May 1, 2026 18:37
The inline `run: |` blocks in claude-review.yml and claude-mention.yml
have grown past the point of being reviewable inside YAML — the log
step alone was 142 lines. Pulls them out to standalone bash scripts:

- .github/scripts/compose-review-scope.sh    (was 36-line inline)
- .github/scripts/find-prior-review-comment.sh (was 10-line inline)
- .github/scripts/log-review-to-ai-review-log.sh (was 142-line inline)
- .github/scripts/parse-claude-mention.sh    (was 17-line inline)

Each script documents its inputs, outputs, and the rationale for
non-obvious mechanics inline. Workflows invoke via
`bash .github/scripts/<name>.sh` rather than direct path execution
— sidesteps the +x bit being dropped on Windows checkouts and the
"forgot to chmod the new script" footgun. The `#!/usr/bin/env bash`
shebangs are now informational only.

Stacked on top of #444 (marker-based review-comment edit), which
introduces the "Find prior review comment" step and the marker'd
log-step lookup. Will rebase to main once #444 merges.

Tests for these scripts are deliberately out of scope — Nathan's
preference per discussion. A separate PR will add coverage via
`npm run test:workflows` (or similar) once we settle on the test
runner shape.

#417 (auth-gate work) carries its own inline scripts and is not
touched here. When that branch updates, its scripts adopt the same
.github/scripts/ structure as part of that PR's review fixup.

Co-Authored-By: Claude Opus 4.7 (1M context) <noreply@anthropic.com>
@heskew heskew force-pushed the workflow/extract-inline-scripts branch from dd61acc to 6bed023 Compare May 1, 2026 18:40
@heskew heskew marked this pull request as ready for review May 1, 2026 18:40
@heskew heskew requested review from a team as code owners May 1, 2026 18:40
@heskew heskew merged commit 3c426fc into main May 1, 2026
41 of 44 checks passed
@heskew heskew deleted the workflow/extract-inline-scripts branch May 1, 2026 18:45
kriszyp pushed a commit that referenced this pull request May 5, 2026
…heck

Symptom: harper PR #411 from a real org member was silently skipped —
`Claude PR Review` evaluated its job-level `if:` to false and ran zero
steps.

Cause: GitHub's webhook `author_association` is unreliable. It reports
`CONTRIBUTOR` (or `NONE`) for org members with private membership AND
for users whose repo access comes via team membership rather than
direct collaborator status. Real HarperFast team members fall into
both buckets. Forcing visibility changes is hostile UX, and the
collaborators API would admit a broader population (read-only
collaborators, default-org-permission users) than we want.

Fix: two-job pattern with team-membership check.

Each workflow has an `authorize` job that runs first, mints an
installation token from a HarperFast-org-owned GitHub App
(Members:Read scope), and checks team membership. The work/review
job has ONE `if: needs.authorize.outputs.authorized == 'true'`. No
step-level guards, no individual user list to maintain.

The App token lives in the authorize job ONLY — the work job uses
the default GITHUB_TOKEN, so the org-read capability never reaches
the agent step.

CODEOWNERS-driven trust set:
  - The auth check reads `.github/CODEOWNERS` via the default token
    and extracts every `@HarperFast/<team>` handle as the trust set.
  - Same set as people we trust to review code; alignment by
    construction. New owner team in CODEOWNERS automatically extends
    trigger trust. New consumer repo inherits its own CODEOWNERS.
  - External-org handles are deliberately ignored — only HarperFast
    teams.
  - If CODEOWNERS is missing, empty, or has no @HarperFast handles,
    falls back to @HarperFast/developers.

Per-workflow specifics:

- claude-review.yml: checks BOTH the PR author
  (`pull_request.user.login`) AND the event actor (`github.actor`).
  A non-trusted user pushing to a trusted user's PR branch changes
  the actor without changing the PR author; refusing those events
  closes that loophole. claude[bot] is admitted explicitly so
  AI-authored PRs from the issue-to-PR pipeline get reviewed
  (ADMIT_CLAUDE_BOT=true).
- claude-mention.yml: checks the commenter. claude[bot] not admitted
  here (only humans trigger mentions).
- claude-issue-to-pr.yml: checks the LABELER (github.actor), not the
  issue author. The labeler must already have at least triage
  permission; a maintainer labeling an external-author issue is a
  legitimate way to invoke the agent on community reports.

Per the post-#447 convention, the auth-check bash lives in
`.github/scripts/authorize-claude-workflow.sh` (shared across all
three workflows; parameterized by env vars). Workflows invoke via
`bash .github/scripts/...`.

Defense-in-depth lint:
- New `.github/workflows/auth-gate-invariants.yml` runs on any PR
  touching a `claude-*.yml` workflow file. Validates structurally
  via `bash .github/scripts/validate-auth-gate-invariants.sh`:
  * `authorize` job exists.
  * `authorize.outputs.authorized` wired to a step output.
  * `actions/create-github-app-token` present and pinned to a SHA.
  * `authorize.permissions` has no `write` scopes.
  * `HARPERFAST_AI_CLIENT_ID` and `HARPERFAST_AI_APP_PRIVATE_KEY`
    secrets referenced.
  * Every non-authorize job has `needs: authorize` and an exact
    `if: needs.authorize.outputs.authorized == 'true'` (no
    compound expressions, no tautologies).
  Make this a REQUIRED status check on `main` via branch
  protection. Subtle attacks on the bash logic are caught by
  CODEOWNERS review on `.github/`.

Required (organization-level) secrets — must be set on the
HarperFast org for any consumer repo to authorize a Claude run:
  - HARPERFAST_AI_CLIENT_ID       (the App's Client ID, like Iv23li…)
  - HARPERFAST_AI_APP_PRIVATE_KEY (.pem file contents)

Replaces #417's two earlier commits (which were on a stale base
that pre-dated #432, #437, #438, #439, #442, #444, #447).

Co-Authored-By: Claude Opus 4.7 (1M context) <noreply@anthropic.com>
Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment

Labels

None yet

Projects

None yet

Development

Successfully merging this pull request may close these issues.

1 participant