Skip to content

fix(redesign): R34 CI fix + gen-over-gen QA pipeline + inbox stagger animations#371

Merged
HomenShum merged 1 commit into
mainfrom
fix/ci-pipeline-upgrade-r34
May 20, 2026
Merged

fix(redesign): R34 CI fix + gen-over-gen QA pipeline + inbox stagger animations#371
HomenShum merged 1 commit into
mainfrom
fix/ci-pipeline-upgrade-r34

Conversation

@HomenShum
Copy link
Copy Markdown
Owner

Summary

  • CI fix: Resolve record check failure — pr-demo-video.yml removed cache: npm (package-lock.json is gitignored) and switched npm ci to npm install --legacy-peer-deps
  • QA pipeline upgrade: Gen-over-gen milestone tracking with semantic P1 fingerprinting, dimension trend arrows, scenario stability badges, generation narrative summaries
  • UI polish: Stagger fade-in animations on inbox rows (40ms cascade, 400ms cap), smoother surface transitions, prefers-reduced-motion respected

Changes

File What
.github/workflows/pr-demo-video.yml Fix CI record check — no lockfile dependency
scripts/ui/interactiveGeminiQaPipeline.mjs Milestone diff engine, semantic P1 fingerprints, generation tracking
src/features/redesign/primitives.css rd-stagger-fade-in keyframe + surface transitions
src/features/redesign/surfaces/InboxSurface.tsx Stagger animation on inbox list items

Verification

  • npx tsc --noEmit — 0 errors
  • npm run build — clean (23s)
  • 24 rounds of QA history (R11→R34), score 91/100
  • Gen 5 avg 89.5 (+0.7 vs Gen 1 baseline 88.8)

Test plan

  • Verify pr-demo-video workflow succeeds on next merged PR
  • Run R35 QA round to confirm stagger animation scores improve
  • Visual check: inbox items fade in sequentially, no pop-in

🤖 Generated with Claude Code

@HomenShum HomenShum enabled auto-merge (squash) May 20, 2026 23:33
@vercel
Copy link
Copy Markdown

vercel Bot commented May 20, 2026

The latest updates on your projects. Learn more about Vercel for GitHub.

Project Deployment Actions Updated (UTC)
nodebench-ai Ready Ready Preview, Comment May 20, 2026 11:48pm

Request Review

@augmentcode
Copy link
Copy Markdown

augmentcode Bot commented May 20, 2026

🤖 Augment PR Summary

Summary: This PR fixes the PR demo video CI workflow, upgrades the interactive Gemini QA pipeline to support gen-over-gen milestone tracking, and adds inbox list stagger animations with smoother surface transitions.

Changes:

  • CI: removed cache: npm (lockfile is gitignored) and replaced npm ci with npm install --legacy-peer-deps in pr-demo-video.yml
  • QA pipeline: added generation segmentation (GEN_SIZE=5), milestone comparisons (baseline/midpoint/gen boundaries/prev), and per-generation summary stats
  • P1 tracking: introduced semantic P1 fingerprinting to dedupe stochastic re-flags by normalized issue text + surface
  • Trends: expanded to all-time screenshot/video dimension trend + stats and scenario stability reporting
  • UI: added rd-stagger-fade-in keyframes + inbox row inline delays for cascading list item entry
  • UI polish: added opacity/transform transitions on inbox rows/cards and a reduced-motion override for the stagger animation

Technical Notes: The comparison output now records both milestone diffs and per-generation aggregates, and prints compact grouped trajectories by generation to make long QA histories easier to interpret.

🤖 Was this summary useful? React with 👍 or 👎

Copy link
Copy Markdown

@augmentcode augmentcode Bot left a comment

Choose a reason for hiding this comment

The reason will be displayed to describe this comment to others. Learn more.

Review completed. 3 suggestions posted.

Fix All in Augment

Comment augment review to trigger a new review at any time.

if (!prevFingerprints.has(fp)) {
const detail = currFpMap.get(fp);
// Check if this is actually a stochastic re-flag (appeared in older rounds, disappeared, reappeared)
const isStochastic = allHistoryFingerprints.has(fp) && !prevFingerprints.has(fp);
Copy link
Copy Markdown

@augmentcode augmentcode Bot May 20, 2026

Choose a reason for hiding this comment

The reason will be displayed to describe this comment to others. Learn more.

scripts/ui/interactiveGeminiQaPipeline.mjs:877 — isStochastic is true for any fingerprint seen in older history, so a true regression after a long gap can be reported as both stochastic (with a “not a real regression” note) and regressed. Consider making the stochastic vs regression buckets mutually exclusive so the report doesn’t contradict itself.

Severity: medium

Fix This in Augment

🤖 Was this useful? React with 👍 or 👎, or 🚀 if it prevented an incident/outage.

const start = valid[0];
const end = valid[valid.length - 1];
const delta = end - start;
const velocity = +(delta / valid.length).toFixed(2);
Copy link
Copy Markdown

@augmentcode augmentcode Bot May 20, 2026

Choose a reason for hiding this comment

The reason will be displayed to describe this comment to others. Learn more.

scripts/ui/interactiveGeminiQaPipeline.mjs:717 — velocity is presented later as “per round”, but it’s computed as delta / valid.length, which underestimates per-step change (typically delta / (valid.length - 1)). This can skew downstream flags like the DECLINING threshold that checks velocity < -0.3.

Severity: low

Fix This in Augment

🤖 Was this useful? React with 👍 or 👎, or 🚀 if it prevented an incident/outage.


- name: Install dependencies
run: npm ci
run: npm install --legacy-peer-deps
Copy link
Copy Markdown

@augmentcode augmentcode Bot May 20, 2026

Choose a reason for hiding this comment

The reason will be displayed to describe this comment to others. Learn more.

.github/workflows/pr-demo-video.yml:57 — Switching to npm install --legacy-peer-deps without a lockfile makes dependency resolution non-deterministic across runs and can introduce flaky CI failures. If the goal is a reliable “record” check, it may be worth ensuring the install step is still reproducible/stable for Playwright recording.

Severity: medium

Fix This in Augment

🤖 Was this useful? React with 👍 or 👎, or 🚀 if it prevented an incident/outage.

…animations

Fix the `record` CI check failure (pr-demo-video workflow needed npm install
instead of npm ci since package-lock.json is gitignored). Upgrade the Gemini
QA pipeline with generation-over-generation milestone tracking, semantic P1
fingerprinting, and dimension trend analysis. Add stagger fade-in animations
to inbox list items and smoother surface content transitions.

- CI: Remove `cache: npm` from setup-node, switch npm ci → npm install
- Pipeline: Milestone diff engine (baseline, gen boundaries, midpoint, previous)
- Pipeline: Semantic P1 fingerprinting for stochastic variance filtering
- Pipeline: Dimension trend arrows with velocity, scenario stability badges
- UI: rd-stagger-fade-in keyframe for inbox rows (40ms cascade, 400ms cap)
- UI: Transition smoothness on inbox-list, v3-card, inbox-row elements
- UI: prefers-reduced-motion respected for new animations

Co-Authored-By: Claude Opus 4.6 <noreply@anthropic.com>
@github-actions
Copy link
Copy Markdown

✅ Dogfood Visual QA Gate: PASSED

Check Status
Screenshots 23 captured (pass)
Walkthrough 9 chapters (pass)
Key Frames 9 extracted (pass)
Scribe Steps 8 how-to steps (pass)
Build success
Artifacts

Download the dogfood-evidence-f58e6a7 artifact from the Actions tab for full screenshots, frames, and walkthrough video.


Generated by Dogfood QA Gate

@HomenShum HomenShum merged commit c8066a6 into main May 20, 2026
16 checks passed
@HomenShum HomenShum deleted the fix/ci-pipeline-upgrade-r34 branch May 20, 2026 23:59
@github-actions
Copy link
Copy Markdown

Demo: walkthrough of the surfaces this PR changed is available as a workflow artifact (pr-demo-371) at https://github.com/HomenShum/nodebench-ai/actions/runs/26197036474

Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment

Labels

None yet

Projects

None yet

Development

Successfully merging this pull request may close these issues.

2 participants