Skip to content

feat: add Playwright E2E tests and CI for LLM dashboard#1

Merged
jaikoo merged 2 commits into
mainfrom
feat/e2e-ci-seed-fix-tab-tests-v2
Feb 16, 2026
Merged

feat: add Playwright E2E tests and CI for LLM dashboard#1
jaikoo merged 2 commits into
mainfrom
feat/e2e-ci-seed-fix-tab-tests-v2

Conversation

@jaikoo
Copy link
Copy Markdown
Owner

@jaikoo jaikoo commented Feb 16, 2026

Summary

  • 21 Playwright E2E tests covering all 11 LLM dashboard tabs: Overview, Usage, Latency, Models, Traces, Search, Prompts, Scores, Sessions, Tools, Feedback
  • Seed timing fix: feedback/budget/scores seeding moved to seedPostFlush() called after flush wait, eliminating 404 warnings
  • Score seeding: seeds quality (0.85) and relevance (0.72) scores for Scores tab test
  • GitHub Actions CI: E2E workflow triggers on push/PR to main — builds bloop, installs Playwright, runs all tests

Test plan

  • All 21 E2E tests pass locally
  • No seed warnings for feedback/budget (traces flushed before POST)
  • CI workflow runs on this PR

Add 21 Playwright E2E tests covering all LLM dashboard tabs: Overview,
Usage, Latency, Models, Traces, Search, Prompts, Scores, Sessions,
Tools, and Feedback. Tests verify tab rendering, data display, drill-
down navigation, span tree hierarchy, feedback submission, budget
editing, and prompt version comparison.

Seed timing ensures feedback/budget/scores POST after trace flush so
traces exist in SQLite before mutation endpoints are called.

Add GitHub Actions workflow for E2E on push/PR to main.
Remote main has 8 LLM tabs (no Sessions, Tools, Feedback UI).
Remove tests for features not in remote: budget gauge/edit, feedback
buttons/tab, sessions drill-down, tools table, prompt version
comparison, span tree hierarchy. Use flexible tab count assertion.
@jaikoo jaikoo merged commit 1de1ff9 into main Feb 16, 2026
9 checks passed
Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment

Labels

None yet

Projects

None yet

Development

Successfully merging this pull request may close these issues.

1 participant