Skip to content

Replace video corpus with episodic fixtures#31

Merged
brianmeyer merged 1 commit into
masterfrom
codex/rec-153-episodic-video-corpus
May 17, 2026
Merged

Replace video corpus with episodic fixtures#31
brianmeyer merged 1 commit into
masterfrom
codex/rec-153-episodic-video-corpus

Conversation

@brianmeyer
Copy link
Copy Markdown
Owner

Summary

  • replace the tiny UAT video clips with compact episodic-memory fixtures and richer transcript sidecars
  • add sidecar search text plus related image/document metadata so video benchmarks can exercise parent-memory, transcript, and related-artifact retrieval
  • update text-to-video benchmark prompts, corpus docs, release docs, and video-quality UAT to match current embed/hybrid modes and deterministic CI behavior

Validation

  • python3 -m pytest -q tests/test_video_corpus.py tests/test_video_sidecars.py tests/test_cross_modal_benchmark_defs.py
  • python3 benchmarks/cross_modal_ablation.py --dry-run
  • bash tests/uat/test_video_quality.sh
  • python3 -m pytest -q
  • .venv/bin/python -m pip wheel . -w /tmp/recallforge-wheel

Notes

  • python3 -m build and .venv/bin/python -m twine check were not available locally because build/twine are not installed; pip wheel succeeded with an isolated build.
  • Research references: Ego4D episodic-memory benchmark and Pillow ImageDraw docs for caption rendering.

@brianmeyer brianmeyer merged commit b43e5b1 into master May 17, 2026
4 checks passed
@brianmeyer brianmeyer deleted the codex/rec-153-episodic-video-corpus branch May 17, 2026 18:42
Copy link
Copy Markdown

@chatgpt-codex-connector chatgpt-codex-connector Bot left a comment

Choose a reason for hiding this comment

The reason will be displayed to describe this comment to others. Learn more.

💡 Codex Review

Here are some automated review suggestions for this pull request.

Reviewed commit: d1393d90fd

ℹ️ About Codex in GitHub

Your team has set up Codex to review pull requests in this repo. Reviews are triggered when you

  • Open a pull request for review
  • Mark a draft as ready
  • Comment "@codex review".

If Codex has suggestions, it will comment; otherwise it will react with 👍.

Codex can also answer questions or update the PR. Try commenting "@codex address that feedback".

"effects": "zoom_in",
"overlay_text": "Pasta recipe",
"scenario": "Kitchen memory from a weeknight family recipe session.",
"images": ["food_pasta_dish.png", "handwritten_notes.png", "food_pasta_dish.png"],
Copy link
Copy Markdown

Choose a reason for hiding this comment

The reason will be displayed to describe this comment to others. Learn more.

P2 Badge Use distinct image files for each captioned cooking scene

cooking_tutorial.mp4 reuses food_pasta_dish.png for scenes 1 and 3, but _burn_text_onto_image writes captioned frames to a temp filename keyed only by source.stem; when the same source appears twice with different scene_captions, the later render overwrites the earlier one, so scene 1 ends up showing scene 3’s caption. This makes the generated fixture inconsistent with its intended timeline/transcript and can skew video-retrieval benchmarks that rely on scene text cues.

Useful? React with 👍 / 👎.

Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment

Labels

None yet

Projects

None yet

Development

Successfully merging this pull request may close these issues.

1 participant