Skip to content

Add sitemap + robots.txt to leaderboard.querygym.com#15

Merged
radinhamidi merged 1 commit intochore/nav-and-sitemap-stackedfrom
chore/leaderboard-sitemap
Apr 30, 2026
Merged

Add sitemap + robots.txt to leaderboard.querygym.com#15
radinhamidi merged 1 commit intochore/nav-and-sitemap-stackedfrom
chore/leaderboard-sitemap

Conversation

@radinhamidi
Copy link
Copy Markdown
Member

Summary

Mirrors the SEO surface that querygym.com already has, on the leaderboard side.

  • `@astrojs/sitemap` integration in `reproducibility/site/astro.config.mjs`. Home priority 1.0; dataset/method/model index pages 0.5; run-detail pages 0.3 monthly (mostly internal-link targets — let search engines focus on the index pages).
  • `reproducibility/site/public/robots.txt` with the standard "Allow all + sitemap pointer" shape.
  • `` in `reproducibility/site/src/layouts/Default.astro` for browser discoverability.

Generates 37 URLs on the current empty-results state (1 home + 32 per-dataset + datasets/methods/models index + about). Shard count grows as runs land via `submit_run.py`.

Where each robots.txt lives

Each origin needs its own `robots.txt` in that site's `public/`:

  • `querygym.com/robots.txt` → `web/site/public/robots.txt` (existed already)
  • `leaderboard.querygym.com/robots.txt` → `reproducibility/site/public/robots.txt` (added in this PR)

Stacked

Base is `chore/nav-and-sitemap-stacked` (PR #14), which is in turn stacked on `chore/copy-polish` (PR #13). Once all three merge in order, this PR's diff narrows to just the leaderboard files.

Test plan

  • `pnpm -F @qg/leaderboard build` succeeds.
  • `dist/sitemap-index.xml` and `dist/sitemap-0.xml` generated.
  • `dist/robots.txt` present, points at sitemap-index.
  • sitemap-0.xml contains 37 `` entries.
  • After deploy: `https://leaderboard.querygym.com/sitemap-index.xml\` resolves; submit to GSC.

🤖 Generated with Claude Code

Mirror what querygym.com already has:
- @astrojs/sitemap integration. Home priority 1.0, per-dataset/method/
  model index pages 0.5, run-detail pages 0.3 monthly (they're mostly
  internal-link targets so search engines should focus on the index
  pages).
- robots.txt with the standard "Allow all + sitemap pointer" shape.
- <link rel="sitemap"> in the layout for browser discoverability.

Generates 37 URLs (1 home + 32 per-dataset + datasets/methods/models
indexes + about) on the current empty-results state. The shard count
will grow as runs land via submit_run.py.

Stacked on chore/nav-and-sitemap-stacked (PR #14).

Co-Authored-By: Claude Opus 4.7 (1M context) <noreply@anthropic.com>
@radinhamidi radinhamidi merged commit 26c72ca into chore/nav-and-sitemap-stacked Apr 30, 2026
2 checks passed
radinhamidi added a commit that referenced this pull request Apr 30, 2026
Recover orphaned PRs #14 + #15 (nav links + leaderboard sitemap)
@radinhamidi radinhamidi deleted the chore/leaderboard-sitemap branch April 30, 2026 05:20
Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment

Labels

None yet

Projects

None yet

Development

Successfully merging this pull request may close these issues.

1 participant