-
Notifications
You must be signed in to change notification settings - Fork 19
fix: use multiple seeds per task in regression baselines #33
New issue
Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.
By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.
Already on GitHub? Sign in to your account
base: main
Are you sure you want to change the base?
Changes from all commits
File filter
Filter by extension
Conversations
Jump to
Diff view
Diff view
There are no files selected for viewing
| Original file line number | Diff line number | Diff line change |
|---|---|---|
|
|
@@ -11,7 +11,7 @@ | |
| "probes": 192, | ||
| "true_positives": 64 | ||
| }, | ||
| "seed": 8, | ||
| "seed": 84, | ||
| "size": 128, | ||
| "task": "memory_index" | ||
| } | ||
|
|
||
| Original file line number | Diff line number | Diff line change |
|---|---|---|
| @@ -1,42 +1,202 @@ | ||
| [ | ||
| { | ||
| "path": "data/io_pipeline/regression.json", | ||
| "path": "data\\io_pipeline\\regression.json", | ||
| "task": "io_pipeline" | ||
| }, | ||
| { | ||
| "path": "data/ordering_core/regression.json", | ||
| "path": "data\\io_pipeline\\regression.json", | ||
|
Member
There was a problem hiding this comment. Choose a reason for hiding this commentThe reason will be displayed to describe this comment to others. Learn more. why |
||
| "task": "io_pipeline" | ||
| }, | ||
| { | ||
| "path": "data\\io_pipeline\\regression.json", | ||
| "task": "io_pipeline" | ||
| }, | ||
| { | ||
| "path": "data\\io_pipeline\\regression.json", | ||
| "task": "io_pipeline" | ||
| }, | ||
| { | ||
| "path": "data\\io_pipeline\\regression.json", | ||
| "task": "io_pipeline" | ||
| }, | ||
|
Comment on lines
1
to
+21
There was a problem hiding this comment. Choose a reason for hiding this commentThe reason will be displayed to describe this comment to others. Learn more. 🧩 Analysis chain🏁 Script executed: #!/bin/bash
# Verify if the baseline files can be resolved with current paths
cd data
echo "=== Checking if paths resolve correctly ==="
# List actual directory structure
find . -name "regression.json" -type f
echo ""
echo "=== Testing path resolution ==="
# On POSIX, this literal path should NOT exist
if [ -e "data\\io_pipeline\\regression.json" ]; then
echo "UNEXPECTED: Backslash path exists (likely Windows)"
else
echo "EXPECTED: Backslash path does not exist on POSIX"
fiRepository: iiitl/chuck Length of output: 521 🏁 Script executed: cat data/regression.jsonRepository: iiitl/chuck Length of output: 4537 🏁 Script executed: find . -name "*.py" -type f | head -20Repository: iiitl/chuck Length of output: 753 🏁 Script executed: grep -r "load_regression_file" --include="*.py" -B 2 -A 10Repository: iiitl/chuck Length of output: 2945 🏁 Script executed: grep -r "test_regression\|_load_expected_by_task" --include="*.py" -B 2 -A 10 | head -100Repository: iiitl/chuck Length of output: 1490 🏁 Script executed: grep -r "DEFAULT_REGRESSION_PATH\|test.*regression" --include="*.py" -B 2 -A 5Repository: iiitl/chuck Length of output: 5156 🏁 Script executed: grep -r "def test_" --include="*.py" | grep -i regressionRepository: iiitl/chuck Length of output: 268 🏁 Script executed: grep -A 5 "DEFAULT_REGRESSION_PATH" chuck/regression.pyRepository: iiitl/chuck Length of output: 819 🏁 Script executed: cat tests/test_regression.pyRepository: iiitl/chuck Length of output: 2334 Windows-style path separators will break on POSIX systems. Paths like Additionally, the manifest should contain 10 entries (one per task), not 50. The current 5 duplicate entries per task are redundant, and when Regenerate this file on a POSIX system or use forward slashes ( 🤖 Prompt for AI Agents
Comment on lines
2
to
+21
There was a problem hiding this comment. Choose a reason for hiding this commentThe reason will be displayed to describe this comment to others. Learn more. Manifest has 50 duplicate entries; should have 10 (one per task). The manifest contains 5 identical entries per task, but each points to the same baseline file. Per the relevant code snippet from self.assertEqual(len(entries), 10)Additionally, After fixing the overwrite bug in 🤖 Prompt for AI Agents |
||
| { | ||
| "path": "data\\ordering_core\\regression.json", | ||
| "task": "ordering_core" | ||
| }, | ||
| { | ||
| "path": "data\\ordering_core\\regression.json", | ||
| "task": "ordering_core" | ||
| }, | ||
| { | ||
| "path": "data/retrieval_core/regression.json", | ||
| "path": "data\\ordering_core\\regression.json", | ||
| "task": "ordering_core" | ||
| }, | ||
| { | ||
| "path": "data\\ordering_core\\regression.json", | ||
| "task": "ordering_core" | ||
| }, | ||
| { | ||
| "path": "data\\ordering_core\\regression.json", | ||
| "task": "ordering_core" | ||
| }, | ||
| { | ||
| "path": "data\\retrieval_core\\regression.json", | ||
| "task": "retrieval_core" | ||
| }, | ||
| { | ||
| "path": "data\\retrieval_core\\regression.json", | ||
| "task": "retrieval_core" | ||
| }, | ||
| { | ||
| "path": "data\\retrieval_core\\regression.json", | ||
| "task": "retrieval_core" | ||
| }, | ||
| { | ||
| "path": "data/data_encoding/regression.json", | ||
| "path": "data\\retrieval_core\\regression.json", | ||
| "task": "retrieval_core" | ||
| }, | ||
| { | ||
| "path": "data\\retrieval_core\\regression.json", | ||
| "task": "retrieval_core" | ||
| }, | ||
| { | ||
| "path": "data\\data_encoding\\regression.json", | ||
| "task": "data_encoding" | ||
| }, | ||
| { | ||
| "path": "data\\data_encoding\\regression.json", | ||
| "task": "data_encoding" | ||
| }, | ||
| { | ||
| "path": "data\\data_encoding\\regression.json", | ||
| "task": "data_encoding" | ||
| }, | ||
| { | ||
| "path": "data\\data_encoding\\regression.json", | ||
| "task": "data_encoding" | ||
| }, | ||
| { | ||
| "path": "data/graph_analytics/regression.json", | ||
| "path": "data\\data_encoding\\regression.json", | ||
| "task": "data_encoding" | ||
| }, | ||
| { | ||
| "path": "data\\graph_analytics\\regression.json", | ||
| "task": "graph_analytics" | ||
| }, | ||
| { | ||
| "path": "data\\graph_analytics\\regression.json", | ||
| "task": "graph_analytics" | ||
| }, | ||
| { | ||
| "path": "data\\graph_analytics\\regression.json", | ||
| "task": "graph_analytics" | ||
| }, | ||
| { | ||
| "path": "data\\graph_analytics\\regression.json", | ||
| "task": "graph_analytics" | ||
| }, | ||
| { | ||
| "path": "data\\graph_analytics\\regression.json", | ||
| "task": "graph_analytics" | ||
| }, | ||
| { | ||
| "path": "data/prime_analytics/regression.json", | ||
| "path": "data\\prime_analytics\\regression.json", | ||
| "task": "prime_analytics" | ||
| }, | ||
| { | ||
| "path": "data/memory_tier/regression.json", | ||
| "path": "data\\prime_analytics\\regression.json", | ||
| "task": "prime_analytics" | ||
| }, | ||
|
Member
There was a problem hiding this comment. Choose a reason for hiding this commentThe reason will be displayed to describe this comment to others. Learn more. Do you realise why each entry is occuring 5 time? |
||
| { | ||
| "path": "data\\prime_analytics\\regression.json", | ||
| "task": "prime_analytics" | ||
| }, | ||
| { | ||
| "path": "data\\prime_analytics\\regression.json", | ||
| "task": "prime_analytics" | ||
| }, | ||
| { | ||
| "path": "data\\prime_analytics\\regression.json", | ||
| "task": "prime_analytics" | ||
| }, | ||
| { | ||
| "path": "data\\memory_tier\\regression.json", | ||
| "task": "memory_tier" | ||
| }, | ||
| { | ||
| "path": "data\\memory_tier\\regression.json", | ||
| "task": "memory_tier" | ||
| }, | ||
| { | ||
| "path": "data/memory_index/regression.json", | ||
| "path": "data\\memory_tier\\regression.json", | ||
| "task": "memory_tier" | ||
| }, | ||
| { | ||
| "path": "data\\memory_tier\\regression.json", | ||
| "task": "memory_tier" | ||
| }, | ||
| { | ||
| "path": "data\\memory_tier\\regression.json", | ||
| "task": "memory_tier" | ||
| }, | ||
| { | ||
| "path": "data\\memory_index\\regression.json", | ||
| "task": "memory_index" | ||
| }, | ||
| { | ||
| "path": "data\\memory_index\\regression.json", | ||
| "task": "memory_index" | ||
| }, | ||
| { | ||
| "path": "data\\memory_index\\regression.json", | ||
| "task": "memory_index" | ||
| }, | ||
| { | ||
| "path": "data/compute_core/regression.json", | ||
| "path": "data\\memory_index\\regression.json", | ||
| "task": "memory_index" | ||
| }, | ||
| { | ||
| "path": "data\\memory_index\\regression.json", | ||
| "task": "memory_index" | ||
| }, | ||
| { | ||
| "path": "data\\compute_core\\regression.json", | ||
| "task": "compute_core" | ||
| }, | ||
| { | ||
| "path": "data\\compute_core\\regression.json", | ||
| "task": "compute_core" | ||
| }, | ||
| { | ||
| "path": "data\\compute_core\\regression.json", | ||
| "task": "compute_core" | ||
| }, | ||
| { | ||
| "path": "data\\compute_core\\regression.json", | ||
| "task": "compute_core" | ||
| }, | ||
| { | ||
| "path": "data/relational_fusion/regression.json", | ||
| "path": "data\\compute_core\\regression.json", | ||
| "task": "compute_core" | ||
|
Member
There was a problem hiding this comment. Choose a reason for hiding this commentThe reason will be displayed to describe this comment to others. Learn more. chuck currently supports linux, mac and WSL. These windows style path style must be reverted |
||
| }, | ||
| { | ||
| "path": "data\\relational_fusion\\regression.json", | ||
| "task": "relational_fusion" | ||
| }, | ||
| { | ||
| "path": "data\\relational_fusion\\regression.json", | ||
| "task": "relational_fusion" | ||
| }, | ||
| { | ||
| "path": "data\\relational_fusion\\regression.json", | ||
| "task": "relational_fusion" | ||
| }, | ||
| { | ||
| "path": "data\\relational_fusion\\regression.json", | ||
| "task": "relational_fusion" | ||
| }, | ||
| { | ||
| "path": "data\\relational_fusion\\regression.json", | ||
| "task": "relational_fusion" | ||
| } | ||
| ] | ||
There was a problem hiding this comment.
Choose a reason for hiding this comment
The reason will be displayed to describe this comment to others. Learn more.
Could you please tell your reasoning behind these changes