Skip to content

Add task: happy-dog-latex#46

Open
Leolty wants to merge 1 commit into
cocoabench:mainfrom
Leolty:happy-dog-latex
Open

Add task: happy-dog-latex#46
Leolty wants to merge 1 commit into
cocoabench:mainfrom
Leolty:happy-dog-latex

Conversation

@Leolty
Copy link
Copy Markdown
Collaborator

@Leolty Leolty commented Jan 30, 2026

A complex and interesting task requires GUI, visual inspection, and complex coding

@Leolty Leolty requested review from Ber666 and zzn-nzz January 30, 2026 11:26
@zzn-nzz
Copy link
Copy Markdown
Collaborator

zzn-nzz commented Feb 5, 2026

Hi @Leolty , this is a creative and challenging task with an interesting idea. It requires many crucial abilities in realistic scenarios, including web interaction, visual perception, and code implementation and execution.

I have two small concerns:

  • Note-head counts problem: I believe the answer makes sense, but if the number of correct models is not specified, I would assume that GPT-5.2-Thinking (Extended) has five note-heads based on the confusing definition of note-head. You might consider explicitly specifying the number of models that satisfy the requirement that “note-head counts match the original image.”

  • The agent output example in evaluation.md seems to be missing.

Once these two issues are resolved, I’d be happy to include this amazing task. Thank you!

Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment

Labels

None yet

Projects

None yet

Development

Successfully merging this pull request may close these issues.

2 participants