Skip to content

[Benchmark]Add TextHalu-Bench#1561

Open
Sammy20207109 wants to merge 5 commits into
open-compass:mainfrom
Sammy20207109:add_texthalubench
Open

[Benchmark]Add TextHalu-Bench#1561
Sammy20207109 wants to merge 5 commits into
open-compass:mainfrom
Sammy20207109:add_texthalubench

Conversation

@Sammy20207109

Copy link
Copy Markdown

Overview

This PR updates TextHaluBench after reviewer feedback. Following your feedback, we conducted a thorough inspection of the benchmark and manually re-checked the samples one by one.

Motivation

Previous benchmarks like ST-VQA and TextVQA are dominated by semantically clear samples, which may overestimate models' true visual grounding. TextHalu-Bench provides a more challenging evaluation set for non-semantic scene text evaluation.

Dataset

Focuses on non-semantic text: isolated numbers, incomplete words, rare or out-of-vocabulary tokens.
Two subtasks:Spotting ,Understanding

We have now updated the dataset and refreshed the corresponding files in this PR. We would greatly appreciate it if you could take another look when convenient. Please let us know if you notice any remaining issues or have additional suggestions for improvement.Thank you again for your time and valuable feedback.

Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment

Labels

None yet

Projects

None yet

Development

Successfully merging this pull request may close these issues.

2 participants