Skip to content
New issue

Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.

By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.

Already on GitHub? Sign in to your account

Doubts about the evaluation results #33

Open
Yanllan opened this issue May 22, 2024 · 1 comment
Open

Doubts about the evaluation results #33

Yanllan opened this issue May 22, 2024 · 1 comment

Comments

@Yanllan
Copy link

Yanllan commented May 22, 2024

First of all, congratulations on completing such a work, but I am confused about the results in the technical report.
The reported results are referred to as "zero-shot evaluation of RadFM", however, the vqa-rad dataset exists in your training dataset, isn't it somewhat paradoxical that? Or did you do something with the dataset or did I misinterpret the paper?

@chaoyi-wu
Copy link
Owner

Thanks for your question. Here our “zero-shot” setting is more referred as a prompting method, distinguished with few-shot, CoT and so on, following FLAN https://arxiv.org/pdf/2301.13688. In other word, "zero-shot" here is not directly linked with any domain shift.

We talk about the transferring ability on unseen tasks in section "Generalization to Unseen Classes in PadChest" which may be more aligned with your understanding for "zero-shot".

Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment
Labels
None yet
Projects
None yet
Development

No branches or pull requests

2 participants