'Just because you are right, doesn't mean I am wrong': Overcoming a bottleneck in the development and evaluation of open-ended visual question answering (VQA) tasks

Man Luo; Shailaja Keyur Sampat; Riley Tallman; Yankai Zeng; Manuha Vancha; Akarshan Sajja; Chitta Baral

'Just because you are right, doesn't mean I am wrong': Overcoming a bottleneck in the development and evaluation of open-ended visual question answering (VQA) tasks

Man Luo, Shailaja Keyur Sampat, Riley Tallman, Yankai Zeng, Manuha Vancha, Akarshan Sajja, Chitta Baral

Computing and Augmented Intelligence, School of (IAFSE-SCAI)

Research output: Chapter in Book/Report/Conference proceeding › Conference contribution

7 Scopus citations

Abstract

GQA (Hudson and Manning, 2019) is a dataset for real-world visual reasoning and compositional question answering. We found that many answers predicted by the best vision-language models on the GQA dataset do not match the ground-truth answer but still are semantically meaningful and correct in the given context. In fact, this is the case with most existing visual question answering (VQA) datasets where they assume only one ground-truth answer for each question. We propose Alternative Answer Sets (AAS) of ground-truth answers to address this limitation, which is created automatically using off-the-shelf NLP tools. We introduce a semantic metric based on AAS and modify top VQA solvers to support multiple plausible answers for a question. We implement this approach on the GQA dataset and show the performance improvements.

Original language	English (US)
Title of host publication	EACL 2021 - 16th Conference of the European Chapter of the Association for Computational Linguistics, Proceedings of the Conference
Publisher	Association for Computational Linguistics (ACL)
Pages	2766-2771
Number of pages	6
ISBN (Electronic)	9781954085022
State	Published - 2021
Event	16th Conference of the European Chapter of the Associationfor Computational Linguistics, EACL 2021 - Virtual, Online Duration: Apr 19 2021 → Apr 23 2021

Publication series

Name	EACL 2021 - 16th Conference of the European Chapter of the Association for Computational Linguistics, Proceedings of the Conference

Conference

Conference	16th Conference of the European Chapter of the Associationfor Computational Linguistics, EACL 2021
City	Virtual, Online
Period	4/19/21 → 4/23/21

ASJC Scopus subject areas

Software
Computational Theory and Mathematics
Linguistics and Language

Cite this

Luo, M., Sampat, S. K., Tallman, R., Zeng, Y., Vancha, M., Sajja, A., & Baral, C. (2021). 'Just because you are right, doesn't mean I am wrong': Overcoming a bottleneck in the development and evaluation of open-ended visual question answering (VQA) tasks. In EACL 2021 - 16th Conference of the European Chapter of the Association for Computational Linguistics, Proceedings of the Conference (pp. 2766-2771). (EACL 2021 - 16th Conference of the European Chapter of the Association for Computational Linguistics, Proceedings of the Conference). Association for Computational Linguistics (ACL).

'Just because you are right, doesn't mean I am wrong': Overcoming a bottleneck in the development and evaluation of open-ended visual question answering (VQA) tasks. / Luo, Man; Sampat, Shailaja Keyur; Tallman, Riley et al.
EACL 2021 - 16th Conference of the European Chapter of the Association for Computational Linguistics, Proceedings of the Conference. Association for Computational Linguistics (ACL), 2021. p. 2766-2771 (EACL 2021 - 16th Conference of the European Chapter of the Association for Computational Linguistics, Proceedings of the Conference).

Research output: Chapter in Book/Report/Conference proceeding › Conference contribution

Luo, M, Sampat, SK, Tallman, R, Zeng, Y, Vancha, M, Sajja, A & Baral, C 2021, 'Just because you are right, doesn't mean I am wrong': Overcoming a bottleneck in the development and evaluation of open-ended visual question answering (VQA) tasks. in EACL 2021 - 16th Conference of the European Chapter of the Association for Computational Linguistics, Proceedings of the Conference. EACL 2021 - 16th Conference of the European Chapter of the Association for Computational Linguistics, Proceedings of the Conference, Association for Computational Linguistics (ACL), pp. 2766-2771, 16th Conference of the European Chapter of the Associationfor Computational Linguistics, EACL 2021, Virtual, Online, 4/19/21.

Luo M, Sampat SK, Tallman R, Zeng Y, Vancha M, Sajja A et al. 'Just because you are right, doesn't mean I am wrong': Overcoming a bottleneck in the development and evaluation of open-ended visual question answering (VQA) tasks. In EACL 2021 - 16th Conference of the European Chapter of the Association for Computational Linguistics, Proceedings of the Conference. Association for Computational Linguistics (ACL). 2021. p. 2766-2771. (EACL 2021 - 16th Conference of the European Chapter of the Association for Computational Linguistics, Proceedings of the Conference).

Luo, Man ; Sampat, Shailaja Keyur ; Tallman, Riley et al. / 'Just because you are right, doesn't mean I am wrong' : Overcoming a bottleneck in the development and evaluation of open-ended visual question answering (VQA) tasks. EACL 2021 - 16th Conference of the European Chapter of the Association for Computational Linguistics, Proceedings of the Conference. Association for Computational Linguistics (ACL), 2021. pp. 2766-2771 (EACL 2021 - 16th Conference of the European Chapter of the Association for Computational Linguistics, Proceedings of the Conference).

@inproceedings{517e41426eda4fa399518f7d7d51286d,

title = "'Just because you are right, doesn't mean I am wrong': Overcoming a bottleneck in the development and evaluation of open-ended visual question answering (VQA) tasks",

abstract = "GQA (Hudson and Manning, 2019) is a dataset for real-world visual reasoning and compositional question answering. We found that many answers predicted by the best vision-language models on the GQA dataset do not match the ground-truth answer but still are semantically meaningful and correct in the given context. In fact, this is the case with most existing visual question answering (VQA) datasets where they assume only one ground-truth answer for each question. We propose Alternative Answer Sets (AAS) of ground-truth answers to address this limitation, which is created automatically using off-the-shelf NLP tools. We introduce a semantic metric based on AAS and modify top VQA solvers to support multiple plausible answers for a question. We implement this approach on the GQA dataset and show the performance improvements.",

author = "Man Luo and Sampat, {Shailaja Keyur} and Riley Tallman and Yankai Zeng and Manuha Vancha and Akarshan Sajja and Chitta Baral",

note = "Funding Information: We are thankful to Tejas Gokhale for useful discussions and feedback on this work. We also thank anonymous reviewers for their thoughtful feedback. This work is partially supported by the National Science Foundation grant IIS-1816039. Publisher Copyright: {\textcopyright} 2021 Association for Computational Linguistics; 16th Conference of the European Chapter of the Associationfor Computational Linguistics, EACL 2021 ; Conference date: 19-04-2021 Through 23-04-2021",

year = "2021",

language = "English (US)",

series = "EACL 2021 - 16th Conference of the European Chapter of the Association for Computational Linguistics, Proceedings of the Conference",

publisher = "Association for Computational Linguistics (ACL)",

pages = "2766--2771",

booktitle = "EACL 2021 - 16th Conference of the European Chapter of the Association for Computational Linguistics, Proceedings of the Conference",

}

TY - GEN

T1 - 'Just because you are right, doesn't mean I am wrong'

T2 - 16th Conference of the European Chapter of the Associationfor Computational Linguistics, EACL 2021

AU - Luo, Man

AU - Sampat, Shailaja Keyur

AU - Tallman, Riley

AU - Zeng, Yankai

AU - Vancha, Manuha

AU - Sajja, Akarshan

AU - Baral, Chitta

N1 - Funding Information: We are thankful to Tejas Gokhale for useful discussions and feedback on this work. We also thank anonymous reviewers for their thoughtful feedback. This work is partially supported by the National Science Foundation grant IIS-1816039. Publisher Copyright: © 2021 Association for Computational Linguistics

PY - 2021

Y1 - 2021

N2 - GQA (Hudson and Manning, 2019) is a dataset for real-world visual reasoning and compositional question answering. We found that many answers predicted by the best vision-language models on the GQA dataset do not match the ground-truth answer but still are semantically meaningful and correct in the given context. In fact, this is the case with most existing visual question answering (VQA) datasets where they assume only one ground-truth answer for each question. We propose Alternative Answer Sets (AAS) of ground-truth answers to address this limitation, which is created automatically using off-the-shelf NLP tools. We introduce a semantic metric based on AAS and modify top VQA solvers to support multiple plausible answers for a question. We implement this approach on the GQA dataset and show the performance improvements.

AB - GQA (Hudson and Manning, 2019) is a dataset for real-world visual reasoning and compositional question answering. We found that many answers predicted by the best vision-language models on the GQA dataset do not match the ground-truth answer but still are semantically meaningful and correct in the given context. In fact, this is the case with most existing visual question answering (VQA) datasets where they assume only one ground-truth answer for each question. We propose Alternative Answer Sets (AAS) of ground-truth answers to address this limitation, which is created automatically using off-the-shelf NLP tools. We introduce a semantic metric based on AAS and modify top VQA solvers to support multiple plausible answers for a question. We implement this approach on the GQA dataset and show the performance improvements.

UR - http://www.scopus.com/inward/record.url?scp=85107276027&partnerID=8YFLogxK

UR - http://www.scopus.com/inward/citedby.url?scp=85107276027&partnerID=8YFLogxK

M3 - Conference contribution

AN - SCOPUS:85107276027

T3 - EACL 2021 - 16th Conference of the European Chapter of the Association for Computational Linguistics, Proceedings of the Conference

SP - 2766

EP - 2771

BT - EACL 2021 - 16th Conference of the European Chapter of the Association for Computational Linguistics, Proceedings of the Conference

PB - Association for Computational Linguistics (ACL)

Y2 - 19 April 2021 through 23 April 2021

ER -

'Just because you are right, doesn't mean I am wrong': Overcoming a bottleneck in the development and evaluation of open-ended visual question answering (VQA) tasks

Abstract

Publication series

Conference

ASJC Scopus subject areas

Other files and links

Fingerprint

Cite this