copy delete add this publication to your clipboard
community post
history of this post
URL
DOI
BibTeX
EndNote
APA
Chicago
DIN 1505
Harvard
MSOffice XML

Right for the Wrong Reasons: Diagnosing Syntactic Heuristics in Natural Language Inference

T. McCoy, E. Pavlick, and T. Linzen. Proceedings of the 57th Annual Meeting of the Association for Computational Linguistics, page 3428--3448. Florence, Italy, Association for Computational Linguistics, (July 2019)
DOI: 10.18653/v1/P19-1334

Abstract

A machine learning system can score well on a given test set by relying on heuristics that are effective for frequent example types but break down in more challenging cases. We study this issue within natural language inference (NLI), the task of determining whether one sentence entails another. We hypothesize that statistical NLI models may adopt three fallible syntactic heuristics: the lexical overlap heuristic, the subsequence heuristic, and the constituent heuristic. To determine whether models have adopted these heuristics, we introduce a controlled evaluation set called HANS (Heuristic Analysis for NLI Systems), which contains many examples where the heuristics fail. We find that models trained on MNLI, including BERT, a state-of-the-art model, perform very poorly on HANS, suggesting that they have indeed adopted these heuristics. We conclude that there is substantial room for improvement in NLI systems, and that the HANS dataset can motivate and measure progress in this area.

Description

Right for the Wrong Reasons: Diagnosing Syntactic Heuristics in Natural Language Inference - ACL Anthology

Links and resources

BibTeX key: mccoy-etal-2019-right
entry type: inproceedings
address: Florence, Italy
booktitle: Proceedings of the 57th Annual Meeting of the Association for Computational Linguistics
year: 2019
month: jul
pages: 3428--3448
publisher: Association for Computational Linguistics
DOI: 10.18653/v1/P19-1334
url: https://www.aclweb.org/anthology/P19-1334

@parismic's tags highlighted

Cite this publication

@inproceedings{mccoy-etal-2019-right, abstract = {A machine learning system can score well on a given test set by relying on heuristics that are effective for frequent example types but break down in more challenging cases. We study this issue within natural language inference (NLI), the task of determining whether one sentence entails another. We hypothesize that statistical NLI models may adopt three fallible syntactic heuristics: the lexical overlap heuristic, the subsequence heuristic, and the constituent heuristic. To determine whether models have adopted these heuristics, we introduce a controlled evaluation set called HANS (Heuristic Analysis for NLI Systems), which contains many examples where the heuristics fail. We find that models trained on MNLI, including BERT, a state-of-the-art model, perform very poorly on HANS, suggesting that they have indeed adopted these heuristics. We conclude that there is substantial room for improvement in NLI systems, and that the HANS dataset can motivate and measure progress in this area.}, added-at = {2021-01-20T09:27:20.000+0100}, address = {Florence, Italy}, author = {McCoy, Tom and Pavlick, Ellie and Linzen, Tal}, biburl = {https://www.bibsonomy.org/bibtex/2e8e8f4e87b645639c4bc8997f4b1bd3f/parismic}, booktitle = {Proceedings of the 57th Annual Meeting of the Association for Computational Linguistics}, description = {Right for the Wrong Reasons: Diagnosing Syntactic Heuristics in Natural Language Inference - ACL Anthology}, doi = {10.18653/v1/P19-1334}, interhash = {4623822271987e7e16f21cd2760ee061}, intrahash = {e8e8f4e87b645639c4bc8997f4b1bd3f}, keywords = {bert_performance dataset instituteclustering quality}, month = jul, pages = {3428--3448}, publisher = {Association for Computational Linguistics}, timestamp = {2021-01-20T09:27:20.000+0100}, title = {Right for the Wrong Reasons: Diagnosing Syntactic Heuristics in Natural Language Inference}, url = {https://www.aclweb.org/anthology/P19-1334}, year = 2019 }

BibSonomy

copy delete add this publication to your clipboard
community post
history of this post
URL
DOI
BibTeX
EndNote
APA
Chicago
DIN 1505
Harvard
MSOffice XML

Right for the Wrong Reasons: Diagnosing Syntactic Heuristics in Natural Language Inference

Abstract

Description

Links and resources

Tags

community

Cite this publication

More citation styles

search on

Meta data

Comments and Reviews
(0)

BibSonomy

copydeleteadd this publication to your clipboardcommunity posthistory of this postURLDOIBibTeXEndNoteAPAChicagoDIN 1505HarvardMSOffice XML Right for the Wrong Reasons: Diagnosing Syntactic Heuristics in Natural Language Inference

Abstract

Description

Links and resources

Tags

community

Cite this publication

More citation styles

search on

Meta data

Comments and Reviews (0)

copy delete add this publication to your clipboard
community post
history of this post
URL
DOI
BibTeX
EndNote
APA
Chicago
DIN 1505
Harvard
MSOffice XML

Right for the Wrong Reasons: Diagnosing Syntactic Heuristics in Natural Language Inference

Comments and Reviews
(0)