copy delete add this publication to your clipboard
community post
history of this post
URL
DOI
BibTeX
EndNote
APA
Chicago
DIN 1505
Harvard
MSOffice XML

Quantifying the Evaluation of Heuristic Methods for Textual Data Augmentation

O. Kashefi, and R. Hwa. Proceedings of the Sixth Workshop on Noisy User-generated Text (W-NUT 2020), page 200--208. Online, Association for Computational Linguistics, (November 2020)
DOI: 10.18653/v1/2020.wnut-1.26

Abstract

Data augmentation has been shown to be effective in providing more training data for machine learning and resulting in more robust classifiers. However, for some problems, there may be multiple augmentation heuristics, and the choices of which one to use may significantly impact the success of the training. In this work, we propose a metric for evaluating augmentation heuristics; specifically, we quantify the extent to which an example is ``hard to distinguish'' by considering the difference between the distribution of the augmented samples of different classes. Experimenting with multiple heuristics in two prediction tasks (positive/negative sentiment and verbosity/conciseness) validates our claims by revealing the connection between the distribution difference of different classes and the classification accuracy.

Description

Quantifying the Evaluation of Heuristic Methods for Textual Data Augmentation - ACL Anthology

Links and resources

BibTeX key: kashefi-hwa-2020-quantifying
entry type: inproceedings
address: Online
booktitle: Proceedings of the Sixth Workshop on Noisy User-generated Text (W-NUT 2020)
year: 2020
month: nov
pages: 200--208
publisher: Association for Computational Linguistics
DOI: 10.18653/v1/2020.wnut-1.26
url: https://www.aclweb.org/anthology/2020.wnut-1.26

@parismic's tags highlighted

Cite this publication

search on

Meta data

Last update 3 years ago
Created 3 years ago

Comments and Reviews
(0)

There is no review or comment yet. You can write one!

BibSonomy

copy delete add this publication to your clipboard
community post
history of this post
URL
DOI
BibTeX
EndNote
APA
Chicago
DIN 1505
Harvard
MSOffice XML

Quantifying the Evaluation of Heuristic Methods for Textual Data Augmentation

Abstract

Description

Links and resources

Tags

community

Cite this publication

More citation styles

search on

Meta data

Comments and Reviews
(0)

BibSonomy

copydeleteadd this publication to your clipboardcommunity posthistory of this postURLDOIBibTeXEndNoteAPAChicagoDIN 1505HarvardMSOffice XML Quantifying the Evaluation of Heuristic Methods for Textual Data Augmentation

Abstract

Description

Links and resources

Tags

community

Cite this publication

More citation styles

search on

Meta data

Comments and Reviews (0)

copy delete add this publication to your clipboard
community post
history of this post
URL
DOI
BibTeX
EndNote
APA
Chicago
DIN 1505
Harvard
MSOffice XML

Quantifying the Evaluation of Heuristic Methods for Textual Data Augmentation

Comments and Reviews
(0)