Investigating the Stability of Concrete Nouns in Word Embeddings
B. Pierrejean, and L. Tanguy. Proceedings of the 13th International Conference on Computational Semantics-Short Papers, page 6. (2019)
Abstract
We know that word embeddings trained using neural-based methods (such as word2vec SGNS) are sensitive to stability problems and that across two models trained using the exact same set of parameters, the nearest neighbors of a word are likely to change. All words are not equally impacted by this internal instability and recent studies have investigated features influencing the stability of word embeddings. This stability can be seen as a clue for the reliability of the semantic representation of a word. In this work, we investigate the influence of the degree of concreteness of nouns on the stability of their semantic representation. We show that for English generic corpora, abstract words are more affected by stability problems than concrete words. We also found that to a certain extent, the difference between the degree of concreteness of a noun and its nearest neighbors can partly explain the stability or instability of its neighbors.
Proceedings of the 13th International Conference on Computational Semantics-Short Papers
year
2019
pages
6
file
Pierrejean, Tanguy - Investigating the Stability of Concrete Nouns in Word Embeddings.pdf:C\:\\Users\\Admin\\Documents\\Research\\_Paperbase\\Word Embeddings\\Pierrejean, Tanguy - Investigating the Stability of Concrete Nouns in Word Embeddings.pdf:application/pdf
%0 Conference Paper
%1 pierrejean_investigating_2019
%A Pierrejean, Benedicte
%A Tanguy, Ludovic
%B Proceedings of the 13th International Conference on Computational Semantics-Short Papers
%D 2019
%K Embedding_Variability Word_Embeddings
%P 6
%T Investigating the Stability of Concrete Nouns in Word Embeddings
%X We know that word embeddings trained using neural-based methods (such as word2vec SGNS) are sensitive to stability problems and that across two models trained using the exact same set of parameters, the nearest neighbors of a word are likely to change. All words are not equally impacted by this internal instability and recent studies have investigated features influencing the stability of word embeddings. This stability can be seen as a clue for the reliability of the semantic representation of a word. In this work, we investigate the influence of the degree of concreteness of nouns on the stability of their semantic representation. We show that for English generic corpora, abstract words are more affected by stability problems than concrete words. We also found that to a certain extent, the difference between the degree of concreteness of a noun and its nearest neighbors can partly explain the stability or instability of its neighbors.
@inproceedings{pierrejean_investigating_2019,
abstract = {We know that word embeddings trained using neural-based methods (such as word2vec SGNS) are sensitive to stability problems and that across two models trained using the exact same set of parameters, the nearest neighbors of a word are likely to change. All words are not equally impacted by this internal instability and recent studies have investigated features influencing the stability of word embeddings. This stability can be seen as a clue for the reliability of the semantic representation of a word. In this work, we investigate the influence of the degree of concreteness of nouns on the stability of their semantic representation. We show that for English generic corpora, abstract words are more affected by stability problems than concrete words. We also found that to a certain extent, the difference between the degree of concreteness of a noun and its nearest neighbors can partly explain the stability or instability of its neighbors.},
added-at = {2020-02-21T16:09:44.000+0100},
author = {Pierrejean, Benedicte and Tanguy, Ludovic},
biburl = {https://www.bibsonomy.org/bibtex/2b577e27ca49ee856007c708ea1482438/tschumacher},
booktitle = {Proceedings of the 13th {International} {Conference} on {Computational} {Semantics}-{Short} {Papers}},
file = {Pierrejean, Tanguy - Investigating the Stability of Concrete Nouns in Word Embeddings.pdf:C\:\\Users\\Admin\\Documents\\Research\\_Paperbase\\Word Embeddings\\Pierrejean, Tanguy - Investigating the Stability of Concrete Nouns in Word Embeddings.pdf:application/pdf},
interhash = {0ac2a46052c7f4c6c18daebab1d652de},
intrahash = {b577e27ca49ee856007c708ea1482438},
keywords = {Embedding_Variability Word_Embeddings},
language = {en},
pages = 6,
timestamp = {2020-02-21T16:09:44.000+0100},
title = {Investigating the {Stability} of {Concrete} {Nouns} in {Word} {Embeddings}},
year = 2019
}