Damián Zanette Marcelo Montemurro
Abstract
We investigate the origin of Zipf's law for words in written texts by means of a stochastic dynamic model for text generation. The model incorporates both features related to the general structure of languages and memory effects inherent to the production of long coherent messages in the communication process. It is shown that the multiplicative dynamics of our model lead to rank-frequency distributions in quantitative agreement with empirical data. Our results give support to the linguistic relevance of Zipf's law in human language.
G. Stumme. Proc. 3rd Intl. Conf. on Formal Concept Analysis, volume 3403 of Lecture Notes in Computer Science, page 315-328. Heidelberg, Springer, (2005)
B. Berendt, A. Hotho, and G. Stumme. Proc. of the 1st Intl. Workshop on Representation and Analysis of Web Space, page 1--16. Technical University of Ostrava, (2005)