@albinzehe

Are Fictional Voices Distinguishable? Classifying Character Voices in Modern Drama

, , and . Proceedings of the 3rd Joint SIGHUM Workshop on Computational Linguistics for Cultural Heritage, Social Sciences, Humanities and Literature, page 29--34. Minneapolis, USA, Association for Computational Linguistics, (June 2019)
DOI: 10.18653/v1/W19-2504

Abstract

According to the literary theory of Mikhail Bakhtin, a dialogic novel is one in which characters speak in their own distinct voices, rather than serving as mouthpieces for their authors. We use text classification to determine which authors best achieve dialogism, looking at a corpus of plays from the late nineteenth and early twentieth centuries. We find that the SAGE model of text generation, which highlights deviations from a background lexical distribution, is an effective method of weighting the words of characters' utterances. Our results show that it is indeed possible to distinguish characters by their speech in the plays of canonical writers such as George Bernard Shaw, whereas characters are clustered more closely in the works of lesser-known playwrights.

Links and resources

Tags

community

  • @albinzehe
  • @dblp
@albinzehe's tags highlighted