We propose a novel attention network for document annotation with user-generated tags. The network is designed according to the human reading and annotation behaviour. Usually, users try to digest the title and obtain a rough idea about the topic first, and then read the content of the document. Present research shows that the title metadata could largely affect the social annotation. To better utilise this information, we design a framework that separates the title from the content of a document and apply a title-guided attention mechanism over each sentence in the content. We also propose two semanticbased loss regularisers that enforce the output of the network to conform to label semantics, i.e. similarity and subsumption. We analyse each part of the proposed system with two real-world open datasets on publication and question annotation. The integrated approach, Joint Multi-label Attention Network (JMAN), significantly outperformed the Bidirectional Gated Recurrent Unit (Bi-GRU) by around 13%-26% and the Hierarchical Attention Network (HAN) by around 4%-12% on both datasets, with around 10%-30% reduction of training time.
J. Lorince, K. Joseph, und P. Todd. Social Computing, Behavioral-Cultural Modeling, and Prediction, Volume 9021 von Lecture Notes in Computer Science, Springer International Publishing, (2015)
Z. Guan, C. Wang, J. Bu, C. Chen, K. Yang, D. Cai, und X. He. Proceedings of the 19th International Conference on World Wide Web, Seite 391--400. New York, NY, USA, ACM, (2010)
S. Doerfel, D. Zoller, P. Singer, T. Niebler, A. Hotho, und M. Strohmaier. Proceedings of the 16th LWA Workshops: KDML, IR and FGWM, Aachen, Germany, September 8-10, 2014., Volume 1226 von CEUR Workshop Proceedings, Seite 18--19. CEUR-WS.org, (2014)
H. Dong, W. Wang, K. Huang, und F. Coenen. Proceedings of the 2019 Conference of the North American Chapter of the Association for Computational Linguistics: Human Language Technologies, Volume 1 (Long and Short Papers), Seite 1348--1354. Minneapolis, Minnesota, Association for Computational Linguistics, (Juni 2019)
R. Jäschke, B. Krause, A. Hotho, und G. Stumme. Proceedings of the Second International Conference on Weblogs and Social Media (ICWSM 2008), Seite 192--193. Menlo Park, CA, USA, AAAI Press, (2008)