Engineer friends often ask me: Graph Deep Learning sounds great, but are there any big commercial success stories? Is it being deployed in practical applications? Besides the obvious ones–recommendation systems at Pinterest, Alibaba and Twitter–a slightly nuanced success story is the Transformer architecture, which has taken the NLP industry by storm. Through this post, I want to establish links between Graph Neural Networks (GNNs) and Transformers. I’ll talk about the intuitions behind model architectures in the NLP and GNN communities, make connections using equations and figures, and discuss how we could work together to drive progress.
S. Wang, L. Hu, Y. Wang, X. He, Q. Sheng, M. Orgun, L. Cao, F. Ricci, and P. Yu. (2021)cite arxiv:2105.06339Comment: Accepted by IJCAI 2021 Survey Track, copyright is owned to IJCAI. The first systematic survey on graph learning based recommender systems. arXiv admin note: text overlap with arXiv:2004.11718.
A. Dargahi Nobari, N. Reshadatmand, and M. Neshati. Proceedings of the 2017 ACM on Conference on Information and Knowledge Management, page 2035–2038. New York, NY, USA, Association for Computing Machinery, (2017)
D. Gibson, J. Kleinberg, and P. Raghavan. Proceedings of the ninth ACM conference on Hypertext and hypermedia : links, objects, time and space---structure in hypermedia systems links, objects, time and space---structure in hypermedia systems - HYPERTEXT \textquotesingle98, ACM Press, (1998)
N. Peng, H. Poon, C. Quirk, K. Toutanova, and W. Yih. ACL, (2017)cite arxiv:1708.03743Comment: Conditional accepted by TACL in December 2016; published in April 2017; presented at ACL in August 2017.
J. Leskovec, and C. Faloutsos. Proceedings of the 12th ACM SIGKDD international conference on Knowledge discovery and data mining, page 631--636. ACM, (2006)
S. Shankar, G. Rajendra, K. Ashok, and G. Nanaso. International Journal on Recent and Innovation Trends in Computing and Communication, 3 (3):
1361--1366(March 2015)
L. Satheesh, P. Prabhakar, and A. P. International Journal on Recent and Innovation Trends in Computing and Communication, 3 (1):
394--400(January 2015)
H. Aher, P. Shirode, K. Shinde, and A. Jadhav. International Journal on Recent and Innovation Trends in Computing and Communication, 3 (1):
46--51(January 2015)
J. Yang, and J. Leskovec. Proceedings of the Sixth ACM International Conference on Web Search and Data Mining, page 587--596. New York, NY, USA, ACM, (2013)
T. Zesch, and I. Gurevych. Proceedings of the TextGraphs-2 Workshop (NAACL-HLT), page 1--8. Rochester, Association for Computational Linguistics, (April 2007)
B. Pereira Nunes, R. Kawase, S. Dietze, D. Taibi, M. Casanova, and W. Nejdl. Proceedings of the Web of Linked Entities Workshop in conjuction with the 11th International Semantic Web Conference, volume 906 of CEUR-WS.org, page 45--57. (November 2012)
B. Pereira Nunes, R. Kawase, S. Dietze, D. Taibi, M. Casanova, and W. Nejdl. Proceedings of the Web of Linked Entities Workshop in conjuction with the 11th International Semantic Web Conference, volume 906 of CEUR-WS.org, page 45--57. (November 2012)
B. Keegan, D. Gergle, and N. Contractor. Proceedings of the ACM 2012 conference on Computer Supported Cooperative Work, page 427--436. New York, NY, USA, ACM, (2012)