The paper discusses the capabilities of large pre-trained language models and their limitations in accessing and manipulating knowledge. The authors introduce retrieval-augmented generation (RAG) models that combine pre-trained parametric and non-parametric memory for language generation. The study explores the effectiveness of RAG models in various NLP tasks and compares them with other architectures.
arXiv is a free distribution service and an open-access archive for 2,316,761 scholarly articles in the fields of physics, mathematics, computer science, quantitative biology, quantitative finance, statistics, electrical engineering and systems science, and economics. Materials on this site are not peer-reviewed by arXiv.
arXiv is a free distribution service and an open-access archive for 2,310,555 scholarly articles in the fields of physics, mathematics, computer science, quantitative biology, quantitative finance, statistics, electrical engineering and systems science, and economics. Materials on this site are not peer-reviewed by arXiv.
H. Neilson, L. Rousseau-Nepton, S. Lawler, und K. Spekkens. (2019)cite arxiv:1910.02976Comment: 11 pages. Community paper submitted to the Canadian Long Range Plan 2020, https://casca.ca/?page_id=11499lrp2020/.
A. Cimatti, F. Fraternali, und C. Nipoti. (2019)cite arxiv:1912.06216Comment: 17 pages, 3 figures, first introductory chapter of the textbook published by Cambridge University Press. For more information https://decdb4ae-c884-4971-9114-5f11b6929fd9.filesusr.com/ugd/f44359_26d2207ea96e4f359636feb5b7473336.pdf.
H. Tajima, und F. Fujisawa. (2020)cite arxiv:2007.00926Comment: 6 pages, 5 figures, accepted by Scientific and Educational Reports of the Faculty of Science and Technology, Kochi University.
M. Lindvall, und J. Molin. (2020)cite arxiv:2001.07455Comment: Accepted for presentation in poster format for the ACM CHI'19 Workshop <Emerging Perspectives in Human-Centered Machine Learning>.
A. Slivkins. (2019)cite arxiv:1904.07272Comment: The manuscript is complete, but comments are very welcome! To be published with Foundations and Trends in Machine Learning.
A. Alemi, und I. Fischer. (2018)cite arxiv:1807.04162Comment: Presented at the ICML 2018 workshop on Theoretical Foundations and Applications of Deep Generative Models.
A. Alemi, und I. Fischer. (2018)cite arxiv:1807.04162Comment: Presented at the ICML 2018 workshop on Theoretical Foundations and Applications of Deep Generative Models.
S. Chaplick, H. Förster, M. Kryven, und A. Wolff. (2019)cite arxiv:1907.08121Comment: Appears in the Proceedings of the 27th International Symposium on Graph Drawing and Network Visualization (GD 2019).
P. Angelini, S. Chaplick, S. Cornelsen, G. Da Lozzo, und V. Roselli. (2019)cite arxiv:1903.07595Comment: Extended version of "Morphing Contact Representations of Graphs", to appear in Proceedings of the 35th International Symposium on Computational Geometry (SoCG 2019).
F. Sultana, A. Sufian, und P. Dutta. (2019)cite arxiv:1905.01614Comment: 7 pages, 10 figures, 1 table, Submitted to 2nd International Conference on Communication, Devices and Computing(ICCDC 2019).
A. Chéritat. (2014)cite arxiv:1410.4417Comment: 16 pages, 7 figures. This version has the following changes: Added computer generated images of the key positions S1 and S2. Corrected several minor mistakes. Corrected the proof of the main proposition (I had forgotten to ensure that the top and bottom curves remain embedded during the homotopy) and slightly changed the statement of Lemma 3 to adapt.