Mizan is an advanced clone to Google’s graph processing system Pregel that utilizes online graph vertex migrations to dynamically optimizes the execution of graph algorithms
This page provides a large hyperlink graph for public download. The graph has been extracted from the Common Crawl 2012 web corpus and covers 3.5 billion web pages and 128 billion hyperlinks between these pages. To the best of our knowledge, this graph is the largest hyperlink graph that is available to the public outside companies such as Google, Yahoo, and Microsoft. Below we provide instructions on how to download the graph as well as basic statistics about its topology.
X. Wang, und M. Zhang. Proceedings of the 39th International Conference on Machine Learning, Volume 162 von Proceedings of Machine Learning Research, Seite 23341--23362. PMLR, (17--23 Jul 2022)
H. Nguyen, N. Nguyen, H. Doan, Z. Ahmadi, T. Doan, und L. Jiang. Proceedings of the ACM Joint European Software Engineering Conference and Symposium on the Foundations of Software Engineering, New York, NY, USA, Association for Computing Machinery, (November 2022)
Y. Yang, C. Huang, L. Xia, und C. Li. Proceedings of the 45th international ACM SIGIR conference on research and development in information retrieval, Seite 1434--1443. (2022)
J. Feng, Y. Chen, F. Li, A. Sarkar, und M. Zhang. Advances in Neural Information Processing Systems, 35, Seite 4776--4790. Curran Associates, Inc., (2022)
X. Liu, T. Zhu, H. Tan, und R. Zhang. The Semantic Web--ISWC 2022: 21st International Semantic Web Conference, Virtual Event, October 23--27, 2022, Proceedings, Seite 284--302. Springer, (2022)
I. Hubner, E. Deeds, und E. Shakhnovich. PROCEEDINGS OF THE NATIONAL ACADEMY OF SCIENCES OF THE UNITED STATES OF AMERICA, 103 (47):
17747-17752(November 2006)