Misc,

Open Graph Benchmark: Datasets for Machine Learning on Graphs

W. Hu, M. Fey, M. Zitnik, Y. Dong, H. Ren, B. Liu, M. Catasta, and J. Leskovec.
(2020)cite arxiv:2005.00687Comment: Fix dataset bug in ogbg-code.

Abstract

We present the Open Graph Benchmark (OGB), a diverse set of challenging and realistic benchmark datasets to facilitate scalable, robust, and reproducible graph machine learning (ML) research. OGB datasets are large-scale, encompass multiple important graph ML tasks, and cover a diverse range of domains, ranging from social and information networks to biological networks, molecular graphs, source code ASTs, and knowledge graphs. For each dataset, we provide a unified evaluation protocol using meaningful application-specific data splits and evaluation metrics. In addition to building the datasets, we also perform extensive benchmark experiments for each dataset. Our experiments suggest that OGB datasets present significant challenges of scalability to large-scale graphs and out-of-distribution generalization under realistic data splits, indicating fruitful opportunities for future research. Finally, OGB provides an automated end-to-end graph ML pipeline that simplifies and standardizes the process of graph data loading, experimental setup, and model evaluation. OGB will be regularly updated and welcomes inputs from the community. OGB datasets as well as data loaders, evaluation scripts, baseline code, and leaderboards are publicly available at https://ogb.stanford.edu .

BibTeX key: hu2020graph
entry type: misc
year: 2020
url: http://arxiv.org/abs/2005.00687
note: cite arxiv:2005.00687Comment: Fix dataset bug in ogbg-code

Users

Comments and Reviewsshow / hide

Please log in to take part in the discussion (add own reviews or comments).

Cite this publication

@misc{hu2020graph, abstract = {We present the Open Graph Benchmark (OGB), a diverse set of challenging and realistic benchmark datasets to facilitate scalable, robust, and reproducible graph machine learning (ML) research. OGB datasets are large-scale, encompass multiple important graph ML tasks, and cover a diverse range of domains, ranging from social and information networks to biological networks, molecular graphs, source code ASTs, and knowledge graphs. For each dataset, we provide a unified evaluation protocol using meaningful application-specific data splits and evaluation metrics. In addition to building the datasets, we also perform extensive benchmark experiments for each dataset. Our experiments suggest that OGB datasets present significant challenges of scalability to large-scale graphs and out-of-distribution generalization under realistic data splits, indicating fruitful opportunities for future research. Finally, OGB provides an automated end-to-end graph ML pipeline that simplifies and standardizes the process of graph data loading, experimental setup, and model evaluation. OGB will be regularly updated and welcomes inputs from the community. OGB datasets as well as data loaders, evaluation scripts, baseline code, and leaderboards are publicly available at https://ogb.stanford.edu .}, added-at = {2021-09-22T14:59:26.000+0200}, author = {Hu, Weihua and Fey, Matthias and Zitnik, Marinka and Dong, Yuxiao and Ren, Hongyu and Liu, Bowen and Catasta, Michele and Leskovec, Jure}, biburl = {https://www.bibsonomy.org/bibtex/21b134ab72d8ec3dbdbed3f13992b13a9/annakrause}, description = {2005.00687.pdf}, interhash = {cb89519b54940746e86e19a8fbb8d63d}, intrahash = {1b134ab72d8ec3dbdbed3f13992b13a9}, keywords = {deeplearning graphlearning todo:read}, note = {cite arxiv:2005.00687Comment: Fix dataset bug in ogbg-code}, timestamp = {2021-09-22T14:59:26.000+0200}, title = {Open Graph Benchmark: Datasets for Machine Learning on Graphs}, url = {http://arxiv.org/abs/2005.00687}, year = 2020 }

BibSonomy

Open Graph Benchmark: Datasets for Machine Learning on Graphs

Abstract

Tags

Users

Comments and Reviewsshow / hide

Cite this publication

More citation styles

search on