tag :: dataset | BibSonomy

bookmarks (hide)743
display
all
bookmarks only
bookmarks per page
5
10
20
50
100
sort by
added at
title
RSS
BibTeX
XML

1Monthly Twitter n-grams generated from a corpus of more than 2 billion English tweets (2013-2023)
https://zenodo.org/records/15736201
12 days ago by @jaeschke
show all tags
dataset
myown
n-gram
text
twitter
datasetmyownn-gramtexttwitter
(0)
copydelete
- community post
- history of this post
1Multimodal Emotion Dataset Friends MELD
https://affective-meld.github.io/
4 months ago by @topel
show all tags
dataset
emotion
datasetemotion
(0)
copydelete
- community post
- history of this post
1DailyDialog is a high-quality multi-turn open-domain English dialog dataset
https://paperswithcode.com/dataset/dailydialog
a year ago by @topel
show all tags
dataset
dataset
(0)
copydelete
- community post
- history of this post
1Présentation – Corpus d'Etude pour le Français Contemporain (CEFC)
https://repository.ortolang.fr/api/content/cefc-orfeo/4/documentation/site-orfeo/home/index.html
a year ago by @topel
show all tags
dataset
speech
datasetspeech
(0)
copydelete
- community post
- history of this post
1Annual Article Processing Charges (APCs) and number of gold and hybrid open access articles in Web of Science indexed journals published by Elsevier, Sage, Springer-Nature, Taylor & Francis and Wiley 2015-2018
https://zenodo.org/records/7086420
a year ago by @jaeschke
show all tags
academic
access
apc
dataset
oa
open
publishing
research
scholarly
academicaccessapcdatasetoaopenpublishingresearchscholarly
(0)
copydelete
- community post
- history of this post
1An annotated human blastocyst dataset to benchmark deep learning architectures for in vitro fertilization | Scientific Data
https://www.nature.com/articles/s41597-023-02182-3
a year ago by @scch
show all tags
annotated
architecture
benchmark
dataset
deep-learning
human
annotatedarchitecturebenchmarkdatasetdeep-learninghuman
(0)
copydelete
- community post
- history of this post
1Chelsa Climate – Climatologies at high resolution for the earth’s land surface areas
https://chelsa-climate.org/
2 years ago by @annakrause
show all tags
climate
dataset
free
highresolution
climatedatasetfreehighresolution
(0)
copydelete
- community post
- history of this post
1ClimateBench | Zenodo
https://zenodo.org/record/7064308
2 years ago by @annakrause
show all tags
benchmark
climate
dataset
machinelearning
benchmarkclimatedatasetmachinelearning
(0)
copydelete
- community post
- history of this post
1Unknown Data | Mining and consolidating research dataset metadata on the Web
https://unknowndataproject.github.io/
2 years ago by @astrupp
show all tags
crawl
data
dataset
datasets
web
crawldatadatasetdatasetsweb
(0)
copydelete
- community post
- history of this post
1MIT Flickr Audio Caption Corpus
https://groups.csail.mit.edu/sls/downloads/flickraudio/
2 years ago by @topel
show all tags
dataset
dataset
(0)
copydelete
- community post
- history of this post
1The Grid Audio-Visual Speech Corpus | Zenodo
The Grid Corpus is a large multitalker audiovisual sentence corpus designed to support joint computational-behavioral studies in speech perception. In brief, the corpus consists of high-quality audio and video (facial) recordings of 1000 sentences spoken by each of 34 talkers (18 male, 16 female), for a total of 34000 sentences. Sentences are of the form "put red at G9 now". audio_25k.zip contains the wav format utterances at a 25 kHz sampling rate in a separate directory per talker alignments.zip provides word-level time alignments, again separated by talker s1.zip, s2.zip etc contain .jpg videos for each talker [note that due to an oversight, no video for talker t21 is available] The Grid Corpus is described in detail in the paper jasagrid.pdf included in the dataset.
2 years ago by @topel
show all tags
dataset
dataset
(0)
copydelete
- community post
- history of this post
1GitHub - earthspecies/beans: BEANS: The Benchmark of Animal Sounds
https://github.com/earthspecies/beans
2 years ago by @topel
show all tags
dataset
dataset
(0)
copydelete
- community post
- history of this post
2Archive Team: The Twitter Stream Grab : Free Web : Free Download, Borrow and Streaming : Internet Archive
https://archive.org/details/twitterstream
3 years ago by @jaeschke
show all tags
archive
data
dataset
sample
stream
twitter
archivedatadatasetsamplestreamtwitter
(0)
copydelete
- community post
- history of this post
1TREC Washington Post Corpus
https://trec.nist.gov/data/wapost/
3 years ago by @jaeschke
show all tags
corpus
dataset
newspaper
text
trec
corpusdatasetnewspapertexttrec
(0)
copydelete
- community post
- history of this post
1Single Cell Perturbation Dataset Explorer
https://www.scperturb.org/
3 years ago by @becker
show all tags
atac
cell
dataset
datasets
perturbation
seq
single
ataccelldatasetdatasetsperturbationseqsingle
(0)
copydelete
- community post
- history of this post
1Veracity of schema.org for datasets (labeled data) | Kaggle
Dataset or Not? A study on the veracity of semantic markup for dataset pages.
3 years ago by @jaeschke
show all tags
data
dataset
google
kaggle
research
schema.org
unknowndata
datadatasetgooglekaggleresearchschema.orgunknowndata
(0)
copydelete
- community post
- history of this post
1Dataset Search: metadata for datasets | Kaggle
Datasets with DOIs and compact identifiers
3 years ago by @jaeschke
show all tags
data
dataset
doi
google
kaggle
research
search
unknowndata
datadatasetdoigooglekaggleresearchsearchunknowndata
(0)
copydelete
- community post
- history of this post
1Machine Learning-Friendly Biomedical Datasets for Equivalence and Subsumption Ontology Matching | Zenodo
The purpose of these datasets is to support equivalence and subsumption ontology matching. There are five ontology pairs extracted from MONDO and UMLS: Source Ontology Pair Category MONDO OMIM-ORDO Disease MONDO NCIT-DOID Disease UMLS SNOMED-FMA Body UMLS SNOMED-NCIT Pharm UMLS SNOMED-NCIT Neoplas Each pair is associated with three folders: "raw_data", "equiv_match", and "subs_match", corresponding to the downloaded source ontologies, the package for equivalence matching, and the package for subsumption matching. See detailed documentation at: https://krr-oxford.github.io/DeepOnto/#/om_resources. See the incoming OAEI Bio-ML track at: https://www.cs.ox.ac.uk/isg/projects/ConCur/oaei/. See our resource paper at: https://arxiv.org/abs/2205.03447.
3 years ago by @hangdong
show all tags
dataset
iswc
machine_learning
myown
oaei
om
ontology
ontology_matching
datasetiswcmachine_learningmyownoaeiomontologyontology_matching
(0)
copydelete
- community post
- history of this post
1Data | Copernicus Marine
https://resources.marine.copernicus.eu/product-detail/GLOBAL_MULTIYEAR_WAV_001_032/INFORMATION
3 years ago by @annakrause
show all tags
dataset
neuralpde
ocean
datasetneuralpdeocean
(0)
copydelete
- community post
- history of this post
1Whole-cell segmentation of tissue images with human-level performance using large-scale data annotation and deep learning | Nature Biotechnology
https://www.nature.com/articles/s41587-021-01094-0
3 years ago by @becker
show all tags
large
dataset
tissue
single
cell
different
technologies
largedatasettissuesinglecelldifferenttechnologies
(0)
copydelete
- community post
- history of this post

⟨⟨
⟨
1
2
3
⟩
⟩⟩

publications (hide)423
display
all
publications only
publications per page
5
10
20
50
100
sort by
added at
title
author
publication date
entry type
help for advanced sorting...
RSS
BibTeX
RDF
more...

1LAI (Indirect): Light Wand - KSU (FIFE)
T. Shah, and E. Kanemasu. (1994)Data set. Available on-line http://www.daac.ornl.gov from Oak Ridge National Laboratory Distributed Active Archive Center, Oak Ridge, Tennessee, U.S.A. doi:10.3334/ORNLDAAC/43. Also published in D. E. Strebel, D. R. Landis, K. F. Huemmrich, and B. W. Meeson (eds.), Collected Data of the First ISLSCP Field Experiment, Vol. 1: Surface Observations and Non-Image Data Sets. CD-ROM. National Aeronautics and Space Administration, Goddard Space Flight Center, Greenbelt, Maryland, U.S.A. (available from http://www.daac.ornl.gov)..
6 years ago by @karinawilliams
show all tags
dataset
fife
datasetfife
(0)
copydeleteadd this publication to your clipboard
1Canopy Photosynthesis Rates (FIFE)
G. Asrar, and P. Sellers. (1994)Data set . Available on-line http://www.daac.ornl.gov from Oak Ridge National Laboratory Distributed Active Archive Center, Oak Ridge, Tennessee, U.S.A. doi:10.3334/ORNLDAAC/27. Also published in D. E. Strebel, D. R. Landis, K. F. Huemmrich, and B. W. Meeson (eds.), Collected Data of the First ISLSCP Field Experiment, Vol. 1: Surface Observations and Non-Image Data Sets. CD-ROM. National Aeronautics and Space Administration, Goddard Space Flight Center, Greenbelt, Maryland, U.S.A. (available from http://www.daac.ornl.gov)..
6 years ago by @karinawilliams
show all tags
dataset
fife
datasetfife
(0)
copydeleteadd this publication to your clipboard
1Soil Water Properties
P. Sellers, and K. Huemmrich. (1994)Derived Data (FIFE). Data set. Available on-line http://www.daac.ornl.gov from Oak Ridge National Laboratory Distributed Active Archive Center, Oak Ridge, Tennessee, U.S.A. doi:10.3334/ORNLDAAC/117. Also published in D. E. Strebel, D. R. Landis, K. F. Huemmrich, and B. W. Meeson (eds.), Collected Data of the First ISLSCP Field Experiment, Vol. 1: Surface Observations and Non-Image Data Sets. CD-ROM. National Aeronautics and Space Administration, Goddard Space Flight Center, Greenbelt, Maryland, U.S.A. (available from http://www.daac.ornl.gov)..
6 years ago by @karinawilliams
show all tags
dataset
fife
datasetfife
(0)
copydeleteadd this publication to your clipboard
1Soil CO2 Flux Data (FIFE)
J. Norman. (1994)Data set. Available on-line http://www.daac.ornl.gov from Oak Ridge National Laboratory Distributed Active Archive Center, Oak Ridge, Tennessee, U.S.A. doi:10.3334/ORNLDAAC/105. Also published in D. E. Strebel, D. R. Landis, K. F. Huemmrich, and B. W. Meeson (eds.), Collected Data of the First ISLSCP Field Experiment, Vol. 1: Surface Observations and Non-Image Data Sets. CD-ROM. National Aeronautics and Space Administration, Goddard Space Flight Center, Greenbelt, Maryland, U.S.A. (available from http://www.daac.ornl.gov)..
6 years ago by @karinawilliams
show all tags
dataset
fife
datasetfife
(0)
copydeleteadd this publication to your clipboard
1Soil Moisture Neutron Probe Data (FIFE)
E. Kanemasu. (1994)Data set. Available on-line http://www.daac.ornl.gov from Oak Ridge National Laboratory Distributed Active Archive Center, Oak Ridge, Tennessee, U.S.A. doi:10.3334/ORNLDAAC/111. Also published in D. E. Strebel, D. R. Landis, K. F. Huemmrich, and B. W. Meeson (eds.), Collected Data of the First ISLSCP Field Experiment, Vol. 1: Surface Observations and Non-Image Data Sets. CD-ROM. National Aeronautics and Space Administration, Goddard Space Flight Center, Greenbelt, Maryland, U.S.A. (available from http://www.daac.ornl.gov)..
6 years ago by @karinawilliams
show all tags
dataset
fife
datasetfife
(0)
copydeleteadd this publication to your clipboard
1Eddy Correlation. Surface Flux: UNL (FIFE)
S. Verma. (1994)Data set. Available on-line http://www.daac.ornl.gov from Oak Ridge National Laboratory Distributed Active Archive Center, Oak Ridge, Tennessee, U.S.A. doi:10.3334/ORNLDAAC/33. Also published in D. E. Strebel, D. R. Landis, K. F. Huemmrich, and B. W. Meeson (eds.), Collected Data of the First ISLSCP Field Experiment, Vol. 1: Surface Observations and Non-Image Data Sets. CD-ROM. National Aeronautics and Space Administration, Goddard Space Flight Center, Greenbelt, Maryland, U.S.A. (available from http://www.daac.ornl.gov)..
6 years ago by @karinawilliams
show all tags
dataset
fife
datasetfife
(0)
copydeleteadd this publication to your clipboard
1Site Averaged AMS Data: 1987 (Betts)
A. Betts. (1994)Data set. Available on-line http://www.daac.ornl.gov from Oak Ridge National Laboratory Distributed Active Archive Center, Oak Ridge, Tennessee, U.S.A. doi:10.3334/ORNLDAAC/88..
6 years ago by @karinawilliams
show all tags
dataset
fife
datasetfife
(0)
copydeleteadd this publication to your clipboard
8DBLP: Some Lessons Learned
M. Ley. Proc. VLDB Endow., 2 (2): 1493--1500 (August 2009)
6 years ago by @tobias.koopmann
show all tags
authortrails
dataset
dblp
publication
authortrailsdatasetdblppublication
(1)
copydeleteadd this publication to your clipboard
5Construction of the Literature Graph in Semantic Scholar.
W. Ammar, D. Groeneveld, C. Bhagavatula, I. Beltagy, M. Crawford, D. Downey, J. Dunkelberger, A. Elgohary, S. Feldman, V. Ha and 13 other author(s). CoRR, (2018)
6 years ago by @tobias.koopmann
show all tags
authortrails
dataset
publications
authortrailsdatasetpublications
(0)
copydeleteadd this publication to your clipboard
2Detection of Anomalies in Large Scale Accounting Data using Deep Autoencoder Networks.
M. Schreyer, T. Sattarov, D. Borth, A. Dengel, and B. Reimer. CoRR, (2017)
6 years ago by @tritsch
show all tags
anomaly
dataset
erp
fraud
synthetic_data
anomalydataseterpfraudsynthetic_data
(0)
copydeleteadd this publication to your clipboard
3Benchmark of Deep Learning Models on Large Healthcare MIMIC Datasets.
S. Purushotham, C. Meng, Z. Che, and Y. Liu. CoRR, (2017)
6 years ago by @nosebrain
show all tags
dataset
deep
health
learning
mimic
numeric
datasetdeephealthlearningmimicnumeric
(0)
copydeleteadd this publication to your clipboard
9Microsoft COCO: Common Objects in Context
T. Lin, M. Maire, S. Belongie, L. Bourdev, R. Girshick, J. Hays, P. Perona, D. Ramanan, C. Zitnick, and P. Dollár. (2014)cite arxiv:1405.0312Comment: 1) updated annotation pipeline description and figures; 2) added new section describing datasets splits; 3) updated author list.
6 years ago by @jannikd
show all tags
boundingbox
dataset
mask
occlusionprob
boundingboxdatasetmaskocclusionprob
(0)
copydeleteadd this publication to your clipboard
6Placing search in context: The concept revisited
L. Finkelstein, E. Gabrilovich, Y. Matias, E. Rivlin, Z. Solan, G. Wolfman, and E. Ruppin. Proceedings of the 10th international conference on World Wide Web, page 406--414. ACM, (2001)
6 years ago by @h_hashimoto
show all tags
WordSim-353
dataset
similarity
word_semantics
WordSim-353datasetsimilarityword_semantics
(0)
copydeleteadd this publication to your clipboard
1Generalisation in humans and deep neural networks
R. Geirhos, C. Temme, J. Rauber, H. Schütt, M. Bethge, and F. Wichmann. Advances in Neural Information Processing Systems, page 7549--7561. (2018)
7 years ago by @loroch
show all tags
dataset
deep_learning
human_level
noise
robustness
datasetdeep_learninghuman_levelnoiserobustness
(0)
copydeleteadd this publication to your clipboard
2Are we ready for autonomous driving? the kitti vision benchmark suite
A. Geiger, P. Lenz, and R. Urtasun. Computer Vision and Pattern Recognition (CVPR), 2012 IEEE Conference on, page 3354--3361. IEEE, (2012)
7 years ago by @analyst
show all tags
2012
autonomous
dataset
2012autonomousdataset
(0)
copydeleteadd this publication to your clipboard
3Fact checking: Task definition and dataset construction
A. Vlachos, and S. Riedel. Proceedings of the ACL 2014 Workshop on Language Technologies and Computational Social Science, page 18--22. (2014)
7 years ago by @thoni
show all tags
factchecking
definition
dataset
citedby:scholar:count:53
citedby:scholar:timestamp:2018-12-13
factcheckingdefinitiondatasetcitedby:scholar:count:53citedby:scholar:timestamp:2018-12-13
(0)
copydeleteadd this publication to your clipboard
1Classification Technique for Predicting Learning Behavior of Student in Higher Education
M. Desai. International Journal of Trend in Scientific Research and Development, (October 2018)
7 years ago by @ijtsrd
show all tags
Business
Data
Dataset
Economics
Machine
Mining
Supervised
Training
Unsupervised
learning
BusinessDataDatasetEconomicsMachineMiningSupervisedTrainingUnsupervisedlearning
(0)
copydeleteadd this publication to your clipboard
1Antibiotics, gut bugs and the young
H. Browne. Nature Reviews Microbiology, 14 (6): 336 (May 3, 2016)
7 years ago by @karthikraman
show all tags
antibiotics
dataset
gut-microbiome
antibioticsdatasetgut-microbiome
(0)
copydeleteadd this publication to your clipboard
1Microbial Community Assembly And Metabolite Profile Of The Gut Microbiome In Extremely Low Birthweight Infants
S. Wandro, S. Osborne, C. Enriquez, C. Bixby, A. Arrieta, and K. Whiteson. bioRxiv, (Apr 10, 2017)
7 years ago by @karthikraman
show all tags
communities
dataset
gut-microbiome
communitiesdatasetgut-microbiome
(0)
copydeleteadd this publication to your clipboard
3Shotgun Metagenomics of 250 Adult Twins Reveals Genetic and Environmental Impacts on the Gut Microbiome
H. Xie, R. Guo, H. Zhong, Q. Feng, Z. Lan, B. Qin, K. Ward, M. Jackson, Y. Xia, X. Chen and 15 other author(s). Cell Systems, 3 (6): 572--584.e3 (December 2016)
7 years ago by @karthikraman
show all tags
dataset
gut-microbiome
metagenomics
datasetgut-microbiomemetagenomics
(0)
copydeleteadd this publication to your clipboard