Kopieren Löschen Diese Publikation zur Ablage hinzufügen
Community-Eintrag
Versionsverlauf dieses Eintrags
URL
DOI
BibTeX
EndNote
APA
Chicago
DIN 1505
Harvard
MSOffice XML

The Unreasonable Effectiveness of Data

A. Halevy, P. Norvig, und F. Pereira. IEEE Intelligent Systems, 24 (2): 8-12 (2009)
DOI: 10.1109/MIS.2009.36

Zusammenfassung

At Brown University, there is excitement of having access to the Brown Corpus, containing one million English words. Since then, we have seen several notable corpora that are about 100 times larger, and in 2006, Google released a trillion-word corpus with frequency counts for all sequences up to five words long. In some ways this corpus is a step backwards from the Brown Corpus: it's taken from unfiltered Web pages and thus contains incomplete sentences, spelling errors, grammatical errors, and all sorts of other errors. It's not annotated with carefully hand-corrected part-of-speech tags. But the fact that it's a million times larger than the Brown Corpus outweighs these drawbacks. A trillion-word corpus - along with other Web-derived corpora of millions, billions, or trillions of links, videos, images, tables, and user interactions - captures even very rare aspects of human behavior. So, this corpus could serve as the basis of a complete model for certain tasks - if only we knew how to extract the model from the data.

Links und Ressourcen

BibTeX-Schlüssel: HalevyNorvigPereira09intelligent
Eintragstyp: article
Jahr: 2009
Zeitschrift: IEEE Intelligent Systems
Nummer: 2
Seiten: 8-12
Band: 24
file: IEEE Digital Library:2009/HalevyNorvigPereira09intelligent.pdf:PDF
issn: 1541-1672
groups: public
intrahash: ea313c2efcaa4b17e5bed2b11693abfd
DOI: 10.1109/MIS.2009.36
timestamp: 2010.10.29
username: flint63

@flint63s Tags hervorgehoben

Zitieren Sie diese Publikation

Suchen auf

Metadaten

Zuletzt geändert vor 6 Jahren
Erstellt vor 12 Jahren

Kommentare und Rezensionen
(0)

Es gibt bisher keine Rezension oder Kommentar. Sie können eine schreiben!

BibSonomy

Kopieren Löschen Diese Publikation zur Ablage hinzufügen
Community-Eintrag
Versionsverlauf dieses Eintrags
URL
DOI
BibTeX
EndNote
APA
Chicago
DIN 1505
Harvard
MSOffice XML

The Unreasonable Effectiveness of Data

Zusammenfassung

Links und Ressourcen

Tags

Community

Zitieren Sie diese Publikation

Mehr Zitationsstile

Suchen auf

Metadaten

Kommentare und Rezensionen
(0)

BibSonomy

KopierenLöschenDiese Publikation zur Ablage hinzufügenCommunity-EintragVersionsverlauf dieses EintragsURLDOIBibTeXEndNoteAPAChicagoDIN 1505HarvardMSOffice XML The Unreasonable Effectiveness of Data

Zusammenfassung

Links und Ressourcen

Tags

Community

Zitieren Sie diese Publikation

Mehr Zitationsstile

Suchen auf

Metadaten

Kommentare und Rezensionen (0)

Kopieren Löschen Diese Publikation zur Ablage hinzufügen
Community-Eintrag
Versionsverlauf dieses Eintrags
URL
DOI
BibTeX
EndNote
APA
Chicago
DIN 1505
Harvard
MSOffice XML

The Unreasonable Effectiveness of Data

Kommentare und Rezensionen
(0)