group :: bibliothek2.0

Lesezeichen (verstecken)4
Anzeige
alles
nur Lesezeichen
Lesezeichen pro Seite
5
10
20
50
100
sortieren nach
hinzugefügt am
Titel
RSS
BibTeX
XML

1Going Grey? Comparing the OCR Accuracy Levels of Bitonal and Greyscale Images
Newspaper collections are the subject of an increasing number of large-scale digitisation projects. In Papers Past (http://paperspast.natlib.govt.nz), a collection of over a million newspaper pages, the introduction of full-text search has made a wealth of information findable that was previously hidden. The search feature is dependent on text extracted from the newspaper page images with Optical Character Recognition (OCR), so any improvement in OCR accuracy will add value to the collection by improving our users' chances of finding useful information.
vor 16 Jahren von @zeitungsportal
alle anzeigen
newspaper
newzealand
digitization
accuracy
ocr
article
grey
newspapernewzealanddigitizationaccuracyocrarticlegrey
KopierenLöschen
- Community-Eintrag
- Versionsverlauf dieses Eintrags
1How Good Can It Get? Analysing and Improving OCR Accuracy in Large Scale Historic Newspaper Digitisation Programs
This article details the work undertaken by the National Library of Australia Newspaper Digitisation Program on identifying and testing solutions to improve OCR accuracy in large scale newspaper digitisation programs. In 2007 and 2008 several different solutions were identified, applied and tested on digitised material now available in the Australian Newspapers Digitisation Program beta service <http://ndpbeta.nla.gov.au/ndp/del/home>. This article gives a state of the art overview of how OCR software works on newspapers, factors that effect OCR accuracy, methods of measuring accuracy, methods of improving accuracy, and testing methods and results for specific solutions that were considered viable for large scale text digitisation projects.
vor 16 Jahren von @zeitungsportal
alle anzeigen
newspaper
digitization
australia
ocr
article
newspaperdigitizationaustraliaocrarticle
KopierenLöschen
- Community-Eintrag
- Versionsverlauf dieses Eintrags
1Apex CoVantage | IZAAC Demonstration
http://apexcovantage.com/izaac_demo.aspx
vor 16 Jahren von @zeitungsportal
alle anzeigen
newspaper
commercial
izaac
layout
software
video
ocr
structure
newspapercommercializaaclayoutsoftwarevideoocrstructure
KopierenLöschen
- Community-Eintrag
- Versionsverlauf dieses Eintrags
2A Framework for Text Processing and Supporting Access to Collections of Digitized Historical Newspapers
Large quantities of historical newspapers are being digitized and OCRd. We describe a framework for processing the OCRd text to identify articles and extract metadata for them. We describe the article schema and provide examples of features that facilitate automatic indexing of them. For this processing, we employ lexical semantics, structural models, and community content. Furthermore, we describe visualization and summarization techniques that can be used to present the extracted events.
vor 16 Jahren von @zeitungsportal
alle anzeigen
identification
newspaper
layout
metadata
digitization
mets
ocr
article
structure
identificationnewspaperlayoutmetadatadigitizationmetsocrarticlestructure
KopierenLöschen
- Community-Eintrag
- Versionsverlauf dieses Eintrags

⟨⟨
⟨
1
⟩
⟩⟩

Publikationen (verstecken)
Anzeige
alles
nur Publikationen
Publikationen pro Seite
5
10
20
50
100
sortieren nach
hinzugefügt am
Titel
Autor
Erscheinungsdatum
Eintragstyp
Hilfe für erweiterte Sortierung...
RSS
BibTeX
RDF
mehr...

Keine Treffer.

⟨⟨
⟨
⟩
⟩⟩

BibSonomy

Lesezeichen (verstecken)4
Anzeige
alles
nur Lesezeichen
Lesezeichen pro Seite
5
10
20
50
100
sortieren nach
hinzugefügt am
Titel
RSS
BibTeX
XML

1Going Grey? Comparing the OCR Accuracy Levels of Bitonal and Greyscale Images

1How Good Can It Get? Analysing and Improving OCR Accuracy in Large Scale Historic Newspaper Digitisation Programs

1Apex CoVantage | IZAAC Demonstration

2A Framework for Text Processing and Supporting Access to Collections of Digitized Historical Newspapers

Publikationen (verstecken)
Anzeige
alles
nur Publikationen
Publikationen pro Seite
5
10
20
50
100
sortieren nach
hinzugefügt am
Titel
Autor
Erscheinungsdatum
Eintragstyp
Hilfe für erweiterte Sortierung...
RSS
BibTeX
RDF
mehr...

Bibliothek 2.0

Stöbern

Verwandte Tags

Tags

BibSonomy

Lesezeichen (verstecken)4 Anzeigeallesnur LesezeichenLesezeichen pro Seite5102050100 sortieren nachhinzugefügt amTitel RSSBibTeXXML

1Going Grey? Comparing the OCR Accuracy Levels of Bitonal and Greyscale Images

1How Good Can It Get? Analysing and Improving OCR Accuracy in Large Scale Historic Newspaper Digitisation Programs

1Apex CoVantage | IZAAC Demonstration

2A Framework for Text Processing and Supporting Access to Collections of Digitized Historical Newspapers

Publikationen (verstecken) Anzeigeallesnur PublikationenPublikationen pro Seite5102050100 sortieren nachhinzugefügt amTitelAutorErscheinungsdatumEintragstypHilfe für erweiterte Sortierung... RSSBibTeXRDFmehr...

Bibliothek 2.0

Stöbern

Verwandte Tags

Tags

Lesezeichen (verstecken)4
Anzeige
alles
nur Lesezeichen
Lesezeichen pro Seite
5
10
20
50
100
sortieren nach
hinzugefügt am
Titel
RSS
BibTeX
XML

Publikationen (verstecken)
Anzeige
alles
nur Publikationen
Publikationen pro Seite
5
10
20
50
100
sortieren nach
hinzugefügt am
Titel
Autor
Erscheinungsdatum
Eintragstyp
Hilfe für erweiterte Sortierung...
RSS
BibTeX
RDF
mehr...