LAREX – A semi-automatic open-source Tool for Layout
Analysis and Region Extraction on Early Printed Books
C. Reul, U. Springmann, und F. Puppe. 2nd International Conference on Digital Access to Textual Cultural Heritage (DATeCH), (2017)
Zusammenfassung
A semi-automatic open-source tool for layout analysis on early
printed books is presented. LAREX uses a rule based connected
components approach which is very fast, easily comprehensible for
the user and allows an intuitive manual correction if necessary. The
PageXML format is used to support integration into existing OCR
workflows. Evaluations showed that LAREX provides an efficient
and flexible way to segment pages of early printed books.
%0 Journal Article
%1 reul2017larex
%A Reul, Christian
%A Springmann, Uwe
%A Puppe, Frank
%D 2017
%J 2nd International Conference on Digital Access to Textual Cultural Heritage (DATeCH)
%K early_printed_books incunabula layout_analysis myown segmentation
%T LAREX – A semi-automatic open-source Tool for Layout
Analysis and Region Extraction on Early Printed Books
%U https://dl.acm.org/citation.cfm?id=3078097
%X A semi-automatic open-source tool for layout analysis on early
printed books is presented. LAREX uses a rule based connected
components approach which is very fast, easily comprehensible for
the user and allows an intuitive manual correction if necessary. The
PageXML format is used to support integration into existing OCR
workflows. Evaluations showed that LAREX provides an efficient
and flexible way to segment pages of early printed books.
@article{reul2017larex,
abstract = {A semi-automatic open-source tool for layout analysis on early
printed books is presented. LAREX uses a rule based connected
components approach which is very fast, easily comprehensible for
the user and allows an intuitive manual correction if necessary. The
PageXML format is used to support integration into existing OCR
workflows. Evaluations showed that LAREX provides an efficient
and flexible way to segment pages of early printed books.},
added-at = {2017-01-25T10:44:20.000+0100},
author = {Reul, Christian and Springmann, Uwe and Puppe, Frank},
biburl = {https://www.bibsonomy.org/bibtex/2414cb7151f2ff589d9107e907f0ccba0/chreul},
interhash = {330312e9971fd642a2e3d2434c2a599d},
intrahash = {414cb7151f2ff589d9107e907f0ccba0},
journal = {2nd International Conference on Digital Access to Textual Cultural Heritage (DATeCH)},
keywords = {early_printed_books incunabula layout_analysis myown segmentation},
timestamp = {2024-07-23T21:27:07.000+0200},
title = {LAREX – A semi-automatic open-source Tool for Layout
Analysis and Region Extraction on Early Printed Books},
url = {https://dl.acm.org/citation.cfm?id=3078097},
year = 2017
}