Inproceedings,

WISHFUL - Website Extraction of Institutional Sources with Heterogeneous Factors and User-Driven Linkage

S. Shahania, M. Spiliopoulou, and D. Broneske.
Information Integration and Web Intelligence: 25th International Conference, IiWAS 2023, Denpasar, Bali, Indonesia, December 4–6, 2023, Proceedings, page 20–26. Berlin, Heidelberg, Springer-Verlag, (2023)
DOI: 10.1007/978-3-031-48316-5_3

Abstract

Extracting information from diverse websites is increasingly important, especially for analyzing vast data sets to detect trends, gain insights. By studying job ads, researchers can monitor employer demand shifts, assisting policymakers in aiding affected workers and industries. However, extraction faces challenges like varied website formats, dynamic content, and duplicate data. This study introduces a method for extracting data from diverse private university websites involving keyword identification, website categorization, and extraction pipelines.

BibTeX key: 10.1007/978-3-031-48316-5_3
entry type: inproceedings
address: Berlin, Heidelberg
booktitle: Information Integration and Web Intelligence: 25th International Conference, IiWAS 2023, Denpasar, Bali, Indonesia, December 4–6, 2023, Proceedings
year: 2023
pages: 20–26
publisher: Springer-Verlag
isbn: 978-3-031-48315-8
numpages: 7
location: Denpasar, Indonesia
DOI: 10.1007/978-3-031-48316-5_3
url: https://doi.org/10.1007/978-3-031-48316-5_3

BibSonomy

WISHFUL - Website Extraction of Institutional Sources with Heterogeneous Factors and User-Driven Linkage

Abstract

Tags

Users

Comments and Reviewsshow / hide

Cite this publication

More citation styles

search on

BibSonomy

WISHFUL - Website Extraction of&nbsp;Institutional Sources with&nbsp;Heterogeneous Factors and&nbsp;User-Driven Linkage

Abstract

Tags

Users

Comments and Reviewsshow / hide

Cite this publication

More citation styles

search on

WISHFUL - Website Extraction of Institutional Sources with Heterogeneous Factors and User-Driven Linkage