Tabula allows you to extract data tables in PDF files into a CSV or Microsoft Excel spreadsheet using a simple, easy-to-use interface. Tabula works on Mac, Windows and Linux.
raleighpublicrecord/dochive · GitHub, DocHive has 2 prerequisites, ImageMagic and Tesserac. coverts pdf pages to images and the OCRs the image. purpose is to extract numeric statistical tables in PDFs for import into spreadsheets.
LexisNexis™ Statistical DataSets is a new online service that enables researchers to build statistical tables and charts from multiple sources in a single interface. This online interactive statistical solution aggregates over 610 licensed and public domain datasets provided by over 55 sources. The DataSets product makes 16.0 billion data points accessible within a single interface. The product exists as a module within LexisNexis Statistical Insight, which up until has covered published statistical tables. Now LexisNexis Statistical brings together the two worlds of published statistics and unpublished numeric data