Protovis composes custom views of data with simple marks such as bars and dots. Unlike low-level graphics libraries that quickly become tedious for visualization, Protovis defines marks through dynamic properties that encode data, allowing inheritance, scales and layouts to simplify construction. Protovis is free and open-source, provided under the BSD License. It uses JavaScript and SVG for web-native visualizations; no plugin required (though you will need a modern web browser)! Although programming experience is helpful, Protovis is mostly declarative and designed to be learned by example.
Recollection seeks to provide the platform, tools and environment that enables the community of NDIIPP Partners to share their collections and data on an ongoing basis. In addition, NDIIPP collections can be showcased from a central point through the activities of the Partners, and not the manual labor of the Library. This allows NDIIPP to maintain the benefits of a distributed network of partners and also take advantage of the collections speaking to one another (Campbell, 2009). Linked data technology is used in Recollection as a basic platform for librarians and curators exposing collections to the Web, and as a source of data to augment these collections. Potential users of the information can more easily discover and analyze this data in a variety of new ways as a result. Not only do consumers of the information have increased access, but collection curators can begin to connect information across collections and from the WWW to enhance collection value with new resources.
A collection of the best open data sets and open-source tools for data science, wrapped in an easy-to-use REST/JSON API. Street Address to Coordinates, File to Text, OCR, Coordinates to Political Areas, Geodict pulls country, city and region names from unstructured English text, and returns their coordinates. IP Address to Coordinates. Removes the parts of the text that seem to be boilerplate, leaving the real sentences. HTML to Text. HTML to Story. Spots text fragments that look like people's names or titles, Text to Times.
The Social Science Data Analysis Network (SSDAN) is a university-based organization that creates demographic media, such as user guides, web sites, and hands-on classroom computer materials that make U.S. census data accessible to policymakers, educators, the media, and informed citizens. SSDAN is directed by demographer William H. Frey and utilizes facilities at the Population Studies Center, University of Michigan.
MemeTracker builds maps of the daily news cycle by analyzing around 900,000 news stories and blog posts per day from 1 million online sources, ranging from mass media to personal blogs. We track the quotes and phrases that appear most frequently over time across this entire online news spectrum. This makes it possible to see how different stories compete for news and blog coverage each day, and how certain stories persist while others fade quickly.
Sophie is open source software for writing, reading and visualizing rich media documents in an interactive, networked environment. The program emerged from the desire to create an easy-to-use application that would allow authors to combine text, images, video, and sound quickly and simply, but with precision and sophistication. Sophie's users are interested in creating robust, elegant, networked, texts and multimedia works without having programming knowledge or training in the use of more complex and costly tools. such as Flash. Sophie was initially designed and developed by the Institute for the Future of the Book.
FITS is currently configured to use a set of 8 tools for identifying, validating, and extracting technical metadata. jhove, exiftool, National Library of New Zealand Metadata Extractor, file utility, droid, ffident, fileinfo, xmlMetadata.
hoover up those sites. Getleft is a web site downloader, that downloads complete web sites according to the settings provided by the user. It automatically changes all the absolute links to relative ones, so you can surf the downloaded pages (web sites) on your local computer without the need to connect to the internet. so that you can surf the site in your hard disk. Getleft supports several filters, allowing you to limit the download to certain files, as well as resuming , following of external links, sitemap and more. Getleft supports proxy connections and can be scheduled to update downloaded pages automatically.
DROID (Digital Record Object Identification) is an automatic file format identification tool. It is the first in a planned series of tools developed by The National Archives under the umbrella of its PRONOM technical registry service.
UpLib is a digital “filing cabinet”, designed for personal use. It allows you to save Web pages, email, photos, scanned bills, receipts, and other documents — even music — for later retrieval. It uses a highly visual interface, presenting saved information as thumbnails of the actual document instead of lines of text.