ThManager is an Open Source Tool for creating and visualizing SKOS RDF vocabularies, a W3C initiative for the representation of knowledge organization systems such as thesauri, classification schemes, subject heading lists, taxonomies, and other types of controlled vocabulary. ThManager facilitates the management of thesauri and other types of controlled vocabularies, such as taxonomies or classification schemes. The tool has been implemented in Java and has the following features:
Multi-platform (Windows, Unix). As it has been developed in Java and the storage of metadata records is managed directly through the file system, the application can be deployed in any platform with the minimum requirement of having installed a Java virtual machine.
Multilingual. The application has been developed following the Java internationalization methodology. Nowadays, there are Spanish and English versions. With little effort, other languages could be supported.
Selection and filtering of the thesauri stored in the local repository.
Description of thesauri by means of metadata in compliance with a Dublin Core based application profile for thesaurus (See application profile) . These metadata can be either visualized in HTML or edited through a form.
Visualization of thesaurus concepts. The visualization interface includes the following widgets:
Alphabetic viewer: It provides the list of thesaurus concepts alphabetically ordered in the selected language.
Hierarchical viewer: It provides a tree showing the hierarchical structure of thesaurus concepts.
Concept viewer: For a selected concept it shows all the properties allowing additionally the navigation to the related concepts by means of hyperlinks.
Search tool: It facilitates search of concepts. The searching process is based on preferred labels allowing the following criteria: "equals", "starts with" and "contains".
Edition of thesaurus content. The tool provides an edition interface to modify the content of a thesaurus: creation of concepts, deletion of concepts, and update of concept properties.
Exchange of thesauri according to SKOS format. The export operation includes the export of thesaurus metadata.
Extraction of related concepts in WordNet. It generates an automatic mapping of thesaurus concepts against the concepts of Wordnet lexical database.
On-line help by means of PDF visualization.
Everything search engine
Locate files and folders by name instantly.
Small installation file
Clean and simple user interface
Quick file indexing
Quick searching
Minimal resource usage
Share files with others easily
Real-time updating
More...
InSight Desktop Search
Easily Search For Files/Folders across HDD.
Support for shared Network Places.
Support for Metadata.
Dedicated Music Search and Playback.
Search Outlook Emails and Contacts.
Search for articles on Wikipedia.
Quick Launch Shortcuts.
Small Index size and Live updations.
InSight Preview.
Quick Disk Indexing Speed : 1-2 min.
Puggle is an open-source desktop search engine written exclusively in Java. It provides full text and metadata search over files, folders, music, photos, web pages and more that are stored locally on your computer.
Katta is a scalable, failure tolerant, distributed, data storage for real time access.
Katta serves large, replicated, indices as shards to serve high loads and very large data sets. These indices can be of different type. Currently implementations are available for Lucene and Hadoop mapfiles.
* Makes serving large or high load indices easy
* Serves very large Lucene or Hadoop Mapfile indices as index shards on many servers
* Replicate shards on different servers for performance and fault-tolerance
* Supports pluggable network topologies
* Master fail-over
* Fast, lightweight, easy to integrate
* Plays well with Hadoop clusters
* Apache Version 2 License
Sphinx is a full-text search engine, distributed under GPL version 2. Commercial license is also available for embedded use.
Generally, it's a standalone search engine, meant to provide fast, size-efficient and relevant fulltext search functions to other applications. Sphinx was specially designed to integrate well with SQL databases and scripting languages. Currently built-in data sources support fetching data either via direct connection to MySQL or PostgreSQL, or using XML pipe mechanism (a pipe to indexer in special XML-based format which Sphinx recognizes).
As for the name, Sphinx is an acronym which is officially decoded as SQL Phrase Index. Yes, I know about CMU's Sphinx project.