"They are built to be human-usable (...) are targeted primarily for storage/retrieval of personal information and serendipitous discovery of group information . (...) The development communities for each are abuzz with ideas for exploiting the structure"
This piece is based on two talks I gave in the spring of 2005 -- one at the O'Reilly ETech conference in March, entitled "Ontology Is Overrated", and one at the IMCExpo in April entitled "Folksonomies & Tags: The rise of user-developed classification." Th
Bow (or libbow) is a library of C code useful for writing statistical text analysis, language modeling and information retrieval programs. The current distribution includes the library, as well as front-ends for document classification (rainbow), document
The aim of the International Journal of Advances in Internet of Things is to provide a forum for scientists and social workers to present and discuss issues in the impact of the Internet to the society and disseminate findings in scientific research on related subjects.
The Cataloger's Reference Shelf is based on 21 MARC manuals and other reference works published by The Library of Congress and frequently accessed by technical services staff. A must see for catalogers!
In this post you will see 5 recipes of supervised classification algorithms applied to small standard datasets that are provided with the scikit-learn library.
Database of animal natural history, distribution, classification, and conservation biology. Contains species accounts about individual animal species and descriptions of levels of organization above the species level, especially phyla, classes, and in some cases, orders and families.
Provides taxonomic, conservation status and distribution information on plants and animals that are extinct, at risk of extinction, or near threatened.
Other articles where Thesaurus is discussed: library: Thesauri: A new use of the term thesaurus, now widespread, dates from the early 1950s in the work of H.P. Luhn, at International Business Machines Corporation (IBM), who was searching for a computer process that could create a list of authorized terms for the indexing…
Concept mining is a discipline at the nexus of data mining, text mining, and linguistics, drawing on artificial intelligence and statistics. It aims to extract concepts from documents.
My diploma thesis about a system to automatically build a multilingual thesaurus from wikipedia, "WikiWord", is finally done. I handed it in yesterday. My research will hopefully help to make Wikipedia more accessible for automatic processing
This introductory course on machine learning will give an overview of many concepts, techniques, and algorithms in machine learning, beginning with topics such as classification and linear regression and ending up with more recent topics such as boosting,
Ein englischer Text von Adam Mathes mit den Themen:The Creation of Metadata, Tagging Content in Del.icio.us and Flickr, From Tags to Folksonomy, Why Folksonomies Work and Areas For Further Research
Scalable and Efficient Data Streaming Algorithms for Detecting Common Content in Internet Traffic. Minho Sung, Abhishek Kumar, Li Li, Jia Wang, Jun Xu. To appear in the Proc. of 2nd IEEE International Workshop on Networking Meets Databases (NetDB'06), April 2006. Sketch Guided Sampling -- Using On-Line Estimates of Flow Size for Adaptive Data Collection. Abhishek Kumar, Jun (Jim) Xu. To appear in the proceedings of IEEE Infocom'06, Barcelona, Spain, April 2006.
LIBLINEAR is a linear classifier for data with millions of instances and features. It supports L2-regularized logistic regression (LR), L2-loss linear SVM, and L1-loss linear SVM.
Main features of LIBLINEAR include
* Same data format as LIBSVM, our general-purpose SVM solver, and also similar usage
* Multi-class classification: 1) one-vs-the rest, 2) Crammer & Singer
* Cross validation for model selection
* Probability estimates (logistic regression only)
* Weights for unbalanced data
* MATLAB/Octave, Java interfaces
Advantages and drawbacks of data organisation in hierarchies, facets and with tags. Problems with finding the needed data without exact knowledge about it.
A. Akyol, Y. Yaslan, and O. Erol. Proceedings of the 9th European Conference on Symbolic
and Quantitative Approaches to Reasoning with
Uncertainty, ECSQARU, volume 4724 of Lecture Notes in Computer Science, page 878--888. Hammamet, Tunisia, Springer, (October 2007)
A. Almal, A. Mitra, R. Datar, P. Lenehan, D. Fry, R. Cote, and W. Worzel. GECCO 2006: Proceedings of the 8th annual conference
on Genetic and evolutionary computation, 1, page 239--246. Seattle, Washington, USA, ACM Press, (8-12 July 2006)