Twitter wird sein frisch eingekauftes Echtzeit-DV-System Storm als Open Source veröffentlichen. Damit wird die Technik für die Parallelisierung von Datenbankabfragen für alle verfügbar.
Medvane is an automated bibliome mining system. Medvane's data source includes articles published since 1973 with at least one author from Harvard or its affiliated institutions. Articles from PubMed are analyzed in the contexts of journal, author, subject, and gene. The relationship between these aspects and their evolution over time give a bird's eye view of biomedical research.
Parallel or distributed mining,Cluster-based data mining algorithms and systems,Grid-based data mining,lgorithms and systems;Peer-to-Peer based data mining algorithms and systems;Data mining algorithms and systems based on parallel hardware platforms
IB, a quarterly journal, is dedicated to the latest advancement of Internet and Business, and the intersection of Economics with business applications. The goal of this journal is to publish cutting edge research and promote the research work in these fast moving areas. All manuscripts submitted to IB must be previously unpublished and may not be considered for publication elsewhere at any time during IB's review period.
We investigate the statistical filtering
of phishing emails, where a classifier is
trained on characteristic features of existing
emails and subsequently is able to identify
new phishing emails with different contents.
We propose advanced email features generated
by adaptively trained Dynamic Markov
Chains and by novel latent Class-Topic Models.
On a publicly available test corpus classifiers
using these features are able to reduce
the number of misclassified emails by two
thirds compared to previous work. Using a
recently proposed more expressive evaluation
method we show that these results are statistically
significant. In addition we successfully
tested our approach on a non-public email
corpus with a real-life composition.
Jeff Ullman is the Stanford W. Ascherman Professor of Computer Science (Emeritus). His interests include database theory, database integration, data mining, and education using the information infrastructure.
Following up on KMeans Clustering Now Running on Elastic MapReduce, Stephen Green has generously documented the steps that was necessary to get an example of k-Means clustering up and running on Amazon’s Elastic MapReduce (EMR) on the Apache Lucene Mahout wiki.
Email marketing & newsletter analytics. Track more than just opening & click rates. Beautiful statistics & reports. Easy to setup. Works with all email service providers.
K. Pandey, R. Yadu, A. Dwivedi, and P. Shukla. International Journal on Recent and Innovation Trends in Computing and Communication, 3 (2):
456--460(February 2015)
J. Eggermont, A. Eiben, and J. van
Hemert. Advances in Intelligent Data Analysis, Third
International Symposium, IDA-99, volume 1642 of LNCS, page 281--290. Amsterdam, The Netherlands, Springer-Verlag, (9--11 August 1999)
M. Atzmueller, S. Beer, and F. Puppe. Proc. 22nd International Florida Artificial Intelligence Research Society Conference (FLAIRS), page 402--407. AAAI Press, (2009)
M. Atzmueller, S. Beer, and F. Puppe. Proc. 22nd International Florida Artificial Intelligence Research Society Conference (FLAIRS), page 402--407. AAAI Press, (2009)
M. Atzmueller, S. Beer, and F. Puppe. Proc. 22nd International Florida Artificial Intelligence Research Society Conference (FLAIRS), accepted, page 372-377. AAAI Press, (2009)
M. Atzmueller, S. Beer, and F. Puppe. Proc. 22nd International Florida Artificial Intelligence Research Society Conference (FLAIRS), accepted, page 372-377. AAAI Press, (2009)
P. Kluegl, M. Atzmueller, and F. Puppe. Proc. LWA 2009, Knowledge Discovery and Machine Learning Track, Darmstadt, Germany, University of Darmstadt, (2009)