Gephi is an open-source software for visualizing and analyzing large networks graphs. Gephi uses a 3D render engine to display graphs in real-time and speed up the exploration. Use Gephi to explore, analyse, spatialise, filter, cluterize, manipulate and export all types of graphs.
From the user's perspective, MDP is a collection of supervised and unsupervised learning algorithms and other data processing units that can be combined into data processing sequences and more complex feed-forward network architectures.
Hazelcast is an open source clustering and highly scalable data distribution platform for Java, which is:
* Lightening-fast; thousands of operations/sec.
* Fail-safe; no losing data after crashes.
* Dynamically scales as new servers added.
* Super-easy to use; include a single jar.
Hazelcast is pure Java. JVMs that are running Hazelcast will dynamically cluster. Although by default Hazelcast will use multicast for discovery, it can also be configured to only use TCP/IP for environments where multicast is not available or preferred.
Following up on KMeans Clustering Now Running on Elastic MapReduce, Stephen Green has generously documented the steps that was necessary to get an example of k-Means clustering up and running on Amazon’s Elastic MapReduce (EMR) on the Apache Lucene Mahout wiki.
TANAGRA is a free DATA MINING software for academic and research purposes. It proposes several data mining methods from exploratory data analysis, statistical learning, machine learning and databases area.
A. Hotho, S. Staab, и G. Stumme. Proceedings of the 2003 IEEE International Conference on Data Mining, стр. 541-544 (Poster. Melbourne, Florida, IEEE Computer Society, (ноября 2003)