Following up on KMeans Clustering Now Running on Elastic MapReduce, Stephen Green has generously documented the steps that was necessary to get an example of k-Means clustering up and running on Amazon’s Elastic MapReduce (EMR) on the Apache Lucene Mahout wiki.
Talis Connected Commons The Talis Connected Commons scheme is intended to directly support the publishing and reuse of Linked Data in the public domain by removing the costs associated with those activities. The scheme is intended to support a wide range of different forms of data publishing. For example scientific researchers seeking to share their research data
Protocol Buffers allow you to define simple data structures in a special definition language, then compile them to produce classes to represent those structures in the language of your choice. These classes come complete with heavily-optimized code to par
Using RhNav - Rhizome Navigation I wrote a data aggregator for Technorati's API. The first result is a video which visualizes blog domains by analysing Technorati's Cosmos (the blogs which link to a particular URL). The video is a screencast of RhNav fetc
Stock Cloud began as data mining experiment with a very simple goal — "Could we extract Business Partnerships by tracking press releases?" To accomplish this we selected a press release distribution agency, MarketWire, and began tracking releases. Usi
Stock Cloud began as data mining experiment with a very simple goal — "Could we extract Business Partnerships by tracking press releases?" To accomplish this we selected a press release distribution agency, MarketWire, and began tracking releases. Usi
G. Truong, S. Gilani, S. Islam, and D. Suter. 2019 Digital Image Computing: Techniques and Applications (DICTA), page 1-8. (December 2019)DST Best Science Paper Award.
X. Ma, Z. Fu, Y. Jiang, M. Yang, and H. Stephen. International Journal of Computer Science and Information Technology (IJCSIT), 9 (3):
31 - 41(June 2017)