tag :: hadoop distributed

bookmarks (hide)31
display
all
bookmarks only
bookmarks per page
5
10
20
50
100
sort by
added at
title
RSS
BibTeX
XML

2katta - distributed lucene
Katta is a scalable, failure tolerant, distributed, data storage for real time access. Katta serves large, replicated, indices as shards to serve high loads and very large data sets. These indices can be of different type. Currently implementations are available for Lucene and Hadoop mapfiles. * Makes serving large or high load indices easy * Serves very large Lucene or Hadoop Mapfile indices as index shards on many servers * Replicate shards on different servers for performance and fault-tolerance * Supports pluggable network topologies * Master fail-over * Fast, lightweight, easy to integrate * Plays well with Hadoop clusters * Apache Version 2 License
14 years ago by @stroeh
show all tags
searchengine
suchmaschine
distributed
hadoop
lucene
katta
searchenginesuchmaschinedistributedhadooplucenekatta
copydelete
- community post
- history of this post
1Last.fm – the Blog · Python + Hadoop = Flying Circus Elephant
http://blog.last.fm/2008/05/29/python-hadoop-flying-circus-elephant
13 years ago by @muehlburger
show all tags
dumbo
scaling
python
clustering
mapreduce
distributed
opensource
hadoop
programming
scalability
dumboscalingpythonclusteringmapreducedistributedopensourcehadoopprogrammingscalability
copydelete
- community post
- history of this post
14Apache Hadoop
http://hadoop.apache.org/
12 years ago by @telekoma
show all tags
apache
ws1213
computing
distributed
hadoop
master
seminar:dfs
uni
apachews1213computingdistributedhadoopmasterseminar:dfsuni
copydelete
- community post
- history of this post
3Data-Intensive Information Processing Applications (Spring 2010) | Home
http://www.umiacs.umd.edu/~jimmylin/cloud-2010-Spring/
14 years ago by @muehlburger
show all tags
nlp
cloud
computing
awm2010
data
mapreduce
course
distributed
lectures
hadoop
nlpcloudcomputingawm2010datamapreducecoursedistributedlectureshadoop
copydelete
- community post
- history of this post
1Apache HBase – Apache HBase™ Home
An open source, non-relational, distributed database modeled after Google’s Bigtable and is written in Java.
6 years ago by @mjbrown
show all tags
apache
bigdata
versioning
web
database
data
distributed
tabular
hadoop
nosql
hbase
apachebigdataversioningwebdatabasedatadistributedtabularhadoopnosqlhbase
copydelete
- community post
- history of this post
3Apache Mesos
http://mesos.apache.org/
8 years ago by @bshanks
show all tags
mesos
cluster
production
clustering
microservice
distributed
scale
hadoop
deploy
mesosclusterproductionclusteringmicroservicedistributedscalehadoopdeploy
copydelete
- community post
- history of this post
14Welcome to Apache Hadoop!
The Apache Hadoop project develops open-source software for reliable, scalable, distributed computing, including:
16 years ago by @carlfischer
show all tags
apache
java
cluster
grid
mapreduce
distributed
opensource
hadoop
apachejavaclustergridmapreducedistributedopensourcehadoop
copydelete
- community post
- history of this post
1Amazon Web Services Developer Community : Running Hadoop MapReduce on Amazon EC2 and Amazon S3
Apache's Hadoop project aims to solve these problems by providing a framework for running large data processing applications on clusters of commodity hardware. Combined with Amazon EC2 for running the application, and Amazon S3 for storing the data, we can run large jobs very economically. This paper describes how to use Amazon Web Services and Hadoop to run an ad hoc analysis on a large collection of web access logs that otherwise would have cost a prohibitive amount in either time or money.
16 years ago by @carlfischer
show all tags
apache
cluster
ec2
amazon
mapreduce
distributed
hadoop
programming
apacheclusterec2amazonmapreducedistributedhadoopprogramming
copydelete
- community post
- history of this post
1RHIPE - R and Hadoop Integrated Processing Environment
http://www.stat.purdue.edu/~sguha/rhipe/
15 years ago by @dolefulrabbit
show all tags
analtics
r
computing
mapreduce
distributed
hadoop
statistics
analticsrcomputingmapreducedistributedhadoopstatistics
copydelete
- community post
- history of this post
1Tom White: Learning MapReduce
http://www.lexemetech.com/2008/03/learning-mapreduce.html
13 years ago by @draganigajic
show all tags
java
distributed
tutorial
mapReduce
hadoop
cloudComputing
javadistributedtutorialmapReducehadoopcloudComputing
copydelete
- community post
- history of this post
1How Raytheon Researchers are Using Hadoop to Build a Scalable, Distributed Triple Store « Cloudera » Apache Hadoop for the Enterprise
http://www.cloudera.com/blog/2010/03/how-raytheon-researchers-are-using-hadoop-to-build-a-scalable-distributed-triple-store/
15 years ago by @dolefulrabbit
show all tags
semantic
distributed
semanticweb
triplestore
article
graph
repository
scalability
cloud
rdf
database
mapreduce
hadoop
sparql
programming
semanticdistributedsemanticwebtriplestorearticlegraphrepositoryscalabilitycloudrdfdatabasemapreducehadoopsparqlprogramming
copydelete
- community post
- history of this post

⟨⟨
⟨
1
2
⟩
⟩⟩

publications (hide)14
display
all
publications only
publications per page
5
10
20
50
100
sort by
added at
title
author
publication date
entry type
help for advanced sorting...
RSS
BibTeX
RDF
more...

1Evaluation of Frequent Itemset Mining Platforms using Apriori and FP-Growth Algorithm
R. Ranjan, and A. Sharma. International Journal of Information Systems & Management Science, (2019)
4 years ago by @becker
show all tags
subgroup
fpm
item
data
distributed
frequent
itemset
flink
mining
subgroups
parallel
spark
pattern
hadoop
subgroupfpmitemdatadistributedfrequentitemsetflinkminingsubgroupsparallelsparkpatternhadoop
copydeleteadd this publication to your clipboard
1HDFS: Erasure-Coded Information Repository System for Hadoop Clusters
A. Patil. International Journal of Trend in Scientific Research and Development, 2 (5): 1957-1960 (August 2018)
6 years ago by @ijtsrd
show all tags
codes
erasure
replica-based
Hadoop
distributed
storage
performance
encoding
system
parallel
?le
archival
clusters
codeserasurereplica-basedHadoopdistributedstorageperformanceencodingsystemparallel?learchivalclusters
copydeleteadd this publication to your clipboard
2CloudSVM : Training an SVM Classifier in Cloud Computing Systems
F. Catak, and M. Balaban. CoRR, (2013)
11 years ago by @thoni
show all tags
cloud
thema:svm_distributed
ss13
computing
seminar
thema
distributed
svm
training
hadoop
main
cloudthema:svm_distributedss13computingseminarthemadistributedsvmtraininghadoopmain
copydeleteadd this publication to your clipboard
2LARGE-SCALE DATA PROCESSING USING MAPREDUCE IN CLOUD COMPUTING ENVIRONMENT
S. Daneshyar, and M. Razmjoo. International Journal on Web Service Computing (IJWSC), 3 (4): 01-13 (December 2012)
5 years ago by @ijwsc
show all tags
cloud
parallel
computing
Hadoop
and
distributed
processing
MapReduce
cloudparallelcomputingHadoopanddistributedprocessingMapReduce
copydeleteadd this publication to your clipboard
2LARGE-SCALE DATA PROCESSING USING MAPREDUCE IN CLOUD COMPUTING ENVIRONMENT
S. Daneshyar, and M. Razmjoo. International Journal on Web Service Computing (IJWSC), 3 (4): 13 (December 2012)
6 years ago by @ijwsc
show all tags
cloud
parallel
computing
Hadoop
and
distributed
processing
MapReduce
cloudparallelcomputingHadoopanddistributedprocessingMapReduce
copydeleteadd this publication to your clipboard
1To Generate the Ontology from Java Source Code
S. Gopinath Ganapathy. International Journal of Advanced Computer Science and Applications(IJACSA), (2011)
11 years ago by @thesaiorg
show all tags
Ontology
Hadoop
Web
Distributed
and
Parser,
File
Language
System;.,component:
QDox
Ontology,
Metadata;
Jena,
OntologyHadoopWebDistributedandParser,FileLanguageSystem;.,component:QDoxOntology,Metadata;Jena,
copydeleteadd this publication to your clipboard
2SystemML: Declarative machine learning on MapReduce
A. Ghoting, R. Krishnamurthy, E. Pednault, B. Reinwald, V. Sindhwani, S. Tatikonda, Y. Tian, and S. Vaithyanathan. Proceedings of the 2011 IEEE 27th International Conference on Data Engineering, page 231--242. Washington, DC, USA, IEEE Computer Society, (2011)
11 years ago by @sb3000
show all tags
bigdata
distributed
hadoop
ml
bigdatadistributedhadoopml
copydeleteadd this publication to your clipboard
2The Hadoop distributed filesystem: Balancing portability and performance
J. Shafer, S. Rixner, and A. Cox. Performance Analysis of Systems Software (ISPASS), 2010 IEEE International Symposium on, page 122 -133. (March 2010)
12 years ago by @telekoma
show all tags
seminar
distributed
hdfs
hadoop
master
seminar:dfs
uni
filesystem
seminardistributedhdfshadoopmasterseminar:dfsunifilesystem
copydeleteadd this publication to your clipboard
5The Hadoop Distributed File System
K. Shvachko, H. Kuang, S. Radia, and R. Chansler. (2010)
13 years ago by @nosebrain
show all tags
system
file
distributed
hadoop
systemfiledistributedhadoop
copydeleteadd this publication to your clipboard
3Distributed Algorithm for Computing Formal Concepts Using Map-Reduce Framework
P. Krajca, and V. Vychodil. Advances in Intelligent Data Analysis VIII Springer-Verlag, (2009)
14 years ago by @muehlburger
show all tags
formal
awmhadoop
computing
algorithms
awm2010
distributed
hadoop
MapReduce
algorithm
formalawmhadoopcomputingalgorithmsawm2010distributedhadoopMapReducealgorithm
copydeleteadd this publication to your clipboard
3Map-reduce-merge: simplified relational data processing on large clusters
H. chih Yang, A. Dasdan, R. Hsiao, and D. Parker. SIGMOD '07: Proceedings of the 2007 ACM SIGMOD international conference on Management of data, page 1029--1040. New York, NY, USA, ACM, (2007)
14 years ago by @flowolf
show all tags
hadoop-group
reduce
awmhadoop
algorithms
distributed
parallel
grid
awm2010
mapreduce
merge
hadoop
map
relational
hadoop-groupreduceawmhadoopalgorithmsdistributedparallelgridawm2010mapreducemergehadoopmaprelational
copydeleteadd this publication to your clipboard
3Map-reduce-merge: simplified relational data processing on large clusters
H. chih Yang, A. Dasdan, R. Hsiao, and D. Parker. SIGMOD '07: Proceedings of the 2007 ACM SIGMOD international conference on Management of data, page 1029--1040. New York, NY, USA, ACM, (2007)
17 years ago by @jhammerb
show all tags
parallel
grid
algorithms
mapreduce
distributed
join
hadoop
parallelgridalgorithmsmapreducedistributedjoinhadoop
copydeleteadd this publication to your clipboard
3The hadoop distributed file system: Architecture and design
D. Borthakur. Hadoop Project Website, (2007)
13 years ago by @ilativ
show all tags
science
crawler
distributed
hadoop
ws12
architecture
sciencecrawlerdistributedhadoopws12architecture
copydeleteadd this publication to your clipboard
3The hadoop distributed file system: Architecture and design
D. Borthakur. The Apache Software Foundation, (2007)
12 years ago by @telekoma
show all tags
system
ws1213
file
seminar
distributed
hadoop
master
seminar:dfs
uni
architecture
systemws1213fileseminardistributedhadoopmasterseminar:dfsuniarchitecture
copydeleteadd this publication to your clipboard

⟨⟨
⟨
1
⟩
⟩⟩

BibSonomy

bookmarks (hide)31
display
all
bookmarks only
bookmarks per page
5
10
20
50
100
sort by
added at
title
RSS
BibTeX
XML

2katta - distributed lucene

1Last.fm – the Blog · Python + Hadoop = Flying Circus Elephant

14Apache Hadoop

3Data-Intensive Information Processing Applications (Spring 2010) | Home

1Apache HBase – Apache HBase™ Home

3Apache Mesos

14Welcome to Apache Hadoop!

1Amazon Web Services Developer Community : Running Hadoop MapReduce on Amazon EC2 and Amazon S3

1RHIPE - R and Hadoop Integrated Processing Environment

1Tom White: Learning MapReduce

1How Raytheon Researchers are Using Hadoop to Build a Scalable, Distributed Triple Store « Cloudera » Apache Hadoop for the Enterprise

publications (hide)14
display
all
publications only
publications per page
5
10
20
50
100
sort by
added at
title
author
publication date
entry type
help for advanced sorting...
RSS
BibTeX
RDF
more...

1Evaluation of Frequent Itemset Mining Platforms using Apriori and FP-Growth Algorithm

1HDFS: Erasure-Coded Information Repository System for Hadoop Clusters

2CloudSVM : Training an SVM Classifier in Cloud Computing Systems

2LARGE-SCALE DATA PROCESSING USING MAPREDUCE IN CLOUD COMPUTING ENVIRONMENT

2LARGE-SCALE DATA PROCESSING USING MAPREDUCE IN CLOUD COMPUTING ENVIRONMENT

1To Generate the Ontology from Java Source Code

2SystemML: Declarative machine learning on MapReduce

2The Hadoop distributed filesystem: Balancing portability and performance

5The Hadoop Distributed File System

3Distributed Algorithm for Computing Formal Concepts Using Map-Reduce Framework

3Map-reduce-merge: simplified relational data processing on large clusters

3Map-reduce-merge: simplified relational data processing on large clusters

3The hadoop distributed file system: Architecture and design

3The hadoop distributed file system: Architecture and design

browse

related tags

bookmarks (hide)31 displayallbookmarks onlybookmarks per page5102050100 sort byadded attitle RSSBibTeXXML

publications (hide)14 displayallpublications onlypublications per page5102050100 sort byadded attitleauthorpublication dateentry typehelp for advanced sorting... RSSBibTeXRDFmore...

browse

related tags

bookmarks (hide)31
display
all
bookmarks only
bookmarks per page
5
10
20
50
100
sort by
added at
title
RSS
BibTeX
XML

publications (hide)14
display
all
publications only
publications per page
5
10
20
50
100
sort by
added at
title
author
publication date
entry type
help for advanced sorting...
RSS
BibTeX
RDF
more...