tag :: mapreduce java apache

закладки (спрятать)4
показать
всё
только закладки
закладки на страницу
5
10
20
50
100
RSS
BibTeX
XML

4Welcome to Pig!
Pig is a platform for analyzing large data sets that consists of a high-level language for expressing data analysis programs, coupled with infrastructure for evaluating these programs. The salient property of Pig programs is that their structure is amenable to substantial parallelization, which in turns enables them to handle very large data sets. At the present time, Pig's infrastructure layer consists of a compiler that produces sequences of Map-Reduce programs, for which large-scale parallel implementations already exist Pig's language layer currently consists of a textual language called Pig Latin, which has the following key properties: * Ease of programming. It is trivial to achieve parallel execution of simple, "embarrassingly parallel" data analysis tasks. * Optimization opportunities. The way in which tasks are encoded permits the system to optimize their execution automatically * Extensibility.
13 лет назад , @draganigajic
apache
datamining
hadoop
java
mapreduce
apachedatamininghadoopjavamapreduce
(0)
копироватьудалить
- Запись сообщества
- посмотреть историю записи
1Sqoop « Cloudera » Apache Hadoop for the Enterprise
Sqoop is a tool designed to import data from relational databases into Hadoop. Sqoop uses JDBC to connect to a database. It examines each table’s schema and automatically generates the necessary classes to import data into the Hadoop Distributed File System (HDFS). Sqoop then creates and launches a MapReduce job to read tables from the database via DBInputFormat, the JDBC-based InputFormat. Tables are read into a set of files in HDFS. Sqoop supports both SequenceFile and text-based target and includes performance enhancements for loading data from MySQL.
15 лет назад , @gresch
apache
db
dbms
hadoop
hdfs
java
mapreduce
software
sql
apachedbdbmshadoophdfsjavamapreducesoftwaresql
(0)
копироватьудалить
- Запись сообщества
- посмотреть историю записи
4Apache Mahout - Overview
http://lucene.apache.org/mahout/
15 лет назад , @dolefulrabbit
analytics
apache
clustering
datamining
distributed
hadoop
java
library
lucene
machinelearning
mapreduce
recommendation
scalable
software
analyticsapacheclusteringdataminingdistributedhadoopjavalibrarylucenemachinelearningmapreducerecommendationscalablesoftware
(0)
копироватьудалить
- Запись сообщества
- посмотреть историю записи
19Welcome to Apache Hadoop!
The Apache Hadoop project develops open-source software for reliable, scalable, distributed computing, including:
16 лет назад , @carlfischer
apache
cluster
distributed
grid
hadoop
java
mapreduce
opensource
apacheclusterdistributedgridhadoopjavamapreduceopensource
(0)
копироватьудалить
- Запись сообщества
- посмотреть историю записи

&lang;&lang;
⟨
1
&rang;
⟩⟩

публикации (спрятать)
показать
всё
только публикации
публикации на страницу
5
10
20
50
100
расширенный...
RSS
BibTeX
RDF
дальше...

Нет подходящих.

&lang;&lang;
⟨
&rang;
⟩⟩

BibSonomy

закладки (спрятать)4
показать
всё
только закладки
закладки на страницу
5
10
20
50
100
RSS
BibTeX
XML

4Welcome to Pig!

1Sqoop « Cloudera » Apache Hadoop for the Enterprise

4Apache Mahout - Overview

19Welcome to Apache Hadoop!

публикации (спрятать)
показать
всё
только публикации
публикации на страницу
5
10
20
50
100
расширенный...
RSS
BibTeX
RDF
дальше...

просмотр

сходные по теме тэги

BibSonomy

закладки (спрятать)4 показатьвсётолько закладкизакладки на страницу5102050100 RSSBibTeXXML

4Welcome to Pig!

1Sqoop « Cloudera » Apache Hadoop for the Enterprise

4Apache Mahout - Overview

19Welcome to Apache Hadoop!

публикации (спрятать) показатьвсётолько публикациипубликации на страницу5102050100 расширенный... RSSBibTeXRDFдальше...

просмотр

сходные по теме тэги

закладки (спрятать)4
показать
всё
только закладки
закладки на страницу
5
10
20
50
100
RSS
BibTeX
XML

публикации (спрятать)
показать
всё
только публикации
публикации на страницу
5
10
20
50
100
расширенный...
RSS
BibTeX
RDF
дальше...