|
||||||||||
| PREV CLASS NEXT CLASS | FRAMES NO FRAMES | |||||||||
| SUMMARY: NESTED | FIELD | CONSTR | METHOD | DETAIL: FIELD | CONSTR | METHOD | |||||||||
java.lang.Objectcom.cloudera.crunch.impl.mem.MemPipeline
public class MemPipeline
| Method Summary | ||
|---|---|---|
static
|
collectionOf(Iterable<T> collect)
|
|
static
|
collectionOf(T... ts)
|
|
void |
done()
Run any remaining jobs required to generate outputs and then clean up any intermediate data files that were created in this run or previous calls to run. |
|
void |
enableDebug()
Turn on debug logging for jobs that are run from this pipeline. |
|
org.apache.hadoop.conf.Configuration |
getConfiguration()
Returns the Configuration instance associated with this pipeline. |
|
static Pipeline |
getInstance()
|
|
|
materialize(PCollection<T> pcollection)
Create the given PCollection and read the data it contains into the returned Collection instance for client use. |
|
|
read(Source<T> source)
Converts the given Source into a PCollection that is
available to jobs run using this Pipeline instance. |
|
|
read(TableSource<K,V> source)
A version of the read method for TableSource instances that
map to PTables. |
|
PCollection<String> |
readTextFile(String pathName)
A convenience method for reading a text file. |
|
void |
run()
Constructs and executes a series of MapReduce jobs in order to write data to the output targets. |
|
void |
setConfiguration(org.apache.hadoop.conf.Configuration conf)
Set the Configuration to use with this pipeline. |
|
static
|
tableOf(Iterable<Pair<S,T>> pairs)
|
|
static
|
tableOf(S s,
T t,
Object... more)
|
|
static
|
typedCollectionOf(PType<T> ptype,
Iterable<T> collect)
|
|
static
|
typedCollectionOf(PType<T> ptype,
T... ts)
|
|
static
|
typedTableOf(PTableType<S,T> ptype,
Iterable<Pair<S,T>> pairs)
|
|
static
|
typedTableOf(PTableType<S,T> ptype,
S s,
T t,
Object... more)
|
|
void |
write(PCollection<?> collection,
Target target)
Write the given collection to the given target on the next pipeline run. |
|
|
writeTextFile(PCollection<T> collection,
String pathName)
A convenience method for writing a text file. |
|
| Methods inherited from class java.lang.Object |
|---|
clone, equals, finalize, getClass, hashCode, notify, notifyAll, toString, wait, wait, wait |
| Method Detail |
|---|
public static Pipeline getInstance()
public static <T> PCollection<T> collectionOf(T... ts)
public static <T> PCollection<T> collectionOf(Iterable<T> collect)
public static <T> PCollection<T> typedCollectionOf(PType<T> ptype,
T... ts)
public static <T> PCollection<T> typedCollectionOf(PType<T> ptype,
Iterable<T> collect)
public static <S,T> PTable<S,T> tableOf(S s,
T t,
Object... more)
public static <S,T> PTable<S,T> typedTableOf(PTableType<S,T> ptype,
S s,
T t,
Object... more)
public static <S,T> PTable<S,T> tableOf(Iterable<Pair<S,T>> pairs)
public static <S,T> PTable<S,T> typedTableOf(PTableType<S,T> ptype,
Iterable<Pair<S,T>> pairs)
public void setConfiguration(org.apache.hadoop.conf.Configuration conf)
PipelineConfiguration to use with this pipeline.
setConfiguration in interface Pipelinepublic org.apache.hadoop.conf.Configuration getConfiguration()
PipelineConfiguration instance associated with this pipeline.
getConfiguration in interface Pipelinepublic <T> PCollection<T> read(Source<T> source)
PipelineSource into a PCollection that is
available to jobs run using this Pipeline instance.
read in interface Pipelinesource - The source of data
public <K,V> PTable<K,V> read(TableSource<K,V> source)
PipelineTableSource instances that
map to PTables.
read in interface Pipelinesource - The source of the data
public void write(PCollection<?> collection,
Target target)
Pipeline
write in interface Pipelinecollection - The collectiontarget - The output targetpublic PCollection<String> readTextFile(String pathName)
Pipeline
readTextFile in interface Pipeline
public <T> void writeTextFile(PCollection<T> collection,
String pathName)
Pipeline
writeTextFile in interface Pipelinepublic <T> Iterable<T> materialize(PCollection<T> pcollection)
Pipeline
materialize in interface Pipelinepcollection - The PCollection to materialize
public void run()
Pipeline
run in interface Pipelinepublic void done()
Pipelinerun.
done in interface Pipelinepublic void enableDebug()
Pipeline
enableDebug in interface Pipeline
|
||||||||||
| PREV CLASS NEXT CLASS | FRAMES NO FRAMES | |||||||||
| SUMMARY: NESTED | FIELD | CONSTR | METHOD | DETAIL: FIELD | CONSTR | METHOD | |||||||||