A B C D E F G H I J L M N O P R S T U V W X Y

A

AbstractCommand - Class in com.cloudera.cdk.morphline.base
Base class for convenient implementation of Command classes.
AbstractCommand(Config, Command, Command, MorphlineContext) - Constructor for class com.cloudera.cdk.morphline.base.AbstractCommand
 
AbstractFieldTransformCommand - Class in com.cloudera.cdk.morphline.stdlib
Base class for convenient implementation of commands that do in-place updates on a field.
AbstractFieldTransformCommand(Config, Command, Command, MorphlineContext) - Constructor for class com.cloudera.cdk.morphline.stdlib.AbstractFieldTransformCommand
 
AbstractParser - Class in com.cloudera.cdk.morphline.stdio
Base class for convenient implementation of morphline parsers.
AbstractParser(Config, Command, Command, MorphlineContext) - Constructor for class com.cloudera.cdk.morphline.stdio.AbstractParser
 
AddCurrentTimeBuilder - Class in com.cloudera.cdk.morphline.stdlib
A command that adds the result of System.currentTimeMillis() to a given output field.
AddCurrentTimeBuilder() - Constructor for class com.cloudera.cdk.morphline.stdlib.AddCurrentTimeBuilder
 
AddLocalHostBuilder - Class in com.cloudera.cdk.morphline.stdlib
A command that adds the name or IP of the local host to a given output field.
AddLocalHostBuilder() - Constructor for class com.cloudera.cdk.morphline.stdlib.AddLocalHostBuilder
 
addSupportedMimeType(String) - Method in class com.cloudera.cdk.morphline.stdio.AbstractParser
 
AddValuesBuilder - Class in com.cloudera.cdk.morphline.stdlib
For each input field value, add the value to the given record output field.
AddValuesBuilder() - Constructor for class com.cloudera.cdk.morphline.stdlib.AddValuesBuilder
 
AddValuesIfAbsentBuilder - Class in com.cloudera.cdk.morphline.stdlib
For each input field value, add the value to the given record output field if the value isn't already contained in that field.
AddValuesIfAbsentBuilder() - Constructor for class com.cloudera.cdk.morphline.stdlib.AddValuesIfAbsentBuilder
 
apply(Object) - Method in class com.cloudera.cdk.data.FieldPartitioner
Apply the partition function to the given value.
apply(Object) - Method in class com.cloudera.cdk.data.partition.HashFieldPartitioner
 
apply(Object) - Method in class com.cloudera.cdk.data.partition.IdentityFieldPartitioner
 
apply(Object) - Method in class com.cloudera.cdk.data.partition.IntRangeFieldPartitioner
 
apply(Object) - Method in class com.cloudera.cdk.data.partition.ListFieldPartitioner
 
apply(Object) - Method in class com.cloudera.cdk.data.partition.MonthFieldPartitioner
 
apply(Object) - Method in class com.cloudera.cdk.data.partition.RangeFieldPartitioner
 
asSource(Dataset, Class<E>) - Static method in class com.cloudera.cdk.data.crunch.CrunchDatasets
Expose the given Dataset as a Crunch ReadableSource.
asTarget(Dataset) - Static method in class com.cloudera.cdk.data.crunch.CrunchDatasets
Expose the given Dataset as a Crunch Target.
ATTACHMENT_BODY - Static variable in class com.cloudera.cdk.morphline.base.Fields
 
ATTACHMENT_CHARSET - Static variable in class com.cloudera.cdk.morphline.base.Fields
 
ATTACHMENT_MIME_TYPE - Static variable in class com.cloudera.cdk.morphline.base.Fields
 
ATTACHMENT_NAME - Static variable in class com.cloudera.cdk.morphline.base.Fields
 
AVRO - Static variable in class com.cloudera.cdk.data.Formats
AVRO: the Avro row-oriented format
AVRO_MEMORY_MIME_TYPE - Static variable in class com.cloudera.cdk.morphline.avro.ReadAvroBuilder
The MIME type identifier that will be filled into output records

B

BASE_ID - Static variable in class com.cloudera.cdk.morphline.base.Fields
 
beginTransaction() - Method in interface com.cloudera.cdk.morphline.solr.DocumentLoader
Begins a transaction
beginTransaction() - Method in class com.cloudera.cdk.morphline.solr.SolrServerDocumentLoader
 
build(Config, Command, Command, MorphlineContext) - Method in interface com.cloudera.cdk.morphline.api.CommandBuilder
Creates and returns a command rooted at the given morphline config.
build() - Method in class com.cloudera.cdk.morphline.api.MorphlineContext.Builder
 
build(Config, Command, Command, MorphlineContext) - Method in class com.cloudera.cdk.morphline.avro.ExtractAvroPathsBuilder
 
build(Config, Command, Command, MorphlineContext) - Method in class com.cloudera.cdk.morphline.avro.ExtractAvroTreeBuilder
 
build(Config, Command, Command, MorphlineContext) - Method in class com.cloudera.cdk.morphline.avro.ReadAvroBuilder
 
build(Config, Command, Command, MorphlineContext) - Method in class com.cloudera.cdk.morphline.avro.ReadAvroContainerBuilder
 
build(Config, Command, Command, MorphlineContext) - Method in class com.cloudera.cdk.morphline.hadoop.sequencefile.ReadSequenceFileBuilder
 
build(Config, Command, Command, MorphlineContext) - Method in class com.cloudera.cdk.morphline.json.ExtractJsonPathsBuilder
 
build(Config, Command, Command, MorphlineContext) - Method in class com.cloudera.cdk.morphline.json.ReadJsonBuilder
 
build(Config, Command, Command, MorphlineContext) - Method in class com.cloudera.cdk.morphline.saxon.ConvertHTMLBuilder
 
build(Config, Command, Command, MorphlineContext) - Method in class com.cloudera.cdk.morphline.saxon.XQueryBuilder
 
build(Config, Command, Command, MorphlineContext) - Method in class com.cloudera.cdk.morphline.saxon.XSLTBuilder
 
build(Config, Command, Command, MorphlineContext) - Method in class com.cloudera.cdk.morphline.solr.GenerateSolrSequenceKeyBuilder
 
build(Config, Command, Command, MorphlineContext) - Method in class com.cloudera.cdk.morphline.solr.LoadSolrBuilder
 
build(Config, Command, Command, MorphlineContext) - Method in class com.cloudera.cdk.morphline.solr.SanitizeUnknownSolrFieldsBuilder
 
build() - Method in class com.cloudera.cdk.morphline.solr.SolrMorphlineContext.Builder
 
build(Config, Command, Command, MorphlineContext) - Method in class com.cloudera.cdk.morphline.solr.TokenizeTextBuilder
 
build(Config, Command, Command, MorphlineContext) - Method in class com.cloudera.cdk.morphline.solrcell.SolrCellBuilder
 
build(Config, Command, Command, MorphlineContext) - Method in class com.cloudera.cdk.morphline.stdio.ReadClobBuilder
 
build(Config, Command, Command, MorphlineContext) - Method in class com.cloudera.cdk.morphline.stdio.ReadCSVBuilder
 
build(Config, Command, Command, MorphlineContext) - Method in class com.cloudera.cdk.morphline.stdio.ReadLineBuilder
 
build(Config, Command, Command, MorphlineContext) - Method in class com.cloudera.cdk.morphline.stdio.ReadMultiLineBuilder
 
build(Config, Command, Command, MorphlineContext) - Method in class com.cloudera.cdk.morphline.stdlib.AddCurrentTimeBuilder
 
build(Config, Command, Command, MorphlineContext) - Method in class com.cloudera.cdk.morphline.stdlib.AddLocalHostBuilder
 
build(Config, Command, Command, MorphlineContext) - Method in class com.cloudera.cdk.morphline.stdlib.AddValuesBuilder
 
build(Config, Command, Command, MorphlineContext) - Method in class com.cloudera.cdk.morphline.stdlib.AddValuesIfAbsentBuilder
 
build(Config, Command, Command, MorphlineContext) - Method in class com.cloudera.cdk.morphline.stdlib.CallParentPipeBuilder
 
build(Config, Command, Command, MorphlineContext) - Method in class com.cloudera.cdk.morphline.stdlib.ContainsBuilder
 
build(Config, Command, Command, MorphlineContext) - Method in class com.cloudera.cdk.morphline.stdlib.ConvertTimestampBuilder
 
build(Config, Command, Command, MorphlineContext) - Method in class com.cloudera.cdk.morphline.stdlib.DecodeBase64Builder
 
build(Config, Command, Command, MorphlineContext) - Method in class com.cloudera.cdk.morphline.stdlib.DropRecordBuilder
 
build(Config, Command, Command, MorphlineContext) - Method in class com.cloudera.cdk.morphline.stdlib.EqualsBuilder
 
build(Config, Command, Command, MorphlineContext) - Method in class com.cloudera.cdk.morphline.stdlib.ExtractURIComponentsBuilder
 
build(Config, Command, Command, MorphlineContext) - Method in class com.cloudera.cdk.morphline.stdlib.ExtractURIQueryParametersBuilder
 
build(Config, Command, Command, MorphlineContext) - Method in class com.cloudera.cdk.morphline.stdlib.FindReplaceBuilder
 
build(Config, Command, Command, MorphlineContext) - Method in class com.cloudera.cdk.morphline.stdlib.GenerateUUIDBuilder
 
build(Config, Command, Command, MorphlineContext) - Method in class com.cloudera.cdk.morphline.stdlib.GrokBuilder
 
build(Config, Command, Command, MorphlineContext) - Method in class com.cloudera.cdk.morphline.stdlib.IfThenElseBuilder
 
build(Config, Command, Command, MorphlineContext) - Method in class com.cloudera.cdk.morphline.stdlib.JavaBuilder
 
build(Config, Command, Command, MorphlineContext) - Method in class com.cloudera.cdk.morphline.stdlib.LogDebugBuilder
 
build(Config, Command, Command, MorphlineContext) - Method in class com.cloudera.cdk.morphline.stdlib.LogErrorBuilder
 
build(Config, Command, Command, MorphlineContext) - Method in class com.cloudera.cdk.morphline.stdlib.LogInfoBuilder
 
build(Config, Command, Command, MorphlineContext) - Method in class com.cloudera.cdk.morphline.stdlib.LogTraceBuilder
 
build(Config, Command, Command, MorphlineContext) - Method in class com.cloudera.cdk.morphline.stdlib.LogWarnBuilder
 
build(Config, Command, Command, MorphlineContext) - Method in class com.cloudera.cdk.morphline.stdlib.NotBuilder
 
build(Config, Command, Command, MorphlineContext) - Method in class com.cloudera.cdk.morphline.stdlib.PipeBuilder
 
build(Config, Command, Command, MorphlineContext) - Method in class com.cloudera.cdk.morphline.stdlib.SeparateAttachmentsBuilder
 
build(Config, Command, Command, MorphlineContext) - Method in class com.cloudera.cdk.morphline.stdlib.SetValuesBuilder
 
build(Config, Command, Command, MorphlineContext) - Method in class com.cloudera.cdk.morphline.stdlib.SplitBuilder
 
build(Config, Command, Command, MorphlineContext) - Method in class com.cloudera.cdk.morphline.stdlib.ToStringBuilder
 
build(Config, Command, Command, MorphlineContext) - Method in class com.cloudera.cdk.morphline.stdlib.TranslateBuilder
 
build(Config, Command, Command, MorphlineContext) - Method in class com.cloudera.cdk.morphline.stdlib.TryRulesBuilder
 
build(Config, Command, Command, MorphlineContext) - Method in class com.cloudera.cdk.morphline.tika.decompress.DecompressBuilder
 
build(Config, Command, Command, MorphlineContext) - Method in class com.cloudera.cdk.morphline.tika.decompress.UnpackBuilder
 
build(Config, Command, Command, MorphlineContext) - Method in class com.cloudera.cdk.morphline.tika.DetectMimeTypeBuilder
 
build(Config, Command, Command, MorphlineContext) - Method in class com.cloudera.cdk.morphline.twitter.ReadJsonTestTweetsBuilder
 
buildCommand(Config, Command, Command) - Method in class com.cloudera.cdk.morphline.base.AbstractCommand
Factory method to create a command rooted at the given cmdConfig.
buildCommandChain(Config, String, Command, boolean) - Method in class com.cloudera.cdk.morphline.base.AbstractCommand
Factory method to create the chain of commands rooted at the given rootConfig.

C

CallParentPipeBuilder - Class in com.cloudera.cdk.morphline.stdlib
Command that routes records to the enclosing pipe morphline object.
CallParentPipeBuilder() - Constructor for class com.cloudera.cdk.morphline.stdlib.CallParentPipeBuilder
 
close() - Method in interface com.cloudera.cdk.data.DatasetReader
Close the reader and release any system resources.
close() - Method in interface com.cloudera.cdk.data.DatasetWriter
Close the writer and release any system resources.
com.cloudera.cdk.data - package com.cloudera.cdk.data
Provides interfaces and classes for reading and writing datasets.
com.cloudera.cdk.data.crunch - package com.cloudera.cdk.data.crunch
Provides helper classes for working with datasets using Crunch.
com.cloudera.cdk.data.filesystem - package com.cloudera.cdk.data.filesystem
Provides a dataset implementation for Hadoop filesystems, like HDFS.
com.cloudera.cdk.data.flume - package com.cloudera.cdk.data.flume
Provides classes for writing datasets via Flume.
com.cloudera.cdk.data.hcatalog - package com.cloudera.cdk.data.hcatalog
Provides a dataset implementation that uses the Hive/HCatalog metastore for metadata, and stores data in a Hadoop FileSystem.
com.cloudera.cdk.data.partition - package com.cloudera.cdk.data.partition
Provides common implementations of FieldPartitioner.
com.cloudera.cdk.morphline.api - package com.cloudera.cdk.morphline.api
This package contains the heart of the morphline API.
com.cloudera.cdk.morphline.avro - package com.cloudera.cdk.morphline.avro
 
com.cloudera.cdk.morphline.base - package com.cloudera.cdk.morphline.base
 
com.cloudera.cdk.morphline.hadoop.sequencefile - package com.cloudera.cdk.morphline.hadoop.sequencefile
 
com.cloudera.cdk.morphline.json - package com.cloudera.cdk.morphline.json
 
com.cloudera.cdk.morphline.saxon - package com.cloudera.cdk.morphline.saxon
 
com.cloudera.cdk.morphline.scriptengine.java - package com.cloudera.cdk.morphline.scriptengine.java
 
com.cloudera.cdk.morphline.solr - package com.cloudera.cdk.morphline.solr
 
com.cloudera.cdk.morphline.solrcell - package com.cloudera.cdk.morphline.solrcell
 
com.cloudera.cdk.morphline.stdio - package com.cloudera.cdk.morphline.stdio
 
com.cloudera.cdk.morphline.stdlib - package com.cloudera.cdk.morphline.stdlib
 
com.cloudera.cdk.morphline.tika - package com.cloudera.cdk.morphline.tika
 
com.cloudera.cdk.morphline.tika.decompress - package com.cloudera.cdk.morphline.tika.decompress
 
com.cloudera.cdk.morphline.twitter - package com.cloudera.cdk.morphline.twitter
 
com.cloudera.cdk.tools - package com.cloudera.cdk.tools
 
CombinedLogFormatConverter - Class in com.cloudera.cdk.tools
A tool for converting files in Combined Log Format to a Dataset.
CombinedLogFormatConverter() - Constructor for class com.cloudera.cdk.tools.CombinedLogFormatConverter
 
Command - Interface in com.cloudera.cdk.morphline.api
A command transforms a record into zero or more records.
CommandBuilder - Interface in com.cloudera.cdk.morphline.api
Interface to create instances of morphline commands.
commitTransaction() - Method in interface com.cloudera.cdk.morphline.solr.DocumentLoader
Sends any outstanding documents to the destination and waits for a positive or negative ack (i.e.
commitTransaction() - Method in class com.cloudera.cdk.morphline.solr.SolrServerDocumentLoader
 
compile(File, String, MorphlineContext, Command, Config...) - Method in class com.cloudera.cdk.morphline.base.Compiler
Parses the given morphlineFile, then finds the morphline with the given morphlineId within, then compiles the morphline and returns the corresponding morphline command.
compile(Config, MorphlineContext, Command) - Method in class com.cloudera.cdk.morphline.base.Compiler
Compiles the given morphline config using the given morphline context.
compile(String, String, Class[]) - Method in class com.cloudera.cdk.morphline.scriptengine.java.FastJavaScriptEngine
 
compile(String) - Method in class com.cloudera.cdk.morphline.scriptengine.java.FastJavaScriptEngine
 
compile(Reader) - Method in class com.cloudera.cdk.morphline.scriptengine.java.FastJavaScriptEngine
 
Compiler - Class in com.cloudera.cdk.morphline.base
Tool to parse and compile a morphline file or morphline config.
Compiler() - Constructor for class com.cloudera.cdk.morphline.base.Compiler
 
CONFIG_KEY_FIELD - Static variable in class com.cloudera.cdk.morphline.hadoop.sequencefile.ReadSequenceFileBuilder
 
CONFIG_VALUE_FIELD - Static variable in class com.cloudera.cdk.morphline.hadoop.sequencefile.ReadSequenceFileBuilder
 
Configs - Class in com.cloudera.cdk.morphline.base
Helpers to traverse and read parts of a HOCON data structure.
Configs() - Constructor for class com.cloudera.cdk.morphline.base.Configs
 
configuration(Configuration) - Method in class com.cloudera.cdk.data.filesystem.FileSystemDatasetRepository.Builder
The Configuration used to find the FileSystem.
configuration(Configuration) - Method in class com.cloudera.cdk.data.hcatalog.HCatalogDatasetRepository.Builder
The Configuration used to find the FileSystem.
ContainsBuilder - Class in com.cloudera.cdk.morphline.stdlib
Command that succeeds if one of the field values of the given named field is equal to one of the the given values, and fails otherwise; Multiple fields can be named, in which case the results are ANDed.
ContainsBuilder() - Constructor for class com.cloudera.cdk.morphline.stdlib.ContainsBuilder
 
containsLifecycleEvent(Record, Notifications.LifecycleEvent) - Static method in class com.cloudera.cdk.morphline.base.Notifications
 
context - Variable in class com.cloudera.cdk.morphline.api.MorphlineContext.Builder
 
ConvertHTMLBuilder - Class in com.cloudera.cdk.morphline.saxon
Command that converts HTML to XHTML using the TagSoup library.
ConvertHTMLBuilder() - Constructor for class com.cloudera.cdk.morphline.saxon.ConvertHTMLBuilder
 
ConvertTimestampBuilder - Class in com.cloudera.cdk.morphline.stdlib
Command that converts the timestamps in a given field from one of a set of input date formats (in an input timezone) to an output date format (in an output timezone), while respecting daylight savings time rules.
ConvertTimestampBuilder() - Constructor for class com.cloudera.cdk.morphline.stdlib.ConvertTimestampBuilder
 
copy() - Method in class com.cloudera.cdk.morphline.api.Record
 
create(String, DatasetDescriptor) - Method in interface com.cloudera.cdk.data.DatasetRepository
Create a Dataset with the supplied descriptor.
create(String, DatasetDescriptor) - Method in class com.cloudera.cdk.data.filesystem.FileSystemDatasetRepository
 
create(String, DatasetDescriptor) - Method in class com.cloudera.cdk.data.hcatalog.HCatalogDatasetRepository
 
create() - Method in class com.cloudera.cdk.morphline.api.MorphlineContext.Builder
 
create() - Method in class com.cloudera.cdk.morphline.solr.SolrMorphlineContext.Builder
 
createBindings() - Method in class com.cloudera.cdk.morphline.scriptengine.java.FastJavaScriptEngine
 
createSolrContentHandler(Metadata, SolrParams, IndexSchema) - Method in class com.cloudera.cdk.morphline.solrcell.StripNonCharSolrContentHandlerFactory
 
createSolrContentHandler(Metadata, SolrParams, IndexSchema) - Method in class com.cloudera.cdk.morphline.solrcell.TrimSolrContentHandlerFactory
 
CrunchDatasets - Class in com.cloudera.cdk.data.crunch
A helper class for exposing a filesystem-based dataset as a Crunch ReadableSource or Target.
CrunchDatasets() - Constructor for class com.cloudera.cdk.data.crunch.CrunchDatasets
 

D

Dataset - Interface in com.cloudera.cdk.data
A logical representation of a set of data entities.
DatasetDescriptor - Class in com.cloudera.cdk.data
The structural definition of a Dataset.
DatasetDescriptor(Schema, PartitionStrategy) - Constructor for class com.cloudera.cdk.data.DatasetDescriptor
Create an instance of this class with the supplied Schema, and optional PartitionStrategy.
DatasetDescriptor.Builder - Class in com.cloudera.cdk.data
A fluent builder to aid in the construction of DatasetDescriptors.
DatasetDescriptor.Builder() - Constructor for class com.cloudera.cdk.data.DatasetDescriptor.Builder
 
DatasetException - Exception in com.cloudera.cdk.data
Exception thrown for dataset-related failures.
DatasetException() - Constructor for exception com.cloudera.cdk.data.DatasetException
 
DatasetException(String) - Constructor for exception com.cloudera.cdk.data.DatasetException
 
DatasetException(String, Throwable) - Constructor for exception com.cloudera.cdk.data.DatasetException
 
DatasetException(Throwable) - Constructor for exception com.cloudera.cdk.data.DatasetException
 
DatasetReader<E> - Interface in com.cloudera.cdk.data
A stream-oriented dataset reader.
DatasetReaderException - Exception in com.cloudera.cdk.data
Exception thrown for dataset reader-related failures.
DatasetReaderException() - Constructor for exception com.cloudera.cdk.data.DatasetReaderException
 
DatasetReaderException(String) - Constructor for exception com.cloudera.cdk.data.DatasetReaderException
 
DatasetReaderException(String, Throwable) - Constructor for exception com.cloudera.cdk.data.DatasetReaderException
 
DatasetReaderException(Throwable) - Constructor for exception com.cloudera.cdk.data.DatasetReaderException
 
DatasetRepository - Interface in com.cloudera.cdk.data
A logical repository (storage system) of Datasets.
DatasetRepositoryException - Exception in com.cloudera.cdk.data
Exception thrown for dataset repository-related failures.
DatasetRepositoryException() - Constructor for exception com.cloudera.cdk.data.DatasetRepositoryException
 
DatasetRepositoryException(String) - Constructor for exception com.cloudera.cdk.data.DatasetRepositoryException
 
DatasetRepositoryException(String, Throwable) - Constructor for exception com.cloudera.cdk.data.DatasetRepositoryException
 
DatasetRepositoryException(Throwable) - Constructor for exception com.cloudera.cdk.data.DatasetRepositoryException
 
DatasetWriter<E> - Interface in com.cloudera.cdk.data
A stream-oriented dataset writer.
DatasetWriterException - Exception in com.cloudera.cdk.data
Exception thrown for dataset writer-related failures.
DatasetWriterException() - Constructor for exception com.cloudera.cdk.data.DatasetWriterException
 
DatasetWriterException(String) - Constructor for exception com.cloudera.cdk.data.DatasetWriterException
 
DatasetWriterException(String, Throwable) - Constructor for exception com.cloudera.cdk.data.DatasetWriterException
 
DatasetWriterException(Throwable) - Constructor for exception com.cloudera.cdk.data.DatasetWriterException
 
day(String, String) - Static method in class com.cloudera.cdk.data.partition.PartitionFunctions
 
day(String, String) - Method in class com.cloudera.cdk.data.PartitionStrategy.Builder
Configure a partitioner for extracting the day from a timestamp field.
DayOfMonthFieldPartitioner - Class in com.cloudera.cdk.data.partition
 
DayOfMonthFieldPartitioner(String, String) - Constructor for class com.cloudera.cdk.data.partition.DayOfMonthFieldPartitioner
 
DecodeBase64Builder - Class in com.cloudera.cdk.morphline.stdlib
Command that converts a Base64 encoded String to a byte[] per Section 6.8.
DecodeBase64Builder() - Constructor for class com.cloudera.cdk.morphline.stdlib.DecodeBase64Builder
 
DecompressBuilder - Class in com.cloudera.cdk.morphline.tika.decompress
Command that decompresses the first attachment.
DecompressBuilder() - Constructor for class com.cloudera.cdk.morphline.tika.decompress.DecompressBuilder
 
delete(String) - Method in class com.cloudera.cdk.data.filesystem.FileSystemMetadataProvider
 
delete(String) - Method in interface com.cloudera.cdk.data.MetadataProvider
Delete all metadata associated with the dataset named name.
detectCharset(Record, Charset) - Method in class com.cloudera.cdk.morphline.stdio.AbstractParser
 
DetectMimeTypeBuilder - Class in com.cloudera.cdk.morphline.tika
Command that auto-detects the MIME type of the first attachment, if no MIME type is defined yet.
DetectMimeTypeBuilder() - Constructor for class com.cloudera.cdk.morphline.tika.DetectMimeTypeBuilder
 
DocumentLoader - Interface in com.cloudera.cdk.morphline.solr
A vehicle to load a list of Solr documents into some kind of destination, such as a SolrServer or MapReduce RecordWriter.
doNotify(Record) - Method in class com.cloudera.cdk.morphline.base.AbstractCommand
 
doProcess(Record) - Method in class com.cloudera.cdk.morphline.base.AbstractCommand
 
doProcess(Record) - Method in class com.cloudera.cdk.morphline.stdio.AbstractParser
 
doProcess(Record, InputStream) - Method in class com.cloudera.cdk.morphline.stdio.AbstractParser
 
doProcess(Record) - Method in class com.cloudera.cdk.morphline.stdlib.AbstractFieldTransformCommand
 
drop(String) - Method in interface com.cloudera.cdk.data.DatasetRepository
Drop the named Dataset.
drop(String) - Method in class com.cloudera.cdk.data.filesystem.FileSystemDatasetRepository
 
drop(String) - Method in class com.cloudera.cdk.data.hcatalog.HCatalogDatasetRepository
 
dropPartition(PartitionKey) - Method in interface com.cloudera.cdk.data.Dataset
Drop a partition for a PartitionKey.
DropRecordBuilder - Class in com.cloudera.cdk.morphline.stdlib
Command that silently consumes records without ever emitting any record - think /dev/null.
DropRecordBuilder() - Constructor for class com.cloudera.cdk.morphline.stdlib.DropRecordBuilder
 

E

ELAPSED_TIME - Static variable in class com.cloudera.cdk.morphline.base.Metrics
 
equals(Object) - Method in class com.cloudera.cdk.data.Format
 
equals(Object) - Method in class com.cloudera.cdk.data.partition.HashFieldPartitioner
 
equals(Object) - Method in class com.cloudera.cdk.data.partition.IdentityFieldPartitioner
 
equals(Object) - Method in class com.cloudera.cdk.data.partition.IntRangeFieldPartitioner
 
equals(Object) - Method in class com.cloudera.cdk.data.partition.ListFieldPartitioner
 
equals(Object) - Method in class com.cloudera.cdk.data.partition.RangeFieldPartitioner
 
equals(Object) - Method in class com.cloudera.cdk.data.PartitionKey
 
equals(Object) - Method in class com.cloudera.cdk.data.PartitionStrategy
 
equals(Object) - Method in class com.cloudera.cdk.morphline.api.Record
 
EqualsBuilder - Class in com.cloudera.cdk.morphline.stdlib
Command that succeeds if all field values of the given named fields are equal to the the given values, and fails otherwise.
EqualsBuilder() - Constructor for class com.cloudera.cdk.morphline.stdlib.EqualsBuilder
 
eval(String, ScriptContext) - Method in class com.cloudera.cdk.morphline.scriptengine.java.FastJavaScriptEngine
 
eval(Reader, ScriptContext) - Method in class com.cloudera.cdk.morphline.scriptengine.java.FastJavaScriptEngine
 
eval(Object...) - Method in class com.cloudera.cdk.morphline.scriptengine.java.FastJavaScriptEngine.JavaCompiledScript
 
eval(ScriptContext) - Method in class com.cloudera.cdk.morphline.scriptengine.java.FastJavaScriptEngine.JavaCompiledScript
 
evaluate(Record) - Method in class com.cloudera.cdk.morphline.base.FieldExpression
Returns the values of a Record referred to by the given field expression
evaluate(Object...) - Method in class com.cloudera.cdk.morphline.scriptengine.java.ScriptEvaluator
 
ExceptionHandler - Interface in com.cloudera.cdk.morphline.api
Morphline-wide default handler that commands can choose to use to handle exceptions.
ExtractAvroPathsBuilder - Class in com.cloudera.cdk.morphline.avro
Command that uses zero or more avro path expressions to extract values from an Avro object.
ExtractAvroPathsBuilder() - Constructor for class com.cloudera.cdk.morphline.avro.ExtractAvroPathsBuilder
 
ExtractAvroTreeBuilder - Class in com.cloudera.cdk.morphline.avro
Command that converts an attached Avro datum to a morphline record by recursively walking the Avro tree and extracting all data into a single morphline record, with fields named by their path in the Avro tree.
ExtractAvroTreeBuilder() - Constructor for class com.cloudera.cdk.morphline.avro.ExtractAvroTreeBuilder
 
ExtractJsonPathsBuilder - Class in com.cloudera.cdk.morphline.json
Command that uses zero or more JSON path expressions to extract values from a JSON object.
ExtractJsonPathsBuilder() - Constructor for class com.cloudera.cdk.morphline.json.ExtractJsonPathsBuilder
 
ExtractURIComponentsBuilder - Class in com.cloudera.cdk.morphline.stdlib
A command that extracts subcomponents from the URIs contained in the given input field and appends them to output fields with the given prefix, namely scheme, authority, host, port, path, query, fragment, schemeSpecificPart, userInfo.
ExtractURIComponentsBuilder() - Constructor for class com.cloudera.cdk.morphline.stdlib.ExtractURIComponentsBuilder
 
ExtractURIQueryParametersBuilder - Class in com.cloudera.cdk.morphline.stdlib
A command that extracts the query parameters with a given name from the URIs contained in the given input field and appends them to the given output field.
ExtractURIQueryParametersBuilder() - Constructor for class com.cloudera.cdk.morphline.stdlib.ExtractURIQueryParametersBuilder
 

F

FastJavaScriptEngine - Class in com.cloudera.cdk.morphline.scriptengine.java
This is a fast script engine for Java programming language - I modified JSR 223 Java Scripting a little for some 100x less overhead for invoking "static" methods.
FastJavaScriptEngine() - Constructor for class com.cloudera.cdk.morphline.scriptengine.java.FastJavaScriptEngine
 
FastJavaScriptEngine.JavaCompiledScript - Class in com.cloudera.cdk.morphline.scriptengine.java
 
FaultTolerance - Class in com.cloudera.cdk.morphline.base
Mission critical, large-scale online production systems need to make progress without downtime despite some issues.
FaultTolerance(boolean, boolean) - Constructor for class com.cloudera.cdk.morphline.base.FaultTolerance
 
FaultTolerance(boolean, boolean, String) - Constructor for class com.cloudera.cdk.morphline.base.FaultTolerance
 
FIELD_NAME - Static variable in class com.cloudera.cdk.morphline.stdlib.AddCurrentTimeBuilder
 
FIELD_NAME - Static variable in class com.cloudera.cdk.morphline.stdlib.AddLocalHostBuilder
 
FIELD_NAME - Static variable in class com.cloudera.cdk.morphline.stdlib.GenerateUUIDBuilder
 
FieldExpression - Class in com.cloudera.cdk.morphline.base
Helper to fetch the values of a field of a Record referred to by a field expression, which is a String of the form @{fieldname}.
FieldExpression(String, Config) - Constructor for class com.cloudera.cdk.morphline.base.FieldExpression
 
FieldPartitioner - Class in com.cloudera.cdk.data
Partitions values for a named field.
FieldPartitioner(String, int) - Constructor for class com.cloudera.cdk.data.FieldPartitioner
 
FieldPartitioner(String, String, int) - Constructor for class com.cloudera.cdk.data.FieldPartitioner
 
Fields - Class in com.cloudera.cdk.morphline.base
Common record field names.
Fields() - Constructor for class com.cloudera.cdk.morphline.base.Fields
 
fileSystem(FileSystem) - Method in class com.cloudera.cdk.data.filesystem.FileSystemDatasetRepository.Builder
The FileSystem to store metadata and dataset files in.
fileSystem(FileSystem) - Method in class com.cloudera.cdk.data.hcatalog.HCatalogDatasetRepository.Builder
The FileSystem to store dataset files in.
FileSystemDatasetRepository - Class in com.cloudera.cdk.data.filesystem
A DatasetRepository that stores data in a Hadoop FileSystem.
FileSystemDatasetRepository(FileSystem, Path) - Constructor for class com.cloudera.cdk.data.filesystem.FileSystemDatasetRepository
Construct a FileSystemDatasetRepository on the given FileSystem and root directory, and a FileSystemMetadataProvider with the same FileSystem and root directory.
FileSystemDatasetRepository(URI) - Constructor for class com.cloudera.cdk.data.filesystem.FileSystemDatasetRepository
Construct a FileSystemDatasetRepository with a root directory at the given URI, and a FileSystemMetadataProvider with the same root directory.
FileSystemDatasetRepository(FileSystem, Path, MetadataProvider) - Constructor for class com.cloudera.cdk.data.filesystem.FileSystemDatasetRepository
Construct a FileSystemDatasetRepository on the given FileSystem and root directory, with the given MetadataProvider for metadata storage.
FileSystemDatasetRepository.Builder - Class in com.cloudera.cdk.data.filesystem
A fluent builder to aid in the construction of FileSystemDatasetRepository instances.
FileSystemDatasetRepository.Builder() - Constructor for class com.cloudera.cdk.data.filesystem.FileSystemDatasetRepository.Builder
 
FileSystemMetadataProvider - Class in com.cloudera.cdk.data.filesystem
A MetadataProvider that stores dataset metadata in a Hadoop FileSystem.
FileSystemMetadataProvider(FileSystem, Path) - Constructor for class com.cloudera.cdk.data.filesystem.FileSystemMetadataProvider
 
find(String, Config, String) - Method in class com.cloudera.cdk.morphline.base.Compiler
Finds the given morphline id within the given morphline config, using the given nameForErrorMsg for error reporting.
FindReplaceBuilder - Class in com.cloudera.cdk.morphline.stdlib
Command that examines each string value in a given field and replaces each substring of the string value that matches the given string literal or grok pattern with the given replacement.
FindReplaceBuilder() - Constructor for class com.cloudera.cdk.morphline.stdlib.FindReplaceBuilder
 
flush() - Method in interface com.cloudera.cdk.data.DatasetWriter
Force or commit any outstanding data to storage.
format(Format) - Method in class com.cloudera.cdk.data.DatasetDescriptor.Builder
Configure the dataset's format.
Format - Class in com.cloudera.cdk.data
The data format used for encoding the data in a Dataset when stored in a DatasetRepository.
Formats - Class in com.cloudera.cdk.data
Contains constant definitions for the standard Format instances supported by the library.

G

GenerateSolrSequenceKeyBuilder - Class in com.cloudera.cdk.morphline.solr
A command that assigns a record unique key that is the concatenation of the given baseIdField record field, followed by a running count of the record number within the current session.
GenerateSolrSequenceKeyBuilder() - Constructor for class com.cloudera.cdk.morphline.solr.GenerateSolrSequenceKeyBuilder
 
GenerateUUIDBuilder - Class in com.cloudera.cdk.morphline.stdlib
A command that sets a universally unique identifier on all records that are intercepted.
GenerateUUIDBuilder() - Constructor for class com.cloudera.cdk.morphline.stdlib.GenerateUUIDBuilder
 
get() - Method in class com.cloudera.cdk.data.DatasetDescriptor.Builder
Get an instance of the configured dataset descriptor.
get(String) - Method in interface com.cloudera.cdk.data.DatasetRepository
Get the latest version of a named Dataset.
get() - Method in class com.cloudera.cdk.data.filesystem.FileSystemDatasetRepository.Builder
 
get(String) - Method in class com.cloudera.cdk.data.filesystem.FileSystemDatasetRepository
 
get() - Method in class com.cloudera.cdk.data.hcatalog.HCatalogDatasetRepository.Builder
 
get(String) - Method in class com.cloudera.cdk.data.hcatalog.HCatalogDatasetRepository
 
get(int) - Method in class com.cloudera.cdk.data.PartitionKey
Return the value at the specified index in the key.
get() - Method in class com.cloudera.cdk.data.PartitionStrategy.Builder
Get the configured PartitionStrategy instance.
get(String) - Method in class com.cloudera.cdk.morphline.api.Record
 
getBatchSize() - Method in class com.cloudera.cdk.morphline.solr.SolrLocator
 
getBoolean(Config, String, boolean) - Method in class com.cloudera.cdk.morphline.base.Configs
 
getBoolean(Config, String) - Method in class com.cloudera.cdk.morphline.base.Configs
 
getCardinality() - Method in class com.cloudera.cdk.data.FieldPartitioner
 
getCardinality() - Method in class com.cloudera.cdk.data.PartitionStrategy
Return the cardinality produced by the contained field partitioners.
getCharset(Config, String, Charset) - Method in class com.cloudera.cdk.morphline.base.Configs
 
getChild() - Method in class com.cloudera.cdk.morphline.base.AbstractCommand
 
getCollectionName() - Method in class com.cloudera.cdk.morphline.solr.SolrLocator
 
getCommandBuilder(String) - Method in class com.cloudera.cdk.morphline.api.MorphlineContext
 
getConfig() - Method in class com.cloudera.cdk.morphline.base.AbstractCommand
 
getConfig(Config, String, Config) - Method in class com.cloudera.cdk.morphline.base.Configs
 
getConfig(Config, String) - Method in class com.cloudera.cdk.morphline.base.Configs
 
getConfigList(Config, String, List<? extends Config>) - Method in class com.cloudera.cdk.morphline.base.Configs
 
getConfigList(Config, String) - Method in class com.cloudera.cdk.morphline.base.Configs
 
getConfigs() - Method in class com.cloudera.cdk.morphline.base.AbstractCommand
 
getContext() - Method in class com.cloudera.cdk.morphline.base.AbstractCommand
 
getCounter(String...) - Method in class com.cloudera.cdk.morphline.base.AbstractCommand
 
getDescriptor() - Method in interface com.cloudera.cdk.data.Dataset
Get the DatasetDescriptor associated with this dataset.
getDocumentLoader() - Method in class com.cloudera.cdk.morphline.solr.SolrMorphlineContext
 
getDouble(Config, String, double) - Method in class com.cloudera.cdk.morphline.base.Configs
 
getDouble(Config, String) - Method in class com.cloudera.cdk.morphline.base.Configs
 
getEngine() - Method in class com.cloudera.cdk.morphline.scriptengine.java.FastJavaScriptEngine.JavaCompiledScript
 
getEngineName() - Method in class com.cloudera.cdk.morphline.scriptengine.java.JavaScriptEngineFactory
 
getEngineVersion() - Method in class com.cloudera.cdk.morphline.scriptengine.java.JavaScriptEngineFactory
 
getExceptionHandler() - Method in class com.cloudera.cdk.morphline.api.MorphlineContext
 
getExtension() - Method in class com.cloudera.cdk.data.Format
Get the extension for use in filenames.
getExtensions() - Method in class com.cloudera.cdk.morphline.scriptengine.java.JavaScriptEngineFactory
 
getFactory() - Method in class com.cloudera.cdk.morphline.scriptengine.java.FastJavaScriptEngine
 
getFieldPartitioners() - Method in class com.cloudera.cdk.data.PartitionStrategy
Get the list of field partitioners used for partitioning.
getFields() - Method in class com.cloudera.cdk.morphline.api.Record
 
getFileSystem() - Method in class com.cloudera.cdk.data.filesystem.FileSystemDatasetRepository
 
getFirstValue(String) - Method in class com.cloudera.cdk.morphline.api.Record
 
getFormat() - Method in class com.cloudera.cdk.data.DatasetDescriptor
Get the associated Format that the data is stored in.
getHistogram(String...) - Method in class com.cloudera.cdk.morphline.base.AbstractCommand
 
getIndexSchema() - Method in class com.cloudera.cdk.morphline.solr.SolrLocator
 
getIndexSchema() - Method in class com.cloudera.cdk.morphline.solr.SolrMorphlineContext
 
getInt(Config, String, int) - Method in class com.cloudera.cdk.morphline.base.Configs
 
getInt(Config, String) - Method in class com.cloudera.cdk.morphline.base.Configs
 
getLanguageName() - Method in class com.cloudera.cdk.morphline.scriptengine.java.JavaScriptEngineFactory
 
getLanguageVersion() - Method in class com.cloudera.cdk.morphline.scriptengine.java.JavaScriptEngineFactory
 
getLength() - Method in class com.cloudera.cdk.data.PartitionKey
Return the number of values in the key.
getLifecycleEvents(Record) - Static method in class com.cloudera.cdk.morphline.base.Notifications
 
getLoader() - Method in class com.cloudera.cdk.morphline.solr.SolrLocator
 
getLong(Config, String, long) - Method in class com.cloudera.cdk.morphline.base.Configs
 
getLong(Config, String) - Method in class com.cloudera.cdk.morphline.base.Configs
 
getMetadataProvider() - Method in class com.cloudera.cdk.data.filesystem.FileSystemDatasetRepository
 
getMeter(String...) - Method in class com.cloudera.cdk.morphline.base.AbstractCommand
 
getMethodCallSyntax(String, String, String...) - Method in class com.cloudera.cdk.morphline.scriptengine.java.JavaScriptEngineFactory
 
getMetricRegistry() - Method in class com.cloudera.cdk.morphline.api.MorphlineContext
 
getMimeTypes() - Method in class com.cloudera.cdk.morphline.scriptengine.java.JavaScriptEngineFactory
 
getName() - Method in interface com.cloudera.cdk.data.Dataset
Get the name of a Dataset.
getName() - Method in class com.cloudera.cdk.data.FieldPartitioner
 
getName() - Method in class com.cloudera.cdk.data.Format
Get the format's name.
getNames() - Method in interface com.cloudera.cdk.morphline.api.CommandBuilder
Returns the names with which this command can be invoked.
getNames() - Method in class com.cloudera.cdk.morphline.avro.ExtractAvroPathsBuilder
 
getNames() - Method in class com.cloudera.cdk.morphline.avro.ExtractAvroTreeBuilder
 
getNames() - Method in class com.cloudera.cdk.morphline.avro.ReadAvroBuilder
 
getNames() - Method in class com.cloudera.cdk.morphline.avro.ReadAvroContainerBuilder
 
getNames() - Method in class com.cloudera.cdk.morphline.hadoop.sequencefile.ReadSequenceFileBuilder
 
getNames() - Method in class com.cloudera.cdk.morphline.json.ExtractJsonPathsBuilder
 
getNames() - Method in class com.cloudera.cdk.morphline.json.ReadJsonBuilder
 
getNames() - Method in class com.cloudera.cdk.morphline.saxon.ConvertHTMLBuilder
 
getNames() - Method in class com.cloudera.cdk.morphline.saxon.XQueryBuilder
 
getNames() - Method in class com.cloudera.cdk.morphline.saxon.XSLTBuilder
 
getNames() - Method in class com.cloudera.cdk.morphline.scriptengine.java.JavaScriptEngineFactory
 
getNames() - Method in class com.cloudera.cdk.morphline.solr.GenerateSolrSequenceKeyBuilder
 
getNames() - Method in class com.cloudera.cdk.morphline.solr.LoadSolrBuilder
 
getNames() - Method in class com.cloudera.cdk.morphline.solr.SanitizeUnknownSolrFieldsBuilder
 
getNames() - Method in class com.cloudera.cdk.morphline.solr.TokenizeTextBuilder
 
getNames() - Method in class com.cloudera.cdk.morphline.solrcell.SolrCellBuilder
 
getNames() - Method in class com.cloudera.cdk.morphline.stdio.ReadClobBuilder
 
getNames() - Method in class com.cloudera.cdk.morphline.stdio.ReadCSVBuilder
 
getNames() - Method in class com.cloudera.cdk.morphline.stdio.ReadLineBuilder
 
getNames() - Method in class com.cloudera.cdk.morphline.stdio.ReadMultiLineBuilder
 
getNames() - Method in class com.cloudera.cdk.morphline.stdlib.AddCurrentTimeBuilder
 
getNames() - Method in class com.cloudera.cdk.morphline.stdlib.AddLocalHostBuilder
 
getNames() - Method in class com.cloudera.cdk.morphline.stdlib.AddValuesBuilder
 
getNames() - Method in class com.cloudera.cdk.morphline.stdlib.AddValuesIfAbsentBuilder
 
getNames() - Method in class com.cloudera.cdk.morphline.stdlib.CallParentPipeBuilder
 
getNames() - Method in class com.cloudera.cdk.morphline.stdlib.ContainsBuilder
 
getNames() - Method in class com.cloudera.cdk.morphline.stdlib.ConvertTimestampBuilder
 
getNames() - Method in class com.cloudera.cdk.morphline.stdlib.DecodeBase64Builder
 
getNames() - Method in class com.cloudera.cdk.morphline.stdlib.DropRecordBuilder
 
getNames() - Method in class com.cloudera.cdk.morphline.stdlib.EqualsBuilder
 
getNames() - Method in class com.cloudera.cdk.morphline.stdlib.ExtractURIComponentsBuilder
 
getNames() - Method in class com.cloudera.cdk.morphline.stdlib.ExtractURIQueryParametersBuilder
 
getNames() - Method in class com.cloudera.cdk.morphline.stdlib.FindReplaceBuilder
 
getNames() - Method in class com.cloudera.cdk.morphline.stdlib.GenerateUUIDBuilder
 
getNames() - Method in class com.cloudera.cdk.morphline.stdlib.GrokBuilder
 
getNames() - Method in class com.cloudera.cdk.morphline.stdlib.IfThenElseBuilder
 
getNames() - Method in class com.cloudera.cdk.morphline.stdlib.JavaBuilder
 
getNames() - Method in class com.cloudera.cdk.morphline.stdlib.LogDebugBuilder
 
getNames() - Method in class com.cloudera.cdk.morphline.stdlib.LogErrorBuilder
 
getNames() - Method in class com.cloudera.cdk.morphline.stdlib.LogInfoBuilder
 
getNames() - Method in class com.cloudera.cdk.morphline.stdlib.LogTraceBuilder
 
getNames() - Method in class com.cloudera.cdk.morphline.stdlib.LogWarnBuilder
 
getNames() - Method in class com.cloudera.cdk.morphline.stdlib.NotBuilder
 
getNames() - Method in class com.cloudera.cdk.morphline.stdlib.PipeBuilder
 
getNames() - Method in class com.cloudera.cdk.morphline.stdlib.SeparateAttachmentsBuilder
 
getNames() - Method in class com.cloudera.cdk.morphline.stdlib.SetValuesBuilder
 
getNames() - Method in class com.cloudera.cdk.morphline.stdlib.SplitBuilder
 
getNames() - Method in class com.cloudera.cdk.morphline.stdlib.ToStringBuilder
 
getNames() - Method in class com.cloudera.cdk.morphline.stdlib.TranslateBuilder
 
getNames() - Method in class com.cloudera.cdk.morphline.stdlib.TryRulesBuilder
 
getNames() - Method in class com.cloudera.cdk.morphline.tika.decompress.DecompressBuilder
 
getNames() - Method in class com.cloudera.cdk.morphline.tika.decompress.UnpackBuilder
 
getNames() - Method in class com.cloudera.cdk.morphline.tika.DetectMimeTypeBuilder
 
getNames() - Method in class com.cloudera.cdk.morphline.twitter.ReadJsonTestTweetsBuilder
 
getOutputStatement(String) - Method in class com.cloudera.cdk.morphline.scriptengine.java.JavaScriptEngineFactory
 
getParameter(String) - Method in class com.cloudera.cdk.morphline.scriptengine.java.JavaScriptEngineFactory
 
getParent() - Method in interface com.cloudera.cdk.morphline.api.Command
Returns the parent of this command.
getParent() - Method in class com.cloudera.cdk.morphline.base.AbstractCommand
 
getPartition(PartitionKey, boolean) - Method in interface com.cloudera.cdk.data.Dataset
Get a partition for a PartitionKey, possibly creating the partition if it doesn't already exist.
getPartitions() - Method in interface com.cloudera.cdk.data.Dataset
Return partitions, if this dataset is partitioned.
getPartitionStrategy() - Method in class com.cloudera.cdk.data.DatasetDescriptor
Get the PartitionStrategy, if this dataset is partitioned.
getProgram(String...) - Method in class com.cloudera.cdk.morphline.scriptengine.java.JavaScriptEngineFactory
 
getReader() - Method in interface com.cloudera.cdk.data.Dataset
Get an appropriate DatasetReader implementation based on the underlying Dataset implementation.
getRootDirectory() - Method in class com.cloudera.cdk.data.filesystem.FileSystemDatasetRepository
 
getSchema() - Method in class com.cloudera.cdk.data.DatasetDescriptor
Get the associated Schema.
getSchemaUrl() - Method in class com.cloudera.cdk.data.DatasetDescriptor
Get a URL from which the Schema may be retrieved.
getScriptEngine() - Method in class com.cloudera.cdk.morphline.scriptengine.java.JavaScriptEngineFactory
 
getServerUrl() - Method in class com.cloudera.cdk.morphline.solr.SolrLocator
 
getSolrHomeDir() - Method in class com.cloudera.cdk.morphline.solr.SolrLocator
 
getSolrServer() - Method in class com.cloudera.cdk.morphline.solr.SolrServerDocumentLoader
 
getSourceName() - Method in class com.cloudera.cdk.data.FieldPartitioner
 
getString(Config, String, String) - Method in class com.cloudera.cdk.morphline.base.Configs
 
getString(Config, String) - Method in class com.cloudera.cdk.morphline.base.Configs
 
getStringList(Config, String, List<String>) - Method in class com.cloudera.cdk.morphline.base.Configs
 
getStringList(Config, String) - Method in class com.cloudera.cdk.morphline.base.Configs
 
getTimer(String...) - Method in class com.cloudera.cdk.morphline.base.AbstractCommand
 
getUpperBounds() - Method in class com.cloudera.cdk.data.partition.RangeFieldPartitioner
 
getValues() - Method in class com.cloudera.cdk.data.PartitionKey
 
getWriter() - Method in interface com.cloudera.cdk.data.Dataset
Get an appropriate DatasetWriter implementation based on the underlying Dataset implementation.
getZkHost() - Method in class com.cloudera.cdk.morphline.solr.SolrLocator
 
GrokBuilder - Class in com.cloudera.cdk.morphline.stdlib
The Grok command uses regular expression pattern matching to extract structured fields from unstructured log data.
GrokBuilder() - Constructor for class com.cloudera.cdk.morphline.stdlib.GrokBuilder
 

H

handleException(Throwable, Record) - Method in interface com.cloudera.cdk.morphline.api.ExceptionHandler
Handle the given exception resulting from the given input record (the record can be null)
handleException(Throwable, Record) - Method in class com.cloudera.cdk.morphline.base.FaultTolerance
 
hash(String, int) - Static method in class com.cloudera.cdk.data.partition.PartitionFunctions
 
hash(String, String, int) - Static method in class com.cloudera.cdk.data.partition.PartitionFunctions
 
hash(String, int) - Method in class com.cloudera.cdk.data.PartitionStrategy.Builder
Configure a hash partitioner with the specified number of buckets .
hash(String, String, int) - Method in class com.cloudera.cdk.data.PartitionStrategy.Builder
Configure a hash partitioner with the specified number of buckets .
hashCode() - Method in class com.cloudera.cdk.data.Format
 
hashCode() - Method in class com.cloudera.cdk.data.partition.HashFieldPartitioner
 
hashCode() - Method in class com.cloudera.cdk.data.partition.IdentityFieldPartitioner
 
hashCode() - Method in class com.cloudera.cdk.data.partition.IntRangeFieldPartitioner
 
hashCode() - Method in class com.cloudera.cdk.data.partition.ListFieldPartitioner
 
hashCode() - Method in class com.cloudera.cdk.data.partition.RangeFieldPartitioner
 
hashCode() - Method in class com.cloudera.cdk.data.PartitionKey
 
hashCode() - Method in class com.cloudera.cdk.data.PartitionStrategy
 
hashCode() - Method in class com.cloudera.cdk.morphline.api.Record
 
HashFieldPartitioner - Class in com.cloudera.cdk.data.partition
 
HashFieldPartitioner(String, int) - Constructor for class com.cloudera.cdk.data.partition.HashFieldPartitioner
 
HashFieldPartitioner(String, String, int) - Constructor for class com.cloudera.cdk.data.partition.HashFieldPartitioner
 
hasNext() - Method in interface com.cloudera.cdk.data.DatasetReader
Tests the reader to see if additional entities can be read.
HCatalogDatasetRepository - Class in com.cloudera.cdk.data.hcatalog
A DatasetRepository that uses the Hive/HCatalog metastore for metadata, and stores data in a Hadoop FileSystem.
HCatalogDatasetRepository() - Constructor for class com.cloudera.cdk.data.hcatalog.HCatalogDatasetRepository
Create an HCatalog dataset repository with managed tables.
HCatalogDatasetRepository(URI) - Constructor for class com.cloudera.cdk.data.hcatalog.HCatalogDatasetRepository
Create an HCatalog dataset repository with external tables.
HCatalogDatasetRepository(FileSystem, Path) - Constructor for class com.cloudera.cdk.data.hcatalog.HCatalogDatasetRepository
Create an HCatalog dataset repository with external tables.
HCatalogDatasetRepository.Builder - Class in com.cloudera.cdk.data.hcatalog
A fluent builder to aid in the construction of HCatalogDatasetRepository instances.
HCatalogDatasetRepository.Builder() - Constructor for class com.cloudera.cdk.data.hcatalog.HCatalogDatasetRepository.Builder
 
hour(String, String) - Static method in class com.cloudera.cdk.data.partition.PartitionFunctions
 
hour(String, String) - Method in class com.cloudera.cdk.data.PartitionStrategy.Builder
Configure a partitioner for extracting the hour from a timestamp field.
HourFieldPartitioner - Class in com.cloudera.cdk.data.partition
 
HourFieldPartitioner(String, String) - Constructor for class com.cloudera.cdk.data.partition.HourFieldPartitioner
 

I

ID - Static variable in class com.cloudera.cdk.morphline.base.Fields
 
identity(String, int) - Static method in class com.cloudera.cdk.data.partition.PartitionFunctions
 
identity(String, int) - Method in class com.cloudera.cdk.data.PartitionStrategy.Builder
Configure an identity partitioner with a cardinality hint of buckets size.
IdentityFieldPartitioner - Class in com.cloudera.cdk.data.partition
 
IdentityFieldPartitioner(String, int) - Constructor for class com.cloudera.cdk.data.partition.IdentityFieldPartitioner
 
IfThenElseBuilder - Class in com.cloudera.cdk.morphline.stdlib
An If-Then-Else command consists of a chain of zero or more conditions commands, as well as a chain of zero or or more commands that are processed if all conditions succeed ("then commands"), as well as a chain of zero or more commands that are processed if one of the conditions fails ("else commands").
IfThenElseBuilder() - Constructor for class com.cloudera.cdk.morphline.stdlib.IfThenElseBuilder
 
importCommandBuilders(Collection<String>) - Method in class com.cloudera.cdk.morphline.api.MorphlineContext
 
incrementNumRecords() - Method in class com.cloudera.cdk.morphline.stdio.AbstractParser
 
IntRangeFieldPartitioner - Class in com.cloudera.cdk.data.partition
 
IntRangeFieldPartitioner(String, int...) - Constructor for class com.cloudera.cdk.data.partition.IntRangeFieldPartitioner
 
IS_IGNORING_RECOVERABLE_EXCEPTIONS - Static variable in class com.cloudera.cdk.morphline.base.FaultTolerance
 
IS_PRODUCTION_MODE - Static variable in class com.cloudera.cdk.morphline.base.FaultTolerance
 
isOpen() - Method in interface com.cloudera.cdk.data.DatasetReader
 
isOpen() - Method in interface com.cloudera.cdk.data.DatasetWriter
 
isPartitioned() - Method in class com.cloudera.cdk.data.DatasetDescriptor
Returns true if an associated dataset is partitioned (that is, has an associated PartitionStrategy, false otherwise.

J

JavaBuilder - Class in com.cloudera.cdk.morphline.stdlib
The "java" command compiles the given Java code block, wrapped into a Java method with a boolean return type and several parameters, along with a Java class definition that contains the given import statements.
JavaBuilder() - Constructor for class com.cloudera.cdk.morphline.stdlib.JavaBuilder
 
JavaScriptEngineFactory - Class in com.cloudera.cdk.morphline.scriptengine.java
This is script engine factory for "Java" script engine.
JavaScriptEngineFactory() - Constructor for class com.cloudera.cdk.morphline.scriptengine.java.JavaScriptEngineFactory
 
JobClasspathHelper - Class in com.cloudera.cdk.tools
This class is an helper to copy the jars needed by the job in the Distributed cache.
JobClasspathHelper() - Constructor for class com.cloudera.cdk.tools.JobClasspathHelper
 

L

LIFE_CYLCLE - Static variable in class com.cloudera.cdk.morphline.base.Notifications
 
ListFieldPartitioner - Class in com.cloudera.cdk.data.partition
 
ListFieldPartitioner(String, List<Set<?>>) - Constructor for class com.cloudera.cdk.data.partition.ListFieldPartitioner
 
load(String) - Method in class com.cloudera.cdk.data.filesystem.FileSystemMetadataProvider
 
load(String) - Method in interface com.cloudera.cdk.data.MetadataProvider
Load the dataset descriptor for the dataset name.
load(SolrInputDocument) - Method in interface com.cloudera.cdk.morphline.solr.DocumentLoader
Loads the given document into the destination
load(SolrInputDocument) - Method in class com.cloudera.cdk.morphline.solr.SolrServerDocumentLoader
 
LoadSolrBuilder - Class in com.cloudera.cdk.morphline.solr
A command that loads a record into a SolrServer or MapReduce SolrOutputFormat.
LoadSolrBuilder() - Constructor for class com.cloudera.cdk.morphline.solr.LoadSolrBuilder
 
LOG - Variable in class com.cloudera.cdk.morphline.base.AbstractCommand
 
Log4jAppender - Class in com.cloudera.cdk.data.flume
 
Log4jAppender() - Constructor for class com.cloudera.cdk.data.flume.Log4jAppender
 
Log4jAppender(String, int) - Constructor for class com.cloudera.cdk.data.flume.Log4jAppender
Sets the hostname and port.
LogDebugBuilder - Class in com.cloudera.cdk.morphline.stdlib
Command that logs to slf4j at DEBUG level.
LogDebugBuilder() - Constructor for class com.cloudera.cdk.morphline.stdlib.LogDebugBuilder
 
LogErrorBuilder - Class in com.cloudera.cdk.morphline.stdlib
Command that logs to slf4j at ERROR level.
LogErrorBuilder() - Constructor for class com.cloudera.cdk.morphline.stdlib.LogErrorBuilder
 
LogInfoBuilder - Class in com.cloudera.cdk.morphline.stdlib
Command that logs to slf4j at INFO level.
LogInfoBuilder() - Constructor for class com.cloudera.cdk.morphline.stdlib.LogInfoBuilder
 
LogTraceBuilder - Class in com.cloudera.cdk.morphline.stdlib
Command that logs to slf4j at TRACE level.
LogTraceBuilder() - Constructor for class com.cloudera.cdk.morphline.stdlib.LogTraceBuilder
 
LogWarnBuilder - Class in com.cloudera.cdk.morphline.stdlib
Command that logs to slf4j at WARN level.
LogWarnBuilder() - Constructor for class com.cloudera.cdk.morphline.stdlib.LogWarnBuilder
 

M

main(String...) - Static method in class com.cloudera.cdk.tools.CombinedLogFormatConverter
 
MESSAGE - Static variable in class com.cloudera.cdk.morphline.base.Fields
 
metadataProvider(MetadataProvider) - Method in class com.cloudera.cdk.data.filesystem.FileSystemDatasetRepository.Builder
The MetadataProvider for metadata storage.
MetadataProvider - Interface in com.cloudera.cdk.data
Defines a service provider interface for metadata system plugins.
MetadataProviderException - Exception in com.cloudera.cdk.data
Exception thrown for metadata provider-related failures.
MetadataProviderException() - Constructor for exception com.cloudera.cdk.data.MetadataProviderException
 
MetadataProviderException(String) - Constructor for exception com.cloudera.cdk.data.MetadataProviderException
 
MetadataProviderException(String, Throwable) - Constructor for exception com.cloudera.cdk.data.MetadataProviderException
 
MetadataProviderException(Throwable) - Constructor for exception com.cloudera.cdk.data.MetadataProviderException
 
Metrics - Class in com.cloudera.cdk.morphline.base
Common metric names.
Metrics() - Constructor for class com.cloudera.cdk.morphline.base.Metrics
 
MIME_TYPE - Static variable in class com.cloudera.cdk.morphline.avro.ReadAvroContainerBuilder
The MIME type that input attachments shall have
MIME_TYPE - Static variable in class com.cloudera.cdk.morphline.json.ReadJsonBuilder
The MIME type identifier that will be filled into output records
minute(String, String) - Static method in class com.cloudera.cdk.data.partition.PartitionFunctions
 
minute(String, String) - Method in class com.cloudera.cdk.data.PartitionStrategy.Builder
Configure a partitioner for extracting the minute from a timestamp field.
MinuteFieldPartitioner - Class in com.cloudera.cdk.data.partition
 
MinuteFieldPartitioner(String, String) - Constructor for class com.cloudera.cdk.data.partition.MinuteFieldPartitioner
 
month(String, String) - Static method in class com.cloudera.cdk.data.partition.PartitionFunctions
 
month(String, String) - Method in class com.cloudera.cdk.data.PartitionStrategy.Builder
Configure a partitioner for extracting the month from a timestamp field.
MonthFieldPartitioner - Class in com.cloudera.cdk.data.partition
 
MonthFieldPartitioner(String, String) - Constructor for class com.cloudera.cdk.data.partition.MonthFieldPartitioner
 
MorphlineCompilationException - Exception in com.cloudera.cdk.morphline.api
Indicates a parse error.
MorphlineCompilationException(String, Config) - Constructor for exception com.cloudera.cdk.morphline.api.MorphlineCompilationException
 
MorphlineCompilationException(String, Config, Throwable) - Constructor for exception com.cloudera.cdk.morphline.api.MorphlineCompilationException
 
MorphlineCompilationException(String, Config, Throwable, CommandBuilder) - Constructor for exception com.cloudera.cdk.morphline.api.MorphlineCompilationException
 
MorphlineContext - Class in com.cloudera.cdk.morphline.api
Additional user defined parameters that will be passed to all morphline commands.
MorphlineContext() - Constructor for class com.cloudera.cdk.morphline.api.MorphlineContext
For public access use MorphlineContext.Builder.build() instead
MorphlineContext.Builder - Class in com.cloudera.cdk.morphline.api
Helper to construct a MorphlineContext instance.
MorphlineContext.Builder() - Constructor for class com.cloudera.cdk.morphline.api.MorphlineContext.Builder
 
MorphlineRuntimeException - Exception in com.cloudera.cdk.morphline.api
Base class of all morphline exceptions.
MorphlineRuntimeException(String) - Constructor for exception com.cloudera.cdk.morphline.api.MorphlineRuntimeException
 
MorphlineRuntimeException(String, Throwable) - Constructor for exception com.cloudera.cdk.morphline.api.MorphlineRuntimeException
 
MorphlineRuntimeException(Throwable) - Constructor for exception com.cloudera.cdk.morphline.api.MorphlineRuntimeException
 

N

NotBuilder - Class in com.cloudera.cdk.morphline.stdlib
A Not command consists of one nested command, the boolean return value of which is inverted.
NotBuilder() - Constructor for class com.cloudera.cdk.morphline.stdlib.NotBuilder
 
Notifications - Class in com.cloudera.cdk.morphline.base
Tools for notifications on the control plane.
Notifications() - Constructor for class com.cloudera.cdk.morphline.base.Notifications
 
Notifications.LifecycleEvent - Enum in com.cloudera.cdk.morphline.base
 
notify(Record) - Method in interface com.cloudera.cdk.morphline.api.Command
Processes the given notification on the control plane of the subtree rooted at this command.
notify(Record) - Method in class com.cloudera.cdk.morphline.base.AbstractCommand
 
notifyBeginTransaction(Command) - Static method in class com.cloudera.cdk.morphline.base.Notifications
 
notifyCommitTransaction(Command) - Static method in class com.cloudera.cdk.morphline.base.Notifications
 
notifyRollbackTransaction(Command) - Static method in class com.cloudera.cdk.morphline.base.Notifications
 
notifyShutdown(Command) - Static method in class com.cloudera.cdk.morphline.base.Notifications
 
notifyStartSession(Command) - Static method in class com.cloudera.cdk.morphline.base.Notifications
 
NUM_NOTIFY_CALLS - Static variable in class com.cloudera.cdk.morphline.base.Metrics
 
NUM_PROCESS_CALLS - Static variable in class com.cloudera.cdk.morphline.base.Metrics
 
NUM_RECORDS - Static variable in class com.cloudera.cdk.morphline.base.Metrics
 

O

open() - Method in interface com.cloudera.cdk.data.DatasetReader
Open the reader, allocating any necessary resources required to produce entities.
open() - Method in interface com.cloudera.cdk.data.DatasetWriter
Open the writer, allocating any necessary resources required to store entities.
OUTPUT_MEDIA_TYPE - Static variable in class com.cloudera.cdk.morphline.hadoop.sequencefile.ReadSequenceFileBuilder
 

P

PARQUET - Static variable in class com.cloudera.cdk.data.Formats
PARQUET: the Parquet columnar format
parse(File, Config...) - Method in class com.cloudera.cdk.morphline.base.Compiler
Loads the given config file from the local file system
PartitionFunctions - Class in com.cloudera.cdk.data.partition
Convenience class so you can say, for example, hash("username", 2) in JEXL.
PartitionFunctions() - Constructor for class com.cloudera.cdk.data.partition.PartitionFunctions
 
PartitionKey - Class in com.cloudera.cdk.data
A key for retrieving partitions from a Dataset.
partitionKey(Object...) - Method in class com.cloudera.cdk.data.PartitionStrategy
Construct a partition key with a variadic array of values corresponding to the field partitioners in this partition strategy.
partitionKeyForEntity(Object) - Method in class com.cloudera.cdk.data.PartitionStrategy
Construct a partition key for the given entity.
partitionKeyForEntity(Object, PartitionKey) - Method in class com.cloudera.cdk.data.PartitionStrategy
Construct a partition key for the given entity, reusing the supplied key if not null.
partitionKeyForPath(Dataset, URI) - Static method in class com.cloudera.cdk.data.filesystem.FileSystemDatasetRepository
Get a PartitionKey corresponding to a partition's filesystem path represented as a URI.
partitionStrategy(PartitionStrategy) - Method in class com.cloudera.cdk.data.DatasetDescriptor.Builder
Configure the dataset's partitioning strategy.
PartitionStrategy - Class in com.cloudera.cdk.data
The strategy used to determine how a dataset is partitioned.
PartitionStrategy(FieldPartitioner...) - Constructor for class com.cloudera.cdk.data.PartitionStrategy
Construct a partition strategy with a variadic array of field partitioners.
PartitionStrategy(List<FieldPartitioner>) - Constructor for class com.cloudera.cdk.data.PartitionStrategy
Construct a partition strategy with a list of field partitioners.
PartitionStrategy.Builder - Class in com.cloudera.cdk.data
A fluent builder to aid in the construction of PartitionStrategys.
PartitionStrategy.Builder() - Constructor for class com.cloudera.cdk.data.PartitionStrategy.Builder
 
pathForDataset(String) - Method in class com.cloudera.cdk.data.filesystem.FileSystemDatasetRepository
Implementations should return the fully-qualified path of the data directory for the dataset with the given name.
ping() - Method in interface com.cloudera.cdk.morphline.solr.DocumentLoader
Issues a ping request to check if the server is alive
ping() - Method in class com.cloudera.cdk.morphline.solr.SolrServerDocumentLoader
 
PipeBuilder - Class in com.cloudera.cdk.morphline.stdlib
Factory to create morphline pipe instances.
PipeBuilder() - Constructor for class com.cloudera.cdk.morphline.stdlib.PipeBuilder
 
populateAvroHeaders(Map<String, String>, Schema, Object) - Method in class com.cloudera.cdk.data.flume.Log4jAppender
 
PREFIX_NAME - Static variable in class com.cloudera.cdk.morphline.stdlib.GenerateUUIDBuilder
 
prepareClasspath(Configuration, Path, Class<?>...) - Method in class com.cloudera.cdk.tools.JobClasspathHelper
 
PRESERVE_EXISTING_NAME - Static variable in class com.cloudera.cdk.morphline.stdlib.AddCurrentTimeBuilder
 
PRESERVE_EXISTING_NAME - Static variable in class com.cloudera.cdk.morphline.stdlib.AddLocalHostBuilder
 
PRESERVE_EXISTING_NAME - Static variable in class com.cloudera.cdk.morphline.stdlib.GenerateUUIDBuilder
 
process(Record) - Method in interface com.cloudera.cdk.morphline.api.Command
Processes the given record on the data plane of this command.
process(Record) - Method in class com.cloudera.cdk.morphline.base.AbstractCommand
 
put(String, Object) - Method in class com.cloudera.cdk.morphline.api.Record
 
putIfAbsent(String, Object) - Method in class com.cloudera.cdk.morphline.api.Record
 

R

range(String, int...) - Static method in class com.cloudera.cdk.data.partition.PartitionFunctions
 
range(String, Comparable<?>...) - Static method in class com.cloudera.cdk.data.partition.PartitionFunctions
 
range(String, int...) - Method in class com.cloudera.cdk.data.PartitionStrategy.Builder
Configure a range partitioner with a set of upperBounds.
range(String, Comparable<?>...) - Method in class com.cloudera.cdk.data.PartitionStrategy.Builder
Configure a range partitioner with a set of upperBounds.
RangeFieldPartitioner - Class in com.cloudera.cdk.data.partition
 
RangeFieldPartitioner(String, Comparable<?>...) - Constructor for class com.cloudera.cdk.data.partition.RangeFieldPartitioner
 
read() - Method in interface com.cloudera.cdk.data.DatasetReader
Fetch the next entity from the reader.
ReadAvroBuilder - Class in com.cloudera.cdk.morphline.avro
Command that parses an InputStream that contains Avro data; for each Avro datum, the command emits a morphline record containing the datum as an attachment in Fields.ATTACHMENT_BODY.
ReadAvroBuilder() - Constructor for class com.cloudera.cdk.morphline.avro.ReadAvroBuilder
 
ReadAvroContainerBuilder - Class in com.cloudera.cdk.morphline.avro
Command that parses an InputStream that contains Avro binary container file data; for each Avro datum, the command emits a morphline record containing the datum as an attachment in Fields.ATTACHMENT_BODY.
ReadAvroContainerBuilder() - Constructor for class com.cloudera.cdk.morphline.avro.ReadAvroContainerBuilder
 
ReadClobBuilder - Class in com.cloudera.cdk.morphline.stdio
Command that emits one record for the entire input stream of the first attachment, interpreting the stream as a Character Large Object (CLOB).
ReadClobBuilder() - Constructor for class com.cloudera.cdk.morphline.stdio.ReadClobBuilder
 
ReadCSVBuilder - Class in com.cloudera.cdk.morphline.stdio
Command that extracts zero or more records from the input stream of the first attachment, representing a Comma Separated Values (CSV) file.
ReadCSVBuilder() - Constructor for class com.cloudera.cdk.morphline.stdio.ReadCSVBuilder
 
ReadJsonBuilder - Class in com.cloudera.cdk.morphline.json
Command that parses an InputStream that contains JSON data; for each JSON object in the stream, the command emits a morphline record containing the object as an attachment in Fields.ATTACHMENT_BODY.
ReadJsonBuilder() - Constructor for class com.cloudera.cdk.morphline.json.ReadJsonBuilder
 
ReadJsonTestTweetsBuilder - Class in com.cloudera.cdk.morphline.twitter
JSON parser that extracts search documents from twitter tweets obtained from the twitter 1% sample firehose with the delimited=length option.
ReadJsonTestTweetsBuilder() - Constructor for class com.cloudera.cdk.morphline.twitter.ReadJsonTestTweetsBuilder
 
ReadLineBuilder - Class in com.cloudera.cdk.morphline.stdio
Command that emits one record per line in the input stream of the first attachment.
ReadLineBuilder() - Constructor for class com.cloudera.cdk.morphline.stdio.ReadLineBuilder
 
ReadMultiLineBuilder - Class in com.cloudera.cdk.morphline.stdio
Multiline log parser that collapse multiline messages into a single record; supports "regex", "what" and "negate" configuration parameters similar to logstash.
ReadMultiLineBuilder() - Constructor for class com.cloudera.cdk.morphline.stdio.ReadMultiLineBuilder
 
ReadSequenceFileBuilder - Class in com.cloudera.cdk.morphline.hadoop.sequencefile
Command that emits one record per sequence file entry in the input stream of the first attachment.
ReadSequenceFileBuilder() - Constructor for class com.cloudera.cdk.morphline.hadoop.sequencefile.ReadSequenceFileBuilder
 
Record - Class in com.cloudera.cdk.morphline.api
A record is a set of named fields where each field has a list of one or more values.
Record() - Constructor for class com.cloudera.cdk.morphline.api.Record
 
RECOVERABLE_EXCEPTION_CLASSES - Static variable in class com.cloudera.cdk.morphline.base.FaultTolerance
 
removeAll(String) - Method in class com.cloudera.cdk.morphline.api.Record
 
removeAttachments(Record) - Static method in class com.cloudera.cdk.morphline.stdio.AbstractParser
 
replaceValues(String, Object) - Method in class com.cloudera.cdk.morphline.api.Record
 
rollbackTransaction() - Method in interface com.cloudera.cdk.morphline.solr.DocumentLoader
Performs a rollback of all non-committed documents pending.
rollbackTransaction() - Method in class com.cloudera.cdk.morphline.solr.SolrServerDocumentLoader
 
rootDirectory(Path) - Method in class com.cloudera.cdk.data.filesystem.FileSystemDatasetRepository.Builder
The root directory for metadata and dataset files.
rootDirectory(URI) - Method in class com.cloudera.cdk.data.filesystem.FileSystemDatasetRepository.Builder
The root directory for metadata and dataset files.
rootDirectory(Path) - Method in class com.cloudera.cdk.data.hcatalog.HCatalogDatasetRepository.Builder
The root directory for dataset files.
rootDirectory(URI) - Method in class com.cloudera.cdk.data.hcatalog.HCatalogDatasetRepository.Builder
The root directory for dataset files.
run(String...) - Method in class com.cloudera.cdk.tools.CombinedLogFormatConverter
 

S

SanitizeUnknownSolrFieldsBuilder - Class in com.cloudera.cdk.morphline.solr
Command that sanitizes record fields that are unknown to Solr schema.xml by either deleting them (renameToPrefix is absent or a zero length string), or by moving them to a field prefixed with the given renameToPrefix (e.g.
SanitizeUnknownSolrFieldsBuilder() - Constructor for class com.cloudera.cdk.morphline.solr.SanitizeUnknownSolrFieldsBuilder
 
save(String, DatasetDescriptor) - Method in class com.cloudera.cdk.data.filesystem.FileSystemMetadataProvider
 
save(String, DatasetDescriptor) - Method in interface com.cloudera.cdk.data.MetadataProvider
Save the dataset descriptor for the dataset name.
schema(Schema) - Method in class com.cloudera.cdk.data.DatasetDescriptor.Builder
Configure the dataset's schema.
schema(File) - Method in class com.cloudera.cdk.data.DatasetDescriptor.Builder
Configure the dataset's schema from a File.
schema(InputStream) - Method in class com.cloudera.cdk.data.DatasetDescriptor.Builder
Configure the dataset's schema from an InputStream.
schema(URI) - Method in class com.cloudera.cdk.data.DatasetDescriptor.Builder
Configure the dataset's schema from a URI.
schema(String) - Method in class com.cloudera.cdk.data.DatasetDescriptor.Builder
Configure the dataset's schema from a String.
schema(Class<T>) - Method in class com.cloudera.cdk.data.DatasetDescriptor.Builder
Configure the dataset's schema via a Java class type.
schemaFromAvroDataFile(File) - Method in class com.cloudera.cdk.data.DatasetDescriptor.Builder
Configure the dataset's schema by using the schema from an existing Avro data file.
schemaFromAvroDataFile(InputStream) - Method in class com.cloudera.cdk.data.DatasetDescriptor.Builder
Configure the dataset's schema by using the schema from an existing Avro data file.
schemaFromAvroDataFile(URI) - Method in class com.cloudera.cdk.data.DatasetDescriptor.Builder
Configure the dataset's schema by using the schema from an existing Avro data file.
ScriptEvaluator<T> - Class in com.cloudera.cdk.morphline.scriptengine.java
Creates and compiles the given Java code block, wrapped into a Java method with the given return type and parameter types, along with a Java class definition that contains the given import statements.
ScriptEvaluator(String, String, Class<T>, String[], Class[], String) - Constructor for class com.cloudera.cdk.morphline.scriptengine.java.ScriptEvaluator
 
SeparateAttachmentsBuilder - Class in com.cloudera.cdk.morphline.stdlib
Command that emits one output record for each attachment in the input record's list of attachments; The result is a list of records, each of which has at most one attachment.
SeparateAttachmentsBuilder() - Constructor for class com.cloudera.cdk.morphline.stdlib.SeparateAttachmentsBuilder
 
SEQUENCE_FILE_META_DATA - Static variable in class com.cloudera.cdk.morphline.hadoop.sequencefile.ReadSequenceFileBuilder
 
setBatchSize(int) - Method in class com.cloudera.cdk.morphline.solr.SolrLocator
 
setCollectionName(String) - Method in class com.cloudera.cdk.morphline.solr.SolrLocator
 
setDatasetName(String) - Method in class com.cloudera.cdk.data.flume.Log4jAppender
 
setDatasetRepositoryClass(String) - Method in class com.cloudera.cdk.data.flume.Log4jAppender
 
setDatasetRepositoryUri(String) - Method in class com.cloudera.cdk.data.flume.Log4jAppender
 
setDocumentLoader(DocumentLoader) - Method in class com.cloudera.cdk.morphline.solr.SolrMorphlineContext.Builder
 
setExceptionHandler(ExceptionHandler) - Method in class com.cloudera.cdk.morphline.api.MorphlineContext.Builder
 
setIndexSchema(IndexSchema) - Method in class com.cloudera.cdk.morphline.solr.SolrMorphlineContext.Builder
 
setMetricRegistry(MetricRegistry) - Method in class com.cloudera.cdk.morphline.api.MorphlineContext.Builder
 
setServerUrl(String) - Method in class com.cloudera.cdk.morphline.solr.SolrLocator
 
setSolrHomeDir(String) - Method in class com.cloudera.cdk.morphline.solr.SolrLocator
 
SetValuesBuilder - Class in com.cloudera.cdk.morphline.stdlib
For each input field, set the given record output field to the same values.
SetValuesBuilder() - Constructor for class com.cloudera.cdk.morphline.stdlib.SetValuesBuilder
 
setZkHost(String) - Method in class com.cloudera.cdk.morphline.solr.SolrLocator
 
shutdown() - Method in interface com.cloudera.cdk.morphline.solr.DocumentLoader
Releases allocated resources
shutdown() - Method in class com.cloudera.cdk.morphline.solr.SolrServerDocumentLoader
 
SolrCellBuilder - Class in com.cloudera.cdk.morphline.solrcell
Command that pipes the first attachment of a record into one of the given Tika parsers, then maps the Tika output back to a record using SolrCell.
SolrCellBuilder() - Constructor for class com.cloudera.cdk.morphline.solrcell.SolrCellBuilder
 
SolrLocator - Class in com.cloudera.cdk.morphline.solr
Set of configuration parameters that identify the location and schema of a Solr server or SolrCloud; Based on this information this class can return the schema and a corresponding DocumentLoader.
SolrLocator(MorphlineContext) - Constructor for class com.cloudera.cdk.morphline.solr.SolrLocator
 
SolrLocator(Config, MorphlineContext) - Constructor for class com.cloudera.cdk.morphline.solr.SolrLocator
 
SolrMorphlineContext - Class in com.cloudera.cdk.morphline.solr
A context that is specific to Solr.
SolrMorphlineContext() - Constructor for class com.cloudera.cdk.morphline.solr.SolrMorphlineContext
For public access use SolrMorphlineContext.Builder.build() instead
SolrMorphlineContext.Builder - Class in com.cloudera.cdk.morphline.solr
Helper to construct a SolrMorphlineContext instance.
SolrMorphlineContext.Builder() - Constructor for class com.cloudera.cdk.morphline.solr.SolrMorphlineContext.Builder
 
SolrServerDocumentLoader - Class in com.cloudera.cdk.morphline.solr
A vehicle to load a list of Solr documents into a local or remote SolrServer.
SolrServerDocumentLoader(SolrServer, int) - Constructor for class com.cloudera.cdk.morphline.solr.SolrServerDocumentLoader
 
SplitBuilder - Class in com.cloudera.cdk.morphline.stdlib
Command that divides strings into substrings, by recognizing a separator (a.k.a.
SplitBuilder() - Constructor for class com.cloudera.cdk.morphline.stdlib.SplitBuilder
 
StripNonCharSolrContentHandlerFactory - Class in com.cloudera.cdk.morphline.solrcell
SolrContentHandler and associated factory that strips non-characters anhd trims on output.
StripNonCharSolrContentHandlerFactory(Collection<String>) - Constructor for class com.cloudera.cdk.morphline.solrcell.StripNonCharSolrContentHandlerFactory
 
SUPPORTED_MIME_TYPES - Static variable in class com.cloudera.cdk.morphline.stdio.AbstractParser
 

T

TIMESTAMP - Static variable in class com.cloudera.cdk.morphline.base.Fields
 
toConfig(String) - Method in class com.cloudera.cdk.morphline.solr.SolrLocator
 
TokenizeTextBuilder - Class in com.cloudera.cdk.morphline.solr
A command that uses the embedded Solr/Lucene Analyzer library to generate tokens from a text string, without sending data to a Solr server.
TokenizeTextBuilder() - Constructor for class com.cloudera.cdk.morphline.solr.TokenizeTextBuilder
 
toString() - Method in class com.cloudera.cdk.data.DatasetDescriptor
 
toString() - Method in class com.cloudera.cdk.data.filesystem.FileSystemDatasetRepository
 
toString() - Method in class com.cloudera.cdk.data.filesystem.FileSystemMetadataProvider
 
toString() - Method in class com.cloudera.cdk.data.Format
 
toString() - Method in class com.cloudera.cdk.data.partition.HashFieldPartitioner
 
toString() - Method in class com.cloudera.cdk.data.partition.IdentityFieldPartitioner
 
toString() - Method in class com.cloudera.cdk.data.partition.IntRangeFieldPartitioner
 
toString() - Method in class com.cloudera.cdk.data.partition.ListFieldPartitioner
 
toString() - Method in class com.cloudera.cdk.data.partition.RangeFieldPartitioner
 
toString() - Method in class com.cloudera.cdk.data.PartitionKey
 
toString() - Method in class com.cloudera.cdk.data.PartitionStrategy
 
toString() - Method in class com.cloudera.cdk.morphline.api.Record
 
toString() - Method in class com.cloudera.cdk.morphline.solr.SolrLocator
 
ToStringBuilder - Class in com.cloudera.cdk.morphline.stdlib
Command that converts the Java objects in a given field via Object.toString() to their string representation, and optionally also applies String.trim().
ToStringBuilder() - Constructor for class com.cloudera.cdk.morphline.stdlib.ToStringBuilder
 
transformFieldValue(Object) - Method in class com.cloudera.cdk.morphline.stdlib.AbstractFieldTransformCommand
Transforms the given input value to some output value
TranslateBuilder - Class in com.cloudera.cdk.morphline.stdlib
Command that examines each string value in a given field and replaces it with the replacement value defined in a given dictionary aka hash table.
TranslateBuilder() - Constructor for class com.cloudera.cdk.morphline.stdlib.TranslateBuilder
 
TrimSolrContentHandlerFactory - Class in com.cloudera.cdk.morphline.solrcell
SolrContentHandler and associated factory that trims field values on output.
TrimSolrContentHandlerFactory(Collection<String>) - Constructor for class com.cloudera.cdk.morphline.solrcell.TrimSolrContentHandlerFactory
 
TryRulesBuilder - Class in com.cloudera.cdk.morphline.stdlib
A tryRules command consists of zero or more rules.
TryRulesBuilder() - Constructor for class com.cloudera.cdk.morphline.stdlib.TryRulesBuilder
 

U

UnpackBuilder - Class in com.cloudera.cdk.morphline.tika.decompress
Command that unpacks the first attachment.
UnpackBuilder() - Constructor for class com.cloudera.cdk.morphline.tika.decompress.UnpackBuilder
 
update(String, DatasetDescriptor) - Method in interface com.cloudera.cdk.data.DatasetRepository
Update an existing Dataset to reflect the supplied descriptor.
update(String, DatasetDescriptor) - Method in class com.cloudera.cdk.data.filesystem.FileSystemDatasetRepository
 
update(String, DatasetDescriptor) - Method in class com.cloudera.cdk.data.hcatalog.HCatalogDatasetRepository
 
USE_IP - Static variable in class com.cloudera.cdk.morphline.stdlib.AddLocalHostBuilder
 

V

validateArguments() - Method in class com.cloudera.cdk.morphline.base.AbstractCommand
 
validateArguments(Config) - Method in class com.cloudera.cdk.morphline.base.Configs
 
validateEnum(Config, String, Class<T>, T...) - Method in class com.cloudera.cdk.morphline.base.Validator
Validates that an enum of the given type with the given value exists, and that this enum is contained in the given list of permitted choices; finally returns that enum object.
validateRange(Config, T, Comparable<T>, Comparable<T>) - Method in class com.cloudera.cdk.morphline.base.Validator
Validates that the given value is contained in the range [min, max]
Validator<T> - Class in com.cloudera.cdk.morphline.base
Simple helper to validate user configurable parameters.
Validator() - Constructor for class com.cloudera.cdk.morphline.base.Validator
 
valueFromString(String) - Method in class com.cloudera.cdk.data.FieldPartitioner
Retrieve the value for the field from the string representation.
valueFromString(String) - Method in class com.cloudera.cdk.data.partition.HashFieldPartitioner
 
valueFromString(String) - Method in class com.cloudera.cdk.data.partition.IdentityFieldPartitioner
 
valueFromString(String) - Method in class com.cloudera.cdk.data.partition.IntRangeFieldPartitioner
 
valueFromString(String) - Method in class com.cloudera.cdk.data.partition.ListFieldPartitioner
 
valueFromString(String) - Method in class com.cloudera.cdk.data.partition.RangeFieldPartitioner
 
valueOf(String) - Static method in enum com.cloudera.cdk.morphline.base.Notifications.LifecycleEvent
Returns the enum constant of this type with the specified name.
values() - Static method in enum com.cloudera.cdk.morphline.base.Notifications.LifecycleEvent
Returns an array containing the constants of this enum type, in the order they are declared.
valueToString(Object) - Method in class com.cloudera.cdk.data.FieldPartitioner
Retrieve the value for the field formatted as a String.
valueToString(Object) - Method in class com.cloudera.cdk.data.partition.DayOfMonthFieldPartitioner
 
valueToString(Object) - Method in class com.cloudera.cdk.data.partition.HourFieldPartitioner
 
valueToString(Object) - Method in class com.cloudera.cdk.data.partition.MinuteFieldPartitioner
 
valueToString(Object) - Method in class com.cloudera.cdk.data.partition.MonthFieldPartitioner
 

W

write(E) - Method in interface com.cloudera.cdk.data.DatasetWriter
Writer an entity of type E to the associated dataset.

X

XQueryBuilder - Class in com.cloudera.cdk.morphline.saxon
Command that parses an InputStream that contains an XML document and runs the given XQuery over the XML document.
XQueryBuilder() - Constructor for class com.cloudera.cdk.morphline.saxon.XQueryBuilder
 
XSLTBuilder - Class in com.cloudera.cdk.morphline.saxon
Command that parses an InputStream that contains an XML document and runs the given XSL Transform over the XML document.
XSLTBuilder() - Constructor for class com.cloudera.cdk.morphline.saxon.XSLTBuilder
 

Y

year(String, String) - Static method in class com.cloudera.cdk.data.partition.PartitionFunctions
 
year(String, String) - Method in class com.cloudera.cdk.data.PartitionStrategy.Builder
Configure a partitioner for extracting the year from a timestamp field.
YearFieldPartitioner - Class in com.cloudera.cdk.data.partition
 
YearFieldPartitioner(String, String) - Constructor for class com.cloudera.cdk.data.partition.YearFieldPartitioner
 

A B C D E F G H I J L M N O P R S T U V W X Y

Copyright © 2013 Cloudera. All rights reserved.