A B C D E F G H I J K L M N O P R S T U V W X Y

A

AbstractCommand - Class in com.cloudera.cdk.morphline.base
Base class for convenient implementation of Command classes.
AbstractCommand(CommandBuilder, Config, Command, Command, MorphlineContext) - Constructor for class com.cloudera.cdk.morphline.base.AbstractCommand
Using the given builder, constructs a command rooted at the given morphline JSON config.
AbstractCommand(Config, Command, Command, MorphlineContext) - Constructor for class com.cloudera.cdk.morphline.base.AbstractCommand
Deprecated. 
AbstractFieldTransformCommand - Class in com.cloudera.cdk.morphline.stdlib
Base class for convenient implementation of commands that do in-place updates on a field.
AbstractFieldTransformCommand(CommandBuilder, Config, Command, Command, MorphlineContext) - Constructor for class com.cloudera.cdk.morphline.stdlib.AbstractFieldTransformCommand
 
AbstractFieldTransformCommand(Config, Command, Command, MorphlineContext) - Constructor for class com.cloudera.cdk.morphline.stdlib.AbstractFieldTransformCommand
Deprecated. 
AbstractParser - Class in com.cloudera.cdk.morphline.stdio
Base class for convenient implementation of morphline parsers.
AbstractParser(CommandBuilder, Config, Command, Command, MorphlineContext) - Constructor for class com.cloudera.cdk.morphline.stdio.AbstractParser
 
AbstractParser(Config, Command, Command, MorphlineContext) - Constructor for class com.cloudera.cdk.morphline.stdio.AbstractParser
Deprecated. 
add(String, Object) - Method in class com.cloudera.cdk.data.Key.Builder
Add a key value for the named field.
AddCurrentTimeBuilder - Class in com.cloudera.cdk.morphline.stdlib
A command that adds the result of System.currentTimeMillis() to a given output field.
AddCurrentTimeBuilder() - Constructor for class com.cloudera.cdk.morphline.stdlib.AddCurrentTimeBuilder
 
AddLocalHostBuilder - Class in com.cloudera.cdk.morphline.stdlib
A command that adds the name or IP of the local host to a given output field.
AddLocalHostBuilder() - Constructor for class com.cloudera.cdk.morphline.stdlib.AddLocalHostBuilder
 
addSupportedMimeType(String) - Method in class com.cloudera.cdk.morphline.stdio.AbstractParser
 
AddValuesBuilder - Class in com.cloudera.cdk.morphline.stdlib
For each input field value, add the value to the given record output field.
AddValuesBuilder() - Constructor for class com.cloudera.cdk.morphline.stdlib.AddValuesBuilder
 
AddValuesIfAbsentBuilder - Class in com.cloudera.cdk.morphline.stdlib
For each input field value, add the value to the given record output field if the value isn't already contained in that field.
AddValuesIfAbsentBuilder() - Constructor for class com.cloudera.cdk.morphline.stdlib.AddValuesIfAbsentBuilder
 
append(Path, int, Progressable) - Method in class com.cloudera.cdk.morphline.hadoop.rcfile.SingleStreamFileSystem
 
apply(S) - Method in class com.cloudera.cdk.data.FieldPartitioner
Apply the partition function to the given value.
apply(Long) - Method in class com.cloudera.cdk.data.partition.DateFormatPartitioner
 
apply(Object) - Method in class com.cloudera.cdk.data.partition.HashFieldPartitioner
 
apply(S) - Method in class com.cloudera.cdk.data.partition.IdentityFieldPartitioner
 
apply(Integer) - Method in class com.cloudera.cdk.data.partition.IntRangeFieldPartitioner
 
apply(S) - Method in class com.cloudera.cdk.data.partition.ListFieldPartitioner
 
apply(Long) - Method in class com.cloudera.cdk.data.partition.MonthFieldPartitioner
 
apply(String) - Method in class com.cloudera.cdk.data.partition.RangeFieldPartitioner
 
asSource(Dataset<E>, Class<E>) - Static method in class com.cloudera.cdk.data.crunch.CrunchDatasets
Expose the given Dataset as a Crunch ReadableSource.
asTarget(Dataset) - Static method in class com.cloudera.cdk.data.crunch.CrunchDatasets
Expose the given Dataset as a Crunch Target.
ATTACHMENT_BODY - Static variable in class com.cloudera.cdk.morphline.base.Fields
 
ATTACHMENT_CHARSET - Static variable in class com.cloudera.cdk.morphline.base.Fields
 
ATTACHMENT_MIME_TYPE - Static variable in class com.cloudera.cdk.morphline.base.Fields
 
ATTACHMENT_NAME - Static variable in class com.cloudera.cdk.morphline.base.Fields
 
AVRO - Static variable in class com.cloudera.cdk.data.Formats
AVRO: the Avro row-oriented format
AVRO_MEMORY_MIME_TYPE - Static variable in class com.cloudera.cdk.morphline.avro.ReadAvroBuilder
The MIME type identifier that will be filled into output records

B

BASE_ID - Static variable in class com.cloudera.cdk.morphline.base.Fields
 
beginTransaction() - Method in interface com.cloudera.cdk.morphline.solr.DocumentLoader
Begins a transaction
beginTransaction() - Method in class com.cloudera.cdk.morphline.solr.SolrServerDocumentLoader
 
build() - Method in class com.cloudera.cdk.data.DatasetDescriptor.Builder
Build an instance of the configured dataset descriptor.
build() - Method in class com.cloudera.cdk.data.filesystem.FileSystemDatasetRepository.Builder
Build an instance of the configured FileSystemDatasetRepository.
build() - Method in class com.cloudera.cdk.data.filesystem.FileSystemMetadataProvider.Builder
Build an instance of the configured FileSystemMetadataProvider.
build() - Method in class com.cloudera.cdk.data.hcatalog.HCatalogDatasetRepository.Builder
Build an instance of the configured HCatalogDatasetRepository.
build() - Method in class com.cloudera.cdk.data.Key.Builder
Build an instance of the configured key.
build() - Method in class com.cloudera.cdk.data.PartitionStrategy.Builder
Build a configured PartitionStrategy instance.
build(Config, Command, Command, MorphlineContext) - Method in interface com.cloudera.cdk.morphline.api.CommandBuilder
Creates and returns a command rooted at the given morphline JSON config.
build() - Method in class com.cloudera.cdk.morphline.api.MorphlineContext.Builder
 
build(Config, Command, Command, MorphlineContext) - Method in class com.cloudera.cdk.morphline.avro.ExtractAvroPathsBuilder
 
build(Config, Command, Command, MorphlineContext) - Method in class com.cloudera.cdk.morphline.avro.ExtractAvroTreeBuilder
 
build(Config, Command, Command, MorphlineContext) - Method in class com.cloudera.cdk.morphline.avro.ReadAvroBuilder
 
build(Config, Command, Command, MorphlineContext) - Method in class com.cloudera.cdk.morphline.avro.ReadAvroContainerBuilder
 
build(Config, Command, Command, MorphlineContext) - Method in class com.cloudera.cdk.morphline.avro.ToAvroBuilder
 
build(Config, Command, Command, MorphlineContext) - Method in class com.cloudera.cdk.morphline.avro.WriteAvroToByteArrayBuilder
 
build(Config, Command, Command, MorphlineContext) - Method in class com.cloudera.cdk.morphline.hadoop.core.DownloadHdfsFileBuilder
 
build(Config, Command, Command, MorphlineContext) - Method in class com.cloudera.cdk.morphline.hadoop.rcfile.ReadRCFileBuilder
 
build(Config, Command, Command, MorphlineContext) - Method in class com.cloudera.cdk.morphline.hadoop.sequencefile.ReadSequenceFileBuilder
 
build(Config, Command, Command, MorphlineContext) - Method in class com.cloudera.cdk.morphline.json.ExtractJsonPathsBuilder
 
build(Config, Command, Command, MorphlineContext) - Method in class com.cloudera.cdk.morphline.json.ReadJsonBuilder
 
build(Config, Command, Command, MorphlineContext) - Method in class com.cloudera.cdk.morphline.maxmind.GeoIPBuilder
 
build(Config, Command, Command, MorphlineContext) - Method in class com.cloudera.cdk.morphline.metrics.servlets.RegisterJVMMetricsBuilder
 
build(Config, Command, Command, MorphlineContext) - Method in class com.cloudera.cdk.morphline.metrics.servlets.StartReportingMetricsToHTTPBuilder
 
build(Config, Command, Command, MorphlineContext) - Method in class com.cloudera.cdk.morphline.saxon.ConvertHTMLBuilder
 
build(Config, Command, Command, MorphlineContext) - Method in class com.cloudera.cdk.morphline.saxon.XQueryBuilder
 
build(Config, Command, Command, MorphlineContext) - Method in class com.cloudera.cdk.morphline.saxon.XSLTBuilder
 
build(Config, Command, Command, MorphlineContext) - Method in class com.cloudera.cdk.morphline.solr.GenerateSolrSequenceKeyBuilder
 
build(Config, Command, Command, MorphlineContext) - Method in class com.cloudera.cdk.morphline.solr.LoadSolrBuilder
 
build(Config, Command, Command, MorphlineContext) - Method in class com.cloudera.cdk.morphline.solr.SanitizeUnknownSolrFieldsBuilder
 
build() - Method in class com.cloudera.cdk.morphline.solr.SolrMorphlineContext.Builder
 
build(Config, Command, Command, MorphlineContext) - Method in class com.cloudera.cdk.morphline.solr.TokenizeTextBuilder
 
build(Config, Command, Command, MorphlineContext) - Method in class com.cloudera.cdk.morphline.solrcell.SolrCellBuilder
 
build(Config, Command, Command, MorphlineContext) - Method in class com.cloudera.cdk.morphline.stdio.ReadClobBuilder
 
build(Config, Command, Command, MorphlineContext) - Method in class com.cloudera.cdk.morphline.stdio.ReadCSVBuilder
 
build(Config, Command, Command, MorphlineContext) - Method in class com.cloudera.cdk.morphline.stdio.ReadLineBuilder
 
build(Config, Command, Command, MorphlineContext) - Method in class com.cloudera.cdk.morphline.stdio.ReadMultiLineBuilder
 
build(Config, Command, Command, MorphlineContext) - Method in class com.cloudera.cdk.morphline.stdlib.AddCurrentTimeBuilder
 
build(Config, Command, Command, MorphlineContext) - Method in class com.cloudera.cdk.morphline.stdlib.AddLocalHostBuilder
 
build(Config, Command, Command, MorphlineContext) - Method in class com.cloudera.cdk.morphline.stdlib.AddValuesBuilder
 
build(Config, Command, Command, MorphlineContext) - Method in class com.cloudera.cdk.morphline.stdlib.AddValuesIfAbsentBuilder
 
build(Config, Command, Command, MorphlineContext) - Method in class com.cloudera.cdk.morphline.stdlib.CallParentPipeBuilder
 
build(Config, Command, Command, MorphlineContext) - Method in class com.cloudera.cdk.morphline.stdlib.ContainsBuilder
 
build(Config, Command, Command, MorphlineContext) - Method in class com.cloudera.cdk.morphline.stdlib.ConvertTimestampBuilder
 
build(Config, Command, Command, MorphlineContext) - Method in class com.cloudera.cdk.morphline.stdlib.DecodeBase64Builder
 
build(Config, Command, Command, MorphlineContext) - Method in class com.cloudera.cdk.morphline.stdlib.DropRecordBuilder
 
build(Config, Command, Command, MorphlineContext) - Method in class com.cloudera.cdk.morphline.stdlib.EqualsBuilder
 
build(Config, Command, Command, MorphlineContext) - Method in class com.cloudera.cdk.morphline.stdlib.ExtractURIComponentBuilder
 
build(Config, Command, Command, MorphlineContext) - Method in class com.cloudera.cdk.morphline.stdlib.ExtractURIComponentsBuilder
 
build(Config, Command, Command, MorphlineContext) - Method in class com.cloudera.cdk.morphline.stdlib.ExtractURIQueryParametersBuilder
 
build(Config, Command, Command, MorphlineContext) - Method in class com.cloudera.cdk.morphline.stdlib.FindReplaceBuilder
 
build(Config, Command, Command, MorphlineContext) - Method in class com.cloudera.cdk.morphline.stdlib.GenerateUUIDBuilder
 
build(Config, Command, Command, MorphlineContext) - Method in class com.cloudera.cdk.morphline.stdlib.GrokBuilder
 
build(Config, Command, Command, MorphlineContext) - Method in class com.cloudera.cdk.morphline.stdlib.IfThenElseBuilder
 
build(Config, Command, Command, MorphlineContext) - Method in class com.cloudera.cdk.morphline.stdlib.JavaBuilder
 
build(Config, Command, Command, MorphlineContext) - Method in class com.cloudera.cdk.morphline.stdlib.LogDebugBuilder
 
build(Config, Command, Command, MorphlineContext) - Method in class com.cloudera.cdk.morphline.stdlib.LogErrorBuilder
 
build(Config, Command, Command, MorphlineContext) - Method in class com.cloudera.cdk.morphline.stdlib.LogInfoBuilder
 
build(Config, Command, Command, MorphlineContext) - Method in class com.cloudera.cdk.morphline.stdlib.LogTraceBuilder
 
build(Config, Command, Command, MorphlineContext) - Method in class com.cloudera.cdk.morphline.stdlib.LogWarnBuilder
 
build(Config, Command, Command, MorphlineContext) - Method in class com.cloudera.cdk.morphline.stdlib.NotBuilder
 
build(Config, Command, Command, MorphlineContext) - Method in class com.cloudera.cdk.morphline.stdlib.PipeBuilder
 
build(Config, Command, Command, MorphlineContext) - Method in class com.cloudera.cdk.morphline.stdlib.SeparateAttachmentsBuilder
 
build(Config, Command, Command, MorphlineContext) - Method in class com.cloudera.cdk.morphline.stdlib.SetValuesBuilder
 
build(Config, Command, Command, MorphlineContext) - Method in class com.cloudera.cdk.morphline.stdlib.SplitBuilder
 
build(Config, Command, Command, MorphlineContext) - Method in class com.cloudera.cdk.morphline.stdlib.SplitKeyValueBuilder
 
build(Config, Command, Command, MorphlineContext) - Method in class com.cloudera.cdk.morphline.stdlib.StartReportingMetricsToCSVBuilder
 
build(Config, Command, Command, MorphlineContext) - Method in class com.cloudera.cdk.morphline.stdlib.StartReportingMetricsToJMXBuilder
 
build(Config, Command, Command, MorphlineContext) - Method in class com.cloudera.cdk.morphline.stdlib.StartReportingMetricsToSLF4JBuilder
 
build(Config, Command, Command, MorphlineContext) - Method in class com.cloudera.cdk.morphline.stdlib.ToByteArrayBuilder
 
build(Config, Command, Command, MorphlineContext) - Method in class com.cloudera.cdk.morphline.stdlib.ToStringBuilder
 
build(Config, Command, Command, MorphlineContext) - Method in class com.cloudera.cdk.morphline.stdlib.TranslateBuilder
 
build(Config, Command, Command, MorphlineContext) - Method in class com.cloudera.cdk.morphline.stdlib.TryRulesBuilder
 
build(Config, Command, Command, MorphlineContext) - Method in class com.cloudera.cdk.morphline.tika.decompress.DecompressBuilder
 
build(Config, Command, Command, MorphlineContext) - Method in class com.cloudera.cdk.morphline.tika.decompress.UnpackBuilder
 
build(Config, Command, Command, MorphlineContext) - Method in class com.cloudera.cdk.morphline.tika.DetectMimeTypeBuilder
 
build(Config, Command, Command, MorphlineContext) - Method in class com.cloudera.cdk.morphline.twitter.ReadJsonTestTweetsBuilder
 
build(Config, Command, Command, MorphlineContext) - Method in class com.cloudera.cdk.morphline.useragent.UserAgentBuilder
 
buildCommand(Config, Command, Command) - Method in class com.cloudera.cdk.morphline.base.AbstractCommand
Factory method to create a command rooted at the given cmdConfig.
buildCommandChain(Config, String, Command, boolean) - Method in class com.cloudera.cdk.morphline.base.AbstractCommand
Factory method to create the chain of commands rooted at the given rootConfig.

C

CallParentPipeBuilder - Class in com.cloudera.cdk.morphline.stdlib
Command that routes records to the enclosing pipe morphline object.
CallParentPipeBuilder() - Constructor for class com.cloudera.cdk.morphline.stdlib.CallParentPipeBuilder
 
close() - Method in interface com.cloudera.cdk.data.DatasetReader
Close the reader and release any system resources.
close() - Method in interface com.cloudera.cdk.data.DatasetWriter
Close the writer and release any system resources.
com.cloudera.cdk.data - package com.cloudera.cdk.data
Provides interfaces and classes for reading and writing datasets.
com.cloudera.cdk.data.crunch - package com.cloudera.cdk.data.crunch
Provides helper classes for working with datasets using Crunch.
com.cloudera.cdk.data.filesystem - package com.cloudera.cdk.data.filesystem
Provides a dataset implementation for Hadoop filesystems, like HDFS.
com.cloudera.cdk.data.flume - package com.cloudera.cdk.data.flume
Provides classes for writing datasets via Flume.
com.cloudera.cdk.data.hcatalog - package com.cloudera.cdk.data.hcatalog
Provides a dataset implementation that uses the Hive/HCatalog metastore for metadata, and stores data in a Hadoop FileSystem.
com.cloudera.cdk.data.partition - package com.cloudera.cdk.data.partition
Provides common implementations of FieldPartitioner.
com.cloudera.cdk.morphline.api - package com.cloudera.cdk.morphline.api
This package contains the heart of the morphline API.
com.cloudera.cdk.morphline.avro - package com.cloudera.cdk.morphline.avro
 
com.cloudera.cdk.morphline.base - package com.cloudera.cdk.morphline.base
 
com.cloudera.cdk.morphline.hadoop.core - package com.cloudera.cdk.morphline.hadoop.core
 
com.cloudera.cdk.morphline.hadoop.rcfile - package com.cloudera.cdk.morphline.hadoop.rcfile
 
com.cloudera.cdk.morphline.hadoop.sequencefile - package com.cloudera.cdk.morphline.hadoop.sequencefile
 
com.cloudera.cdk.morphline.json - package com.cloudera.cdk.morphline.json
 
com.cloudera.cdk.morphline.maxmind - package com.cloudera.cdk.morphline.maxmind
 
com.cloudera.cdk.morphline.metrics.servlets - package com.cloudera.cdk.morphline.metrics.servlets
 
com.cloudera.cdk.morphline.saxon - package com.cloudera.cdk.morphline.saxon
 
com.cloudera.cdk.morphline.scriptengine.java - package com.cloudera.cdk.morphline.scriptengine.java
 
com.cloudera.cdk.morphline.solr - package com.cloudera.cdk.morphline.solr
 
com.cloudera.cdk.morphline.solrcell - package com.cloudera.cdk.morphline.solrcell
 
com.cloudera.cdk.morphline.stdio - package com.cloudera.cdk.morphline.stdio
 
com.cloudera.cdk.morphline.stdlib - package com.cloudera.cdk.morphline.stdlib
 
com.cloudera.cdk.morphline.tika - package com.cloudera.cdk.morphline.tika
 
com.cloudera.cdk.morphline.tika.decompress - package com.cloudera.cdk.morphline.tika.decompress
 
com.cloudera.cdk.morphline.twitter - package com.cloudera.cdk.morphline.twitter
 
com.cloudera.cdk.morphline.useragent - package com.cloudera.cdk.morphline.useragent
 
com.cloudera.cdk.tools - package com.cloudera.cdk.tools
 
CombinedLogFormatConverter - Class in com.cloudera.cdk.tools
A tool for converting files in Combined Log Format to a Dataset.
CombinedLogFormatConverter() - Constructor for class com.cloudera.cdk.tools.CombinedLogFormatConverter
 
Command - Interface in com.cloudera.cdk.morphline.api
A command transforms a record into zero or more records.
CommandBuilder - Interface in com.cloudera.cdk.morphline.api
Interface to create instances of morphline commands.
commitTransaction() - Method in interface com.cloudera.cdk.morphline.solr.DocumentLoader
Sends any outstanding documents to the destination and waits for a positive or negative ack (i.e.
commitTransaction() - Method in class com.cloudera.cdk.morphline.solr.SolrServerDocumentLoader
 
compare(String, String) - Method in class com.cloudera.cdk.data.partition.DateFormatPartitioner
 
compare(Integer, Integer) - Method in class com.cloudera.cdk.data.partition.HashFieldPartitioner
 
compare(S, S) - Method in class com.cloudera.cdk.data.partition.IdentityFieldPartitioner
 
compare(Integer, Integer) - Method in class com.cloudera.cdk.data.partition.IntRangeFieldPartitioner
 
compare(Integer, Integer) - Method in class com.cloudera.cdk.data.partition.ListFieldPartitioner
 
compare(String, String) - Method in class com.cloudera.cdk.data.partition.RangeFieldPartitioner
 
compile(File, String, MorphlineContext, Command, Config...) - Method in class com.cloudera.cdk.morphline.base.Compiler
Parses the given morphlineFile, then finds the morphline with the given morphlineId within, then compiles the morphline and returns the corresponding morphline command.
compile(Config, MorphlineContext, Command) - Method in class com.cloudera.cdk.morphline.base.Compiler
Compiles the given morphline config using the given morphline context.
compile(String, String, Class[]) - Method in class com.cloudera.cdk.morphline.scriptengine.java.FastJavaScriptEngine
 
compile(String) - Method in class com.cloudera.cdk.morphline.scriptengine.java.FastJavaScriptEngine
 
compile(Reader) - Method in class com.cloudera.cdk.morphline.scriptengine.java.FastJavaScriptEngine
 
Compiler - Class in com.cloudera.cdk.morphline.base
Tool to parse and compile a morphline file or morphline config.
Compiler() - Constructor for class com.cloudera.cdk.morphline.base.Compiler
 
ConcurrentSchemaModificationException - Exception in com.cloudera.cdk.data
Exception thrown when a schema modification collided with another client trying to modify the schema of the same dataset.
ConcurrentSchemaModificationException(String) - Constructor for exception com.cloudera.cdk.data.ConcurrentSchemaModificationException
 
ConcurrentSchemaModificationException(Throwable) - Constructor for exception com.cloudera.cdk.data.ConcurrentSchemaModificationException
 
ConcurrentSchemaModificationException(String, Throwable) - Constructor for exception com.cloudera.cdk.data.ConcurrentSchemaModificationException
 
CONFIG_KEY_FIELD - Static variable in class com.cloudera.cdk.morphline.hadoop.sequencefile.ReadSequenceFileBuilder
 
CONFIG_VALUE_FIELD - Static variable in class com.cloudera.cdk.morphline.hadoop.sequencefile.ReadSequenceFileBuilder
 
Configs - Class in com.cloudera.cdk.morphline.base
Helpers to traverse and read parts of a HOCON data structure.
Configs() - Constructor for class com.cloudera.cdk.morphline.base.Configs
 
configuration(Configuration) - Method in class com.cloudera.cdk.data.filesystem.FileSystemDatasetRepository.Builder
The Configuration used to find the FileSystem (optional).
configuration(Configuration) - Method in class com.cloudera.cdk.data.filesystem.FileSystemMetadataProvider.Builder
The Configuration used to find the FileSystem.
configuration(Configuration) - Method in class com.cloudera.cdk.data.hcatalog.HCatalogDatasetRepository.Builder
The Configuration used to find the FileSystem.
ContainsBuilder - Class in com.cloudera.cdk.morphline.stdlib
Command that succeeds if one of the field values of the given named field is equal to one of the the given values, and fails otherwise; Multiple fields can be named, in which case the results are ANDed.
ContainsBuilder() - Constructor for class com.cloudera.cdk.morphline.stdlib.ContainsBuilder
 
containsLifecycleEvent(Record, Notifications.LifecycleEvent) - Static method in class com.cloudera.cdk.morphline.base.Notifications
 
context - Variable in class com.cloudera.cdk.morphline.api.MorphlineContext.Builder
 
ConvertHTMLBuilder - Class in com.cloudera.cdk.morphline.saxon
Command that converts HTML to XHTML using the TagSoup library.
ConvertHTMLBuilder() - Constructor for class com.cloudera.cdk.morphline.saxon.ConvertHTMLBuilder
 
ConvertTimestampBuilder - Class in com.cloudera.cdk.morphline.stdlib
Command that converts the timestamps in a given field from one of a set of input date formats (in an input timezone) to an output date format (in an output timezone), while respecting daylight savings time rules.
ConvertTimestampBuilder() - Constructor for class com.cloudera.cdk.morphline.stdlib.ConvertTimestampBuilder
 
copy() - Method in class com.cloudera.cdk.morphline.api.Record
Returns a shallow copy of this record.
create(String, DatasetDescriptor) - Method in interface com.cloudera.cdk.data.DatasetRepository
Create a Dataset with the supplied descriptor.
create(String, DatasetDescriptor) - Method in class com.cloudera.cdk.data.filesystem.FileSystemDatasetRepository
 
create(String, DatasetDescriptor) - Method in class com.cloudera.cdk.data.filesystem.FileSystemMetadataProvider
 
create(String, DatasetDescriptor) - Method in class com.cloudera.cdk.data.hcatalog.HCatalogDatasetRepository
 
create(String, DatasetDescriptor) - Method in interface com.cloudera.cdk.data.MetadataProvider
Create a DatasetDescriptor for the dataset named name.
create(String, DatasetDescriptor) - Method in interface com.cloudera.cdk.data.RandomAccessDatasetRepository
 
create() - Method in class com.cloudera.cdk.morphline.api.MorphlineContext.Builder
 
create(Path, FsPermission, boolean, int, short, long, Progressable) - Method in class com.cloudera.cdk.morphline.hadoop.rcfile.SingleStreamFileSystem
 
create() - Method in class com.cloudera.cdk.morphline.solr.SolrMorphlineContext.Builder
 
createBindings() - Method in class com.cloudera.cdk.morphline.scriptengine.java.FastJavaScriptEngine
 
createSolrContentHandler(Metadata, SolrParams, IndexSchema) - Method in class com.cloudera.cdk.morphline.solrcell.StripNonCharSolrContentHandlerFactory
 
createSolrContentHandler(Metadata, SolrParams, IndexSchema) - Method in class com.cloudera.cdk.morphline.solrcell.TrimSolrContentHandlerFactory
 
CrunchDatasets - Class in com.cloudera.cdk.data.crunch
A helper class for exposing a filesystem-based dataset as a Crunch ReadableSource or Target.
CrunchDatasets() - Constructor for class com.cloudera.cdk.data.crunch.CrunchDatasets
 
CSV - Static variable in class com.cloudera.cdk.data.Formats
CSV: comma-separated values (read-only).

D

Dataset<E> - Interface in com.cloudera.cdk.data
A logical representation of a set of data entities.
DatasetDescriptor - Class in com.cloudera.cdk.data
The structural definition of a Dataset.
DatasetDescriptor(Schema, URL, Format, URI, Map<String, String>, PartitionStrategy) - Constructor for class com.cloudera.cdk.data.DatasetDescriptor
Create an instance of this class with the supplied Schema, optional URL, Format, optional location URL, and optional PartitionStrategy.
DatasetDescriptor.Builder - Class in com.cloudera.cdk.data
A fluent builder to aid in the construction of DatasetDescriptors.
DatasetDescriptor.Builder() - Constructor for class com.cloudera.cdk.data.DatasetDescriptor.Builder
 
DatasetDescriptor.Builder(DatasetDescriptor) - Constructor for class com.cloudera.cdk.data.DatasetDescriptor.Builder
Creates a Builder configured to copy descriptor, if it is not modified.
DatasetException - Exception in com.cloudera.cdk.data
Exception thrown for dataset-related failures.
DatasetException() - Constructor for exception com.cloudera.cdk.data.DatasetException
 
DatasetException(String) - Constructor for exception com.cloudera.cdk.data.DatasetException
 
DatasetException(String, Throwable) - Constructor for exception com.cloudera.cdk.data.DatasetException
 
DatasetException(Throwable) - Constructor for exception com.cloudera.cdk.data.DatasetException
 
DatasetExistsException - Exception in com.cloudera.cdk.data
Exception thrown when trying to create a dataset that already exists.
DatasetExistsException(String) - Constructor for exception com.cloudera.cdk.data.DatasetExistsException
 
DatasetExistsException(String, Throwable) - Constructor for exception com.cloudera.cdk.data.DatasetExistsException
 
DatasetExistsException(Throwable) - Constructor for exception com.cloudera.cdk.data.DatasetExistsException
 
DatasetIOException - Exception in com.cloudera.cdk.data
Exception thrown for dataset IO-related failures.
DatasetIOException(String, IOException) - Constructor for exception com.cloudera.cdk.data.DatasetIOException
 
DatasetNotFoundException - Exception in com.cloudera.cdk.data
Exception thrown to indicate that a dataset was not found.
DatasetNotFoundException(String) - Constructor for exception com.cloudera.cdk.data.DatasetNotFoundException
 
DatasetNotFoundException(String, Throwable) - Constructor for exception com.cloudera.cdk.data.DatasetNotFoundException
 
DatasetNotFoundException(Throwable) - Constructor for exception com.cloudera.cdk.data.DatasetNotFoundException
 
DatasetReader<E> - Interface in com.cloudera.cdk.data
A stream-oriented dataset reader.
DatasetReaderException - Exception in com.cloudera.cdk.data
Exception thrown for dataset reader-related failures.
DatasetReaderException() - Constructor for exception com.cloudera.cdk.data.DatasetReaderException
 
DatasetReaderException(String) - Constructor for exception com.cloudera.cdk.data.DatasetReaderException
 
DatasetReaderException(String, Throwable) - Constructor for exception com.cloudera.cdk.data.DatasetReaderException
 
DatasetReaderException(Throwable) - Constructor for exception com.cloudera.cdk.data.DatasetReaderException
 
DatasetRepositories - Class in com.cloudera.cdk.data
Convenience methods for working with DatasetRepository instances.
DatasetRepositories() - Constructor for class com.cloudera.cdk.data.DatasetRepositories
 
DatasetRepository - Interface in com.cloudera.cdk.data
A logical repository (storage system) of Datasets.
DatasetRepositoryException - Exception in com.cloudera.cdk.data
Exception thrown for dataset repository-related failures.
DatasetRepositoryException() - Constructor for exception com.cloudera.cdk.data.DatasetRepositoryException
 
DatasetRepositoryException(String) - Constructor for exception com.cloudera.cdk.data.DatasetRepositoryException
 
DatasetRepositoryException(String, Throwable) - Constructor for exception com.cloudera.cdk.data.DatasetRepositoryException
 
DatasetRepositoryException(Throwable) - Constructor for exception com.cloudera.cdk.data.DatasetRepositoryException
 
DatasetWriter<E> - Interface in com.cloudera.cdk.data
A stream-oriented dataset writer.
DatasetWriterException - Exception in com.cloudera.cdk.data
Exception thrown for dataset writer-related failures.
DatasetWriterException() - Constructor for exception com.cloudera.cdk.data.DatasetWriterException
 
DatasetWriterException(String) - Constructor for exception com.cloudera.cdk.data.DatasetWriterException
 
DatasetWriterException(String, Throwable) - Constructor for exception com.cloudera.cdk.data.DatasetWriterException
 
DatasetWriterException(Throwable) - Constructor for exception com.cloudera.cdk.data.DatasetWriterException
 
dateFormat(String, String, String) - Static method in class com.cloudera.cdk.data.partition.PartitionFunctions
 
dateFormat(String, String, String) - Method in class com.cloudera.cdk.data.PartitionStrategy.Builder
Configure a partitioner that applies a custom date format to a timestamp field.
DateFormatPartitioner - Class in com.cloudera.cdk.data.partition
A FieldPartitioner that formats a timestamp (long) in milliseconds since epoch, such as those returned by System.currentTimeMillis(), using SimpleDateFormat.
DateFormatPartitioner(String, String, String) - Constructor for class com.cloudera.cdk.data.partition.DateFormatPartitioner
Construct a new DateFormatPartitioner for Universal Coordinated Time, UTC (+00:00), and cardinality 1095 (3 years, 1 day = 1 partition).
DateFormatPartitioner(String, String, String, int, TimeZone) - Constructor for class com.cloudera.cdk.data.partition.DateFormatPartitioner
Construct a new DateFormatPartitioner for Universal Coordinated Time, UTC (+00:00).
day(String, String) - Static method in class com.cloudera.cdk.data.partition.PartitionFunctions
 
day(String, String) - Method in class com.cloudera.cdk.data.PartitionStrategy.Builder
Configure a partitioner for extracting the day from a timestamp field.
day(String) - Method in class com.cloudera.cdk.data.PartitionStrategy.Builder
Configure a partitioner for extracting the day from a timestamp field.
DayOfMonthFieldPartitioner - Class in com.cloudera.cdk.data.partition
 
DayOfMonthFieldPartitioner(String, String) - Constructor for class com.cloudera.cdk.data.partition.DayOfMonthFieldPartitioner
 
DecodeBase64Builder - Class in com.cloudera.cdk.morphline.stdlib
Command that converts a Base64 encoded String to a byte[] per Section 6.8.
DecodeBase64Builder() - Constructor for class com.cloudera.cdk.morphline.stdlib.DecodeBase64Builder
 
DecompressBuilder - Class in com.cloudera.cdk.morphline.tika.decompress
Command that decompresses the first attachment.
DecompressBuilder() - Constructor for class com.cloudera.cdk.morphline.tika.decompress.DecompressBuilder
 
delete(String) - Method in interface com.cloudera.cdk.data.DatasetRepository
Delete the named Dataset.
delete(String) - Method in class com.cloudera.cdk.data.filesystem.FileSystemDatasetRepository
 
delete(String) - Method in class com.cloudera.cdk.data.filesystem.FileSystemMetadataProvider
 
delete(String) - Method in class com.cloudera.cdk.data.hcatalog.HCatalogDatasetRepository
 
delete(String) - Method in interface com.cloudera.cdk.data.MetadataProvider
Delete all metadata associated with the dataset named name.
delete(Key) - Method in interface com.cloudera.cdk.data.RandomAccessDataset
Deletes the entity in the dataset with Key key.
delete(E) - Method in interface com.cloudera.cdk.data.RandomAccessDataset
Deletes the entity passed to this method in the dataset.
delete(Path, boolean) - Method in class com.cloudera.cdk.morphline.hadoop.rcfile.SingleStreamFileSystem
 
detectCharset(Record, Charset) - Method in class com.cloudera.cdk.morphline.stdio.AbstractParser
 
DetectMimeTypeBuilder - Class in com.cloudera.cdk.morphline.tika
Command that auto-detects the MIME type of the first attachment, if no MIME type is defined yet.
DetectMimeTypeBuilder() - Constructor for class com.cloudera.cdk.morphline.tika.DetectMimeTypeBuilder
 
DocumentLoader - Interface in com.cloudera.cdk.morphline.solr
A vehicle to load a list of Solr documents into some kind of destination, such as a SolrServer or MapReduce RecordWriter.
doNotify(Record) - Method in class com.cloudera.cdk.morphline.base.AbstractCommand
Processes the given notification on the control plane of the subtree rooted at this command.
doProcess(Record) - Method in class com.cloudera.cdk.morphline.base.AbstractCommand
Processes the given record on the data plane of this command.
doProcess(Record) - Method in class com.cloudera.cdk.morphline.stdio.AbstractParser
 
doProcess(Record, InputStream) - Method in class com.cloudera.cdk.morphline.stdio.AbstractParser
 
doProcess(Record) - Method in class com.cloudera.cdk.morphline.stdlib.AbstractFieldTransformCommand
 
DownloadHdfsFileBuilder - Class in com.cloudera.cdk.morphline.hadoop.core
Command for transferring HDFS files, for example to help with centralized configuration file management.
DownloadHdfsFileBuilder() - Constructor for class com.cloudera.cdk.morphline.hadoop.core.DownloadHdfsFileBuilder
 
dropPartition(PartitionKey) - Method in interface com.cloudera.cdk.data.Dataset
Drop a partition for a PartitionKey.
DropRecordBuilder - Class in com.cloudera.cdk.morphline.stdlib
Command that silently consumes records without ever emitting any record - think /dev/null.
DropRecordBuilder() - Constructor for class com.cloudera.cdk.morphline.stdlib.DropRecordBuilder
 

E

ELAPSED_TIME - Static variable in class com.cloudera.cdk.morphline.base.Metrics
 
equals(Object) - Method in class com.cloudera.cdk.data.DatasetDescriptor
 
equals(Object) - Method in class com.cloudera.cdk.data.Format
 
equals(Object) - Method in class com.cloudera.cdk.data.Key
 
equals(Object) - Method in class com.cloudera.cdk.data.partition.HashFieldPartitioner
 
equals(Object) - Method in class com.cloudera.cdk.data.partition.IdentityFieldPartitioner
 
equals(Object) - Method in class com.cloudera.cdk.data.partition.IntRangeFieldPartitioner
 
equals(Object) - Method in class com.cloudera.cdk.data.partition.ListFieldPartitioner
 
equals(Object) - Method in class com.cloudera.cdk.data.partition.RangeFieldPartitioner
 
equals(Object) - Method in class com.cloudera.cdk.data.PartitionKey
 
equals(Object) - Method in class com.cloudera.cdk.data.PartitionStrategy
 
equals(Object) - Method in class com.cloudera.cdk.morphline.api.Record
 
EqualsBuilder - Class in com.cloudera.cdk.morphline.stdlib
Command that succeeds if all field values of the given named fields are equal to the the given values, and fails otherwise.
EqualsBuilder() - Constructor for class com.cloudera.cdk.morphline.stdlib.EqualsBuilder
 
eval(String, ScriptContext) - Method in class com.cloudera.cdk.morphline.scriptengine.java.FastJavaScriptEngine
 
eval(Reader, ScriptContext) - Method in class com.cloudera.cdk.morphline.scriptengine.java.FastJavaScriptEngine
 
eval(Object...) - Method in class com.cloudera.cdk.morphline.scriptengine.java.FastJavaScriptEngine.JavaCompiledScript
 
eval(ScriptContext) - Method in class com.cloudera.cdk.morphline.scriptengine.java.FastJavaScriptEngine.JavaCompiledScript
 
evaluate(Record) - Method in class com.cloudera.cdk.morphline.base.FieldExpression
Returns the values of a Record referred to by the given field expression
evaluate(Object...) - Method in class com.cloudera.cdk.morphline.scriptengine.java.ScriptEvaluator
 
ExceptionHandler - Interface in com.cloudera.cdk.morphline.api
Morphline-wide default handler that commands can choose to use to handle exceptions.
exists(String) - Method in interface com.cloudera.cdk.data.DatasetRepository
Checks if there is a Dataset in this repository named name.
exists(String) - Method in class com.cloudera.cdk.data.filesystem.FileSystemDatasetRepository
 
exists(String) - Method in class com.cloudera.cdk.data.filesystem.FileSystemMetadataProvider
 
exists(String) - Method in class com.cloudera.cdk.data.hcatalog.HCatalogDatasetRepository
 
exists(String) - Method in interface com.cloudera.cdk.data.MetadataProvider
Checks if there is a DatasetDescriptor for the dataset named name.
ExtractAvroPathsBuilder - Class in com.cloudera.cdk.morphline.avro
Command that uses zero or more avro path expressions to extract values from an Avro object.
ExtractAvroPathsBuilder() - Constructor for class com.cloudera.cdk.morphline.avro.ExtractAvroPathsBuilder
 
ExtractAvroTreeBuilder - Class in com.cloudera.cdk.morphline.avro
Command that converts an attached Avro datum to a morphline record by recursively walking the Avro tree and extracting all data into a single morphline record, with fields named by their path in the Avro tree.
ExtractAvroTreeBuilder() - Constructor for class com.cloudera.cdk.morphline.avro.ExtractAvroTreeBuilder
 
ExtractJsonPathsBuilder - Class in com.cloudera.cdk.morphline.json
Command that uses zero or more JSON path expressions to extract values from a JSON object.
ExtractJsonPathsBuilder() - Constructor for class com.cloudera.cdk.morphline.json.ExtractJsonPathsBuilder
 
ExtractURIComponentBuilder - Class in com.cloudera.cdk.morphline.stdlib
A command that extracts a subcomponent from the URIs contained in the given input field and appends it to the given output fields.
ExtractURIComponentBuilder() - Constructor for class com.cloudera.cdk.morphline.stdlib.ExtractURIComponentBuilder
 
ExtractURIComponentsBuilder - Class in com.cloudera.cdk.morphline.stdlib
A command that extracts subcomponents from the URIs contained in the given input field and appends them to output fields with the given prefix, namely scheme, authority, host, port, path, query, fragment, schemeSpecificPart, userInfo.
ExtractURIComponentsBuilder() - Constructor for class com.cloudera.cdk.morphline.stdlib.ExtractURIComponentsBuilder
 
ExtractURIQueryParametersBuilder - Class in com.cloudera.cdk.morphline.stdlib
A command that extracts the query parameters with a given name from the URIs contained in the given input field and appends them to the given output field.
ExtractURIQueryParametersBuilder() - Constructor for class com.cloudera.cdk.morphline.stdlib.ExtractURIQueryParametersBuilder
 

F

FastJavaScriptEngine - Class in com.cloudera.cdk.morphline.scriptengine.java
This is a fast script engine for Java programming language - I modified JSR 223 Java Scripting a little for some 100x less overhead for invoking "static" methods.
FastJavaScriptEngine() - Constructor for class com.cloudera.cdk.morphline.scriptengine.java.FastJavaScriptEngine
 
FastJavaScriptEngine.JavaCompiledScript - Class in com.cloudera.cdk.morphline.scriptengine.java
 
FaultTolerance - Class in com.cloudera.cdk.morphline.base
Mission critical, large-scale online production systems need to make progress without downtime despite some issues.
FaultTolerance(boolean, boolean) - Constructor for class com.cloudera.cdk.morphline.base.FaultTolerance
 
FaultTolerance(boolean, boolean, String) - Constructor for class com.cloudera.cdk.morphline.base.FaultTolerance
 
FIELD_NAME - Static variable in class com.cloudera.cdk.morphline.stdlib.AddCurrentTimeBuilder
 
FIELD_NAME - Static variable in class com.cloudera.cdk.morphline.stdlib.AddLocalHostBuilder
 
FIELD_NAME - Static variable in class com.cloudera.cdk.morphline.stdlib.GenerateUUIDBuilder
 
FieldExpression - Class in com.cloudera.cdk.morphline.base
Helper to fetch the values of a field of a Record referred to by a field expression, which is a String of the form @{fieldname}.
FieldExpression(String, Config) - Constructor for class com.cloudera.cdk.morphline.base.FieldExpression
 
FieldPartitioner<S,T> - Class in com.cloudera.cdk.data
Partitions values for a named field.
FieldPartitioner(String, Class<S>, Class<T>, int) - Constructor for class com.cloudera.cdk.data.FieldPartitioner
 
FieldPartitioner(String, String, Class<S>, Class<T>, int) - Constructor for class com.cloudera.cdk.data.FieldPartitioner
 
Fields - Class in com.cloudera.cdk.morphline.base
Common record field names.
Fields() - Constructor for class com.cloudera.cdk.morphline.base.Fields
 
fileSystem(FileSystem) - Method in class com.cloudera.cdk.data.filesystem.FileSystemDatasetRepository.Builder
The FileSystem to store metadata and dataset files in (optional).
FileSystemDatasetRepository - Class in com.cloudera.cdk.data.filesystem
A DatasetRepository that stores data in a Hadoop FileSystem.
FileSystemDatasetRepository(Configuration, MetadataProvider) - Constructor for class com.cloudera.cdk.data.filesystem.FileSystemDatasetRepository
Construct a FileSystemDatasetRepository for the given MetadataProvider for metadata storage.
FileSystemDatasetRepository.Builder - Class in com.cloudera.cdk.data.filesystem
A fluent builder to aid in the construction of FileSystemDatasetRepository instances.
FileSystemDatasetRepository.Builder() - Constructor for class com.cloudera.cdk.data.filesystem.FileSystemDatasetRepository.Builder
 
FileSystemMetadataProvider - Class in com.cloudera.cdk.data.filesystem
A MetadataProvider that stores dataset metadata in a Hadoop FileSystem.
FileSystemMetadataProvider(Configuration, Path) - Constructor for class com.cloudera.cdk.data.filesystem.FileSystemMetadataProvider
 
FileSystemMetadataProvider.Builder - Class in com.cloudera.cdk.data.filesystem
A fluent builder to aid in the construction of FileSystemMetadataProvider instances.
FileSystemMetadataProvider.Builder() - Constructor for class com.cloudera.cdk.data.filesystem.FileSystemMetadataProvider.Builder
 
find(String, Config, String) - Method in class com.cloudera.cdk.morphline.base.Compiler
Finds the given morphline id within the given morphline config, using the given nameForErrorMsg for error reporting.
FindReplaceBuilder - Class in com.cloudera.cdk.morphline.stdlib
Command that examines each string value in a given field and replaces each substring of the string value that matches the given string literal or grok pattern with the given replacement.
FindReplaceBuilder() - Constructor for class com.cloudera.cdk.morphline.stdlib.FindReplaceBuilder
 
flush() - Method in interface com.cloudera.cdk.data.DatasetWriter
Force or commit any outstanding data to storage.
format(Format) - Method in class com.cloudera.cdk.data.DatasetDescriptor.Builder
Configure the dataset's format (optional).
format(String) - Method in class com.cloudera.cdk.data.DatasetDescriptor.Builder
Configure the dataset's format from a format name String (optional).
Format - Class in com.cloudera.cdk.data
The data format used for encoding the data in a Dataset when stored in a DatasetRepository.
Formats - Class in com.cloudera.cdk.data
Contains constant definitions for the standard Format instances supported by the library.
fromString(String) - Static method in class com.cloudera.cdk.data.Formats
Return a Format for the format name specified.

G

GenerateSolrSequenceKeyBuilder - Class in com.cloudera.cdk.morphline.solr
A command that assigns a record unique key that is the concatenation of the given baseIdField record field, followed by a running count of the record number within the current session.
GenerateSolrSequenceKeyBuilder() - Constructor for class com.cloudera.cdk.morphline.solr.GenerateSolrSequenceKeyBuilder
 
GenerateUUIDBuilder - Class in com.cloudera.cdk.morphline.stdlib
A command that sets a universally unique identifier on all records that are intercepted.
GenerateUUIDBuilder() - Constructor for class com.cloudera.cdk.morphline.stdlib.GenerateUUIDBuilder
 
GeoIPBuilder - Class in com.cloudera.cdk.morphline.maxmind
Command that returns Geolocation information for a given IP address, using an efficient in-memory Maxmind database lookup.
GeoIPBuilder() - Constructor for class com.cloudera.cdk.morphline.maxmind.GeoIPBuilder
 
get() - Method in class com.cloudera.cdk.data.DatasetDescriptor.Builder
Deprecated. will be removed in 0.11.0
get() - Method in class com.cloudera.cdk.data.filesystem.FileSystemDatasetRepository.Builder
Deprecated. will be removed in 0.11.0
get() - Method in class com.cloudera.cdk.data.filesystem.FileSystemMetadataProvider.Builder
Deprecated. will be removed in 0.11.0
get() - Method in class com.cloudera.cdk.data.hcatalog.HCatalogDatasetRepository.Builder
Deprecated. will be removed in 0.11.0
get(int) - Method in class com.cloudera.cdk.data.Key
Returns the value for index.
get(int) - Method in class com.cloudera.cdk.data.PartitionKey
Return the value at the specified index in the key.
get() - Method in class com.cloudera.cdk.data.PartitionStrategy.Builder
Deprecated. will be removed in 0.11.0
get(Key) - Method in interface com.cloudera.cdk.data.RandomAccessDataset
Return the entity stored in the dataset at the row specified with Key key.
get(String) - Method in class com.cloudera.cdk.morphline.api.Record
Returns a view of the values associated with the given key.
getBatchSize() - Method in class com.cloudera.cdk.morphline.solr.SolrLocator
 
getBoolean(Config, String, boolean) - Method in class com.cloudera.cdk.morphline.base.Configs
 
getBoolean(Config, String) - Method in class com.cloudera.cdk.morphline.base.Configs
 
getCardinality() - Method in class com.cloudera.cdk.data.FieldPartitioner
 
getCardinality() - Method in class com.cloudera.cdk.data.PartitionStrategy
Return the cardinality produced by the contained field partitioners.
getCharset(Config, String, Charset) - Method in class com.cloudera.cdk.morphline.base.Configs
 
getChild() - Method in class com.cloudera.cdk.morphline.base.AbstractCommand
Returns the child of this command.
getCollectionName() - Method in class com.cloudera.cdk.morphline.solr.SolrLocator
 
getCommandBuilder(String) - Method in class com.cloudera.cdk.morphline.api.MorphlineContext
 
getConfig() - Method in class com.cloudera.cdk.morphline.base.AbstractCommand
Returns the JSON configuration of this command.
getConfig(Config, String, Config) - Method in class com.cloudera.cdk.morphline.base.Configs
 
getConfig(Config, String) - Method in class com.cloudera.cdk.morphline.base.Configs
 
getConfigList(Config, String, List<? extends Config>) - Method in class com.cloudera.cdk.morphline.base.Configs
 
getConfigList(Config, String) - Method in class com.cloudera.cdk.morphline.base.Configs
 
getConfigs() - Method in class com.cloudera.cdk.morphline.base.AbstractCommand
Returns a helper for convenient access to the JSON configuration of this command.
getContext() - Method in class com.cloudera.cdk.morphline.base.AbstractCommand
Returns the context of this command.
getCounter(String...) - Method in class com.cloudera.cdk.morphline.base.AbstractCommand
Returns a counter for maintaining main-memory metrics.
getDataset() - Method in interface com.cloudera.cdk.data.View
Returns the underlying Dataset that this is a View of.
getDescriptor() - Method in interface com.cloudera.cdk.data.Dataset
Get the DatasetDescriptor associated with this dataset.
getDocumentLoader() - Method in class com.cloudera.cdk.morphline.solr.SolrMorphlineContext
 
getDouble(Config, String, double) - Method in class com.cloudera.cdk.morphline.base.Configs
 
getDouble(Config, String) - Method in class com.cloudera.cdk.morphline.base.Configs
 
getEngine() - Method in class com.cloudera.cdk.morphline.scriptengine.java.FastJavaScriptEngine.JavaCompiledScript
 
getEngineName() - Method in class com.cloudera.cdk.morphline.scriptengine.java.JavaScriptEngineFactory
 
getEngineVersion() - Method in class com.cloudera.cdk.morphline.scriptengine.java.JavaScriptEngineFactory
 
getEntrySet(Config) - Method in class com.cloudera.cdk.morphline.base.Configs
 
getExceptionHandler() - Method in class com.cloudera.cdk.morphline.api.MorphlineContext
 
getExtension() - Method in class com.cloudera.cdk.data.Format
Get the extension for use in filenames.
getExtensions() - Method in class com.cloudera.cdk.morphline.scriptengine.java.JavaScriptEngineFactory
 
getFactory() - Method in class com.cloudera.cdk.morphline.scriptengine.java.FastJavaScriptEngine
 
getFieldPartitioners() - Method in class com.cloudera.cdk.data.PartitionStrategy
Get the list of field partitioners used for partitioning.
getFields() - Method in class com.cloudera.cdk.morphline.api.Record
Returns the fields that are stored in this record.
getFileStatus(Path) - Method in class com.cloudera.cdk.morphline.hadoop.rcfile.SingleStreamFileSystem
 
getFirstValue(String) - Method in class com.cloudera.cdk.morphline.api.Record
Returns the first value associated with the given key, or null if no such value exists
getFormat() - Method in class com.cloudera.cdk.data.DatasetDescriptor
Get the associated Format that the data is stored in.
getHealthCheckRegistry() - Method in class com.cloudera.cdk.morphline.api.MorphlineContext
 
getHistogram(String...) - Method in class com.cloudera.cdk.morphline.base.AbstractCommand
Returns a histogram for maintaining main-memory metrics.
getIndexSchema() - Method in class com.cloudera.cdk.morphline.solr.SolrLocator
 
getIndexSchema() - Method in class com.cloudera.cdk.morphline.solr.SolrMorphlineContext
 
getInt(Config, String, int) - Method in class com.cloudera.cdk.morphline.base.Configs
 
getInt(Config, String) - Method in class com.cloudera.cdk.morphline.base.Configs
 
getIOException() - Method in exception com.cloudera.cdk.data.DatasetIOException
 
getLanguageName() - Method in class com.cloudera.cdk.morphline.scriptengine.java.JavaScriptEngineFactory
 
getLanguageVersion() - Method in class com.cloudera.cdk.morphline.scriptengine.java.JavaScriptEngineFactory
 
getLength() - Method in class com.cloudera.cdk.data.PartitionKey
Return the number of values in the key.
getLifecycleEvents(Record) - Static method in class com.cloudera.cdk.morphline.base.Notifications
 
getLoader() - Method in class com.cloudera.cdk.morphline.solr.SolrLocator
 
getLocale(Config, String, Locale) - Method in class com.cloudera.cdk.morphline.base.Configs
 
getLocale(Config, String) - Method in class com.cloudera.cdk.morphline.base.Configs
 
getLocation() - Method in class com.cloudera.cdk.data.DatasetDescriptor
Get the URL location where the data for this Dataset is stored (optional).
getLong(Config, String, long) - Method in class com.cloudera.cdk.morphline.base.Configs
 
getLong(Config, String) - Method in class com.cloudera.cdk.morphline.base.Configs
 
getMetadataProvider() - Method in class com.cloudera.cdk.data.filesystem.FileSystemDatasetRepository
 
getMeter(String...) - Method in class com.cloudera.cdk.morphline.base.AbstractCommand
Returns a meter for maintaining main-memory metrics.
getMethodCallSyntax(String, String, String...) - Method in class com.cloudera.cdk.morphline.scriptengine.java.JavaScriptEngineFactory
 
getMetricRegistry() - Method in class com.cloudera.cdk.morphline.api.MorphlineContext
 
getMimeTypes() - Method in class com.cloudera.cdk.morphline.scriptengine.java.JavaScriptEngineFactory
 
getName() - Method in interface com.cloudera.cdk.data.Dataset
Get the name of a Dataset.
getName() - Method in class com.cloudera.cdk.data.FieldPartitioner
 
getName() - Method in class com.cloudera.cdk.data.Format
Get the format's name.
getNames() - Method in interface com.cloudera.cdk.morphline.api.CommandBuilder
Returns the names with which this command can be invoked.
getNames() - Method in class com.cloudera.cdk.morphline.avro.ExtractAvroPathsBuilder
 
getNames() - Method in class com.cloudera.cdk.morphline.avro.ExtractAvroTreeBuilder
 
getNames() - Method in class com.cloudera.cdk.morphline.avro.ReadAvroBuilder
 
getNames() - Method in class com.cloudera.cdk.morphline.avro.ReadAvroContainerBuilder
 
getNames() - Method in class com.cloudera.cdk.morphline.avro.ToAvroBuilder
 
getNames() - Method in class com.cloudera.cdk.morphline.avro.WriteAvroToByteArrayBuilder
 
getNames() - Method in class com.cloudera.cdk.morphline.hadoop.core.DownloadHdfsFileBuilder
 
getNames() - Method in class com.cloudera.cdk.morphline.hadoop.rcfile.ReadRCFileBuilder
 
getNames() - Method in class com.cloudera.cdk.morphline.hadoop.sequencefile.ReadSequenceFileBuilder
 
getNames() - Method in class com.cloudera.cdk.morphline.json.ExtractJsonPathsBuilder
 
getNames() - Method in class com.cloudera.cdk.morphline.json.ReadJsonBuilder
 
getNames() - Method in class com.cloudera.cdk.morphline.maxmind.GeoIPBuilder
 
getNames() - Method in class com.cloudera.cdk.morphline.metrics.servlets.RegisterJVMMetricsBuilder
 
getNames() - Method in class com.cloudera.cdk.morphline.metrics.servlets.StartReportingMetricsToHTTPBuilder
 
getNames() - Method in class com.cloudera.cdk.morphline.saxon.ConvertHTMLBuilder
 
getNames() - Method in class com.cloudera.cdk.morphline.saxon.XQueryBuilder
 
getNames() - Method in class com.cloudera.cdk.morphline.saxon.XSLTBuilder
 
getNames() - Method in class com.cloudera.cdk.morphline.scriptengine.java.JavaScriptEngineFactory
 
getNames() - Method in class com.cloudera.cdk.morphline.solr.GenerateSolrSequenceKeyBuilder
 
getNames() - Method in class com.cloudera.cdk.morphline.solr.LoadSolrBuilder
 
getNames() - Method in class com.cloudera.cdk.morphline.solr.SanitizeUnknownSolrFieldsBuilder
 
getNames() - Method in class com.cloudera.cdk.morphline.solr.TokenizeTextBuilder
 
getNames() - Method in class com.cloudera.cdk.morphline.solrcell.SolrCellBuilder
 
getNames() - Method in class com.cloudera.cdk.morphline.stdio.ReadClobBuilder
 
getNames() - Method in class com.cloudera.cdk.morphline.stdio.ReadCSVBuilder
 
getNames() - Method in class com.cloudera.cdk.morphline.stdio.ReadLineBuilder
 
getNames() - Method in class com.cloudera.cdk.morphline.stdio.ReadMultiLineBuilder
 
getNames() - Method in class com.cloudera.cdk.morphline.stdlib.AddCurrentTimeBuilder
 
getNames() - Method in class com.cloudera.cdk.morphline.stdlib.AddLocalHostBuilder
 
getNames() - Method in class com.cloudera.cdk.morphline.stdlib.AddValuesBuilder
 
getNames() - Method in class com.cloudera.cdk.morphline.stdlib.AddValuesIfAbsentBuilder
 
getNames() - Method in class com.cloudera.cdk.morphline.stdlib.CallParentPipeBuilder
 
getNames() - Method in class com.cloudera.cdk.morphline.stdlib.ContainsBuilder
 
getNames() - Method in class com.cloudera.cdk.morphline.stdlib.ConvertTimestampBuilder
 
getNames() - Method in class com.cloudera.cdk.morphline.stdlib.DecodeBase64Builder
 
getNames() - Method in class com.cloudera.cdk.morphline.stdlib.DropRecordBuilder
 
getNames() - Method in class com.cloudera.cdk.morphline.stdlib.EqualsBuilder
 
getNames() - Method in class com.cloudera.cdk.morphline.stdlib.ExtractURIComponentBuilder
 
getNames() - Method in class com.cloudera.cdk.morphline.stdlib.ExtractURIComponentsBuilder
 
getNames() - Method in class com.cloudera.cdk.morphline.stdlib.ExtractURIQueryParametersBuilder
 
getNames() - Method in class com.cloudera.cdk.morphline.stdlib.FindReplaceBuilder
 
getNames() - Method in class com.cloudera.cdk.morphline.stdlib.GenerateUUIDBuilder
 
getNames() - Method in class com.cloudera.cdk.morphline.stdlib.GrokBuilder
 
getNames() - Method in class com.cloudera.cdk.morphline.stdlib.IfThenElseBuilder
 
getNames() - Method in class com.cloudera.cdk.morphline.stdlib.JavaBuilder
 
getNames() - Method in class com.cloudera.cdk.morphline.stdlib.LogDebugBuilder
 
getNames() - Method in class com.cloudera.cdk.morphline.stdlib.LogErrorBuilder
 
getNames() - Method in class com.cloudera.cdk.morphline.stdlib.LogInfoBuilder
 
getNames() - Method in class com.cloudera.cdk.morphline.stdlib.LogTraceBuilder
 
getNames() - Method in class com.cloudera.cdk.morphline.stdlib.LogWarnBuilder
 
getNames() - Method in class com.cloudera.cdk.morphline.stdlib.NotBuilder
 
getNames() - Method in class com.cloudera.cdk.morphline.stdlib.PipeBuilder
 
getNames() - Method in class com.cloudera.cdk.morphline.stdlib.SeparateAttachmentsBuilder
 
getNames() - Method in class com.cloudera.cdk.morphline.stdlib.SetValuesBuilder
 
getNames() - Method in class com.cloudera.cdk.morphline.stdlib.SplitBuilder
 
getNames() - Method in class com.cloudera.cdk.morphline.stdlib.SplitKeyValueBuilder
 
getNames() - Method in class com.cloudera.cdk.morphline.stdlib.StartReportingMetricsToCSVBuilder
 
getNames() - Method in class com.cloudera.cdk.morphline.stdlib.StartReportingMetricsToJMXBuilder
 
getNames() - Method in class com.cloudera.cdk.morphline.stdlib.StartReportingMetricsToSLF4JBuilder
 
getNames() - Method in class com.cloudera.cdk.morphline.stdlib.ToByteArrayBuilder
 
getNames() - Method in class com.cloudera.cdk.morphline.stdlib.ToStringBuilder
 
getNames() - Method in class com.cloudera.cdk.morphline.stdlib.TranslateBuilder
 
getNames() - Method in class com.cloudera.cdk.morphline.stdlib.TryRulesBuilder
 
getNames() - Method in class com.cloudera.cdk.morphline.tika.decompress.DecompressBuilder
 
getNames() - Method in class com.cloudera.cdk.morphline.tika.decompress.UnpackBuilder
 
getNames() - Method in class com.cloudera.cdk.morphline.tika.DetectMimeTypeBuilder
 
getNames() - Method in class com.cloudera.cdk.morphline.twitter.ReadJsonTestTweetsBuilder
 
getNames() - Method in class com.cloudera.cdk.morphline.useragent.UserAgentBuilder
 
getNanoseconds(Config, String, long) - Method in class com.cloudera.cdk.morphline.base.Configs
 
getNanoseconds(Config, String) - Method in class com.cloudera.cdk.morphline.base.Configs
 
getOutputStatement(String) - Method in class com.cloudera.cdk.morphline.scriptengine.java.JavaScriptEngineFactory
 
getParameter(String) - Method in class com.cloudera.cdk.morphline.scriptengine.java.JavaScriptEngineFactory
 
getParent() - Method in interface com.cloudera.cdk.morphline.api.Command
Returns the parent of this command.
getParent() - Method in class com.cloudera.cdk.morphline.base.AbstractCommand
 
getPartition(PartitionKey, boolean) - Method in interface com.cloudera.cdk.data.Dataset
Get a partition for a PartitionKey, possibly creating the partition if it doesn't already exist.
getPartitions() - Method in interface com.cloudera.cdk.data.Dataset
Return partitions, if this dataset is partitioned.
getPartitionStrategy() - Method in class com.cloudera.cdk.data.DatasetDescriptor
Get the PartitionStrategy, if this dataset is partitioned.
getProgram(String...) - Method in class com.cloudera.cdk.morphline.scriptengine.java.JavaScriptEngineFactory
 
getProperty(String) - Method in class com.cloudera.cdk.data.DatasetDescriptor
Get a named property.
getReader() - Method in interface com.cloudera.cdk.data.Dataset
Deprecated. will be removed in 0.11.0; use View.newReader()
getSchema() - Method in class com.cloudera.cdk.data.DatasetDescriptor
Get the associated Schema.
getSchemaUrl() - Method in class com.cloudera.cdk.data.DatasetDescriptor
Get a URL from which the Schema may be retrieved (optional).
getScriptEngine() - Method in class com.cloudera.cdk.morphline.scriptengine.java.JavaScriptEngineFactory
 
getServerUrl() - Method in class com.cloudera.cdk.morphline.solr.SolrLocator
 
getSolrHomeDir() - Method in class com.cloudera.cdk.morphline.solr.SolrLocator
 
getSolrServer() - Method in class com.cloudera.cdk.morphline.solr.SolrServerDocumentLoader
 
getSourceName() - Method in class com.cloudera.cdk.data.FieldPartitioner
 
getSourceType() - Method in class com.cloudera.cdk.data.FieldPartitioner
The type of the source field, which is the type of the type expected by the apply function.
getString(Config, String, String) - Method in class com.cloudera.cdk.morphline.base.Configs
 
getString(Config, String) - Method in class com.cloudera.cdk.morphline.base.Configs
 
getStringList(Config, String, List<String>) - Method in class com.cloudera.cdk.morphline.base.Configs
 
getStringList(Config, String) - Method in class com.cloudera.cdk.morphline.base.Configs
 
getTimer(String...) - Method in class com.cloudera.cdk.morphline.base.AbstractCommand
Returns a timer for maintaining main-memory metrics.
getTimeUnit(Config, String, TimeUnit) - Method in class com.cloudera.cdk.morphline.base.Configs
 
getTimeUnit(Config, String) - Method in class com.cloudera.cdk.morphline.base.Configs
 
getTimeUnit(String) - Method in class com.cloudera.cdk.morphline.base.Configs
 
getType() - Method in class com.cloudera.cdk.data.FieldPartitioner
The type of the target field, which is the type of the return value of the partition function.
getUpperBounds() - Method in class com.cloudera.cdk.data.partition.RangeFieldPartitioner
 
getUri() - Method in class com.cloudera.cdk.morphline.hadoop.rcfile.SingleStreamFileSystem
 
getValues() - Method in class com.cloudera.cdk.data.PartitionKey
 
getWorkingDirectory() - Method in class com.cloudera.cdk.morphline.hadoop.rcfile.SingleStreamFileSystem
 
getWriter() - Method in interface com.cloudera.cdk.data.Dataset
Deprecated. will be removed in 0.11.0; use View.newWriter()
getZkHost() - Method in class com.cloudera.cdk.morphline.solr.SolrLocator
 
GrokBuilder - Class in com.cloudera.cdk.morphline.stdlib
The Grok command uses regular expression pattern matching to extract structured fields from unstructured log data.
GrokBuilder() - Constructor for class com.cloudera.cdk.morphline.stdlib.GrokBuilder
 

H

handleException(Throwable, Record) - Method in interface com.cloudera.cdk.morphline.api.ExceptionHandler
Handle the given exception resulting from the given input record (the record can be null)
handleException(Throwable, Record) - Method in class com.cloudera.cdk.morphline.base.FaultTolerance
 
hash(String, int) - Static method in class com.cloudera.cdk.data.partition.PartitionFunctions
 
hash(String, String, int) - Static method in class com.cloudera.cdk.data.partition.PartitionFunctions
 
hash(String, int) - Method in class com.cloudera.cdk.data.PartitionStrategy.Builder
Configure a hash partitioner with the specified number of buckets .
hash(String, String, int) - Method in class com.cloudera.cdk.data.PartitionStrategy.Builder
Configure a hash partitioner with the specified number of buckets .
hashCode() - Method in class com.cloudera.cdk.data.DatasetDescriptor
 
hashCode() - Method in class com.cloudera.cdk.data.Format
 
hashCode() - Method in class com.cloudera.cdk.data.Key
 
hashCode() - Method in class com.cloudera.cdk.data.partition.HashFieldPartitioner
 
hashCode() - Method in class com.cloudera.cdk.data.partition.IdentityFieldPartitioner
 
hashCode() - Method in class com.cloudera.cdk.data.partition.IntRangeFieldPartitioner
 
hashCode() - Method in class com.cloudera.cdk.data.partition.ListFieldPartitioner
 
hashCode() - Method in class com.cloudera.cdk.data.partition.RangeFieldPartitioner
 
hashCode() - Method in class com.cloudera.cdk.data.PartitionKey
 
hashCode() - Method in class com.cloudera.cdk.data.PartitionStrategy
 
hashCode() - Method in class com.cloudera.cdk.morphline.api.Record
 
HashFieldPartitioner - Class in com.cloudera.cdk.data.partition
 
HashFieldPartitioner(String, int) - Constructor for class com.cloudera.cdk.data.partition.HashFieldPartitioner
 
HashFieldPartitioner(String, String, int) - Constructor for class com.cloudera.cdk.data.partition.HashFieldPartitioner
 
hasNext() - Method in interface com.cloudera.cdk.data.DatasetReader
Tests the reader to see if additional entities can be read.
hasProperty(String) - Method in class com.cloudera.cdk.data.DatasetDescriptor
Check if a named property exists.
HCatalogDatasetRepository - Class in com.cloudera.cdk.data.hcatalog
A DatasetRepository that uses the Hive/HCatalog metastore for metadata, and stores data in a Hadoop FileSystem.
HCatalogDatasetRepository.Builder - Class in com.cloudera.cdk.data.hcatalog
A fluent builder to aid in the construction of HCatalogDatasetRepository instances.
HCatalogDatasetRepository.Builder() - Constructor for class com.cloudera.cdk.data.hcatalog.HCatalogDatasetRepository.Builder
 
hour(String, String) - Static method in class com.cloudera.cdk.data.partition.PartitionFunctions
 
hour(String, String) - Method in class com.cloudera.cdk.data.PartitionStrategy.Builder
Configure a partitioner for extracting the hour from a timestamp field.
hour(String) - Method in class com.cloudera.cdk.data.PartitionStrategy.Builder
Configure a partitioner for extracting the hour from a timestamp field.
HourFieldPartitioner - Class in com.cloudera.cdk.data.partition
 
HourFieldPartitioner(String, String) - Constructor for class com.cloudera.cdk.data.partition.HourFieldPartitioner
 

I

ID - Static variable in class com.cloudera.cdk.morphline.base.Fields
 
identity(String, int) - Static method in class com.cloudera.cdk.data.partition.PartitionFunctions
Deprecated. Use PartitionFunctions.identity(String, Class, int).
identity(String, Class<S>, int) - Static method in class com.cloudera.cdk.data.partition.PartitionFunctions
 
identity(String, int) - Method in class com.cloudera.cdk.data.PartitionStrategy.Builder
Deprecated. Use PartitionStrategy.Builder.identity(String, Class, int).
identity(String, Class<S>, int) - Method in class com.cloudera.cdk.data.PartitionStrategy.Builder
Configure an identity partitioner for a given type with a cardinality hint of buckets size.
IdentityFieldPartitioner<S extends Comparable> - Class in com.cloudera.cdk.data.partition
 
IdentityFieldPartitioner(String, Class<S>, int) - Constructor for class com.cloudera.cdk.data.partition.IdentityFieldPartitioner
 
IfThenElseBuilder - Class in com.cloudera.cdk.morphline.stdlib
An If-Then-Else command consists of a chain of zero or more conditions commands, as well as a chain of zero or or more commands that are processed if all conditions succeed ("then commands"), as well as a chain of zero or more commands that are processed if one of the conditions fails ("else commands").
IfThenElseBuilder() - Constructor for class com.cloudera.cdk.morphline.stdlib.IfThenElseBuilder
 
importCommandBuilders(Collection<String>) - Method in class com.cloudera.cdk.morphline.api.MorphlineContext
 
IncompatibleSchemaException - Exception in com.cloudera.cdk.data
Exception thrown when a schema update is not compatible with the previous schema (or schemas) or with existing datasets (if shared storage is used, for example).
IncompatibleSchemaException(String) - Constructor for exception com.cloudera.cdk.data.IncompatibleSchemaException
 
IncompatibleSchemaException(String, Throwable) - Constructor for exception com.cloudera.cdk.data.IncompatibleSchemaException
 
IncompatibleSchemaException(Throwable) - Constructor for exception com.cloudera.cdk.data.IncompatibleSchemaException
 
increment(Key, String, long) - Method in interface com.cloudera.cdk.data.RandomAccessDataset
Increment a field named fieldName on the entity by the specified amount.
incrementNumRecords() - Method in class com.cloudera.cdk.morphline.stdio.AbstractParser
 
IntRangeFieldPartitioner - Class in com.cloudera.cdk.data.partition
 
IntRangeFieldPartitioner(String, int...) - Constructor for class com.cloudera.cdk.data.partition.IntRangeFieldPartitioner
 
IS_IGNORING_RECOVERABLE_EXCEPTIONS - Static variable in class com.cloudera.cdk.morphline.base.FaultTolerance
 
IS_PRODUCTION_MODE - Static variable in class com.cloudera.cdk.morphline.base.FaultTolerance
 
isMeasuringMetrics() - Method in class com.cloudera.cdk.morphline.base.AbstractCommand
Returns whether or not metrics should be measured.
isOpen() - Method in interface com.cloudera.cdk.data.DatasetReader
 
isOpen() - Method in interface com.cloudera.cdk.data.DatasetWriter
 
isPartitioned() - Method in class com.cloudera.cdk.data.DatasetDescriptor
Returns true if an associated dataset is partitioned (that is, has an associated PartitionStrategy), false otherwise.

J

JavaBuilder - Class in com.cloudera.cdk.morphline.stdlib
The "java" command compiles the given Java code block, wrapped into a Java method with a boolean return type and several parameters, along with a Java class definition that contains the given import statements.
JavaBuilder() - Constructor for class com.cloudera.cdk.morphline.stdlib.JavaBuilder
 
JavaScriptEngineFactory - Class in com.cloudera.cdk.morphline.scriptengine.java
This is script engine factory for "Java" script engine.
JavaScriptEngineFactory() - Constructor for class com.cloudera.cdk.morphline.scriptengine.java.JavaScriptEngineFactory
 
JobClasspathHelper - Class in com.cloudera.cdk.tools
This class is an helper to copy the jars needed by the job in the Distributed cache.
JobClasspathHelper() - Constructor for class com.cloudera.cdk.tools.JobClasspathHelper
 

K

Key - Class in com.cloudera.cdk.data
A key for retrieving entities from a RandomAccessDataset.
Key.Builder - Class in com.cloudera.cdk.data
A fluent builder to aid in the construction of Key objects.
Key.Builder(RandomAccessDataset) - Constructor for class com.cloudera.cdk.data.Key.Builder
Construct a Key.Builder for a RandomAccessDataset.

L

LIFE_CYLCLE - Static variable in class com.cloudera.cdk.morphline.base.Notifications
 
list() - Method in interface com.cloudera.cdk.data.DatasetRepository
List the names of the Datasets in this DatasetRepository.
list() - Method in class com.cloudera.cdk.data.filesystem.FileSystemDatasetRepository
 
list() - Method in class com.cloudera.cdk.data.filesystem.FileSystemMetadataProvider
 
list() - Method in class com.cloudera.cdk.data.hcatalog.HCatalogDatasetRepository
 
list() - Method in interface com.cloudera.cdk.data.MetadataProvider
List the names of the Datasets managed by this MetadataProvider.
ListFieldPartitioner<S> - Class in com.cloudera.cdk.data.partition
 
ListFieldPartitioner(String, List<Set<S>>, Class<S>) - Constructor for class com.cloudera.cdk.data.partition.ListFieldPartitioner
 
listProperties() - Method in class com.cloudera.cdk.data.DatasetDescriptor
List the names of all custom properties set.
listStatus(Path) - Method in class com.cloudera.cdk.morphline.hadoop.rcfile.SingleStreamFileSystem
 
load(String) - Method in interface com.cloudera.cdk.data.DatasetRepository
Get the latest version of a named Dataset.
load(String) - Method in class com.cloudera.cdk.data.filesystem.FileSystemDatasetRepository
 
load(String) - Method in class com.cloudera.cdk.data.filesystem.FileSystemMetadataProvider
 
load(String) - Method in class com.cloudera.cdk.data.hcatalog.HCatalogDatasetRepository
 
load(String) - Method in interface com.cloudera.cdk.data.MetadataProvider
Load the dataset descriptor for the dataset name.
load(String) - Method in interface com.cloudera.cdk.data.RandomAccessDatasetRepository
 
load(SolrInputDocument) - Method in interface com.cloudera.cdk.morphline.solr.DocumentLoader
Loads the given document into the destination
load(SolrInputDocument) - Method in class com.cloudera.cdk.morphline.solr.SolrServerDocumentLoader
 
LoadSolrBuilder - Class in com.cloudera.cdk.morphline.solr
A command that loads a record into a SolrServer or MapReduce SolrOutputFormat.
LoadSolrBuilder() - Constructor for class com.cloudera.cdk.morphline.solr.LoadSolrBuilder
 
location(URI) - Method in class com.cloudera.cdk.data.DatasetDescriptor.Builder
Configure the Dataset's location (optional).
location(Path) - Method in class com.cloudera.cdk.data.DatasetDescriptor.Builder
Configure the Dataset's location (optional).
location(String) - Method in class com.cloudera.cdk.data.DatasetDescriptor.Builder
Configure the Dataset's location (optional).
LOG - Variable in class com.cloudera.cdk.morphline.base.AbstractCommand
 
Log4jAppender - Class in com.cloudera.cdk.data.flume
 
Log4jAppender() - Constructor for class com.cloudera.cdk.data.flume.Log4jAppender
 
Log4jAppender(String, int) - Constructor for class com.cloudera.cdk.data.flume.Log4jAppender
Sets the hostname and port.
LogDebugBuilder - Class in com.cloudera.cdk.morphline.stdlib
Command that logs to slf4j at DEBUG level.
LogDebugBuilder() - Constructor for class com.cloudera.cdk.morphline.stdlib.LogDebugBuilder
 
LogErrorBuilder - Class in com.cloudera.cdk.morphline.stdlib
Command that logs to slf4j at ERROR level.
LogErrorBuilder() - Constructor for class com.cloudera.cdk.morphline.stdlib.LogErrorBuilder
 
LogInfoBuilder - Class in com.cloudera.cdk.morphline.stdlib
Command that logs to slf4j at INFO level.
LogInfoBuilder() - Constructor for class com.cloudera.cdk.morphline.stdlib.LogInfoBuilder
 
LogTraceBuilder - Class in com.cloudera.cdk.morphline.stdlib
Command that logs to slf4j at TRACE level.
LogTraceBuilder() - Constructor for class com.cloudera.cdk.morphline.stdlib.LogTraceBuilder
 
LogWarnBuilder - Class in com.cloudera.cdk.morphline.stdlib
Command that logs to slf4j at WARN level.
LogWarnBuilder() - Constructor for class com.cloudera.cdk.morphline.stdlib.LogWarnBuilder
 

M

main(String[]) - Static method in class com.cloudera.cdk.morphline.base.PrettyPrinter
 
main(String...) - Static method in class com.cloudera.cdk.tools.CombinedLogFormatConverter
 
MESSAGE - Static variable in class com.cloudera.cdk.morphline.base.Fields
 
metadataProvider(MetadataProvider) - Method in class com.cloudera.cdk.data.filesystem.FileSystemDatasetRepository.Builder
The MetadataProvider for metadata storage (optional).
MetadataProvider - Interface in com.cloudera.cdk.data
A service provider interface for metadata system plugins.
MetadataProviderException - Exception in com.cloudera.cdk.data
Exception thrown for metadata provider-related failures.
MetadataProviderException() - Constructor for exception com.cloudera.cdk.data.MetadataProviderException
 
MetadataProviderException(String) - Constructor for exception com.cloudera.cdk.data.MetadataProviderException
 
MetadataProviderException(String, Throwable) - Constructor for exception com.cloudera.cdk.data.MetadataProviderException
 
MetadataProviderException(Throwable) - Constructor for exception com.cloudera.cdk.data.MetadataProviderException
 
Metrics - Class in com.cloudera.cdk.morphline.base
Common metric names.
Metrics() - Constructor for class com.cloudera.cdk.morphline.base.Metrics
 
MIME_TYPE - Static variable in class com.cloudera.cdk.morphline.avro.ReadAvroContainerBuilder
The MIME type that input attachments shall have
MIME_TYPE - Static variable in class com.cloudera.cdk.morphline.json.ReadJsonBuilder
The MIME type identifier that will be filled into output records
minute(String, String) - Static method in class com.cloudera.cdk.data.partition.PartitionFunctions
 
minute(String, String) - Method in class com.cloudera.cdk.data.PartitionStrategy.Builder
Configure a partitioner for extracting the minute from a timestamp field.
minute(String) - Method in class com.cloudera.cdk.data.PartitionStrategy.Builder
Configure a partitioner for extracting the minute from a timestamp field.
MinuteFieldPartitioner - Class in com.cloudera.cdk.data.partition
 
MinuteFieldPartitioner(String, String) - Constructor for class com.cloudera.cdk.data.partition.MinuteFieldPartitioner
 
mkdirs(Path, FsPermission) - Method in class com.cloudera.cdk.morphline.hadoop.rcfile.SingleStreamFileSystem
 
month(String, String) - Static method in class com.cloudera.cdk.data.partition.PartitionFunctions
 
month(String, String) - Method in class com.cloudera.cdk.data.PartitionStrategy.Builder
Configure a partitioner for extracting the month from a timestamp field.
month(String) - Method in class com.cloudera.cdk.data.PartitionStrategy.Builder
Configure a partitioner for extracting the month from a timestamp field.
MonthFieldPartitioner - Class in com.cloudera.cdk.data.partition
 
MonthFieldPartitioner(String, String) - Constructor for class com.cloudera.cdk.data.partition.MonthFieldPartitioner
 
MORPHLINE_APP - Static variable in class com.cloudera.cdk.morphline.base.Metrics
 
MorphlineCompilationException - Exception in com.cloudera.cdk.morphline.api
Indicates a parse error.
MorphlineCompilationException(String, Config) - Constructor for exception com.cloudera.cdk.morphline.api.MorphlineCompilationException
 
MorphlineCompilationException(String, Config, Throwable) - Constructor for exception com.cloudera.cdk.morphline.api.MorphlineCompilationException
 
MorphlineCompilationException(String, Config, Throwable, CommandBuilder) - Constructor for exception com.cloudera.cdk.morphline.api.MorphlineCompilationException
 
MorphlineContext - Class in com.cloudera.cdk.morphline.api
Additional user defined parameters that will be passed to all morphline commands.
MorphlineContext() - Constructor for class com.cloudera.cdk.morphline.api.MorphlineContext
For public access use MorphlineContext.Builder.build() instead
MorphlineContext.Builder - Class in com.cloudera.cdk.morphline.api
Helper to construct a MorphlineContext instance.
MorphlineContext.Builder() - Constructor for class com.cloudera.cdk.morphline.api.MorphlineContext.Builder
 
MorphlineRuntimeException - Exception in com.cloudera.cdk.morphline.api
Base class of all morphline exceptions.
MorphlineRuntimeException(String) - Constructor for exception com.cloudera.cdk.morphline.api.MorphlineRuntimeException
 
MorphlineRuntimeException(String, Throwable) - Constructor for exception com.cloudera.cdk.morphline.api.MorphlineRuntimeException
 
MorphlineRuntimeException(Throwable) - Constructor for exception com.cloudera.cdk.morphline.api.MorphlineRuntimeException
 

N

newReader() - Method in interface com.cloudera.cdk.data.View
Get an appropriate DatasetReader implementation based on this View of the underlying Dataset implementation.
newWriter() - Method in interface com.cloudera.cdk.data.View
Get an appropriate DatasetWriter implementation based on this View of the underlying Dataset implementation.
next() - Method in interface com.cloudera.cdk.data.DatasetReader
Fetch the next entity from the reader.
NoSuchDatasetException - Exception in com.cloudera.cdk.data
Deprecated. will be removed in 0.11.0, use DatasetNotFoundException instead
NoSuchDatasetException(String) - Constructor for exception com.cloudera.cdk.data.NoSuchDatasetException
Deprecated.  
NoSuchDatasetException(String, Throwable) - Constructor for exception com.cloudera.cdk.data.NoSuchDatasetException
Deprecated.  
NoSuchDatasetException(Throwable) - Constructor for exception com.cloudera.cdk.data.NoSuchDatasetException
Deprecated.  
NotBuilder - Class in com.cloudera.cdk.morphline.stdlib
A Not command consists of one nested command, the boolean return value of which is inverted.
NotBuilder() - Constructor for class com.cloudera.cdk.morphline.stdlib.NotBuilder
 
Notifications - Class in com.cloudera.cdk.morphline.base
Tools for notifications on the control plane.
Notifications() - Constructor for class com.cloudera.cdk.morphline.base.Notifications
 
Notifications.LifecycleEvent - Enum in com.cloudera.cdk.morphline.base
 
notify(Record) - Method in interface com.cloudera.cdk.morphline.api.Command
Processes the given notification on the control plane of the subtree rooted at this command.
notify(Record) - Method in class com.cloudera.cdk.morphline.base.AbstractCommand
 
notifyBeginTransaction(Command) - Static method in class com.cloudera.cdk.morphline.base.Notifications
 
notifyCommitTransaction(Command) - Static method in class com.cloudera.cdk.morphline.base.Notifications
 
notifyRollbackTransaction(Command) - Static method in class com.cloudera.cdk.morphline.base.Notifications
 
notifyShutdown(Command) - Static method in class com.cloudera.cdk.morphline.base.Notifications
 
notifyStartSession(Command) - Static method in class com.cloudera.cdk.morphline.base.Notifications
 
NUM_EXCEPTION_RECORDS - Static variable in class com.cloudera.cdk.morphline.base.Metrics
 
NUM_FAILED_RECORDS - Static variable in class com.cloudera.cdk.morphline.base.Metrics
 
NUM_NOTIFY_CALLS - Static variable in class com.cloudera.cdk.morphline.base.Metrics
 
NUM_PROCESS_CALLS - Static variable in class com.cloudera.cdk.morphline.base.Metrics
 
NUM_RECORDS - Static variable in class com.cloudera.cdk.morphline.base.Metrics
 

O

open() - Method in interface com.cloudera.cdk.data.DatasetReader
Open the reader, allocating any necessary resources required to produce entities.
open(String) - Static method in class com.cloudera.cdk.data.DatasetRepositories
Synonym for DatasetRepositories.open(java.net.URI) for String URIs.
open(URI) - Static method in class com.cloudera.cdk.data.DatasetRepositories
Open a DatasetRepository for the given URI.
open() - Method in interface com.cloudera.cdk.data.DatasetWriter
Open the writer, allocating any necessary resources required to store entities.
open(Path, int) - Method in class com.cloudera.cdk.morphline.hadoop.rcfile.SingleStreamFileSystem
 
openRandomAccess(String) - Static method in class com.cloudera.cdk.data.DatasetRepositories
Synonym for DatasetRepositories.openRandomAccess(java.net.URI) for String URIs.
openRandomAccess(URI) - Static method in class com.cloudera.cdk.data.DatasetRepositories
Synonym for DatasetRepositories.open(java.net.URI) for RandomAccessDatasetRepositorys
OUTPUT_MEDIA_TYPE - Static variable in class com.cloudera.cdk.morphline.hadoop.rcfile.ReadRCFileBuilder
 
OUTPUT_MEDIA_TYPE - Static variable in class com.cloudera.cdk.morphline.hadoop.sequencefile.ReadSequenceFileBuilder
 

P

PARQUET - Static variable in class com.cloudera.cdk.data.Formats
PARQUET: the Parquet columnar format
parse(File, Config...) - Method in class com.cloudera.cdk.morphline.base.Compiler
Loads the given config file from the local file system
PartitionFunctions - Class in com.cloudera.cdk.data.partition
Convenience class so you can say, for example, hash("username", 2) in JEXL.
PartitionFunctions() - Constructor for class com.cloudera.cdk.data.partition.PartitionFunctions
 
PartitionKey - Class in com.cloudera.cdk.data
A key for retrieving partitions from a Dataset.
partitionKey(Object...) - Method in class com.cloudera.cdk.data.PartitionStrategy
Construct a partition key with a variadic array of values corresponding to the field partitioners in this partition strategy.
partitionKeyForEntity(Object) - Method in class com.cloudera.cdk.data.PartitionStrategy
Construct a partition key for the given entity.
partitionKeyForEntity(Object, PartitionKey) - Method in class com.cloudera.cdk.data.PartitionStrategy
Construct a partition key for the given entity, reusing the supplied key if not null.
partitionKeyForPath(Dataset, URI) - Static method in class com.cloudera.cdk.data.filesystem.FileSystemDatasetRepository
Get a PartitionKey corresponding to a partition's filesystem path represented as a URI.
partitionStrategy(PartitionStrategy) - Method in class com.cloudera.cdk.data.DatasetDescriptor.Builder
Configure the dataset's partitioning strategy (optional).
PartitionStrategy - Class in com.cloudera.cdk.data
The strategy used to determine how a dataset is partitioned.
PartitionStrategy(List<FieldPartitioner>) - Constructor for class com.cloudera.cdk.data.PartitionStrategy
Construct a partition strategy with a list of field partitioners.
PartitionStrategy.Builder - Class in com.cloudera.cdk.data
A fluent builder to aid in the construction of PartitionStrategys.
PartitionStrategy.Builder() - Constructor for class com.cloudera.cdk.data.PartitionStrategy.Builder
 
ping() - Method in interface com.cloudera.cdk.morphline.solr.DocumentLoader
Issues a ping request to check if the server is alive
ping() - Method in class com.cloudera.cdk.morphline.solr.SolrServerDocumentLoader
 
PipeBuilder - Class in com.cloudera.cdk.morphline.stdlib
Factory to create morphline pipe instances.
PipeBuilder() - Constructor for class com.cloudera.cdk.morphline.stdlib.PipeBuilder
 
populateAvroHeaders(Map<String, String>, Schema, Object) - Method in class com.cloudera.cdk.data.flume.Log4jAppender
 
PREFIX_NAME - Static variable in class com.cloudera.cdk.morphline.stdlib.GenerateUUIDBuilder
 
prepareClasspath(Configuration, Path, Class<?>...) - Method in class com.cloudera.cdk.tools.JobClasspathHelper
 
PRESERVE_EXISTING_NAME - Static variable in class com.cloudera.cdk.morphline.stdlib.AddCurrentTimeBuilder
 
PRESERVE_EXISTING_NAME - Static variable in class com.cloudera.cdk.morphline.stdlib.AddLocalHostBuilder
 
PRESERVE_EXISTING_NAME - Static variable in class com.cloudera.cdk.morphline.stdlib.GenerateUUIDBuilder
 
PrettyPrinter - Class in com.cloudera.cdk.morphline.base
Utility to nicely indent and format a morphline config file.
PrettyPrinter() - Constructor for class com.cloudera.cdk.morphline.base.PrettyPrinter
 
process(Record) - Method in interface com.cloudera.cdk.morphline.api.Command
Processes the given record on the data plane of this command.
process(Record) - Method in class com.cloudera.cdk.morphline.base.AbstractCommand
 
property(String, String) - Method in class com.cloudera.cdk.data.DatasetDescriptor.Builder
Add a key-value property to the descriptor.
put(E) - Method in interface com.cloudera.cdk.data.RandomAccessDataset
Put the entity into the dataset.
put(String, Object) - Method in class com.cloudera.cdk.morphline.api.Record
Adds the given value to the values currently associated with the given key.
putIfAbsent(String, Object) - Method in class com.cloudera.cdk.morphline.api.Record
Adds the given value to the values currently associated with the given key, iff the key isn't already associated with that same value.

R

RandomAccessDataset<E> - Interface in com.cloudera.cdk.data
A Dataset that supports random access operations on entities.
RandomAccessDatasetRepository - Interface in com.cloudera.cdk.data
A logical repository (storage system) of RandomAccessDatasets.
range(String, int...) - Static method in class com.cloudera.cdk.data.partition.PartitionFunctions
 
range(String, String...) - Static method in class com.cloudera.cdk.data.partition.PartitionFunctions
 
range(String, int...) - Method in class com.cloudera.cdk.data.PartitionStrategy.Builder
Configure a range partitioner with a set of upperBounds.
range(String, String...) - Method in class com.cloudera.cdk.data.PartitionStrategy.Builder
Configure a range partitioner for strings with a set of upperBounds.
RangeFieldPartitioner - Class in com.cloudera.cdk.data.partition
 
RangeFieldPartitioner(String, String...) - Constructor for class com.cloudera.cdk.data.partition.RangeFieldPartitioner
 
RC_FILE_META_DATA - Static variable in class com.cloudera.cdk.morphline.hadoop.rcfile.ReadRCFileBuilder
 
ReadAvroBuilder - Class in com.cloudera.cdk.morphline.avro
Command that parses an InputStream that contains Avro data; for each Avro datum, the command emits a morphline record containing the datum as an attachment in Fields.ATTACHMENT_BODY.
ReadAvroBuilder() - Constructor for class com.cloudera.cdk.morphline.avro.ReadAvroBuilder
 
ReadAvroContainerBuilder - Class in com.cloudera.cdk.morphline.avro
Command that parses an InputStream that contains Avro binary container file data; for each Avro datum, the command emits a morphline record containing the datum as an attachment in Fields.ATTACHMENT_BODY.
ReadAvroContainerBuilder() - Constructor for class com.cloudera.cdk.morphline.avro.ReadAvroContainerBuilder
 
ReadClobBuilder - Class in com.cloudera.cdk.morphline.stdio
Command that emits one record for the entire input stream of the first attachment, interpreting the stream as a Character Large Object (CLOB).
ReadClobBuilder() - Constructor for class com.cloudera.cdk.morphline.stdio.ReadClobBuilder
 
ReadCSVBuilder - Class in com.cloudera.cdk.morphline.stdio
Command that extracts zero or more records from the input stream of the first attachment, representing a Comma Separated Values (CSV) file.
ReadCSVBuilder() - Constructor for class com.cloudera.cdk.morphline.stdio.ReadCSVBuilder
 
ReadJsonBuilder - Class in com.cloudera.cdk.morphline.json
Command that parses an InputStream that contains JSON data; for each JSON object in the stream, the command emits a morphline record containing the object as an attachment in Fields.ATTACHMENT_BODY.
ReadJsonBuilder() - Constructor for class com.cloudera.cdk.morphline.json.ReadJsonBuilder
 
ReadJsonTestTweetsBuilder - Class in com.cloudera.cdk.morphline.twitter
JSON parser that extracts search documents from twitter tweets obtained from the twitter 1% sample firehose with the delimited=length option.
ReadJsonTestTweetsBuilder() - Constructor for class com.cloudera.cdk.morphline.twitter.ReadJsonTestTweetsBuilder
 
ReadLineBuilder - Class in com.cloudera.cdk.morphline.stdio
Command that emits one record per line in the input stream of the first attachment.
ReadLineBuilder() - Constructor for class com.cloudera.cdk.morphline.stdio.ReadLineBuilder
 
ReadMultiLineBuilder - Class in com.cloudera.cdk.morphline.stdio
Multiline log parser that collapse multiline messages into a single record; supports "regex", "what" and "negate" configuration parameters similar to logstash.
ReadMultiLineBuilder() - Constructor for class com.cloudera.cdk.morphline.stdio.ReadMultiLineBuilder
 
ReadRCFileBuilder - Class in com.cloudera.cdk.morphline.hadoop.rcfile
Command for reading Record Columnar (RC) Files.
ReadRCFileBuilder() - Constructor for class com.cloudera.cdk.morphline.hadoop.rcfile.ReadRCFileBuilder
 
ReadSequenceFileBuilder - Class in com.cloudera.cdk.morphline.hadoop.sequencefile
Command that emits one record per sequence file entry in the input stream of the first attachment.
ReadSequenceFileBuilder() - Constructor for class com.cloudera.cdk.morphline.hadoop.sequencefile.ReadSequenceFileBuilder
 
Record - Class in com.cloudera.cdk.morphline.api
A record is a set of named fields where each field has a list of one or more values.
Record() - Constructor for class com.cloudera.cdk.morphline.api.Record
Creates a new empty record.
RECOVERABLE_EXCEPTION_CLASSES - Static variable in class com.cloudera.cdk.morphline.base.FaultTolerance
 
RegisterJVMMetricsBuilder - Class in com.cloudera.cdk.morphline.metrics.servlets
Command that registers metrics that are related to the Java Virtual Machine with the MorphlineContext of the morphline.
RegisterJVMMetricsBuilder() - Constructor for class com.cloudera.cdk.morphline.metrics.servlets.RegisterJVMMetricsBuilder
 
remove() - Method in interface com.cloudera.cdk.data.DatasetReader
Remove the last entity from the reader (OPTIONAL).
removeAll(String) - Method in class com.cloudera.cdk.morphline.api.Record
Removes all values that are associated with the given key
removeAttachments(Record) - Static method in class com.cloudera.cdk.morphline.stdio.AbstractParser
 
rename(Path, Path) - Method in class com.cloudera.cdk.morphline.hadoop.rcfile.SingleStreamFileSystem
 
replaceValues(String, Object) - Method in class com.cloudera.cdk.morphline.api.Record
Removes all values that are associated with the given key, and then associates the given value with the given key.
requiresLayout() - Method in class com.cloudera.cdk.data.flume.Log4jAppender
 
rollbackTransaction() - Method in interface com.cloudera.cdk.morphline.solr.DocumentLoader
Performs a rollback of all non-committed documents pending.
rollbackTransaction() - Method in class com.cloudera.cdk.morphline.solr.SolrServerDocumentLoader
 
rootDirectory(Path) - Method in class com.cloudera.cdk.data.filesystem.FileSystemDatasetRepository.Builder
The root directory for metadata and dataset files.
rootDirectory(URI) - Method in class com.cloudera.cdk.data.filesystem.FileSystemDatasetRepository.Builder
The root directory for metadata and dataset files.
rootDirectory(String) - Method in class com.cloudera.cdk.data.filesystem.FileSystemDatasetRepository.Builder
The root directory for metadata and dataset files.
rootDirectory(Path) - Method in class com.cloudera.cdk.data.filesystem.FileSystemMetadataProvider.Builder
The root directory for metadata files.
rootDirectory(Path) - Method in class com.cloudera.cdk.data.hcatalog.HCatalogDatasetRepository.Builder
The root directory for dataset files.
rootDirectory(URI) - Method in class com.cloudera.cdk.data.hcatalog.HCatalogDatasetRepository.Builder
The root directory for dataset files.
rootDirectory(String) - Method in class com.cloudera.cdk.data.hcatalog.HCatalogDatasetRepository.Builder
The root directory for metadata and dataset files.
run(String...) - Method in class com.cloudera.cdk.tools.CombinedLogFormatConverter
 

S

SanitizeUnknownSolrFieldsBuilder - Class in com.cloudera.cdk.morphline.solr
Command that sanitizes record fields that are unknown to Solr schema.xml by either deleting them (renameToPrefix is absent or a zero length string), or by moving them to a field prefixed with the given renameToPrefix (e.g.
SanitizeUnknownSolrFieldsBuilder() - Constructor for class com.cloudera.cdk.morphline.solr.SanitizeUnknownSolrFieldsBuilder
 
schema(Schema) - Method in class com.cloudera.cdk.data.DatasetDescriptor.Builder
Configure the dataset's schema.
schema(File) - Method in class com.cloudera.cdk.data.DatasetDescriptor.Builder
Configure the dataset's schema from a File.
schema(InputStream) - Method in class com.cloudera.cdk.data.DatasetDescriptor.Builder
Configure the dataset's schema from an InputStream.
schema(Class<T>) - Method in class com.cloudera.cdk.data.DatasetDescriptor.Builder
Configure the dataset's schema via a Java class type.
schemaFromAvroDataFile(File) - Method in class com.cloudera.cdk.data.DatasetDescriptor.Builder
Configure the dataset's schema by using the schema from an existing Avro data file.
schemaFromAvroDataFile(InputStream) - Method in class com.cloudera.cdk.data.DatasetDescriptor.Builder
Configure the dataset's schema by using the schema from an existing Avro data file.
schemaFromAvroDataFile(URI) - Method in class com.cloudera.cdk.data.DatasetDescriptor.Builder
Configure the dataset's schema by using the schema from an existing Avro data file.
schemaLiteral(String) - Method in class com.cloudera.cdk.data.DatasetDescriptor.Builder
Configure the dataset's schema from a String.
SchemaNotFoundException - Exception in com.cloudera.cdk.data
Exception thrown to indicate that there was a problem finding a particular version of a schema.
SchemaNotFoundException(String) - Constructor for exception com.cloudera.cdk.data.SchemaNotFoundException
 
SchemaNotFoundException(String, Throwable) - Constructor for exception com.cloudera.cdk.data.SchemaNotFoundException
 
SchemaNotFoundException(Throwable) - Constructor for exception com.cloudera.cdk.data.SchemaNotFoundException
 
schemaUri(URI) - Method in class com.cloudera.cdk.data.DatasetDescriptor.Builder
Configure the Dataset's schema from a URI.
schemaUri(String) - Method in class com.cloudera.cdk.data.DatasetDescriptor.Builder
Configure the Dataset's schema from a String URI.
SchemaValidationException - Exception in com.cloudera.cdk.data
Exception thrown to indicate that there was a problem parsing or validating a schema.
SchemaValidationException(String) - Constructor for exception com.cloudera.cdk.data.SchemaValidationException
 
SchemaValidationException(Throwable) - Constructor for exception com.cloudera.cdk.data.SchemaValidationException
 
SchemaValidationException(String, Throwable) - Constructor for exception com.cloudera.cdk.data.SchemaValidationException
 
ScriptEvaluator<T> - Class in com.cloudera.cdk.morphline.scriptengine.java
Creates and compiles the given Java code block, wrapped into a Java method with the given return type and parameter types, along with a Java class definition that contains the given import statements.
ScriptEvaluator(String, String, Class<T>, String[], Class[], String) - Constructor for class com.cloudera.cdk.morphline.scriptengine.java.ScriptEvaluator
 
SeparateAttachmentsBuilder - Class in com.cloudera.cdk.morphline.stdlib
Command that emits one output record for each attachment in the input record's list of attachments; The result is a list of records, each of which has at most one attachment.
SeparateAttachmentsBuilder() - Constructor for class com.cloudera.cdk.morphline.stdlib.SeparateAttachmentsBuilder
 
SEQUENCE_FILE_META_DATA - Static variable in class com.cloudera.cdk.morphline.hadoop.sequencefile.ReadSequenceFileBuilder
 
SerializationException - Exception in com.cloudera.cdk.data
Exception thrown when an object cannot be serialized to, or deserialized from, its binary representation.
SerializationException(String, Throwable) - Constructor for exception com.cloudera.cdk.data.SerializationException
 
setBatchSize(int) - Method in class com.cloudera.cdk.morphline.solr.SolrLocator
 
setCollectionName(String) - Method in class com.cloudera.cdk.morphline.solr.SolrLocator
 
setDatasetName(String) - Method in class com.cloudera.cdk.data.flume.Log4jAppender
 
setDatasetRepositoryClass(String) - Method in class com.cloudera.cdk.data.flume.Log4jAppender
Deprecated. Use datasetRepositoryUri with a 'repo:' URI.
setDatasetRepositoryUri(String) - Method in class com.cloudera.cdk.data.flume.Log4jAppender
 
setDocumentLoader(DocumentLoader) - Method in class com.cloudera.cdk.morphline.solr.SolrMorphlineContext.Builder
 
setExceptionHandler(ExceptionHandler) - Method in class com.cloudera.cdk.morphline.api.MorphlineContext.Builder
 
setHealthCheckRegistry(HealthCheckRegistry) - Method in class com.cloudera.cdk.morphline.api.MorphlineContext.Builder
 
setIndexSchema(IndexSchema) - Method in class com.cloudera.cdk.morphline.solr.SolrMorphlineContext.Builder
 
setMetricRegistry(MetricRegistry) - Method in class com.cloudera.cdk.morphline.api.MorphlineContext.Builder
 
setServerUrl(String) - Method in class com.cloudera.cdk.morphline.solr.SolrLocator
 
setSolrHomeDir(String) - Method in class com.cloudera.cdk.morphline.solr.SolrLocator
 
SetValuesBuilder - Class in com.cloudera.cdk.morphline.stdlib
For each input field, set the given record output field to the same values.
SetValuesBuilder() - Constructor for class com.cloudera.cdk.morphline.stdlib.SetValuesBuilder
 
setWorkingDirectory(Path) - Method in class com.cloudera.cdk.morphline.hadoop.rcfile.SingleStreamFileSystem
 
setZkHost(String) - Method in class com.cloudera.cdk.morphline.solr.SolrLocator
 
shutdown() - Method in interface com.cloudera.cdk.morphline.solr.DocumentLoader
Releases allocated resources
shutdown() - Method in class com.cloudera.cdk.morphline.solr.SolrServerDocumentLoader
 
SingleStreamFileSystem - Class in com.cloudera.cdk.morphline.hadoop.rcfile
Very simple Filesystem Implementation which serves an InputStream for a given path.
SingleStreamFileSystem(InputStream, Path) - Constructor for class com.cloudera.cdk.morphline.hadoop.rcfile.SingleStreamFileSystem
 
SolrCellBuilder - Class in com.cloudera.cdk.morphline.solrcell
Command that pipes the first attachment of a record into one of the given Tika parsers, then maps the Tika output back to a record using SolrCell.
SolrCellBuilder() - Constructor for class com.cloudera.cdk.morphline.solrcell.SolrCellBuilder
 
SolrLocator - Class in com.cloudera.cdk.morphline.solr
Set of configuration parameters that identify the location and schema of a Solr server or SolrCloud; Based on this information this class can return the schema and a corresponding DocumentLoader.
SolrLocator(MorphlineContext) - Constructor for class com.cloudera.cdk.morphline.solr.SolrLocator
 
SolrLocator(Config, MorphlineContext) - Constructor for class com.cloudera.cdk.morphline.solr.SolrLocator
 
SolrMorphlineContext - Class in com.cloudera.cdk.morphline.solr
A context that is specific to Solr.
SolrMorphlineContext() - Constructor for class com.cloudera.cdk.morphline.solr.SolrMorphlineContext
For public access use SolrMorphlineContext.Builder.build() instead
SolrMorphlineContext.Builder - Class in com.cloudera.cdk.morphline.solr
Helper to construct a SolrMorphlineContext instance.
SolrMorphlineContext.Builder() - Constructor for class com.cloudera.cdk.morphline.solr.SolrMorphlineContext.Builder
 
SolrServerDocumentLoader - Class in com.cloudera.cdk.morphline.solr
A vehicle to load a list of Solr documents into a local or remote SolrServer.
SolrServerDocumentLoader(SolrServer, int) - Constructor for class com.cloudera.cdk.morphline.solr.SolrServerDocumentLoader
 
SplitBuilder - Class in com.cloudera.cdk.morphline.stdlib
Command that divides strings into substrings, by recognizing a separator (a.k.a.
SplitBuilder() - Constructor for class com.cloudera.cdk.morphline.stdlib.SplitBuilder
 
SplitKeyValueBuilder - Class in com.cloudera.cdk.morphline.stdlib
Command that iterates over the items in a given record input field, interprets each item as a key-value pair where the key and value are separated by the given separator character, and adds the pair's value to the record field named after the pair's key.
SplitKeyValueBuilder() - Constructor for class com.cloudera.cdk.morphline.stdlib.SplitKeyValueBuilder
 
StartReportingMetricsToCSVBuilder - Class in com.cloudera.cdk.morphline.stdlib
Command that starts periodically appending the metrics of the com.codahale.metrics.MetricRegistry of the morphline context to a set of CSV files, configured via a com.codahale.metrics.CsvReporter.Builder.
StartReportingMetricsToCSVBuilder() - Constructor for class com.cloudera.cdk.morphline.stdlib.StartReportingMetricsToCSVBuilder
 
StartReportingMetricsToHTTPBuilder - Class in com.cloudera.cdk.morphline.metrics.servlets
Command that exposes liveness status, health check status, metrics state and thread dumps via a set of HTTP URLs served by Jetty, using the AdminServlet.
StartReportingMetricsToHTTPBuilder() - Constructor for class com.cloudera.cdk.morphline.metrics.servlets.StartReportingMetricsToHTTPBuilder
 
StartReportingMetricsToJMXBuilder - Class in com.cloudera.cdk.morphline.stdlib
Command that starts publishing the metrics of the com.codahale.metrics.MetricRegistry of the morphline context to JMX, configured via a com.codahale.metrics.JmxReporter.Builder.
StartReportingMetricsToJMXBuilder() - Constructor for class com.cloudera.cdk.morphline.stdlib.StartReportingMetricsToJMXBuilder
 
StartReportingMetricsToSLF4JBuilder - Class in com.cloudera.cdk.morphline.stdlib
Command that starts periodically logging the metrics of the com.codahale.metrics.MetricRegistry of the morphline context to SLF4j, configured via a com.codahale.metrics.Slf4jReporter.Builder.
StartReportingMetricsToSLF4JBuilder() - Constructor for class com.cloudera.cdk.morphline.stdlib.StartReportingMetricsToSLF4JBuilder
 
StripNonCharSolrContentHandlerFactory - Class in com.cloudera.cdk.morphline.solrcell
SolrContentHandler and associated factory that strips non-characters anhd trims on output.
StripNonCharSolrContentHandlerFactory(Collection<String>) - Constructor for class com.cloudera.cdk.morphline.solrcell.StripNonCharSolrContentHandlerFactory
 
SUPPORTED_MIME_TYPES - Static variable in class com.cloudera.cdk.morphline.stdio.AbstractParser
 

T

TIMESTAMP - Static variable in class com.cloudera.cdk.morphline.base.Fields
 
ToAvroBuilder - Class in com.cloudera.cdk.morphline.avro
Command that converts a morphline record to an Avro record.
ToAvroBuilder() - Constructor for class com.cloudera.cdk.morphline.avro.ToAvroBuilder
 
ToByteArrayBuilder - Class in com.cloudera.cdk.morphline.stdlib
Command that converts the Java objects in a given field via Object.toString() to their string representation, and then via String.getBytes(Charset) to their byte array representation.
ToByteArrayBuilder() - Constructor for class com.cloudera.cdk.morphline.stdlib.ToByteArrayBuilder
 
toConfig(String) - Method in class com.cloudera.cdk.morphline.solr.SolrLocator
 
TokenizeTextBuilder - Class in com.cloudera.cdk.morphline.solr
A command that uses the embedded Solr/Lucene Analyzer library to generate tokens from a text string, without sending data to a Solr server.
TokenizeTextBuilder() - Constructor for class com.cloudera.cdk.morphline.solr.TokenizeTextBuilder
 
toString() - Method in class com.cloudera.cdk.data.DatasetDescriptor
 
toString() - Method in class com.cloudera.cdk.data.filesystem.FileSystemDatasetRepository
 
toString() - Method in class com.cloudera.cdk.data.filesystem.FileSystemMetadataProvider
 
toString() - Method in class com.cloudera.cdk.data.Format
 
toString() - Method in class com.cloudera.cdk.data.Key
 
toString() - Method in class com.cloudera.cdk.data.partition.HashFieldPartitioner
 
toString() - Method in class com.cloudera.cdk.data.partition.IdentityFieldPartitioner
 
toString() - Method in class com.cloudera.cdk.data.partition.IntRangeFieldPartitioner
 
toString() - Method in class com.cloudera.cdk.data.partition.ListFieldPartitioner
 
toString() - Method in class com.cloudera.cdk.data.partition.RangeFieldPartitioner
 
toString() - Method in class com.cloudera.cdk.data.PartitionKey
 
toString() - Method in class com.cloudera.cdk.data.PartitionStrategy
 
toString() - Method in class com.cloudera.cdk.morphline.api.Record
 
toString() - Method in class com.cloudera.cdk.morphline.solr.SolrLocator
 
ToStringBuilder - Class in com.cloudera.cdk.morphline.stdlib
Command that converts the Java objects in a given field via Object.toString() to their string representation, and optionally also applies String.trim().
ToStringBuilder() - Constructor for class com.cloudera.cdk.morphline.stdlib.ToStringBuilder
 
transformFieldValue(Object) - Method in class com.cloudera.cdk.morphline.stdlib.AbstractFieldTransformCommand
Transforms the given input value to some output value
TranslateBuilder - Class in com.cloudera.cdk.morphline.stdlib
Command that examines each string value in a given field and replaces it with the replacement value defined in a given dictionary aka hash table.
TranslateBuilder() - Constructor for class com.cloudera.cdk.morphline.stdlib.TranslateBuilder
 
TrimSolrContentHandlerFactory - Class in com.cloudera.cdk.morphline.solrcell
SolrContentHandler and associated factory that trims field values on output.
TrimSolrContentHandlerFactory(Collection<String>) - Constructor for class com.cloudera.cdk.morphline.solrcell.TrimSolrContentHandlerFactory
 
TryRulesBuilder - Class in com.cloudera.cdk.morphline.stdlib
A tryRules command consists of zero or more rules.
TryRulesBuilder() - Constructor for class com.cloudera.cdk.morphline.stdlib.TryRulesBuilder
 

U

UnknownFormatException - Exception in com.cloudera.cdk.data
Exception thrown when a Dataset is in an unknown format.
UnknownFormatException(String) - Constructor for exception com.cloudera.cdk.data.UnknownFormatException
 
UnknownFormatException(String, Throwable) - Constructor for exception com.cloudera.cdk.data.UnknownFormatException
 
UnknownFormatException(Throwable) - Constructor for exception com.cloudera.cdk.data.UnknownFormatException
 
UnpackBuilder - Class in com.cloudera.cdk.morphline.tika.decompress
Command that unpacks the first attachment.
UnpackBuilder() - Constructor for class com.cloudera.cdk.morphline.tika.decompress.UnpackBuilder
 
update(String, DatasetDescriptor) - Method in interface com.cloudera.cdk.data.DatasetRepository
Update an existing Dataset to reflect the supplied descriptor.
update(String, DatasetDescriptor) - Method in class com.cloudera.cdk.data.filesystem.FileSystemDatasetRepository
 
update(String, DatasetDescriptor) - Method in class com.cloudera.cdk.data.filesystem.FileSystemMetadataProvider
 
update(String, DatasetDescriptor) - Method in class com.cloudera.cdk.data.hcatalog.HCatalogDatasetRepository
 
update(String, DatasetDescriptor) - Method in interface com.cloudera.cdk.data.MetadataProvider
Update a DatasetDescriptor for the dataset named name.
update(String, DatasetDescriptor) - Method in interface com.cloudera.cdk.data.RandomAccessDatasetRepository
 
USE_IP - Static variable in class com.cloudera.cdk.morphline.stdlib.AddLocalHostBuilder
 
UserAgentBuilder - Class in com.cloudera.cdk.morphline.useragent
Command that parses user agent strings and returns structured higher level data like user agent family, operating system, version, and device type, using the underlying API and regexes.yaml BrowserScope database from https://github.com/tobie/ua-parser.
UserAgentBuilder() - Constructor for class com.cloudera.cdk.morphline.useragent.UserAgentBuilder
 

V

validateArguments() - Method in class com.cloudera.cdk.morphline.base.AbstractCommand
Helper that checks if the user provided configuration parameters are valid.
validateArguments(Config) - Method in class com.cloudera.cdk.morphline.base.Configs
 
validateEnum(Config, String, Class<T>, T...) - Method in class com.cloudera.cdk.morphline.base.Validator
Validates that an enum of the given type with the given value exists, and that this enum is contained in the given list of permitted choices; finally returns that enum object.
validateRange(Config, T, Comparable<T>, Comparable<T>) - Method in class com.cloudera.cdk.morphline.base.Validator
Validates that the given value is contained in the range [min, max]
Validator<T> - Class in com.cloudera.cdk.morphline.base
Simple helper to validate user configurable parameters.
Validator() - Constructor for class com.cloudera.cdk.morphline.base.Validator
 
valueFromString(String) - Method in class com.cloudera.cdk.data.FieldPartitioner
Deprecated. will be removed in 0.11.0
valueFromString(String) - Method in class com.cloudera.cdk.data.partition.DateFormatPartitioner
Deprecated. 
valueFromString(String) - Method in class com.cloudera.cdk.data.partition.HashFieldPartitioner
Deprecated. 
valueFromString(String) - Method in class com.cloudera.cdk.data.partition.IdentityFieldPartitioner
Deprecated. 
valueFromString(String) - Method in class com.cloudera.cdk.data.partition.IntRangeFieldPartitioner
Deprecated. 
valueFromString(String) - Method in class com.cloudera.cdk.data.partition.ListFieldPartitioner
Deprecated. 
valueFromString(String) - Method in class com.cloudera.cdk.data.partition.RangeFieldPartitioner
Deprecated. 
valueOf(String) - Static method in enum com.cloudera.cdk.morphline.base.Notifications.LifecycleEvent
Returns the enum constant of this type with the specified name.
values() - Static method in enum com.cloudera.cdk.morphline.base.Notifications.LifecycleEvent
Returns an array containing the constants of this enum type, in the order they are declared.
valueToString(T) - Method in class com.cloudera.cdk.data.FieldPartitioner
Deprecated. will be removed in 0.11.0
valueToString(Integer) - Method in class com.cloudera.cdk.data.partition.DayOfMonthFieldPartitioner
Deprecated. 
valueToString(Integer) - Method in class com.cloudera.cdk.data.partition.HourFieldPartitioner
Deprecated. 
valueToString(Integer) - Method in class com.cloudera.cdk.data.partition.MinuteFieldPartitioner
Deprecated. 
valueToString(Integer) - Method in class com.cloudera.cdk.data.partition.MonthFieldPartitioner
Deprecated. 
View<E> - Interface in com.cloudera.cdk.data
A View is a subset of a Dataset.

W

write(E) - Method in interface com.cloudera.cdk.data.DatasetWriter
Writer an entity of type E to the associated dataset.
WriteAvroToByteArrayBuilder - Class in com.cloudera.cdk.morphline.avro
Command that serializes the Avro records contained in the _attachment_body field into a byte array and replaces the _attachment_body field with that byte array.
WriteAvroToByteArrayBuilder() - Constructor for class com.cloudera.cdk.morphline.avro.WriteAvroToByteArrayBuilder
 

X

XQueryBuilder - Class in com.cloudera.cdk.morphline.saxon
Command that parses an InputStream that contains an XML document and runs the given XQuery over the XML document.
XQueryBuilder() - Constructor for class com.cloudera.cdk.morphline.saxon.XQueryBuilder
 
XSLTBuilder - Class in com.cloudera.cdk.morphline.saxon
Command that parses an InputStream that contains an XML document and runs the given XSL Transform over the XML document.
XSLTBuilder() - Constructor for class com.cloudera.cdk.morphline.saxon.XSLTBuilder
 

Y

year(String, String) - Static method in class com.cloudera.cdk.data.partition.PartitionFunctions
 
year(String, String) - Method in class com.cloudera.cdk.data.PartitionStrategy.Builder
Configure a partitioner for extracting the year from a timestamp field.
year(String) - Method in class com.cloudera.cdk.data.PartitionStrategy.Builder
Configure a partitioner for extracting the year from a timestamp field.
YearFieldPartitioner - Class in com.cloudera.cdk.data.partition
 
YearFieldPartitioner(String, String) - Constructor for class com.cloudera.cdk.data.partition.YearFieldPartitioner
 

A B C D E F G H I J K L M N O P R S T U V W X Y

Copyright © 2013 Cloudera. All rights reserved.