public class KinesisUtils
extends Object
Constructor and Description |
---|
KinesisUtils() |
Modifier and Type | Method and Description |
---|---|
static JavaReceiverInputDStream<byte[]> |
createStream(JavaStreamingContext jssc,
String streamName,
String endpointUrl,
Duration checkpointInterval,
com.amazonaws.services.kinesis.clientlibrary.lib.worker.InitialPositionInStream initialPositionInStream,
StorageLevel storageLevel)
Create an input stream that pulls messages from a Kinesis stream.
|
static JavaReceiverInputDStream<byte[]> |
createStream(JavaStreamingContext jssc,
String kinesisAppName,
String streamName,
String endpointUrl,
String regionName,
com.amazonaws.services.kinesis.clientlibrary.lib.worker.InitialPositionInStream initialPositionInStream,
Duration checkpointInterval,
StorageLevel storageLevel)
Create an input stream that pulls messages from a Kinesis stream.
|
static <T> JavaReceiverInputDStream<T> |
createStream(JavaStreamingContext jssc,
String kinesisAppName,
String streamName,
String endpointUrl,
String regionName,
com.amazonaws.services.kinesis.clientlibrary.lib.worker.InitialPositionInStream initialPositionInStream,
Duration checkpointInterval,
StorageLevel storageLevel,
Function<com.amazonaws.services.kinesis.model.Record,T> messageHandler,
Class<T> recordClass)
Create an input stream that pulls messages from a Kinesis stream.
|
static <T> JavaReceiverInputDStream<T> |
createStream(JavaStreamingContext jssc,
String kinesisAppName,
String streamName,
String endpointUrl,
String regionName,
com.amazonaws.services.kinesis.clientlibrary.lib.worker.InitialPositionInStream initialPositionInStream,
Duration checkpointInterval,
StorageLevel storageLevel,
Function<com.amazonaws.services.kinesis.model.Record,T> messageHandler,
Class<T> recordClass,
String awsAccessKeyId,
String awsSecretKey)
Create an input stream that pulls messages from a Kinesis stream.
|
static JavaReceiverInputDStream<byte[]> |
createStream(JavaStreamingContext jssc,
String kinesisAppName,
String streamName,
String endpointUrl,
String regionName,
com.amazonaws.services.kinesis.clientlibrary.lib.worker.InitialPositionInStream initialPositionInStream,
Duration checkpointInterval,
StorageLevel storageLevel,
String awsAccessKeyId,
String awsSecretKey)
Create an input stream that pulls messages from a Kinesis stream.
|
static ReceiverInputDStream<byte[]> |
createStream(StreamingContext ssc,
String streamName,
String endpointUrl,
Duration checkpointInterval,
com.amazonaws.services.kinesis.clientlibrary.lib.worker.InitialPositionInStream initialPositionInStream,
StorageLevel storageLevel)
Create an input stream that pulls messages from a Kinesis stream.
|
static ReceiverInputDStream<byte[]> |
createStream(StreamingContext ssc,
String kinesisAppName,
String streamName,
String endpointUrl,
String regionName,
com.amazonaws.services.kinesis.clientlibrary.lib.worker.InitialPositionInStream initialPositionInStream,
Duration checkpointInterval,
StorageLevel storageLevel)
Create an input stream that pulls messages from a Kinesis stream.
|
static <T> ReceiverInputDStream<T> |
createStream(StreamingContext ssc,
String kinesisAppName,
String streamName,
String endpointUrl,
String regionName,
com.amazonaws.services.kinesis.clientlibrary.lib.worker.InitialPositionInStream initialPositionInStream,
Duration checkpointInterval,
StorageLevel storageLevel,
scala.Function1<com.amazonaws.services.kinesis.model.Record,T> messageHandler,
scala.reflect.ClassTag<T> evidence$1)
Create an input stream that pulls messages from a Kinesis stream.
|
static <T> ReceiverInputDStream<T> |
createStream(StreamingContext ssc,
String kinesisAppName,
String streamName,
String endpointUrl,
String regionName,
com.amazonaws.services.kinesis.clientlibrary.lib.worker.InitialPositionInStream initialPositionInStream,
Duration checkpointInterval,
StorageLevel storageLevel,
scala.Function1<com.amazonaws.services.kinesis.model.Record,T> messageHandler,
String awsAccessKeyId,
String awsSecretKey,
scala.reflect.ClassTag<T> evidence$2)
Create an input stream that pulls messages from a Kinesis stream.
|
static ReceiverInputDStream<byte[]> |
createStream(StreamingContext ssc,
String kinesisAppName,
String streamName,
String endpointUrl,
String regionName,
com.amazonaws.services.kinesis.clientlibrary.lib.worker.InitialPositionInStream initialPositionInStream,
Duration checkpointInterval,
StorageLevel storageLevel,
String awsAccessKeyId,
String awsSecretKey)
Create an input stream that pulls messages from a Kinesis stream.
|
public static <T> ReceiverInputDStream<T> createStream(StreamingContext ssc, String kinesisAppName, String streamName, String endpointUrl, String regionName, com.amazonaws.services.kinesis.clientlibrary.lib.worker.InitialPositionInStream initialPositionInStream, Duration checkpointInterval, StorageLevel storageLevel, scala.Function1<com.amazonaws.services.kinesis.model.Record,T> messageHandler, scala.reflect.ClassTag<T> evidence$1)
Note: The AWS credentials will be discovered using the DefaultAWSCredentialsProviderChain on the workers. See AWS documentation to understand how DefaultAWSCredentialsProviderChain gets the AWS credentials.
ssc
- StreamingContext objectkinesisAppName
- Kinesis application name used by the Kinesis Client Library
(KCL) to update DynamoDBstreamName
- Kinesis stream nameendpointUrl
- Url of Kinesis service (e.g., https://kinesis.us-east-1.amazonaws.com)regionName
- Name of region used by the Kinesis Client Library (KCL) to update
DynamoDB (lease coordination and checkpointing) and CloudWatch (metrics)initialPositionInStream
- In the absence of Kinesis checkpoint info, this is the
worker's initial starting position in the stream.
The values are either the beginning of the stream
per Kinesis' limit of 24 hours
(InitialPositionInStream.TRIM_HORIZON) or
the tip of the stream (InitialPositionInStream.LATEST).checkpointInterval
- Checkpoint interval for Kinesis checkpointing.
See the Kinesis Spark Streaming documentation for more
details on the different types of checkpoints.storageLevel
- Storage level to use for storing the received objects.
StorageLevel.MEMORY_AND_DISK_2 is recommended.messageHandler
- A custom message handler that can generate a generic output from a
Kinesis Record
, which contains both message data, and metadata.evidence$1
- (undocumented)public static <T> ReceiverInputDStream<T> createStream(StreamingContext ssc, String kinesisAppName, String streamName, String endpointUrl, String regionName, com.amazonaws.services.kinesis.clientlibrary.lib.worker.InitialPositionInStream initialPositionInStream, Duration checkpointInterval, StorageLevel storageLevel, scala.Function1<com.amazonaws.services.kinesis.model.Record,T> messageHandler, String awsAccessKeyId, String awsSecretKey, scala.reflect.ClassTag<T> evidence$2)
Note: The given AWS credentials will get saved in DStream checkpoints if checkpointing is enabled. Make sure that your checkpoint directory is secure.
ssc
- StreamingContext objectkinesisAppName
- Kinesis application name used by the Kinesis Client Library
(KCL) to update DynamoDBstreamName
- Kinesis stream nameendpointUrl
- Url of Kinesis service (e.g., https://kinesis.us-east-1.amazonaws.com)regionName
- Name of region used by the Kinesis Client Library (KCL) to update
DynamoDB (lease coordination and checkpointing) and CloudWatch (metrics)initialPositionInStream
- In the absence of Kinesis checkpoint info, this is the
worker's initial starting position in the stream.
The values are either the beginning of the stream
per Kinesis' limit of 24 hours
(InitialPositionInStream.TRIM_HORIZON) or
the tip of the stream (InitialPositionInStream.LATEST).checkpointInterval
- Checkpoint interval for Kinesis checkpointing.
See the Kinesis Spark Streaming documentation for more
details on the different types of checkpoints.storageLevel
- Storage level to use for storing the received objects.
StorageLevel.MEMORY_AND_DISK_2 is recommended.messageHandler
- A custom message handler that can generate a generic output from a
Kinesis Record
, which contains both message data, and metadata.awsAccessKeyId
- AWS AccessKeyId (if null, will use DefaultAWSCredentialsProviderChain)awsSecretKey
- AWS SecretKey (if null, will use DefaultAWSCredentialsProviderChain)evidence$2
- (undocumented)public static ReceiverInputDStream<byte[]> createStream(StreamingContext ssc, String kinesisAppName, String streamName, String endpointUrl, String regionName, com.amazonaws.services.kinesis.clientlibrary.lib.worker.InitialPositionInStream initialPositionInStream, Duration checkpointInterval, StorageLevel storageLevel)
Note: The AWS credentials will be discovered using the DefaultAWSCredentialsProviderChain on the workers. See AWS documentation to understand how DefaultAWSCredentialsProviderChain gets the AWS credentials.
ssc
- StreamingContext objectkinesisAppName
- Kinesis application name used by the Kinesis Client Library
(KCL) to update DynamoDBstreamName
- Kinesis stream nameendpointUrl
- Url of Kinesis service (e.g., https://kinesis.us-east-1.amazonaws.com)regionName
- Name of region used by the Kinesis Client Library (KCL) to update
DynamoDB (lease coordination and checkpointing) and CloudWatch (metrics)initialPositionInStream
- In the absence of Kinesis checkpoint info, this is the
worker's initial starting position in the stream.
The values are either the beginning of the stream
per Kinesis' limit of 24 hours
(InitialPositionInStream.TRIM_HORIZON) or
the tip of the stream (InitialPositionInStream.LATEST).checkpointInterval
- Checkpoint interval for Kinesis checkpointing.
See the Kinesis Spark Streaming documentation for more
details on the different types of checkpoints.storageLevel
- Storage level to use for storing the received objects.
StorageLevel.MEMORY_AND_DISK_2 is recommended.public static ReceiverInputDStream<byte[]> createStream(StreamingContext ssc, String kinesisAppName, String streamName, String endpointUrl, String regionName, com.amazonaws.services.kinesis.clientlibrary.lib.worker.InitialPositionInStream initialPositionInStream, Duration checkpointInterval, StorageLevel storageLevel, String awsAccessKeyId, String awsSecretKey)
Note: The given AWS credentials will get saved in DStream checkpoints if checkpointing is enabled. Make sure that your checkpoint directory is secure.
ssc
- StreamingContext objectkinesisAppName
- Kinesis application name used by the Kinesis Client Library
(KCL) to update DynamoDBstreamName
- Kinesis stream nameendpointUrl
- Url of Kinesis service (e.g., https://kinesis.us-east-1.amazonaws.com)regionName
- Name of region used by the Kinesis Client Library (KCL) to update
DynamoDB (lease coordination and checkpointing) and CloudWatch (metrics)initialPositionInStream
- In the absence of Kinesis checkpoint info, this is the
worker's initial starting position in the stream.
The values are either the beginning of the stream
per Kinesis' limit of 24 hours
(InitialPositionInStream.TRIM_HORIZON) or
the tip of the stream (InitialPositionInStream.LATEST).checkpointInterval
- Checkpoint interval for Kinesis checkpointing.
See the Kinesis Spark Streaming documentation for more
details on the different types of checkpoints.storageLevel
- Storage level to use for storing the received objects.
StorageLevel.MEMORY_AND_DISK_2 is recommended.awsAccessKeyId
- AWS AccessKeyId (if null, will use DefaultAWSCredentialsProviderChain)awsSecretKey
- AWS SecretKey (if null, will use DefaultAWSCredentialsProviderChain)public static ReceiverInputDStream<byte[]> createStream(StreamingContext ssc, String streamName, String endpointUrl, Duration checkpointInterval, com.amazonaws.services.kinesis.clientlibrary.lib.worker.InitialPositionInStream initialPositionInStream, StorageLevel storageLevel)
Note:
- The AWS credentials will be discovered using the DefaultAWSCredentialsProviderChain
on the workers. See AWS documentation to understand how DefaultAWSCredentialsProviderChain
gets AWS credentials.
- The region of the endpointUrl
will be used for DynamoDB and CloudWatch.
- The Kinesis application name used by the Kinesis Client Library (KCL) will be the app name
in SparkConf
.
ssc
- StreamingContext objectstreamName
- Kinesis stream nameendpointUrl
- Endpoint url of Kinesis service
(e.g., https://kinesis.us-east-1.amazonaws.com)checkpointInterval
- Checkpoint interval for Kinesis checkpointing.
See the Kinesis Spark Streaming documentation for more
details on the different types of checkpoints.initialPositionInStream
- In the absence of Kinesis checkpoint info, this is the
worker's initial starting position in the stream.
The values are either the beginning of the stream
per Kinesis' limit of 24 hours
(InitialPositionInStream.TRIM_HORIZON) or
the tip of the stream (InitialPositionInStream.LATEST).storageLevel
- Storage level to use for storing the received objects
StorageLevel.MEMORY_AND_DISK_2 is recommended.public static <T> JavaReceiverInputDStream<T> createStream(JavaStreamingContext jssc, String kinesisAppName, String streamName, String endpointUrl, String regionName, com.amazonaws.services.kinesis.clientlibrary.lib.worker.InitialPositionInStream initialPositionInStream, Duration checkpointInterval, StorageLevel storageLevel, Function<com.amazonaws.services.kinesis.model.Record,T> messageHandler, Class<T> recordClass)
Note: The AWS credentials will be discovered using the DefaultAWSCredentialsProviderChain on the workers. See AWS documentation to understand how DefaultAWSCredentialsProviderChain gets the AWS credentials.
jssc
- Java StreamingContext objectkinesisAppName
- Kinesis application name used by the Kinesis Client Library
(KCL) to update DynamoDBstreamName
- Kinesis stream nameendpointUrl
- Url of Kinesis service (e.g., https://kinesis.us-east-1.amazonaws.com)regionName
- Name of region used by the Kinesis Client Library (KCL) to update
DynamoDB (lease coordination and checkpointing) and CloudWatch (metrics)initialPositionInStream
- In the absence of Kinesis checkpoint info, this is the
worker's initial starting position in the stream.
The values are either the beginning of the stream
per Kinesis' limit of 24 hours
(InitialPositionInStream.TRIM_HORIZON) or
the tip of the stream (InitialPositionInStream.LATEST).checkpointInterval
- Checkpoint interval for Kinesis checkpointing.
See the Kinesis Spark Streaming documentation for more
details on the different types of checkpoints.storageLevel
- Storage level to use for storing the received objects.
StorageLevel.MEMORY_AND_DISK_2 is recommended.messageHandler
- A custom message handler that can generate a generic output from a
Kinesis Record
, which contains both message data, and metadata.recordClass
- Class of the records in DStreampublic static <T> JavaReceiverInputDStream<T> createStream(JavaStreamingContext jssc, String kinesisAppName, String streamName, String endpointUrl, String regionName, com.amazonaws.services.kinesis.clientlibrary.lib.worker.InitialPositionInStream initialPositionInStream, Duration checkpointInterval, StorageLevel storageLevel, Function<com.amazonaws.services.kinesis.model.Record,T> messageHandler, Class<T> recordClass, String awsAccessKeyId, String awsSecretKey)
Note: The given AWS credentials will get saved in DStream checkpoints if checkpointing is enabled. Make sure that your checkpoint directory is secure.
jssc
- Java StreamingContext objectkinesisAppName
- Kinesis application name used by the Kinesis Client Library
(KCL) to update DynamoDBstreamName
- Kinesis stream nameendpointUrl
- Url of Kinesis service (e.g., https://kinesis.us-east-1.amazonaws.com)regionName
- Name of region used by the Kinesis Client Library (KCL) to update
DynamoDB (lease coordination and checkpointing) and CloudWatch (metrics)initialPositionInStream
- In the absence of Kinesis checkpoint info, this is the
worker's initial starting position in the stream.
The values are either the beginning of the stream
per Kinesis' limit of 24 hours
(InitialPositionInStream.TRIM_HORIZON) or
the tip of the stream (InitialPositionInStream.LATEST).checkpointInterval
- Checkpoint interval for Kinesis checkpointing.
See the Kinesis Spark Streaming documentation for more
details on the different types of checkpoints.storageLevel
- Storage level to use for storing the received objects.
StorageLevel.MEMORY_AND_DISK_2 is recommended.messageHandler
- A custom message handler that can generate a generic output from a
Kinesis Record
, which contains both message data, and metadata.recordClass
- Class of the records in DStreamawsAccessKeyId
- AWS AccessKeyId (if null, will use DefaultAWSCredentialsProviderChain)awsSecretKey
- AWS SecretKey (if null, will use DefaultAWSCredentialsProviderChain)public static JavaReceiverInputDStream<byte[]> createStream(JavaStreamingContext jssc, String kinesisAppName, String streamName, String endpointUrl, String regionName, com.amazonaws.services.kinesis.clientlibrary.lib.worker.InitialPositionInStream initialPositionInStream, Duration checkpointInterval, StorageLevel storageLevel)
Note: The AWS credentials will be discovered using the DefaultAWSCredentialsProviderChain on the workers. See AWS documentation to understand how DefaultAWSCredentialsProviderChain gets the AWS credentials.
jssc
- Java StreamingContext objectkinesisAppName
- Kinesis application name used by the Kinesis Client Library
(KCL) to update DynamoDBstreamName
- Kinesis stream nameendpointUrl
- Url of Kinesis service (e.g., https://kinesis.us-east-1.amazonaws.com)regionName
- Name of region used by the Kinesis Client Library (KCL) to update
DynamoDB (lease coordination and checkpointing) and CloudWatch (metrics)initialPositionInStream
- In the absence of Kinesis checkpoint info, this is the
worker's initial starting position in the stream.
The values are either the beginning of the stream
per Kinesis' limit of 24 hours
(InitialPositionInStream.TRIM_HORIZON) or
the tip of the stream (InitialPositionInStream.LATEST).checkpointInterval
- Checkpoint interval for Kinesis checkpointing.
See the Kinesis Spark Streaming documentation for more
details on the different types of checkpoints.storageLevel
- Storage level to use for storing the received objects.
StorageLevel.MEMORY_AND_DISK_2 is recommended.public static JavaReceiverInputDStream<byte[]> createStream(JavaStreamingContext jssc, String kinesisAppName, String streamName, String endpointUrl, String regionName, com.amazonaws.services.kinesis.clientlibrary.lib.worker.InitialPositionInStream initialPositionInStream, Duration checkpointInterval, StorageLevel storageLevel, String awsAccessKeyId, String awsSecretKey)
Note: The given AWS credentials will get saved in DStream checkpoints if checkpointing is enabled. Make sure that your checkpoint directory is secure.
jssc
- Java StreamingContext objectkinesisAppName
- Kinesis application name used by the Kinesis Client Library
(KCL) to update DynamoDBstreamName
- Kinesis stream nameendpointUrl
- Url of Kinesis service (e.g., https://kinesis.us-east-1.amazonaws.com)regionName
- Name of region used by the Kinesis Client Library (KCL) to update
DynamoDB (lease coordination and checkpointing) and CloudWatch (metrics)initialPositionInStream
- In the absence of Kinesis checkpoint info, this is the
worker's initial starting position in the stream.
The values are either the beginning of the stream
per Kinesis' limit of 24 hours
(InitialPositionInStream.TRIM_HORIZON) or
the tip of the stream (InitialPositionInStream.LATEST).checkpointInterval
- Checkpoint interval for Kinesis checkpointing.
See the Kinesis Spark Streaming documentation for more
details on the different types of checkpoints.storageLevel
- Storage level to use for storing the received objects.
StorageLevel.MEMORY_AND_DISK_2 is recommended.awsAccessKeyId
- AWS AccessKeyId (if null, will use DefaultAWSCredentialsProviderChain)awsSecretKey
- AWS SecretKey (if null, will use DefaultAWSCredentialsProviderChain)public static JavaReceiverInputDStream<byte[]> createStream(JavaStreamingContext jssc, String streamName, String endpointUrl, Duration checkpointInterval, com.amazonaws.services.kinesis.clientlibrary.lib.worker.InitialPositionInStream initialPositionInStream, StorageLevel storageLevel)
Note:
- The AWS credentials will be discovered using the DefaultAWSCredentialsProviderChain
on the workers. See AWS documentation to understand how DefaultAWSCredentialsProviderChain
gets AWS credentials.
- The region of the endpointUrl
will be used for DynamoDB and CloudWatch.
- The Kinesis application name used by the Kinesis Client Library (KCL) will be the app name in
SparkConf
.
jssc
- Java StreamingContext objectstreamName
- Kinesis stream nameendpointUrl
- Endpoint url of Kinesis service
(e.g., https://kinesis.us-east-1.amazonaws.com)checkpointInterval
- Checkpoint interval for Kinesis checkpointing.
See the Kinesis Spark Streaming documentation for more
details on the different types of checkpoints.initialPositionInStream
- In the absence of Kinesis checkpoint info, this is the
worker's initial starting position in the stream.
The values are either the beginning of the stream
per Kinesis' limit of 24 hours
(InitialPositionInStream.TRIM_HORIZON) or
the tip of the stream (InitialPositionInStream.LATEST).storageLevel
- Storage level to use for storing the received objects
StorageLevel.MEMORY_AND_DISK_2 is recommended.