Azure Data Lake Storage
This guide shows you how to import data from files stored in Azure Data Lake Storage Gen2 (ADLS Gen2)
Enable the Azure Data Lake Storage using the pinot-adls
plugin. In the controller or server, add the config:
By default Pinot loads all the plugins, so you can just drop this plugin there. Also, if you specify -Dplugins.include
, you need to put all the plugins you want to use, e.g. pinot-json
, pinot-avro
, pinot-kafka-2.0...
Azure Blob Storage provides the following options:
accountName
: Name of the Azure account under which the storage is created.accessKey
: Access key required for the authentication.fileSystemName
: Name of the file system to use, for example, the container name (similar to the bucket name in S3).enableChecksum
: Enable MD5 checksum for verification. Default isfalse
.
Each of these properties should be prefixed by pinot.[node].storage.factory.class.adl2.
where node
is either controller
or server
depending on the config, like this:
Examples
Job spec
Controller config
Server config
Minion config
Last updated