Azure Data Lake Storage


Azure Data Lake Storage


Prerequisites

Whitelist CData IPs

To establish a connection to Azure Data Lake Storage, you need to allow access to Azure Data Lake Storage via CData’s IP. When hosting Azure Data Lake Storage behind a firewall, you must safelist these IP addresses in your firewall.

  • Range: 52.224.0.160 to 52.224.0.175

  • CIDR notation: 52.224.0.160/28

Ensure Azure Data Lake Storage is Publicly Accessible

Provide a public facing IP/domain to connect to this data source. The following private IP ranges do not work:

  • 10.0.0.0 to 10.255.255.255

  • 172.16.0.0 to 172.31.255.255

  • 192.168.0.0 to 192.168.255.255

  • 127.0.0.1 (aka ‘localhost’)

Setup Guide

Follow these steps to connect Azure Data Lake Storage to your CData Connect account:

  1. Open the Connections page of the CData Connect dashboard.

  2. Click + Add Connection in the upper-right corner.

  3. Type Azure Data Lake Storage into the search field, then click the data source name.

  4. On the Basic Settings tab of the new connection, enter a connection name or keep the default name.

  5. Enter the following information:

    • File Format—choose the file format type.

    • Auth Scheme—set to Azure AD.

    • Azure Storage Account—the name of the storage account.

    • File System(Optional)—enter the file system protocol.

  6. Click Sign in to connect securely through OAuth. This action opens the Azure Data Lake Storage sign-in page in a new tab.

  7. Log into your Azure Data Lake Storage account and provide the requested permissions (if applicable).

  8. At the top of the CData Connect Add Azure Data Lake Storage Connection page, click Save & Test.

    • If the connection test succeeds, a message indicates that your connection has been created.

    • If the connection test fails, ensure that you entered your login information correctly with no stray spaces or other characters. Then, try again.