Zeppelin

Zeppelin is an advanced, web-based notebook that enables interactive data analysis and visualization. It supports multiple languages such as Python, SQL, Scala, and more, allowing for seamless integration with various data processing backends. With its powerful collaboration features, Zeppelin facilitates real-time sharing and teamwork, making it an ideal tool for data scientists, analysts, and developers working on complex data projects. Its extensible architecture and rich visualization capabilities ensure a versatile and robust environment for conducting and presenting data-driven insights.

Requirements

Below are the prerequisites you need to fulfil before connecting the e6data cluster to Metabase:

  • Users of the e6data cluster must use Apache Zeppelin 0.8.2 or newer.

  • Install the latest e6data jdbc driver.

  • Establish a Direct or TLS connection in your cluster.

  • You can obtain the connection details required from the Connection Information tab in the e6data cluster.

Connection

Get the necessary connection information for your cluster:

Once you have gathered this information, you can create a new connection in Zeppelin. To connect with Zeppelin, follow the below instructions:

  1. To access the list of interpreters available for querying, navigate to the top right corner and click on the Profile. From there, select "Interpreter," which will display the list of interpreters you can choose from for your queries.

  2. Go to the top right corner and click on the Create option.

  1. Use below configurations to setup the new E6 JDBC Interpreter

    1. Interpreter Name - Give the name for the interpreter

    2. Interpreter group - Select jdbc from the drop down menu

    3. From the properties section use below configuration setting

  1. In the Artifact tab of the Dependencies section, provide the file path of the E6 JDBC jar that you have downloaded on the EC2 instance.

  2. Click on save to save the changes.

Driver class remains same and manual input is required on zeppelin i.e. io.e6.jdbc.driver.E6Driver

TLS/HTTP

e6data utilizes globally trusted certificates, ensuring that the host and port provided by the connection dialogue are adequate. Any e6data cluster necessitating authentication must also employ TLS/HTTPS. If adhering to globally trusted certificate best practices, utilize the cluster’s HTTPS URL in the connection string as detailed in the aforementioned steps.

Last updated

#930: Cross account hive GCP

Change request updated