Understanding Catalogs in e6data

Most analytical data is stored in blob storage platforms like AWS S3, GCS, or Azure Blob Storage. The structural metadata of this data (table names, column names, etc.) is typically stored in Metastores like Hive, Glue, Google Dataproc Metastore, Delta Lake, etc.

An e6data Catalog requires access to a Metastore to understand where the data it needs to query is stored.

Last updated