ADL Metadata Extracted
List of all the metadata that Secoda pulls from Azure Data Lake
Supported file formats
Extraction of metadata is only supported for the following file formats:
CSV (*.csv)
TSV (*.tsv)
JSON (*.json)
JSONL (*.jsonl)
Parquet (*.parquet)
Apache Avro (*.avro)
What does Secoda extract from Azure Data Lake?
Database (Storage Account containers are referred to as databases in Secoda)
Name
Schema (Directories within a container are referred to as schemas in Secoda)
Name
Table (Files are referred to as tables in Secoda)
Name
File format type (Parquet, CSV, JSON, etc.)
Columns (Fields in files are referred to as columns in Secoda)
Name
Data type
Important Notes
Secoda will extract metadata from the most recent version of files when multiple versions exist
The integration supports up to 100,000 files per container extraction limit
Last updated
Was this helpful?