Loading data

Bringing your own data

It's possible to point your Lakehouse node at an arbitrary S3 bucket with Delta Tables inside of it. However, there are some major caveats to doing so. (These will be resolved in future releases.)

Caveats

  • The tables must be stored as Delta Lake Tables within the location.
  • A Delta Lake Table (or Delta Table) is a folder of Parquet files along with some JSON metadata.

Loading data into your bucket

You can use the lakehouse-loader utility to export data from an arbitrary Postgres instance to Delta Tables in a storage bucket. See Delta Lake Table tools for more information on how to get and use that utility.

For more details, see querying delta tables.


Could this page be better? Report a problem or suggest an addition!