In this lesson we will:

  • Lesson contents 1
  • Lesson contents 2
  • Lesson contents 3

Querying External Files

ClickHouse allows us query data that is stored outside of ClickHouse in formats such as Parquet, Iceberg, Delta, CSV etc.

This is useful in a few situatoins.

Firstly, if we want to ingest that data into ClickHouse, it is sometimes useful to query it and then use that query as the basis for the ingest.

select * from 

Querying Data Lakes

We may also wish to. One example of this is when we maintain a data lake.

The usual mechanism for querying data held within a database or data warehouse is to ingest the data into the database and let it manage it for us.

Icnreasingly however, many companies are looking to decouple this, and store their data in other formats such as Parquet.

In some intsances, they are running a data lake, and want to keep this data avaialble.

In this instance, it is useful to treat our database as a query engine, and allow us to query data that is stored elsewhere.

How Does It Work

Querying is easy

select * from file ()
Next Lesson:
03

Querying Object Storage

In this lesson we will learn about querying data that is stored in cloud object storage in locations such as AWS S3.

0h 15m




Work With The Experts In Real-Time Analytics & AI

we help enterprise organisations deploy powerful real-time Data, Analytics and AI solutions based on ClickHouse, the worlds fastest open-source database.

Join our mailing list for regular insights:

We help enterprise organisations deploy advanced data, analytics and AI enabled systems based on modern cloud-native technology.

© 2024 Ensemble. All Rights Reserved.