Understanding _dlt_id and _dlt_parent_id in dlt
Learn what _dlt_id and _dlt_parent_id mean in dlt, why they appear in normalized tables, and how to use them in SQL joins.
Learn what _dlt_id and _dlt_parent_id mean in dlt, why they appear in normalized tables, and how to use them in SQL joins.
Learn how dlt normalizes nested JSON into parent and child tables in DuckDB using a simple students and courses example.
Learn when to use replace, append, or merge in a dlt pipeline with practical DuckDB examples and beginner-friendly explanations.
Learn how to load a local JSON file into DuckDB with dlt, inspect the result with SQL, and understand a simple data pipeline pattern.
Learn PySpark Structured Streaming step by step by building a beginner-friendly real-time data pipeline with JSON files, readStream, writeStream, output modes, triggers, and checkpoints.
Learn how to load a local CSV file into DuckDB with dlt, inspect the result with SQL, and understand the basic pipeline pattern.
Learn what dbt snapshots are, why they matter, and how to track historical changes in your data warehouse with a beginner-friendly customer status example.
Learn what dbt exposures are, why they matter, and how to connect dbt models to dashboards, reports, notebooks, and machine learning workflows.
📘 Introduction In this hands-on tutorial, you will learn how to load data from a REST API into DuckDB using dlt. This is a great first local data pipeline because you do not need a cloud warehouse, a complex setup, or production credentials. We will use dlt to fetch data...