Understanding _dlt_id and _dlt_parent_id in dlt
Learn what _dlt_id and _dlt_parent_id mean in dlt, why they appear in normalized tables, and how to use them in SQL joins.
As a Data Engineer, I collect, extract and transform raw data in order to provide clean, reliable and usable data.
Learn what _dlt_id and _dlt_parent_id mean in dlt, why they appear in normalized tables, and how to use them in SQL joins.
Learn how dlt normalizes nested JSON into parent and child tables in DuckDB using a simple students and courses example.
Learn when to use replace, append, or merge in a dlt pipeline with practical DuckDB examples and beginner-friendly explanations.
Learn how to load a local JSON file into DuckDB with dlt, inspect the result with SQL, and understand a simple data pipeline pattern.
Learn PySpark Structured Streaming step by step by building a beginner-friendly real-time data pipeline with JSON files, readStream, writeStream, output modes, triggers, and checkpoints.
Compare FastAPI and Flask for serving AI agents, ML APIs, LLM backends, and production-ready Python services.
Learn how to load a local CSV file into DuckDB with dlt, inspect the result with SQL, and understand the basic pipeline pattern.
Learn what dbt snapshots are, why they matter, and how to track historical changes in your data warehouse with a beginner-friendly customer status example.
Learn what dbt exposures are, why they matter, and how to connect dbt models to dashboards, reports, notebooks, and machine learning workflows.