Data Engineer

Data Engineer

As a Data Engineer, I collect, extract and transform raw data in order to provide clean, reliable and usable data.

142 posts
How to Ingest Data from Kafka Streams to Delta Tables Using PySpark in Databricks
Academy Membership DatabricksPySpark

How to Ingest Data from Kafka Streams to Delta Tables Using PySpark in Databricks

📘 Introduction Real-time data ingestion is a critical part of modern data architectures. Organizations need to process and store continuous streams of information for analytics, monitoring, and machine learning. Databricks, with the combined power of PySpark and Delta Lake, provides an efficient way to build end-to-end streaming pipelines that handle data...

Overview of all important YAML Files in dbt
Academy Membership dbtData Engineering

Overview of all important YAML Files in dbt

📘Introduction When working with dbt (data build tool), YAML files are the backbone of your project’s configuration. They define how dbt behaves, how your models connect to data sources, and how metadata, documentation, and tests are managed. Understanding these YAML files and knowing where they are located within your...

How to Generate a Hash from Multiple Columns in PySpark
Academy Membership PySparkData Engineering

How to Generate a Hash from Multiple Columns in PySpark

📘 Introduction When processing massive datasets in PySpark, it’s often necessary to uniquely identify rows or efficiently detect changes across records. Using multiple columns as a composite key can quickly become cumbersome and inefficient — especially during joins or deduplication. A better solution is to generate a single hash value derived...

Configuring DEV and PROD Environments in dbt
Academy Membership dbtData Engineering

Configuring DEV and PROD Environments in dbt

📘Introduction In this hands-on dbt tutorial, you’ll learn how to configure separate development (DEV) and production (PROD) environments to safely build, test, and deploy your data models. We’ll walk through why environment separation matters and how to configure your profiles.yml so you can switch between environments with...

You’ve successfully subscribed to Deep Learning Nerds | The ultimate Learning Platform for AI and Data Science
Welcome back! You’ve successfully signed in.
Great! You’ve successfully signed up.
Success! Your email is updated.
Your link has expired
Success! Check your email for magic link to sign-in.