Deep Learning Nerds Academy
The ultimate Learning Platform for AI, Data Science, Data Analytics and Data Engineering.

How to Generate a Hash from Multiple Columns in PySpark
Academy Membership PySparkData Engineering

How to Generate a Hash from Multiple Columns in PySpark

📘 Introduction When processing massive datasets in PySpark, it’s often necessary to uniquely identify rows or efficiently detect changes across records. Using multiple columns as a composite key can quickly become cumbersome and inefficient — especially during joins or deduplication. A better solution is to generate a single hash value derived...

Configuring DEV and PROD Environments in dbt
Academy Membership dbtData Engineering

Configuring DEV and PROD Environments in dbt

📘Introduction In this hands-on dbt tutorial, you’ll learn how to configure separate development (DEV) and production (PROD) environments to safely build, test, and deploy your data models. We’ll walk through why environment separation matters and how to configure your profiles.yml so you can switch between environments with...

Using Jinja loops for flexible Aggregations in dbt
Academy Membership dbtData Engineering

Using Jinja loops for flexible Aggregations in dbt

📘Introduction In this hands-on dbt tutorial, you’ll learn how to make your aggregations dynamic and flexible using Jinja loops inside your SQL models. Instead of writing multiple aggregation functions, you’ll see how to dynamically generate aggregation logic — saving time and reducing repetitive SQL code. 🎓 Preparing for dbt Analytics...

You’ve successfully subscribed to Deep Learning Nerds | The ultimate Learning Platform for AI and Data Science
Welcome back! You’ve successfully signed in.
Great! You’ve successfully signed up.
Success! Your email is updated.
Your link has expired
Success! Check your email for magic link to sign-in.