Data Engineer

Data Engineer

As a Data Engineer, I collect, extract and transform raw data in order to provide clean, reliable and usable data.

151 posts
Query CSV files in Python using DuckDB
Academy Membership DuckDBPython

Query CSV files in Python using DuckDB

📘 Introduction One of DuckDB’s most useful features is the ability to query CSV files directly—no need to load them into a database first. This tutorial will guide you through running SQL queries on a CSV file using Python. ✅ Prerequisites Before you begin, make sure you have: 🐍☑️ Installed Python...

Query a Pandas DataFrame using DuckDB
Academy Membership DuckDBPython

Query a Pandas DataFrame using DuckDB

📘 Introduction If you enjoy working with pandas but wish you could use clean, powerful SQL at any time, then DuckDB is the right tool for you. With DuckDB, you can query your DataFrames instantly without having to set up a database, run a server, or change your workflow. ✅ Prerequisites Before...

How to install DuckDB with pip
Academy Membership DuckDBData Engineering

How to install DuckDB with pip

📘Introduction In this blog post, we’ll walk through how to install DuckDB, a fast and lightweight analytical database engine designed for modern data workflows. Whether you're working on data analysis, ETL pipelines, or experimenting with in-process SQL queries, DuckDB is incredibly easy to set up and use....

PySpark coalesce() Function Explained
Academy Membership PySparkPython

PySpark coalesce() Function Explained

📘 Introduction In many real-world datasets, the same type of information can appear in more than one column. A customer may provide an email address, a phone number, or a backup contact, and different systems may populate different fields. When you want to select the first available non-null value from several...

Using pre-hooks in dbt: A Hands-On Guide
Academy Membership dbtData Engineering

Using pre-hooks in dbt: A Hands-On Guide

📘Introduction In this hands-on dbt tutorial, you'll learn how to use pre-hooks to automate tasks such as creating backup tables before a model runs. Pre-hooks allow you to execute SQL before your dbt model builds, which is useful for auditing, data quality checks, or preparing the environment. 🎓 Preparing...

Kafka Producers and Consumers Explained: How Data Flows in Apache Kafka
Academy Membership Kafka

Kafka Producers and Consumers Explained: How Data Flows in Apache Kafka

📘 Introduction Modern systems produce endless streams of real-time data — from app events and online purchases to sensor readings and transactions. To handle this flow efficiently, applications need a fast, reliable way to move data between services as it happens. That’s where Apache Kafka comes in. Kafka is a distributed...

You’ve successfully subscribed to Deep Learning Nerds | The ultimate Learning Platform for AI and Data Science
Welcome back! You’ve successfully signed in.
Great! You’ve successfully signed up.
Success! Your email is updated.
Your link has expired
Success! Check your email for magic link to sign-in.