Deep Learning Nerds Academy
The ultimate Learning Platform for AI, Data Science, Data Analytics and Data Engineering.

PySpark - Parse a Column of JSON Strings
Academy Membership PySparkPython

PySpark - Parse a Column of JSON Strings

Introduction Parsing JSON strings with PySpark is an essential task when working with large datasets in JSON format. By transforming JSON data into a structured format, you can enable efficient processing and analysis. PySpark provides a powerful way to parse these JSON strings and extract their contents into separate columns,...

PySpark - Convert Column from String to Timestamp Format
Academy Membership PySparkPython

PySpark - Convert Column from String to Timestamp Format

Introduction In data processing, it's common to find timestamp fields as strings. Converting these string representations into proper timestamp formats is crucial for accurate data analysis and processing. In this tutorial, we will explore how to convert a string to a timestamp column in a PySpark DataFrame. Import...

Overview of Materialization types in dbt
Academy Membership dbtData Engineering

Overview of Materialization types in dbt

Introduction In this tutorial, we’ll dive into the concept of Materializations in dbt and explore the different types available. Understanding materializations is key to optimizing how data is stored, queried, and updated within your dbt projects. Whether you're just starting with dbt or preparing for the dbt...

PySpark - Convert Column from String to Date Format
Academy Membership PySparkPython

PySpark - Convert Column from String to Date Format

Introduction In data processing, it's common to find date fields as strings. Converting these string representations into proper date formats is crucial for accurate data analysis and processing. In this tutorial, we will explore how to convert a string to a date column in a PySpark DataFrame. Import...

dbt Core vs dbt Cloud
Academy Membership dbtData Engineering

dbt Core vs dbt Cloud

Introduction dbt comes in two versions - dbt Core and dbt Cloud. While both provide the core functionality for data transformation, they serve different purposes and are suited to different requirements. In this tutorial, we’ll dive into the features of dbt Core and dbt Cloud, highlighting the key differences...

Overview of essential dbt commands

Overview of essential dbt commands

Introduction dbt empowers data professionals to transform raw data into clean, analytics-ready datasets. Mastering its essential commands is key to unlocking its full potential. In this tutorial, we’ll explore the key dbt commands, providing both the syntax and a short description for each command. This tutorial is also a...

PySpark - Extract a Substring from a DataFrame Column
Academy Membership PySparkPython

PySpark - Extract a Substring from a DataFrame Column

Introduction When dealing with large datasets in PySpark, it's common to encounter situations where you need to manipulate string data within your DataFrame columns. One such common operation is extracting a portion of a string—also known as a substring—from a column. In this tutorial, we will...

You’ve successfully subscribed to Deep Learning Nerds | The ultimate Learning Platform for AI and Data Science
Welcome back! You’ve successfully signed in.
Great! You’ve successfully signed up.
Success! Your email is updated.
Your link has expired
Success! Check your email for magic link to sign-in.