Deep Learning Nerds Academy
The ultimate Learning Platform for AI, Data Science, Data Analytics and Data Engineering.

PySpark - Drop Columns from a DataFrame

PySpark - Drop Columns from a DataFrame

Introduction In this tutorial, we want to drop columns from a PySpark DataFrame. In order to do this, we use the the drop() method of PySpark. Import Libraries First, we import the following python modules: from pyspark.sql import SparkSession Create SparkSession Before we can work with Pyspark, we need...

PySpark - Add Columns to a DataFrame

PySpark - Add Columns to a DataFrame

Introduction In this tutorial, we want to add columns to a PySpark DataFrame. In order to do this, we use the the withColumn() method of PySpark. Import Libraries First, we import the following python modules: from pyspark.sql import SparkSession from pyspark.sql.functions import lit, col, sum, when from...

PySpark - Select Columns from a DataFrame

PySpark - Select Columns from a DataFrame

Introduction In this tutorial, we want to select specific columns from a PySpark DataFrame. In order to do this, we use the select() method of PySpark in different variants. Import Libraries First, we import the following python modules: from pyspark.sql import SparkSession from pyspark.sql.functions import col Create...

PySpark - Rename Columns of a DataFrame

PySpark - Rename Columns of a DataFrame

Introduction In this tutorial, we want to rename a PySpark DataFrame column. In order to do this, we use the the withColumnRenamed() method of PySpark. Import Libraries First, we import the following python modules: from pyspark.sql import SparkSession Create SparkSession Before we can work with Pyspark, we need to...

PostgreSQL - SELECT DISTINCT

PostgreSQL - SELECT DISTINCT

Introduction In this tutorial, we want to select data from a table by using a SQL Statement. The result should not contain duplicate rows. In order to do this, we use the SELECT Statement with the DISTINCT Clause. We have already created the table student with the following data: Example...

Pandas - Filter Rows from a DataFrame

Pandas - Filter Rows from a DataFrame

Introduction In this tutorial, we want to filter specific rows from a Pandas DataFrame based on specific conditions. In order to do this, we use DataFrame conditions with logical operators, the loc() method and the query() method of Pandas. Import Libraries First, we import the following python modules: import pandas...

PySpark - Convert PySpark to Pandas DataFrame

PySpark - Convert PySpark to Pandas DataFrame

Introduction In this tutorial, we want to convert a PySpark DataFrame into a Pandas DataFrame with a specific schema. In order to do this, we use the the toPandas() method of PySpark. Import Libraries First, we import the following python modules: from pyspark.sql import SparkSession Create SparkSession Before we...

PySpark - Convert Pandas to PySpark DataFrame

PySpark - Convert Pandas to PySpark DataFrame

Introduction In this tutorial, we want to convert a Pandas DataFrame into a PySpark DataFrame with a specific schema. In order to do this, we use the the createDataFrame() function of PySpark. Import Libraries First, we import the following python modules: import pandas as pd from pyspark.sql import...

PostgreSQL - SELECT

PostgreSQL - SELECT

Introduction In this tutorial, we want to select data from a table by using a SQL Statement. In order to do this, we use the SELECT Statement. We have already created the table student with the following data: Example 1 - SELECT specific columns We want to select the columns...

You’ve successfully subscribed to Deep Learning Nerds | The ultimate Learning Platform for AI and Data Science
Welcome back! You’ve successfully signed in.
Great! You’ve successfully signed up.
Success! Your email is updated.
Your link has expired
Success! Check your email for magic link to sign-in.