Introduction
In this tutorial, we want to one-hot encode a categorical variable of a Pandas DataFrame. In order to do this, we use the get_dummies() function of Pandas.
Import Libraries
First, we import the following python modules:
import pandas as pd
Create Pandas DataFrame
Next, we create a Pandas DataFrame with some example data from a dictionary:
data = {
"language": ["Python", "Python", "Java", "JavaScript"],
"framework": ["Django", "FastAPI", "Spring", "ReactJS"],
"users": [20000, 9000, 7000, 5000]
}
df = pd.DataFrame(data)
df
data:image/s3,"s3://crabby-images/69a0a/69a0a193bfd3a472cc38f92721e047ecd8a3d805" alt=""
Create Dummy Variables
Now, we would like to one-hot encode the column "language" of the Pandas DataFrame.
To do this, we convert the column "language" into dummy variables by using the get_dummies() function of Pandas:
df = pd.get_dummies(df, columns=['language'])
df
data:image/s3,"s3://crabby-images/e7339/e7339a77adb26b650b88bb91aaf2f64381225c72" alt=""
Conclusion
Congratulations! Now you are one step closer to become an AI Expert. You have seen that it is very easy to one-hot encode a categorical variable of a Pandas DataFrame. We can simply use the get_dummies() function of Pandas. Try it yourself!