Introduction

In this tutorial, we will explain step-by-step how o read an Excel file into a PySpark DataFrame in Databricks.

Configure Cluster

First, install on a Databricks cluster the spark-excel library (also referred as com.crealytics.spark.excel).

To do this, select your Databricks cluster in the "Compute" page and navigate to the "Libraries" tab.

Click on the "Install new" button.

You can view this post with the tier: Academy Membership

Join academy now to read the post and get access to the full library of premium posts for academy members only.

Join Academy Already have an account? Sign In