Spark SQL Fails to Create Table from Volume-Based Data Source in Databricks

Ask Question

Asked 4 months ago

Modified 4 months ago

Viewed 101 times

Part of Microsoft Azure Collective

In Databricks, I uploaded my source data under a volume folder (e.g., /Volumes/my_catalog/my_schema/landing_source/). Now, I want to create a DataFrame or table using this volume path as the source.

1. Using Spark API:

I am able to successfully create a DataFrame using the spark.read API:

df = spark.read.format("csv") \
    .option("header", "true") \
    .load("/Volumes/my_catalog/my_schema/landing_source/")

2. Using SQL CREATE TABLE with datasource:

When I try to create a table directly using SQL with the volume path as the data source:

CREATE TABLE IF NOT EXISTS my_catalog.my_schema.my_table
USING CSV
OPTIONS (
  path "/Volumes/my_catalog/my_schema/landing_source/",
  header "true",
  inferSchema "true"
)

I get the following exception:

[RequestId=4800ec53-1a9d-946e-b61f-0ab7c93b88fc ErrorClass=INVALID_PARAMETER_VALUE.INVALID_PARAMETER_VALUE] Missing cloud file system scheme JVM stacktrace: com.databricks.sql.managedcatalog.UnityCatalogServiceException at com.databricks.managedcatalog.ErrorDetailsHandler.wrapServiceException(ErrorDetailsHandler.scala:111) at com.databricks.managedcatalog.ErrorDetailsHandler.wrapServiceException$(ErrorDetailsHandler.scala:66) at com.databricks.managedcatalog.ManagedCatalogClientImpl.wrapServiceException(ManagedCatalogClientImpl.scala:266) at com.databricks.managedcatalog.ManagedCatalogClientImpl.recordAndWrapExceptionBase(ManagedCatalogClientImpl.scala:7309) at com.databricks.managedcatalog.ManagedCatalogClientImpl.recordAndWrapException(ManagedCatalogClientImpl.scala:7295) at com.databricks.managedcatalog.ManagedCatalogClientImpl.generateTemporaryPathCredentials(ManagedCatalogClientImpl.scala:6277)

Currently i am using Databricks trial account for my learning. How can i fix this issue.

asked Jul 19 at 5:09

Learn Hadoop

3,05810 gold badges35 silver badges71 bronze badges

Why are you trying to create a table in a path location which already exists and has CSV files?

Frank
– Frank

2025-07-20 12:02:39 +00:00
Commented Jul 20 at 12:02
required to read CSV file in job

Learn Hadoop
– Learn Hadoop

2025-07-21 16:05:36 +00:00
Commented Jul 21 at 16:05
Instead of putting the path in OPTIONS, try using the LOCATION parameter

fuzzy-memory
– fuzzy-memory

2025-08-16 05:19:41 +00:00
Commented Aug 16 at 5:19
did you find a solution?

Tiago
– Tiago

2025-10-27 14:50:38 +00:00
Commented Oct 27 at 14:50

Add a comment |

0 Your Answer

Sign up or log in

Post as a guest

Name

Required, but never shown

Post as a guest

Name

Required, but never shown

By clicking “Post Your Answer”, you agree to our terms of service and acknowledge you have read our privacy policy.

Start asking to get answers

Find the answer to your question by asking.

Ask question

Explore related questions

See similar questions with these tags.

Collectives™ on Stack Overflow

Spark SQL Fails to Create Table from Volume-Based Data Source in Databricks

0

Your Answer

Hot Network Questions

Collectives™ on Stack Overflow

0

Know someone who can answer? Share a link to this question via email, Twitter, or Facebook.

Your Answer

Sign up or log in

Post as a guest