I’m using Cloudera Quickstart VM which includes Apache Spark 1.6.0 and Apache Hive.
I want to connect Spark with Hive so that I can run SQL queries on my Hive tables directly from PySpark.
-
Please clarify your specific problem or provide additional details to highlight exactly what you need. As it's currently written, it's hard to tell exactly what you're asking.Toni– Toni2025-11-08 09:28:49 +00:00Commented Nov 8 at 9:28
-
Thank you for your message. My specific problem is that I have data stored in a Hive database, and I want to perform ETL using Apache Spark in Cloudera. However, I am finding it very difficult to perform this process and need guidance on how to connect Spark with Hive and run ETL tasks effectively. Please let me know if you need any additional details.Akhlaq Ahmad– Akhlaq Ahmad2025-11-08 22:27:05 +00:00Commented Nov 8 at 22:27
Add a comment
|