-5

I’m using Cloudera Quickstart VM which includes Apache Spark 1.6.0 and Apache Hive.
I want to connect Spark with Hive so that I can run SQL queries on my Hive tables directly from PySpark.

2
  • Please clarify your specific problem or provide additional details to highlight exactly what you need. As it's currently written, it's hard to tell exactly what you're asking. Commented Nov 8 at 9:28
  • Thank you for your message. My specific problem is that I have data stored in a Hive database, and I want to perform ETL using Apache Spark in Cloudera. However, I am finding it very difficult to perform this process and need guidance on how to connect Spark with Hive and run ETL tasks effectively. Please let me know if you need any additional details. Commented Nov 8 at 22:27

0

Start asking to get answers

Find the answer to your question by asking.

Ask question

Explore related questions

See similar questions with these tags.