spark-submit --master yarn-cluster --deploy-mode cluster test.py
end up with error
import pandas as pd ImportError: No module named pandas
this is the only error I see.
using anaconda python distribution 2.7
PYSPARK_VENV]/lib/python2.7/site-packages/ location has pandas.