I noticed I can use CASE-THEN with Spark if I use an SQLContext and the .sql() function. Is there a way to use this in a JAVA syntax, directly on dataframes, too? How?
Now, I write:
SparkConf sparkConf = new SparkConf();
JavaSparkContext ctx = new JavaSparkContext(sparkConf);
SQLContext sqlContext = new SQLContext(ctx);
DataFrame df = //some imported data
df.registerTempTable("df");
sqlContext.sql("SELECT *use case-then in here* FROM df");
I'm looking for something like
df.select(case("this").then("that"));