There is a program called "cufflinks" which is run as follows:
cufflinks -o <output-dir> <input-file>
This program takes 1 file as input and generates 4 files as output in the "output-dir".
I am trying to run the same program on a Hadoop cluster using Runtime.exec() in a mapper class. I am setting
output-dir=/some/path/on/HDFS
I was expecting that the 4 files will be generated on HDFS as o/p. However, that is not true and the o/p directory on HDFS does not contain any of these 4 files.
I then tried setting
output-dir=/tmp/output/
and it worked.
Can anyone please suggest why it does not work on HDFS? What do I need to do to make it work on HDFS?
Thanks.