The raw data looks like the following:
YAPM1,20100901,23:36:01.563,Quote,,,,,,,4563,,,,,,
YAPM1,20100901,23:36:03.745,Quote,,,,,4537,,,,,,,,
The first row has extra empty columns. I parse the data as follows:
val tokens = List.fromString(line, ',')
The result:
List(YAPM1, 20100901, 23:36:01.563, Quote, 4563)
List(YAPM1, 20100901, 23:36:03.745, Quote, 4537)
At the moment there is no way of using the resulting Lists to deduce which rows had the extra columns. How do I do this?