I have a column of ArrayType in Pyspark. I want to filter only the values in the Array for every Row (I don't want to filter out actual rows!) without using UDF.
For instance given this dataset with column A of ArrayType:
| A |
______________
|[-2, 1, 7] |
|[1] |
|[-4, -1, -3]|
And I would like to have only positive values the output would be:
| A |
______________
|[1, 7] |
|[1] |
|[] |