i have a dataset consist of C/C++ functions as rows. i want to get each function, spilt them, and create a list of words(A). and put that list A to list B as List of Lists in python
so far I was using this but my dataset has 128312 items and it is slow.
can we improve this? if yes I am open to suggestions
functionSourceDF = hdf.get('functionSource')
.
.
.
FSDarray = []
for i in range(0,size):
FSDarray.append(functionSourceDF[i].split(" "))
FSDarray = np.array(FSDarray)
Thank you.