Communities for your favorite technologies. Explore all Collectives
Stack Overflow for Teams is now called Stack Internal. Bring the best of human thought and AI automation together at your work.
Bring the best of human thought and AI automation together at your work. Learn more
Find centralized, trusted content and collaborate around the technologies you use most.
Stack Internal
Knowledge at work
Bring the best of human thought and AI automation together at your work.
I'd like to know equivalence in PySpark to the use of reset_index() command used in pandas. When using the default command (reset_index), as follows:
reset_index()
reset_index
data.reset_index()
I get an error:
"DataFrame' object has no attribute 'reset_index' error"
monotonically_increasing_id
Like the other comments mentioned, if you do need to add an index to your DF, you can use:
from pyspark.sql.functions import monotonically_increasing_id df = df.withColumn("index_column",monotonically_increasing_id())
Add a comment
Required, but never shown
By clicking “Post Your Answer”, you agree to our terms of service and acknowledge you have read our privacy policy.
Start asking to get answers
Find the answer to your question by asking.
Explore related questions
See similar questions with these tags.
monotonically_increasing_id