I have a dataset that looks like this:
postcode house_number col2 col3
xxx xxx xxx xxx
xxx xxx xxx xxx
I want to group the data by postcode and house_number, if two rows have the same postcode and house_number, it means they are the same property, then I want to construct a unique_id for each property (in other words, for a unique_id, the postcode / house_number must be the same, but the value for col2 / col3 might be different), something like:
unique_id postcode house_number col2 col3
0 111 222 xxx xxx
0 111 222 xxx xxx
1 xxx xxx xxx xxx
.....
I tried new_df = ppd_df.groupby(['postcode','house_number']).reset_index() but it gave me error AttributeError: 'DataFrameGroupBy' object has no attribute 'reset_index', also I'm not sure how to construct the column unique_id. Can someone help please? Thanks.