I am doing a dataframe outer join using multiple columns:
DF1:
ColumnA ColumnB ColumnC ColumnD
1 2 3 4
1 2 3 4
DF2:
ColumnE ColumnF ColumnG ColumnH
1 2 3 4
1 2 3 4
Merging code:
df= pd.merge(DF1, DF2, left_on=['ColumnA','ColumnB','ColumnC','ColumnD'], right_on=['ColumnE','ColumnF','ColumnG','ColumnH'], how='outer')
Actual outcome:
ColumnA ColumnB ColumnC ColumnD ColumnE ColumnF ColumnG ColumnH
1 2 3 4 1 2 3 4
1 2 3 4 1 2 3 4
1 2 3 4 1 2 3 4
1 2 3 4 1 2 3 4
Expected outcome(the values should display only twice as the combination of columns matches exactly in two datasets):
ColumnA ColumnB ColumnC ColumnD ColumnE ColumnF ColumnG ColumnH
1 2 3 4 1 2 3 4
1 2 3 4 1 2 3 4
Can someone advise where am I going wrong?