Pandas: Multiple columns into one column

Question

I have the following data (2 columns, 4 rows):

Column 1: A, B, C, D

Column 2: E, F, G, H

I am attempting to combine the columns into one column to look like this (1 column, 8 rows):

Column 3: A, B, C, D, E, F, G, H

I am using pandas DataFrame and have tried using different functions with no success (append, concat, etc.). Any help would be most appreciated!

Henry Ecker · Accepted Answer · 2021-10-30 20:49:51Z

42

The trick is to use stack()

df.stack().reset_index()
    
   level_0   level_1  0
0        0  Column 1  A
1        0  Column 2  E
2        1  Column 1  B
3        1  Column 2  F
4        2  Column 1  C
5        2  Column 2  G
6        3  Column 1  D
7        3  Column 2  H

edited Oct 30, 2021 at 20:49

Henry Ecker♦

35.8k19 gold badges48 silver badges67 bronze badges

answered Feb 14, 2018 at 11:01

Nickpick

6,67720 gold badges72 silver badges137 bronze badges

Sign up to request clarification or add additional context in comments.

1 Comment

Martin Over a year ago

Aren't the values in the rightmost column of this answer in a wrong order compared to a column asked for by the OP?

Henry Ecker · Accepted Answer · 2021-10-30 20:50:41Z

17

Update

pandas has a built in method for this stack which does what you want see the other answer.

This was my first answer before I knew about stack many years ago:

In [227]:

df = pd.DataFrame({'Column 1':['A', 'B', 'C', 'D'],'Column 2':['E', 'F', 'G', 'H']})
df
Out[227]:
  Column 1 Column 2
0        A        E
1        B        F
2        C        G
3        D        H

[4 rows x 2 columns]

In [228]:

df['Column 1'].append(df['Column 2']).reset_index(drop=True)
Out[228]:
0    A
1    B
2    C
3    D
4    E
5    F
6    G
7    H
dtype: object

edited Oct 30, 2021 at 20:50

Henry Ecker♦

35.8k19 gold badges48 silver badges67 bronze badges

answered May 1, 2014 at 15:00

EdChum

397k204 gold badges836 silver badges583 bronze badges

Comments

Zero · Accepted Answer · 2017-10-12 15:27:42Z

12

You can flatten the values in column direction using ravel, is much faster.

In [1238]: df
Out[1238]:
  Column 1 Column 2
0        A        E
1        B        F
2        C        G
3        D        H

In [1239]: pd.Series(df.values.ravel('F'))
Out[1239]:
0    A
1    B
2    C
3    D
4    E
5    F
6    G
7    H
dtype: object

Details

Medium

In [1245]: df.shape
Out[1245]: (4000, 2)

In [1246]: %timeit pd.Series(df.values.ravel('F'))
10000 loops, best of 3: 86.2 µs per loop

In [1247]: %timeit df['Column 1'].append(df['Column 2']).reset_index(drop=True)
1000 loops, best of 3: 816 µs per loop

Large

In [1249]: df.shape
Out[1249]: (40000, 2)

In [1250]: %timeit pd.Series(df.values.ravel('F'))
10000 loops, best of 3: 87.5 µs per loop

In [1251]: %timeit df['Column 1'].append(df['Column 2']).reset_index(drop=True)
100 loops, best of 3: 1.72 ms per loop

answered Oct 12, 2017 at 15:27

Zero

77.4k22 gold badges153 silver badges153 bronze badges

2 Comments

smci Over a year ago

df.values is thunking out to the underlying array, and calling numpy.ravel() on it. But pandas offers stack().

Frank Over a year ago

DataFrame.to_numpy() is preferred to DataFrame.values.

mechanical_meat · Accepted Answer · 2014-05-01 15:02:40Z

4

What you appear to be asking is simply for help on creating another view of your data. If there is no reason those data are in two columns in the first place then just create one column. If however you need to combine them for presentation in some other tool you can do something like:

import itertools as it, pandas as pd
df = pd.DataFrame({1:['a','b','c','d'],2:['e','f','g','h']})
sorted(it.chain(*df.values))
# -> ['a', 'b', 'c', 'd', 'e', 'f', 'g', 'h']

answered May 1, 2014 at 15:02

mechanical_meat

170k25 gold badges237 silver badges231 bronze badges

Comments

Bending Rodriguez · Accepted Answer · 2024-02-27 10:55:17Z

0

for column in test:
 for index, row in test.iterrows():
  print(row[column])

try this, and instead of printing add the value to a df

edited Feb 27, 2024 at 10:55

answered Feb 27, 2024 at 10:54

Bending Rodriguez

1,3733 gold badges27 silver badges70 bronze badges

Collectives™ on Stack Overflow

Pandas: Multiple columns into one column

5 Answers 5

1 Comment

Comments

2 Comments

Comments

Comments

Your Answer

Linked

Hot Network Questions

Collectives™ on Stack Overflow

5 Answers 5

1 Comment

Comments

2 Comments

Comments

Comments

Your Answer

Sign up or log in

Post as a guest

Linked

Related