Append a String to a String in a List

Question

I am reading an excel table:

import pandas as pd

df = pd.read_excel('file.xlsx', usecols = 'A,B,C')
print(df)

Now I want to create a list with every row in the table as string. In addition I want to add a 'X' at the end of every string in the list:

keylist = []
list1, list2, list3 = df['A'].tolist(), df['B'].tolist(), df['C'].tolist()

for i in zip(list1, list2, list3):
    val = map(str, i)
    keylist.append('/'.join(val))
    keylist += 'X'

print(keylist)

Everything works except the 'adding a X' part. This results in:

['blue/a/a1', 'X', 'blue/a/a2', 'X', ....

But what I want is:

['blue/a/a1/X', 'blue/a/a2/X',

Thanks beforehand.

keylist is an array, so doing += is the same as adding the the array. You would want to do something moreso akin to val. — Fallenreaper
– Fallenreaper, Commented Apr 3, 2018 at 17:08
Did you try val = map(str, i) keylist.append('/'.join(val+'X')) in your for loop! — SalGorithm
– SalGorithm, Commented Apr 3, 2018 at 17:09

jezrael · Accepted Answer · 2018-04-03 17:43:58Z

8

I think better is:

d = {'A': ['blue', 'blue', 'blue', 'red', 'red', 'red', 'yellow', 
           'yellow', 'green', 'green', 'green'],
     'B': ['a', 'a', 'b', 'c', 'c', 'c', 'd', 'e', 'f', 'f', 'g'], 
     'C': ['a1', 'a2', 'b1', 'c1', 'c2', 'c3', 'd1', 'e1', 'f1', 'f2', 'g1']}
df = pd.DataFrame(d)
print (df)
         A  B   C
0     blue  a  a1
1     blue  a  a2
2     blue  b  b1
3      red  c  c1
4      red  c  c2
5      red  c  c3
6   yellow  d  d1
7   yellow  e  e1
8    green  f  f1
9    green  f  f2
10   green  g  g1

keylist = df.apply(lambda x: '/'.join(x), axis=1).add('/X').values.tolist()
print (keylist)

['blue/a/a1/X', 'blue/a/a2/X', 'blue/b/b1/X', 'red/c/c1/X', 'red/c/c2/X', 
 'red/c/c3/X', 'yellow/d/d1/X', 'yellow/e/e1/X', 
 'green/f/f1/X', 'green/f/f2/X', 'green/g/g1/X']

Or if only few columns:

keylist = (df['A'] + '/' + df['B'] + '/' + df['C'] + '/X').values.tolist()

Some timings:

#[110000 rows x 3 columns]
df = pd.concat([df] * 10000, ignore_index=True)

In [364]: %%timeit
     ...: (df['A'] + '/' + df['B'] + '/' + df['C'] + '/X').values.tolist()
     ...: 
60.2 ms ± 1.04 ms per loop (mean ± std. dev. of 7 runs, 10 loops each)

In [365]: %%timeit
     ...: df.apply(lambda x: '/'.join(x), axis=1).add('/X').tolist()
     ...: 
2.48 s ± 39.1 ms per loop (mean ± std. dev. of 7 runs, 1 loop each)


In [366]: %%timeit
     ...: list1, list2, list3 = df['A'].tolist(), df['B'].tolist(), df['C'].tolist()
     ...: for i in zip(list1, list2, list3):
     ...:     val = map(str, i)
     ...:     keylist.append('/'.join(val))
     ...:     keylist[-1] += '/X'
     ...: 
192 ms ± 78.5 ms per loop (mean ± std. dev. of 7 runs, 10 loops each)

In [367]: %%timeit
     ...: df.iloc[:,0].str.cat([df[c] for c in df.columns[1:]],sep='/').tolist()
     ...: 
61.1 ms ± 540 µs per loop (mean ± std. dev. of 7 runs, 10 loops each)

In [368]: %%timeit
     ...: df.assign(New='X').apply('/'.join,1).tolist()
     ...: 
2.51 s ± 76.8 ms per loop (mean ± std. dev. of 7 runs, 1 loop each)

In [369]: %%timeit
     ...: ['{0}/{1}/{2}/X'.format(i, j, k) for i, j, k in df.values.tolist()]
74.6 ms ± 2.27 ms per loop (mean ± std. dev. of 7 runs, 10 loops each)

edited Apr 3, 2018 at 17:43

answered Apr 3, 2018 at 17:09

jezrael

868k103 gold badges1.4k silver badges1.3k bronze badges

Sign up to request clarification or add additional context in comments.

6 Comments

Fallenreaper Over a year ago

join works on an array, doing it your way, will give: blue/a/a2X. if you look at my answer, you can easily augment this by adding it to the val array... so that way JOIN works correctly.

Nick is tired Over a year ago

@Fallenreaper ah, but yours will provide a syntax error because of a stray . ;)

Fallenreaper Over a year ago

touche. hahahaha

jezrael Over a year ago

@Fallenreaper - hmmm, I guess you dont downvote, right?

BENY Over a year ago

Again ? sign ...:-(

|

jpp · Accepted Answer · 2018-04-03 17:11:02Z

1

Here is one way using a list comprehension with str.format:

res = ['{0}/{1}/{2}/X'.format(i, j, k) for i, j, k in df.values.tolist()]

# ['blue/a/a1/X', 'blue/a/a2/X', 'blue/b/b1/X', 'red/c/c1/X', ...]

There is no need, as in this solution, to split into 3 lists and zip them.

answered Apr 3, 2018 at 17:11

jpp

166k37 gold badges301 silver badges362 bronze badges

Comments

BENY · Accepted Answer · 2018-04-03 17:14:54Z

1

Base on pandas

df.assign(New='X').apply('/'.join,1).tolist()
Out[812]: ['blue/a/a1/X', 'blue/a/a2/X', 'blue/b/b1/X']

answered Apr 3, 2018 at 17:14

BENY

324k22 gold badges176 silver badges250 bronze badges

Comments

Fallenreaper · Accepted Answer · 2018-04-03 17:15:58Z

1

You are doing += do the keylist which adds to that list, you need to do it to the val array.

for i in zip(list1, list2, list3):
  val = map(str,i)
  val += 'X' # you can combine this and the above if you want to look like:
  #val = map(str, i) + 'X'
  keylist.append("/".join(val))
print(keylist)

edited Apr 3, 2018 at 17:15

answered Apr 3, 2018 at 17:10

Fallenreaper

10.8k15 gold badges77 silver badges143 bronze badges

3 Comments

Fallenreaper Over a year ago

why? val is a map, so when you use join, it recognizes the entry and adds it there?

Fallenreaper Over a year ago

Its all good. :) I am just using the variables OP uses, otherwise id have named it something a bit more human readable. :)

jezrael Over a year ago

Sorry, I dont check it carefully.

DJK · Accepted Answer · 2018-04-03 17:36:49Z

0

You can use the cat string operation to join the columns into a single series with a specified sep argument. Then simply convert the new series into a list

 df
         A  B   C
0     blue  a  a1
1     blue  a  a2
2     blue  b  b1
3      red  c  c1
4      red  c  c2
5      red  c  c3
6   yellow  d  d1
7   yellow  e  e1
8    green  f  f1
9    green  f  f2
10   green  g  g1

df.iloc[:,0].str.cat([df[c] for c in df.columns[1:]],sep='/').tolist()

['blue/a/a1', 'blue/a/a2', 'blue/b/b1', 'red/c/c1', 'red/c/c2', 'red/c/c3', 'yellow/d/d1', 'yellow/e/e1', 'green/f/f1', 'green/f/f2', 'green/g/g1']

edited Apr 3, 2018 at 17:36

answered Apr 3, 2018 at 17:25

DJK

9,3424 gold badges28 silver badges41 bronze badges

Comments

Austin · Accepted Answer · 2018-04-03 17:55:37Z

0

You could add /X to last item in list everytime in the loop:

for i in zip(list1, list2, list3):
    val = map(str, i)
    keylist.append('/'.join(val))
    keylist[-1] += '/X'

# ['blue/a/a1/X', 'blue/a/a2/X',....]

edited Apr 3, 2018 at 17:55

answered Apr 3, 2018 at 17:10

Austin

26.1k4 gold badges28 silver badges52 bronze badges

Collectives™ on Stack Overflow

Append a String to a String in a List

6 Answers 6

6 Comments

Comments

Comments

3 Comments

Comments

Comments

Your Answer

Hot Network Questions

Collectives™ on Stack Overflow

6 Answers 6

6 Comments

Comments

Comments

3 Comments

Comments

Comments

Your Answer

Sign up or log in

Post as a guest

Related