Python regex to find connected digits [duplicate]

Question

I have raw txt files and need to use regex to search each digit separated by space.

Question, data format is like:

   6   3   1   0
   7   3   1   0
   8   35002   0
   9   34104   0

My regex is:

(?P<COORD>\d+)

The matched output for first two lines are, (6,3,1,0) and (7,3,1,0) which are correct. However, it doesn't apply to last two lines, their output are (8, 35002, 0) and (9, 34104, 0). The correct grouping numbers should be (8, 3, 5002, 0) and (9, 3, 4104, 0). How can I solve this?

This is a fixed-width text, see stackoverflow.com/questions/4914008/… — Wiktor Stribiżew
– Wiktor Stribiżew, Commented Nov 29, 2021 at 15:50
(?P<COORD>(?<= {4})|(?<= {3})\d|(?<= {2})\d{2}|(?<= )\d{3}|\d{4}) — logi-kal
– logi-kal, Commented Nov 29, 2021 at 16:37
@horcrux This code works. How can I rename these 4 groups of digits in different name? — Kelvin Lo
– Kelvin Lo, Commented Nov 30, 2021 at 12:12
my_regex = "".join([r" *(?P<COORD%s>(?<= {4})|(?<= {3})\d|(?<= {2})\d{2}|(?<= )\d{3}|\d{4})" % i for i in range(1,5)]) gives you this regex — logi-kal
– logi-kal, Commented Nov 30, 2021 at 14:09
@horcrux thank you! I wish I can give you the best answer if you don't mind adding an answer — Kelvin Lo
– Kelvin Lo, Commented Nov 30, 2021 at 15:35

Shanavas M · Accepted Answer · 2021-11-29 16:13:09Z

0

If the numbers are aligned and the width of the columns are fixed, You can use

width = 4
for line in lines:
    columns = [ line[j: j + width] for j in range(0, len(line), width)]
    numbers = list(map(lambda x: int(x.strip()), columns))
    # or a one liner
    print(list(int(line[j:j+width].strip()) for j in range(0, len(line), width)))

answered Nov 29, 2021 at 16:13

Shanavas M

1,6391 gold badge18 silver badges25 bronze badges

Sign up to request clarification or add additional context in comments.

1 Comment

Kelvin Lo Over a year ago

Is it possible to use regex? Because I have other lines in string.

Collectives™ on Stack Overflow

Python regex to find connected digits [duplicate]

1 Answer 1

1 Comment

Linked

Hot Network Questions

Collectives™ on Stack Overflow

1 Answer 1

1 Comment

Linked

Related