Lines missing in python

Question

I am writing a code in python where I am removing all the text after a specific word but in output lines are missing. I have a text file in unicode which have 3 lines:

my name is test1
my name is
my name is test 2

What I want is to remove text after word "test" so I could get the output as below

my name is test
my name is
my name is test

I have written a code but it does the task but also removes the second line "my name is" My code is below

txt = ""
with open(r"test.txt", 'r') as fp:
    for line in fp.readlines():
        splitStr = "test"
        index = line.find(splitStr)
        if index > 0:
            txt += line[:index + len(splitStr)] + "\n"
with open(r"test.txt", "w") as fp:
    fp.write(txt)

You may need to add an else block. The second line does not have the word 'test'. — techrhl
– techrhl, Commented Apr 1, 2022 at 9:36
When the substring is not in the string, you're not adding that line to the output. Also, this could be done in an easier way with txt += line.strip().split(splitStr)[0], avoiding unnecessary ifs — Shinra tensei
– Shinra tensei, Commented Apr 1, 2022 at 9:39

Kamar · Accepted Answer · 2022-04-01 09:56:33Z

2

Your code does not append the line if the splitStr is not defined.

txt = ""
with open(r"test.txt", 'r') as fp:
for line in fp.readlines():
    splitStr = "test"
    index = line.find(splitStr)
    if index != -1:
        txt += line[:index + len(splitStr)] + "\n"
    else:
        txt += line
with open(r"test.txt", "w") as fp:
    fp.write(txt)

edited Apr 1, 2022 at 9:56

answered Apr 1, 2022 at 9:39

Kamar

3722 silver badges7 bronze badges

Sign up to request clarification or add additional context in comments.

3 Comments

user2261062 Over a year ago

what if find returns 0 (element found at the beginning)? you are then not removing what's after the split element but appending the whole line instead. try with the string "test hello". it should return "test" but it returns "test hello"

techrhl Over a year ago

I don't think we need a new line (\\n) in the else block. The line itself has that. This code will work but writes an extra line in output file.

Kamar Over a year ago

Corrected the code with the suggestions

Karkar · Accepted Answer · 2022-04-01 09:51:46Z

0

It looks like if there is no keyword found the index become -1. So you are avoiding the lines w/o keyword. I would modify your if by adding the condition as follows:

txt = ""
with open(r"test.txt", 'r') as fp:
    for line in fp.readlines():
        splitStr = "test"
        index = line.find(splitStr)
        if index > 0:
            txt += line[:index + len(splitStr)] + "\n"
        elif index < 0:
            txt += line 
with open(r"test.txt", "w") as fp:
    fp.write(txt)

No need to add \n because the line already contains it.

answered Apr 1, 2022 at 9:51

Karkar

466 bronze badges

1 Comment

user2261062 Over a year ago

how is this answer different from Mohamad Kamar one? Also you are missing the case where index is 0

buhtz · Accepted Answer · 2022-04-01 10:42:44Z

In my solution I simulate the input file via io.StringIO. Compared to your code my solution remove the else branch and only use one += operater. Also splitStr is set only one time and not on each iteration. This makes the code more clear and reduces possible errore sources.

import io

# simulates a file for this example
the_file = io.StringIO("""my name is test1
my name is
my name is test 2""")

txt = ""
splitStr = "test"

with the_file as fp:
    # each line
    for line in fp.readlines():
        # cut somoething?
        if splitStr in line:
            # find index
            index = line.find(splitStr)
            # cut after 'splitStr' and add newline
            line = line[:index + len(splitStr)] + "\n"

        # append line to output
        txt += line

print(txt)

When handling with files in Python 3 it is recommended to use pathlib for that like this.

import pathlib
file_path = pathlib.Path("test.txt")

# read from wile
with file_path.open('r') as fp:
    # do something

# write back to the file
with file_path.open('w') as fp:
    # do something

Schnitte · Accepted Answer · 2022-04-01 09:47:53Z

-1

Suggestion:

for line in fp.readlines():
     i = line.find('test')
     if i != -1:
         line = line[:i]

edited Apr 1, 2022 at 9:47

answered Apr 1, 2022 at 9:39

Schnitte

1,2276 silver badges16 bronze badges

4 Comments

user2261062 Over a year ago

when not found and find returns -1 your code will do line = line[:-1]. Is this an intended behaviour?

Schnitte Over a year ago

That would cut off the last character in the line, so I agree it is unintended and thank you for the suggestion. A way to sort this out would be to put line = line[:i] into an if block: if i != -1: I'll amend my suggestion accordingly.

user2261062 Over a year ago

this now removes the whole line if the split element is not found, which is the problem the OP is facing. Also if the element is found it removes the splitting element, but what should be removed s whatever is after the split element

user18071561 Over a year ago

Thankyou everyone. I got the solution Thanks again everyone

Collectives™ on Stack Overflow

Lines missing in python

4 Answers 4

3 Comments

1 Comment

Comments

4 Comments

Your Answer

Hot Network Questions

Collectives™ on Stack Overflow

4 Answers 4

3 Comments

1 Comment

Comments

4 Comments

Your Answer

Sign up or log in

Post as a guest

Related