Reading Data from a website and using python regex

Question

I'm trying to take information from a site, read it in line by line and only take the lines that start with two digits, a semicolon, two digits a semicolon and two more digits (i.e. 00:00:00). Matches are exported to another file.

I am getting a syntax error for the semicolons in my regex.

#!/usr/bin/python

import urllib2
import re

#imported urllib to collect the data. imported re for regular expressions to     test format.


#creating our output file
f=open("output.txt", "r+")

#opening a file like object using urllib
webpage= urllib2.open("https://code.wireshark.org/review/gitweb?p=wireshark.git;a=blob_plain;f=manuf")


#string used to store the output
str=""

#string used to store current line
temp=""


#add while loop to read in that data. line by line. 
temp=webpage.readline()
if temp.re.search([0-9][0-9]:[0-9][0-9]:[0-9][0-9]):

  str.concat(temp)
  temp=""

You need to escape the colon by adding a \ infront of it. The colon is an operator in regex. — blasko
– blasko, Commented Aug 4, 2015 at 23:19
Since when, @blasko? The issue is just missing quotes ("") around the regex. — Jake Griffin
– Jake Griffin, Commented Aug 4, 2015 at 23:29
@JakeGriffin helping him with the problem he will encounter once he adds the quotes. — blasko
– blasko, Commented Aug 4, 2015 at 23:35
My point was that colons are not an operator in regex. "[0-9][0-9]:[0-9][0-9]:[0-9][0-9]" works just fine. Colons are only (part of) an operator in the (?: ... ) syntax. — Jake Griffin
– Jake Griffin, Commented Aug 4, 2015 at 23:39
@blasko oh thanks! I also changed code f=open("output.txt", "r+") to w+ however, now I get the error "AttributeError: 'str' object has no attribute 're' from the if line. — DannyG
– DannyG, Commented Aug 4, 2015 at 23:43

weirdev · Accepted Answer · 2015-08-04 23:19:37Z

2

You are searching using raw code, try inputting a string

if temp.re.search("[0-9][0-9]:[0-9][0-9]:[0-9][0-9]"):

answered Aug 4, 2015 at 23:19

weirdev

1,3988 silver badges21 bronze badges

Sign up to request clarification or add additional context in comments.

Collectives™ on Stack Overflow

Reading Data from a website and using python regex

1 Answer 1

Comments

Your Answer

Hot Network Questions

Collectives™ on Stack Overflow

1 Answer 1

Comments

Your Answer

Sign up or log in

Post as a guest

Related