how to read (or parse) EXCEL comments using python

Question

I have several excel files that use lots of comments for saving information. For example, one cell has value 2 and there is a comment attached to the cell saying "2008:2#2009:4". it seems that value 2 is for the current year (2010) value. The comment keeps all previous year values separated by '#'. I would like to create a dictionary to keep all this info like {2008:2, 2009:4, 2010:2} but I don't know how to parse (or read) this comment attached to the cell. Python excel readin module has this function (reading in comment)?

skjerns · Accepted Answer · 2018-01-27 21:52:53Z

5

You can do this without an Excel COM object using openpyxl:

from openpyxl import load_workbook

workbook = load_workbook('/tmp/data.xlsx')
first_sheet = workbook.get_sheet_names()[0]
worksheet = workbook.get_sheet_by_name(first_sheet)

for row in worksheet.iter_rows():
    for cell in row:
        if cell.comment:
            print(cell.comment.text)

The parsing of the comments itself can be done the same as with Steven Rumbalski's answer.

(example adapted from here)

answered Jan 27, 2018 at 21:52

skjerns

2,2702 gold badges19 silver badges26 bronze badges

Sign up to request clarification or add additional context in comments.

1 Comment

Elsa Li Over a year ago

openpyxl doesnot support xls files...Is there any other way to read comments from xls files? Thanks

Steven Rumbalski · Accepted Answer · 2010-09-15 15:20:08Z

3

Normally for reading from Excel, I would suggest using xlrd, but xlrd does not support comments. So instead use the Excel COM object:

from win32com.client import Dispatch
xl = Dispatch("Excel.Application")
xl.Visible = True
wb = xl.Workbooks.Open("Book1.xls")
sh = wb.Sheets("Sheet1")
comment = sh.Cells(1,1).Comment.Text()

And here's how to parse the comment:

comment = "2008:2#2009:4"
d = {}
for item in comment.split('#'):
    key, val = item.split(':')
    d[key] = val

Often, Excel comments are on two lines with the first line noting who created the comment. If so your code would look more like this:

comment = """Steven:
2008:2#2009:4"""
_, comment = comment.split('\n')
d = {}
for item in comment.split('#'):
    key, val = item.split(':')
    d[key] = val

answered Sep 15, 2010 at 15:20

Steven Rumbalski

45.7k10 gold badges96 silver badges125 bronze badges

2 Comments

George Hilliard Over a year ago

This is surprisingly clean. The only problem is that it requires you to have Excel installed. Is there a way to do this with LibreOffice? Bonus points if it works on Linux.

Elsa Li Over a year ago

Does it have to iterate cell by cell? Can it extract all the comments from a column? Thanks @Steven Rumbalski

elena.kim · Accepted Answer · 2021-05-13 10:07:27Z

0

After running the last posted code here, can you store that information later in a word document?

from openpyxl import load_workbook
    
workbook = load_workbook('/tmp/data.xlsx')
first_sheet = workbook.get_sheet_names()[0]
worksheet = workbook.get_sheet_by_name(first_sheet)

for row in worksheet.iter_rows():
    for cell in row:
        if cell.comment:
            print(cell.comment.text)

edited May 13, 2021 at 10:07

elena.kim

9664 gold badges14 silver badges22 bronze badges

answered May 12, 2021 at 22:55

C. Heins

1

Collectives™ on Stack Overflow

how to read (or parse) EXCEL comments using python

3 Answers 3

1 Comment

2 Comments

Comments

Your Answer

Hot Network Questions

Collectives™ on Stack Overflow

3 Answers 3

1 Comment

2 Comments

Comments

Your Answer

Sign up or log in

Post as a guest

Related