How we can use iter_rows() in Python openpyxl package?

Eghbal picture Eghbal · Apr 22, 2015 · Viewed 50k times · Source

I'm using openpyxl package in Python(Canopy) to use excel files. We have this tutorial in this link : LINK

you can also use the openpyxl.worksheet.Worksheet.iter_rows() method:

>>> tuple(ws.iter_rows('A1:C2'))
((<Cell Sheet1.A1>, <Cell Sheet1.B1>, <Cell Sheet1.C1>),
 (<Cell Sheet1.A2>, <Cell Sheet1.B2>, <Cell Sheet1.C2>))

>>> for row in ws.iter_rows('A1:C2'):
...        for cell in row:
...            print cell
<Cell Sheet1.A1>
<Cell Sheet1.B1>
<Cell Sheet1.C1>
<Cell Sheet1.A2>
<Cell Sheet1.B2>
<Cell Sheet1.C2>

How we can import openpyxl.worksheet.Worksheet.iter_rows() method in python? I used this code:

import openpyxl as op
ms = op.load_workbook('mtest.xlsx')

ws = ms.active

op.worksheet.Worksheet.iter_rows()

This code returns:

type object 'Worksheet' has no attribute 'iter_rows' 

What is the problem?

Answer

DNA picture DNA · Apr 22, 2015

As shown in the tutorial, you need to call the iter_rows method on an instance of a worksheet, for example (for openpyxl 2.5.14 or earlier):

>>> for row in ws.iter_rows('A1:C2'):
...        for cell in row:
...            print cell

or

>>> for row in ws.iter_rows(min_row=1, max_col=3, max_row=2):
...    for cell in row:
...        print(cell)
<Cell Sheet1.A1>
<Cell Sheet1.B1>
<Cell Sheet1.C1>
<Cell Sheet1.A2>
<Cell Sheet1.B2>
<Cell Sheet1.C2>

As your error message states, you are calling it on the Worksheet type, which won't work; it needs to be called on an object:

op.worksheet.Worksheet.iter_rows()  # wrong

See also this example in another answer.

For older versions of openpyxl, you may need to ensure that you enable iterators when loading your workbook - see this thread. This isn't required for more recent versions.

Here's a complete example which I just tested in the Python REPL (with openpyxl 1.8.3):

>>> import openpyxl as op
>>> wb = op.load_workbook('/tmp/test.xlsx', use_iterators=True)
>>> ws = wb.active
>>> for row in ws.iter_rows():
...   for cell in row:
...     print cell
... 
RawCell(row=1, column='A', coordinate='A1', internal_value=1.0, data_type='n', style_id='0', number_format='general')
RawCell(row=1, column='B', coordinate='B1', internal_value=10.0, data_type='n', style_id='0', number_format='general')
...