Tabula-py - ImportError: No module named tabula

AgentX picture AgentX · Aug 9, 2017 · Viewed 9.6k times · Source

I am trying to use Tabula-py to read a pdf. I installed tabula-py through pip install tabula-py

I have also installed the required dependencies

requests
pandas
pytest
flake8

My code is currently as follows:

import tabula
import pandas as pd

df = tabula.read_pdf("report.pdf", pages=2)
print(df)

I am getting the following error:

Traceback (most recent call last):
File "tabula_pdf_reader.py", line 1, in <module>
  import tabula
ImportError: No module named tabula

Any inputs to what I am missing here?

Answer

Jai picture Jai · Feb 2, 2019

I faced this same issue in Ubuntu.

First, check the version of the JDK and JRE that are installed on your machine by running java --version and javac --version. Each should have a version greater than 7.

Then use pip3 to install tabula.