Top "Pypdf" questions

A Pure-Python library built as a PDF toolkit.

How can I extract a JavaScript from a PDF file with a command line tool?

How can I extract a JavaScript object from a PDF file using a command line tool? I am trying to …

javascript python-2.7 pdf cmd pypdf
Whitespace gone from PDF extraction, and strange word interpretation

Using the snippet below, I've attempted to extract the text data from this PDF file. import pyPdf def get_text(…

python pdf unicode pypdf
PDF - Remove White Margins

I would like to know a way to remove white margins from a PDF file. Just like Adobe Acrobat X …

pdf pdf-generation itext ghostscript pypdf
How to install a module for python 2.6 on CentOS?

After I install python 2.6 on CentOS by: wget http://download.fedoraproject.org/pub/epel/5/i386/epel-release-5-4.noarch.rpm sudo …

python centos yum pypdf
Extract images from PDF using python PyPDF2

Is there any way to extract images as stream from pdf document (using PyPDF2 library)? Also is it possible to …

python pdf image-processing reportlab pypdf
Parsing a PDF with no /Root object using PDFMiner

I'm trying to extract text from a large number of PDFs using PDFMiner python bindings. The module I wrote works …

python pypdf pdf-parsing pdf-manipulation
Highlight text in a PDF with Python

I'm working on custom search engine for my PDF data corpus. I have a transformation layer which is able to …

python pdf search pypdf pdfminer
Reading pdf files line by line using python

I used the following code to read the pdf file, but it does not read it. What could possibly be …

python pypdf
finding on which page a search string is located in a pdf document using python

Which python packages can I use to find out out on which page a specific “search string” is located ? I …

python pdf pypdf
How to close pyPDF "PdfFileReader" Class file handle

this should be very simple question, for which I couldn't find answer by Google search: How to close file handle …

python pypdf