Top "Extract" questions

Questions related to retrieving specific information from a (typically minimally structured) data source, such as a web site, media file, source code collection or compressed archive (in which case the desired information is one or more original, uncompressed files).

Extracting files from merge module

All I want is a command-line tool that can extract files from a merge module (.msm) onto disk. Said differently, …

installation windows-installer extract merge-module orca
How to extract images from flash viewers?

This deals with the (diverse) flash viewers that let you zoom in on images on websites. I’m trying to …

flash image screen-scraping extract shockwave
Programmatically Extract PDF Tables

I have a bunch of PDF docs with tabular data in them which I need to extract into a more …

.net pdf extract acrobat
Creating or assigning variables from a dictionary in Python

I tried to ask a question normally once in here but nobody understands what I want to ask. So I've …

python variables extract
OpenCV: color extraction based on Gaussian mixture model

I am trying to use opencv EM algorithm to do color extraction.I am using the following code based on …

c++ opencv colors extract gaussian
Extracting image from PDF with /CCITTFaxDecode filter

I have a pdf that was generated from scanning software. The pdf has 1 TIFF image per page. I want to …

image pdf itextsharp extract
Extract Data from HTML using PHP

Here is what I am looking for : I have a Link which displays some data on HTML format : http://www.118.…

php html extract html-content-extraction
Extract information from Gmail with Python

I have come through solutions to extract useful information from selected received emails in Gmail mailbox. Aim in this example …

python gmail extract imaplib
how can we extract text from pdf using itextsharp with spaces?

I am using below method to extract pdf text line by line. But problem that, it is not reading spaces …

c# pdf extract extraction pdf-reader
Convert webarchive to html

I managed to collect the behavior of a complex web site into a webarchive. Thereafter I would like to turn …

html extract webarchive