Popular "text-extraction" questions | Page 3

My question is sort of like this question but I have more constraints: I know the document's are reasonably sane …

c# html d text-extraction

I am looking to get the filename from the end of a filepath string, say $text = "bob/hello/myfile.zip"; …

php substring filenames filepath text-extraction

i want to detect text area from image as a preprocessing step for tesseract OCR engine, the engine works well …

c++ image-processing tesseract text-extraction

Given the following HTML: <p><span class="xn-location">OAK RIDGE, N.J.</span>, <…

regex html-content-extraction text-extraction

Using sed or similar how would you extract lines from a file? If I wanted lines 1, 5, 1010, 20503 from a file, how …

unix sed awk line-numbers text-extraction

I find this question, but it uses command line, and I do not want to call a Python script in …

python text-extraction pdfminer

Is there an (unobtrusive, to the user) way to get all the text in a page with Javascript? I could …

javascript text text-extraction

From a string that contains a lot of HTML, how can I extract all the text from <h1>&…

php text-extraction domparser

I'm trying to get my way through Poppler and its (lack of) documentation. What I want to do is a …

c++ pdf text-extraction poppler

sudo python3 -m pip install textract sudo apt-get install textract pip install textract sudo apt-get install swig I want to …

python-3.5 text-extraction

Top "Text-extraction" questions