Re: [dev] sple - A simple PDF links/emails extracotr.

From: Jochen Sprickerhof <dwm_AT_jochen.sprickerhof.de>
Date: Thu, 7 May 2015 17:07:59 +0200

* Jason Woofenden <jason_AT_jasonwoof.com> [2015-05-07 10:09]:
> pdftohtml -stdout foo.pdf | sed -ne 's/href="\([^"]\+\)"/\n\1\n/g' -e 's/\(^[^\n]*\n\|\(\n\)\)\([^\n]*\)\n[^\n]*/\2\3/gp'

I would use grep ;). Using my urlselct from [1] I would write:

pdftotext foo.pdf - | urlselect

Cheers Jochen

[1] http://lists.suckless.org/dev/1504/26641.html

Received on Thu May 07 2015 - 17:07:59 CEST

This archive was generated by hypermail 2.3.0 : Thu May 07 2015 - 17:12:13 CEST