Re: [dev] sple - A simple PDF links/emails extracotr.

Contemporary messages sorted: [ by date ] [ by thread ] [ by subject ] [ by author ] [ by messages with attachments ]

From: Hiltjo Posthuma <hiltjo_AT_codemadness.org>
Date: Fri, 8 May 2015 10:16:18 +0200

On Sun, May 3, 2015 at 11:06 PM, Hypsurus <hypsurus_AT_mail.ru> wrote:
> Hello to dev_AT_suckless,
>
> i just started a new project: "sple - simple PDF links extractor"
>
> the uses for sple:
> 1. to extract email addresses/links from a multiple PDF files
>
> i working on the following features:
> 2. parse PDF documents Objects (Autohr/Date/Fonts/Images/)
> 3. extract images/fonts from the PDF file.
>
> the goal is writing sple without any external library.
>
> i would like to develop sple under suckless.org
>
> how can i do it?
>
> Source code for the project:
> https://github.com/Hypsurus/sple
>
> thanks
>
> --
> hypsurus

Mupdf[0] also contains a tool "mudraw", with it you can output to
various formats including text:

mudraw -o - -F txt input.pdf | grep 'whatever'

Kind regards,
HIltjo

[0] http://mupdf.com/
Received on Fri May 08 2015 - 10:16:18 CEST

This archive was generated by hypermail 2.3.0 : Fri May 08 2015 - 10:24:07 CEST