Re: [dev] Stripping html from email

From: Antoni Grzymala <antoni_AT_chopin.edu.pl>
Date: Thu, 26 Aug 2010 12:39:33 +0200

Suraj Kurapati dixit (2010-08-23, 21:05):

> On Mon, Aug 23, 2010 at 8:46 PM, Anthony J. Bentley
> <anthonyjbentley_AT_gmail.com> wrote:
> >> Is there currently a tool or script that I can use to strip html
> >> from emails?
> >
> > mhshow-show-text/html: lynx -dump %F | less
> >
> > Lynx sucks but it sorta works well enough here, I guess.
>
> I find that w3m does a much better job of HTML to plain-text
> conversion than Lynx. It even renders HTML tables using Unicode
> box-drawing characters!
>
> http://w3m.sourceforge.net/

I tried using w3m instead of lynx -dump, and it's truly better at
rendering, but lynx used the traditional blah[1]...

[1] uri://some.url...

notation, so that I can actually fish out the links. Is that possible
in w3c as well?

-- 
[a]
Received on Thu Aug 26 2010 - 12:39:33 CEST

This archive was generated by hypermail 2.2.0 : Thu Aug 26 2010 - 12:48:02 CEST