Re: Question about MIME and HTML Email

From: Byron C. Darrah <bdarr_at_sse.FU.HAC.COM_at_hypermail-project.org>
Date: Mon, 30 Nov 1998 11:12:52 -0800 (PST)
Message-Id: <199811301912.LAA19931_at_pepperoni.pizza.hac.com>

Alright, I put together a little something that I think will make a good start for an HTML filter. You can download it from:

     http://www.cs.ucla.edu/~darrah/html_filter.tgz

Here's a little description of how it works:

  1. Comments, SGML commands, and unrecognized HTML tags are removed.
  2. Unmatched close tags are removed.
  3. The list of recognized tags is configurable, in a header file called filter_config.h.
  4. Recognized tags can be supressed. ie: removed.
  5. Recongized tags which are containers can cause all contained text to be supressed.
  6. Close tags are generated for unclosed containers.
  7. In the case of 2 or 7, a comment is emitted into the output, denoting the problem.

The current version has a very small list of recognized tags. We need to expand that.

In order to gurantee no buffer overflows, the html_filter uses the dynamic_strings_t module that I offered to Kent (by way of this mailing list) a while back. So I think the current unreleased Landfield beta version of hypermail probably already has this module in it.

If you want to integrate this filter with a version of hypermail (or other program) that uses different code for handling arbitrary length strings, you may want to either change html_filter or hypermail so that they use the same code for this.

This is my first cut at this, so there may be bugs :-).

--Byron Darrah



Date: Thu, 19 Nov 1998 09:32:13 -0800 (PST) From: "Byron C. Darrah" <bdarr_at_sed.hac.com>

Hmm, okay. I'll dig up that code I mentioned, and see if I can get something pretty close to what we want without too much effort. Then I'll post a follow-up to this list and we can work from there.

You should be hearing from me soon.

--Byron



Date: Thu, 19 Nov 1998 11:56:12 +0100 (MET) From: Daniel Stenberg <Daniel.Stenberg_at_sth.frontec.se> cc: hypermail_at_landfield.com

On Thu, 15 Oct 1998, Byron C. Darrah wrote:

> > > I actually have some code sitting around that would probably be easy for
> > > me to modify to get it to do this. Kent, if you like, I could do this
> > > and give you the result.
> >
> > I'd be happy to assist if there's a need for it.
>
> Great! I'd like to at least hear from Kent before doing anything on
> this, but if he says "go" then I'll probably be able to use some help.

Now its been a month. I think you should go ahead with this work if you're still interested. We can't wait for Kent.

-- 
   Daniel Stenberg    http://www.fts.frontec.se/~dast     0708-317742
   ech`echo xiun|tr nu oc|sed 'sx\([sx]\)\([xoi]\)xo un\2\1 is xg'`ol
Received on Mon 30 Nov 1998 09:16:51 PM GMT

This archive was generated by hypermail 2.3.0 : Sat 13 Mar 2010 03:46:11 AM GMT GMT