character set question

From: Craig A Summerhill <craig_at_cni.org_at_hypermail-project.org>
Date: Fri, 12 Mar 1999 05:11:12 -0500 (EST)
Message-Id: <9903121011.AA21159_at_a.cni.org>


Hypermail people,

I probably should know the answer to this, but I don't

In this example:

   ftp://ftp.cni.org/members/hypermail/0046.txt
   http://www.cni.org/hm2a15/0046.html
   http://www.cni.org/hm2a15/0046.txt

I have a question about character set treatment in the hypermail HTML markup... The data in the RFC821 From: field contains MIME encoding, and is labeled as ISO-8859-1. The thing is the guy's name is Pena (with a tilde over the n). When I view the HTML source, or the HTML file, it appears that the ntilde has been written directly into the HTML.

The ntilde character isn't ASCII, but I guess it probably is Latin-1.

In early versions of HTML (1.0 and 1.1), there were conventions for extended characters like this using the &; convention (e.g. &ntilde;).

Some questions:

   Although it seems to display fine in Netscape 4.05, I wonder whether    this action is really kosher?

   Does the "<!DOCTYPE HTML PUBLIC "-//IETF//DTD HTML//EN">" DTD defintion    statement in the HTML affect the browser's expectations in terms of    character sets?

   What version of HTML does hypermail claim to support? Is the output    HTML 3.2? HTML 4.0?    Are there some assumptions about character sets here which I should know?

-- 

   Craig A. Summerhill, Systems Coordinator and Program Officer
   Coalition for Networked Information
   21 Dupont Circle, N.W., Washington, D.C.   20036
   Internet: craig_at_cni.org   AT&Tnet (202) 296-5098
Received on Fri 12 Mar 1999 12:18:26 PM GMT

This archive was generated by hypermail 2.3.0 : Sat 13 Mar 2010 03:46:11 AM GMT GMT