<?xml version="1.0" encoding="utf-8"?>
<rss version="2.0"
    xmlns:dc="http://purl.org/dc/elements/1.1/"
     xmlns:admin="http://webns.net/mvcb/"
     xmlns:content="http://purl.org/rss/1.0/modules/content/"
     xmlns:rdf="http://www.w3.org/1999/02/22-rdf-syntax-ns#">
	<channel> 

	<title>Comments on: Why does my Vietnamese text only display correctly in the US? </title>
	<link>http://ask.metafilter.com/39223/Why-does-my-Vietnamese-text-only-display-correctly-in-the-US/</link>
	<description>Comments on Ask MetaFilter post Why does my Vietnamese text only display correctly in the US?</description>
	<pubDate>Wed, 31 May 2006 04:13:43 -0800</pubDate>
	<lastBuildDate>Wed, 31 May 2006 04:13:43 -0800</lastBuildDate>
	<language>en-us</language>
	<docs>http://blogs.law.harvard.edu/tech/rss</docs>
	<ttl>60</ttl>

	<item>
		<title>Question: Why does my Vietnamese text only display correctly in the US? </title>
		<link>http://ask.metafilter.com/39223/Why-does-my-Vietnamese-text-only-display-correctly-in-the-US</link>	
		<description>How does a web browser decide whether it can display a numerical character reference? &lt;br /&gt;&lt;br /&gt; I have a web page that is available in English and Vietnamese. I use numerical character references for the Vietnamese characters that fall outside the ISO-8859-1 character set (which is the document character set). On my machine at home, this seems to work fine. But I hear from users in Vietnam that it&apos;s all messed up. I know they have fonts capable of displaying the characters.&lt;br&gt;
&lt;br&gt;
Due to database issues beyond my control, I cannot store the Vietnamese text directly in UTF-8. If I change the output document encoding to UTF-8, the characters from character references get mangled even on my machine. What gives? I thought character references were independent of a document&apos;s encoding, referring directly to HTML&apos;s ISO-10646 roots? &lt;br&gt;
&lt;br&gt;
Is there anything I can do, given that these characters must be stored as references?</description>
		<guid isPermaLink="false">post:ask.metafilter.com,2006:site.39223</guid>
		<pubDate>Wed, 31 May 2006 03:16:49 -0800</pubDate>
		<dc:creator>Nothing</dc:creator>
		
			<category>HTML</category>
		
			<category>characters</category>
		
			<category>characterencodings</category>
		
			<category>encoding</category>
		
			<category>utf8</category>
		
			<category>iso</category>
		
	</item> <item>
		<title>By: Nothing</title>
		<link>http://ask.metafilter.com/39223/Why-does-my-Vietnamese-text-only-display-correctly-in-the-US#605727</link>	
		<description>I am trying sending the pages with an encoding of VISCII and it seems to be working, at least on one test case. But why is that necessary?</description>
		<guid isPermaLink="false">comment:ask.metafilter.com,2006:site.39223-605727</guid>
		<pubDate>Wed, 31 May 2006 04:13:43 -0800</pubDate>
		<dc:creator>Nothing</dc:creator>
	</item><item>
		<title>By: beerbajay</title>
		<link>http://ask.metafilter.com/39223/Why-does-my-Vietnamese-text-only-display-correctly-in-the-US#605731</link>	
		<description>Did you use a default font that they likely have, but which might not have Vietnamese characters? Find out a standard display font for Vietnamese characters and use that as your first choice and whatever you have now as a fallback font?</description>
		<guid isPermaLink="false">comment:ask.metafilter.com,2006:site.39223-605731</guid>
		<pubDate>Wed, 31 May 2006 04:19:52 -0800</pubDate>
		<dc:creator>beerbajay</dc:creator>
	</item><item>
		<title>By: xueexueg</title>
		<link>http://ask.metafilter.com/39223/Why-does-my-Vietnamese-text-only-display-correctly-in-the-US#605865</link>	
		<description>It would help to have a link to either the page in question, or some sample characters/entities that aren&apos;t coming through properly.  At least an excerpt of a page that isn&apos;t working right.  If you have trouble pasting that excerpt usably into the MeFi textarea, send it to my email address in plaintext.&lt;br&gt;
&lt;br&gt;
Some things that I check with these problems on my own sites (though I don&apos;t use entities):&lt;br&gt;
&lt;br&gt;
- does the server&apos;s announced Content-type encoding match the header&apos;s encoding declaration? (validator.w3.org will tell you, as will &lt;tt&gt;lynx -head -dump http://yoursite.mil/yourpage.xml&lt;/tt&gt;)  You should probably be declaring an encoding in your header.  But I don&apos;t think you or I know what that encoding should be yet.&lt;br&gt;
&lt;br&gt;
- I think the failure upon explicitly setting UTF-8 suggests that you&apos;re not actually giving UTF8.&lt;br&gt;
&lt;br&gt;
- are these entities in an 8-bit VISCIIish encoding, or UTF, or what?  Are there other encodings present on the page?&lt;br&gt;
&lt;br&gt;
- what was your reference for encoding these entities?</description>
		<guid isPermaLink="false">comment:ask.metafilter.com,2006:site.39223-605865</guid>
		<pubDate>Wed, 31 May 2006 07:05:50 -0800</pubDate>
		<dc:creator>xueexueg</dc:creator>
	</item><item>
		<title>By: Sharcho</title>
		<link>http://ask.metafilter.com/39223/Why-does-my-Vietnamese-text-only-display-correctly-in-the-US#606144</link>	
		<description>It might be that you&apos;re using Windows XP, and they&apos;re using some other operating system (e.g. Windows 98, Linux) that doesn&apos;t come with that font installed.&lt;br&gt;
&lt;br&gt;
It might be that your server sends the wrong HTTP header. Internet Explorer ignores that header, but any other browser will show it incorrectly if the header&apos;s wrong.</description>
		<guid isPermaLink="false">comment:ask.metafilter.com,2006:site.39223-606144</guid>
		<pubDate>Wed, 31 May 2006 12:37:23 -0800</pubDate>
		<dc:creator>Sharcho</dc:creator>
	</item><item>
		<title>By: AmbroseChapel</title>
		<link>http://ask.metafilter.com/39223/Why-does-my-Vietnamese-text-only-display-correctly-in-the-US#606567</link>	
		<description>&lt;strong&gt;&lt;em&gt;I thought character references were independent of a document&apos;s encoding, referring directly to HTML&apos;s ISO-10646 roots? &lt;/em&gt;&lt;/strong&gt;&lt;br&gt;
&lt;br&gt;
Hmmm. I think you&apos;ve got that completely the wrong way around.&lt;br&gt;
&lt;br&gt;
This is why UTF-8 was invented, precisely because that &lt;em&gt;isn&apos;t &lt;/em&gt;true.&lt;br&gt;
&lt;br&gt;
Setting the page encoding is necessary, pre-Unicode, because there aren&apos;t enough entities to go around.&lt;br&gt;
&lt;br&gt;
So &lt;strong&gt;&amp;amp;#&amp;lt; some number&amp;gt;;&lt;/strong&gt; means one thing in Russian, another in Farsi, and something else again in Thai. Because it means different things in different encodings, you tell the browser which one to use.&lt;br&gt;
&lt;br&gt;
Unicode doesn&apos;t have that problem -- there are enough numbers that Vietnam can keep a set of them all to itself and not have to share.</description>
		<guid isPermaLink="false">comment:ask.metafilter.com,2006:site.39223-606567</guid>
		<pubDate>Wed, 31 May 2006 21:41:22 -0800</pubDate>
		<dc:creator>AmbroseChapel</dc:creator>
	</item>
	</channel>
</rss>
