<?xml version="1.0" encoding="utf-8"?>
<rss version="2.0"
    xmlns:dc="http://purl.org/dc/elements/1.1/"
     xmlns:admin="http://webns.net/mvcb/"
     xmlns:content="http://purl.org/rss/1.0/modules/content/"
     xmlns:rdf="http://www.w3.org/1999/02/22-rdf-syntax-ns#">
	<channel> 

	<title>Comments on: Can't make sense of a Russian site</title>
	<link>http://ask.metafilter.com/32659/Cant-make-sense-of-a-Russian-site/</link>
	<description>Comments on Ask MetaFilter post Can't make sense of a Russian site</description>
	<pubDate>Tue, 14 Feb 2006 15:42:18 -0800</pubDate>
	<lastBuildDate>Tue, 14 Feb 2006 15:42:18 -0800</lastBuildDate>
	<language>en-us</language>
	<docs>http://blogs.law.harvard.edu/tech/rss</docs>
	<ttl>60</ttl>

	<item>
		<title>Question: Can&apos;t make sense of a Russian site</title>
		<link>http://ask.metafilter.com/32659/Cant-make-sense-of-a-Russian-site</link>	
		<description>Is &lt;a href=&quot;http://www.airwar.ru&quot;&gt;this&lt;/a&gt; website really Russian? &lt;br /&gt;&lt;br /&gt; Every once in awhile I run across a site like this that looks Russian but fails to use Cyrillic characters.  All I see in my browser on such &lt;a href=&quot;http://www.airwar.ru/enc/fighter/mig25m.html&quot;&gt;pages&lt;/a&gt; are phonetic Romanized spellings of what looks like Russian (i.e. &quot;bjlo povjsit&apos; vjsotnje i skorostnje&quot; using regular Roman characters).  I tried changing View &gt; Character Encoding in Firefox (Win2K) but that doesn&apos;t make it Cyrillic.  Also the Russian-English &lt;a href=&quot;http://www.online-translator.com/srvurl.asp?lang=en&quot;&gt;translator&lt;/a&gt; sites choke on the page.&lt;br&gt;
&lt;br&gt;
I did put Cyrillic in my Regional Options, and can view &quot;normal&quot; Russian sites like &lt;a href=&quot;http://www.pravda.ru/&quot;&gt;this one&lt;/a&gt; in Cyrillic just fine.&lt;br&gt;
&lt;br&gt;
My ultimate goal is to feed such pages into an autotranslator so I can read them in English.  And yes, I know the site has an English version, but it lacks a lot of content I want to see.</description>
		<guid isPermaLink="false">post:ask.metafilter.com,2006:site.32659</guid>
		<pubDate>Tue, 14 Feb 2006 15:33:34 -0800</pubDate>
		<dc:creator>shannymara</dc:creator>
		
			<category>russian</category>
		
			<category>characters</category>
		
			<category>browser</category>
		
	</item> <item>
		<title>By: shannymara</title>
		<link>http://ask.metafilter.com/32659/Cant-make-sense-of-a-Russian-site#510120</link>	
		<description>Update: Google&apos;s cache of it is all Cyrillic.  So I have that to work with, but that doesn&apos;t answer why it won&apos;t display in Cyrillic on my computer.</description>
		<guid isPermaLink="false">comment:ask.metafilter.com,2006:site.32659-510120</guid>
		<pubDate>Tue, 14 Feb 2006 15:42:18 -0800</pubDate>
		<dc:creator>shannymara</dc:creator>
	</item><item>
		<title>By: Pontius Pilate</title>
		<link>http://ask.metafilter.com/32659/Cant-make-sense-of-a-Russian-site#510122</link>	
		<description>Yup, it&apos;s Russian. The title is &quot;A corner of the sky.&quot; It&apos;s showing up just fine in Russian in my IE6 and Firefox with default settings.</description>
		<guid isPermaLink="false">comment:ask.metafilter.com,2006:site.32659-510122</guid>
		<pubDate>Tue, 14 Feb 2006 15:42:45 -0800</pubDate>
		<dc:creator>Pontius Pilate</dc:creator>
	</item><item>
		<title>By: jessamyn</title>
		<link>http://ask.metafilter.com/32659/Cant-make-sense-of-a-Russian-site#510123</link>	
		<description>&lt;small&gt;[fixed url]&lt;/small&gt;</description>
		<guid isPermaLink="false">comment:ask.metafilter.com,2006:site.32659-510123</guid>
		<pubDate>Tue, 14 Feb 2006 15:45:51 -0800</pubDate>
		<dc:creator>jessamyn</dc:creator>
	</item><item>
		<title>By: michaelkuznet</title>
		<link>http://ask.metafilter.com/32659/Cant-make-sense-of-a-Russian-site#510126</link>	
		<description>Yeah, Cyrillic Russian here too (Safari).</description>
		<guid isPermaLink="false">comment:ask.metafilter.com,2006:site.32659-510126</guid>
		<pubDate>Tue, 14 Feb 2006 15:49:29 -0800</pubDate>
		<dc:creator>michaelkuznet</dc:creator>
	</item><item>
		<title>By: nixxon</title>
		<link>http://ask.metafilter.com/32659/Cant-make-sense-of-a-Russian-site#510141</link>	
		<description>FWIW, I see the non-cyrillic version as well (Opera on XP).</description>
		<guid isPermaLink="false">comment:ask.metafilter.com,2006:site.32659-510141</guid>
		<pubDate>Tue, 14 Feb 2006 16:18:44 -0800</pubDate>
		<dc:creator>nixxon</dc:creator>
	</item><item>
		<title>By: gimonca</title>
		<link>http://ask.metafilter.com/32659/Cant-make-sense-of-a-Russian-site#510168</link>	
		<description>Hmm. Works in IE for me, doesn&apos;t work in Firefox.&lt;br&gt;
&lt;br&gt;
Here are the HTTP headers:&lt;br&gt;
&lt;br&gt;
HTTP/1.1&#183;200&#183;OK(CR)(LF)&lt;br&gt;
Date:&#183;Wed,&#183;15&#183;Feb&#183;2006&#183;00:29:40&#183;GMT(CR)(LF)&lt;br&gt;
Server:&#183;Apache(CR)(LF)&lt;br&gt;
Content-Type:&#183;text/html;&#183;charset=windows-1251(CR)(LF)&lt;br&gt;
Content-Language:&#183;ru(CR)(LF)&lt;br&gt;
Connection:&#183;close(CR)(LF)&lt;br&gt;
Transfer-Encoding:&#183;chunked(CR)(LF)&lt;br&gt;
(CR)(LF)</description>
		<guid isPermaLink="false">comment:ask.metafilter.com,2006:site.32659-510168</guid>
		<pubDate>Tue, 14 Feb 2006 16:49:44 -0800</pubDate>
		<dc:creator>gimonca</dc:creator>
	</item><item>
		<title>By: sbutler</title>
		<link>http://ask.metafilter.com/32659/Cant-make-sense-of-a-Russian-site#510200</link>	
		<description>Yes. The difference is probably what charsets your browser says it can accept. For example, here is a default packet capture with Camino:&lt;br&gt;
&lt;br&gt;
&lt;small&gt;GET / HTTP/1.1&lt;br&gt;
Host: www.airwar.ru&lt;br&gt;
User-Agent: Mozilla/5.0 (Macintosh; U; PPC Mac OS X Mach-O; en-US; rv:1.8.0.1) Gecko/20060214 Camino/1.0 (MultiLang)&lt;br&gt;
Accept: text/xml,application/xml,application/xhtml+xml,text/html;q=0.9,text/plain;q=0.8,image/png,*/*;q=0.5&lt;br&gt;
Accept-Language: fr,en;q=0.9,ja;q=0.9,de;q=0.8,es;q=0.7,it;q=0.7,nl;q=0.6,sv;q=0.5,nb;q=0.5,da;q=0.4,fi;q=0.3,pt;q=0.3,zh-Hans;q=0.2,zh-Hant;q=0.1,ko;q=0.1&lt;br&gt;
Accept-Encoding: gzip,deflate&lt;br&gt;
Accept-Charset: ISO-8859-1,utf-8;q=0.7,*;q=0.7&lt;br&gt;
Keep-Alive: 300&lt;br&gt;
Connection: keep-alive&lt;br&gt;
Cookie: b=b&lt;br&gt;
&lt;br&gt;
&lt;br&gt;
HTTP/1.1 200 OK&lt;br&gt;
Date: Wed, 15 Feb 2006 01:50:51 GMT&lt;br&gt;
Server: Apache&lt;br&gt;
Content-Type: text/html; charset=ISO-8859-1&lt;br&gt;
Content-Language: ru&lt;br&gt;
Connection: close&lt;br&gt;
Transfer-Encoding: chunked&lt;/small&gt;&lt;br&gt;
&lt;br&gt;
Notice how my browser says it accepts ISO-8859-1, so the site sends content in ISO-8859-1 (the non-cyrillic version). Now, here is the same site, this time with Safari:&lt;br&gt;
&lt;br&gt;
&lt;small&gt;GET / HTTP/1.1&lt;br&gt;
Accept: */*&lt;br&gt;
Accept-Language: fr&lt;br&gt;
Accept-Encoding: gzip, deflate&lt;br&gt;
Cookie: b=b&lt;br&gt;
User-Agent: Mozilla/5.0 (Macintosh; U; PPC Mac OS X; fr) AppleWebKit/417.9 (KHTML, like Gecko) Safari/417.8&lt;br&gt;
Connection: keep-alive&lt;br&gt;
Host: www.airwar.ru&lt;br&gt;
&lt;br&gt;
&lt;br&gt;
HTTP/1.1 200 OK&lt;br&gt;
Date: Wed, 15 Feb 2006 01:58:33 GMT&lt;br&gt;
Server: Apache&lt;br&gt;
Content-Type: text/html; charset=koi8-r&lt;br&gt;
Content-Language: ru&lt;br&gt;
Connection: close&lt;br&gt;
Transfer-Encoding: chunked&lt;/small&gt;&lt;br&gt;
&lt;br&gt;
As you can see, Safari did not add an Accept-Charset header! So the site sent the content in it&apos;s prefered charset (koi8-r) which uses a cyrillic alphabet.&lt;br&gt;
&lt;br&gt;
I don&apos;t know how you&apos;d get Mozilla based products to ask for a different charset. It appears that changing it in the menu only affects how an already downloaded page is interpreted (that is, it doesn&apos;t change what Accept-Charset header Mozilla sends to the server).</description>
		<guid isPermaLink="false">comment:ask.metafilter.com,2006:site.32659-510200</guid>
		<pubDate>Tue, 14 Feb 2006 18:09:43 -0800</pubDate>
		<dc:creator>sbutler</dc:creator>
	</item><item>
		<title>By: sbutler</title>
		<link>http://ask.metafilter.com/32659/Cant-make-sense-of-a-Russian-site#510209</link>	
		<description>Ahh ha! Figured out how to make Mozilla products &quot;work&quot;.&lt;br&gt;
&lt;br&gt;
1. Type &quot;about:config&quot; and filter for &quot;charset&quot;.&lt;br&gt;
2. Add to the setting &quot;intl.accept_charsets&quot; the value &quot;koi8-r&quot;.&lt;br&gt;
3. Change &quot;intl.charset.default&quot; to &quot;koi8-r&quot;. &lt;br&gt;
&lt;br&gt;
If you reload the page then it should appear cyrillic.</description>
		<guid isPermaLink="false">comment:ask.metafilter.com,2006:site.32659-510209</guid>
		<pubDate>Tue, 14 Feb 2006 18:24:46 -0800</pubDate>
		<dc:creator>sbutler</dc:creator>
	</item><item>
		<title>By: gimonca</title>
		<link>http://ask.metafilter.com/32659/Cant-make-sense-of-a-Russian-site#510240</link>	
		<description>Odd that Firefox wouldn&apos;t put that under View &amp;gt; Character Encoding. None of the various settings there would work for me.</description>
		<guid isPermaLink="false">comment:ask.metafilter.com,2006:site.32659-510240</guid>
		<pubDate>Tue, 14 Feb 2006 19:53:21 -0800</pubDate>
		<dc:creator>gimonca</dc:creator>
	</item>
	</channel>
</rss>
