<?xml version="1.0" encoding="utf-8"?>
<rss version="2.0"
    xmlns:dc="http://purl.org/dc/elements/1.1/"
     xmlns:admin="http://webns.net/mvcb/"
     xmlns:content="http://purl.org/rss/1.0/modules/content/"
     xmlns:rdf="http://www.w3.org/1999/02/22-rdf-syntax-ns#">
	<channel>
	  <title>Ask MetaFilter questions tagged with unicode</title>
      <link>http://ask.metafilter.com/tags/unicode</link>
      <description>Questions tagged with 'unicode' at Ask MetaFilter.</description>
	  <pubDate>Tue, 17 Nov 2009 05:33:07 -0800</pubDate> <lastBuildDate>Tue, 17 Nov 2009 05:33:07 -0800</lastBuildDate>

      <language>en-us</language>
	  <docs>http://blogs.law.harvard.edu/tech/rss</docs>
	  <ttl>60</ttl>	  
	<item>
	<title>Looking for Outlook setting.</title>
	<link>http://ask.metafilter.com/138329/Looking%2Dfor%2DOutlook%2Dsetting</link>	
	<description>Is there an Outlook setting to tell it to substitute a glyph from an alternate font if the current display font doesn&apos;t have a glyph for a particular character? For example, I send a message with left- and right-ceiling brackets (U+2308 and U+2309) to a colleague who uses Outlook. No font is specified in the message. Outlook uses Arial or whatever the default is to display the message.  The stock Arial that comes with Windows does not contain a glyph for U+2308 and U+2309, so he can&apos;t see the bracket characters. Most e-mail programs will do the obvious thing and substitute a glyph from another font; however, Outlook does not, at least not out of the box. Is there a setting down in the bowels somewhere that will get Outlook to do this?</description>
	<guid isPermaLink="false">tag:ask.metafilter.com,2009:site.138329</guid>
	<pubDate>Tue, 17 Nov 2009 05:33:07 -0800</pubDate>
	<category>font</category>
	<category>outlook</category>
	<category>unicode</category>
	<dc:creator>iconjack</dc:creator>
	</item>
	<item>
	<title>Pimp my browser.</title>
	<link>http://ask.metafilter.com/131904/Pimp%2Dmy%2Dbrowser</link>	
	<description>I&apos;m trying to get extensions to work with the chrome beta browser.  My first candidate is AdBlock and I looked &lt;a href=&quot;http://www.mark8t.com/2009/03/20/creating-google-chrome-extensions-beta-adblock-for-chrome/&quot;&gt;here.&lt;/a&gt; However, I can&apos;t seem to get it to work.  I created a manifest.json file but when I saved it as plain text, chrome complained that it had to be in utf-8.  So I saved it as &quot;unicode&quot; (in wordpad) but I still have the same error.  &lt;br&gt;
&lt;br&gt;
Have any of you tried to do a chrome extension and succeeded?  Do any of you know how to convert a text file to utf-8?</description>
	<guid isPermaLink="false">tag:ask.metafilter.com,2009:site.131904</guid>
	<pubDate>Thu, 03 Sep 2009 12:59:12 -0800</pubDate>
	<category>AdBlock</category>
	<category>chrome</category>
	<category>extensions</category>
	<category>resolved</category>
	<category>unicode</category>
	<dc:creator>Obscure Reference</dc:creator>
	</item>
	<item>
	<title>Fixed-width Arabic fonts.</title>
	<link>http://ask.metafilter.com/117351/Fixedwidth%2DArabic%2Dfonts</link>	
	<description>There are a couple fixed-width Unicode Arabic fonts out there.  The ones I&apos;ve seen are pretty hard on the eyes, though, especially at small sizes.  Any suggestions for a really &lt;i&gt;readable&lt;/i&gt; one?  (The goal, FWIW, is to be able to work with mixed Arabic/Latin-alphabet text in a terminal window without going blind or blowing the Latin text up to a ridiculous size.)</description>
	<guid isPermaLink="false">tag:ask.metafilter.com,2009:site.117351</guid>
	<pubDate>Sat, 21 Mar 2009 14:52:47 -0800</pubDate>
	<category>arabic</category>
	<category>commandline</category>
	<category>fixedwidth</category>
	<category>font</category>
	<category>monospace</category>
	<category>terminal</category>
	<category>unicode</category>
	<dc:creator>nebulawindphone</dc:creator>
	</item>
	<item>
	<title>Pokemon font?</title>
	<link>http://ask.metafilter.com/113053/Pokemon%2Dfont</link>	
	<description>Is there a dingbats font that uses Pokemon as characters or can this be done with Unicode? If there isn&apos;t can somebody here make it? C&apos;mon, how awesome would that be? (answer: _pretty_ awesome)</description>
	<guid isPermaLink="false">tag:ask.metafilter.com,2009:site.113053</guid>
	<pubDate>Sat, 31 Jan 2009 10:57:14 -0800</pubDate>
	<category>dingbats</category>
	<category>fonts</category>
	<category>pikichuichooseyou</category>
	<category>pokemon</category>
	<category>reallyawesome</category>
	<category>unicode</category>
	<dc:creator>n&#xed;mwunnan</dc:creator>
	</item>
	<item>
	<title>Deeply anal-retentive font geek question about IPA fonts.</title>
	<link>http://ask.metafilter.com/109553/Deeply%2Danalretentive%2Dfont%2Dgeek%2Dquestion%2Dabout%2DIPA%2Dfonts</link>	
	<description>Roman IPA fonts and the non-roman characters who love them. I&apos;m looking for compatible fonts &#8212; a single family, or fonts from different families that look nice together &#8212; that do all of the following:&lt;ul&gt;&lt;li&gt;Include a wide range of IPA and other Unicode Latin characters, at least in serifed roman and ideally in italic, bold and bold italic too.&lt;/li&gt;&lt;li&gt;Include monospace characters, bold monospace characters, and slanted monospace characters, at least for the plain ASCII character set.&lt;/li&gt;&lt;li&gt;Include small caps, at least for the plain ASCII character set.&lt;/li&gt;&lt;/ul&gt;&lt;br&gt;
Why, you ask?  I&apos;m a linguistics student (making the IPA and oddball Latin characters a necessity), I sometimes write software documentation (where I like to use monospaced fonts for things like user input and output) and I sometimes write about topics like metaphor and frame semantics (where small caps are conventionally used for names of metaphors or of frames).  &lt;br&gt;
&lt;br&gt;
At the moment, I&apos;m doing all three &#8212; writing documentation for a bit of linguistics software having to do with frame semantics.  We&apos;re self-publishing the result, and it would be really nice if it came out looking reasonably-professional.  &lt;br&gt;
&lt;br&gt;
Here&apos;s the fonts I&apos;ve looked at: SIL&apos;s Charis and Doulos don&apos;t have monospaced characters or small caps.  TITUS Cyberbit doesn&apos;t have anything but plain roman serifed characters.   Computer Modern Roman doesn&apos;t have bold and bold italic monospaced characters, or bold italic serifed characters.   Deja Vu doesn&apos;t have small caps.&lt;br&gt;
&lt;br&gt;
(If all else fails, I will just use Deja Vu for everything and fake the small caps by shrinking down regular capital letters.  But first, I want to satisfy my anal-retentive streak by finding out what it would take to do things The Right Way.  We use XeTeX, so either TeX fonts or Type 1/TrueType/OpenType fonts will work.  Professional fonts are probably not in our budget, but let me know if there&apos;s one or a set that&apos;ll do this anyway &#8212; I&apos;m still curious what the real solution would look like.)</description>
	<guid isPermaLink="false">tag:ask.metafilter.com,2008:site.109553</guid>
	<pubDate>Thu, 18 Dec 2008 14:56:14 -0800</pubDate>
	<category>font</category>
	<category>ipa</category>
	<category>linguistics</category>
	<category>typesetting</category>
	<category>unicode</category>
	<dc:creator>nebulawindphone</dc:creator>
	</item>
	<item>
	<title>Unicode replacement characters</title>
	<link>http://ask.metafilter.com/100601/Unicode%2Dreplacement%2Dcharacters</link>	
	<description>Is there a name for the domino like box characters with 4 characters inside that display when you don&apos;t have support for a language script?  There a few examples on this &lt;a href=&quot;http://www.i18nguy.com/unicode-example.html&quot;&gt; page&lt;/a&gt; that show up for me.  Bengal, Bhutan and Khmer for example show up as boxes with 4 characters inside.  The first two characters are often in common for all the boxes within each language example. &lt;br&gt;
In case it is system dependent - Windows XP, Firefox 3.0.1, English(US).</description>
	<guid isPermaLink="false">tag:ask.metafilter.com,2008:site.100601</guid>
	<pubDate>Mon, 01 Sep 2008 05:52:15 -0800</pubDate>
	<category>fonts</category>
	<category>I18n</category>
	<category>unicode</category>
	<dc:creator>srboisvert</dc:creator>
	</item>
	<item>
	<title>How serious are intersecting coordinates in a font?</title>
	<link>http://ask.metafilter.com/100506/How%2Dserious%2Dare%2Dintersecting%2Dcoordinates%2Din%2Da%2Dfont</link>	
	<description>How serious are intersecting coordinates in a font? I&apos;m making a Unicode font in FontCreator, using composite glyphs for the angstroms and c cedilla.  When I try to validate the font, FontCreator reports that everything is fine except for intersecting coordinates on &#xc5;, &#xe7;, and &#xc7;. &lt;small&gt;(I haven&apos;t included any Romanian glyphs, though maybe I should.)&lt;/small&gt;&lt;br&gt;
&lt;br&gt;
On those three characters, the diacritics are meant to connect.&lt;br&gt;
&lt;br&gt;
FontCreator, rather unhelpfully, says that it&apos;s up to me how many of the errors I choose to correct.&lt;br&gt;
&lt;br&gt;
Would any certain programs, esp. common word processors, have problems with the font if I leave in the errors?  e.g. I&apos;ve found mention that Flash chokes on characters with incorrect contour direction, but no mention of what problems, if any, programs might have with intersecting coordinates.&lt;br&gt;
&lt;br&gt;
Or should those characters be drawn as simple glyphs rather than composited?</description>
	<guid isPermaLink="false">tag:ask.metafilter.com,2008:site.100506</guid>
	<pubDate>Sat, 30 Aug 2008 13:45:03 -0800</pubDate>
	<category>font</category>
	<category>fontcreator</category>
	<category>fonts</category>
	<category>intersectingcoordinates</category>
	<category>unicode</category>
	<dc:creator>johnofjack</dc:creator>
	</item>
	<item>
	<title>Conditional search engine selection in Firefox?</title>
	<link>http://ask.metafilter.com/98613/Conditional%2Dsearch%2Dengine%2Dselection%2Din%2DFirefox</link>	
	<description>Conditional search engine selection in Firefox search box based on contents (in particular, character set) of the search query? I would like Firefox to use a different search engine depending on the type of string I enter into the search box. I think that regex matching, if possible, would do the trick.&lt;br&gt;
&lt;br&gt;
I like using the search box in Firefox.&lt;br&gt;
&lt;br&gt;
I like Google for most searches, but I want to use Yandex by default for any search that contains Cyrillic (Russian) characters.&lt;br&gt;
&lt;br&gt;
For example, if I&apos;m searching for &quot;keys wallet phone&quot;, Firefox should use Google, but if I&apos;m searching for &quot;&#1082;&#1083;&#1102;&#1095;&#1080; &#1073;&#1091;&#1084;&#1072;&#1078;&#1085;&#1080;&#1082; &#1090;&#1077;&#1083;&#1077;&#1092;&#1086;&#1085;&quot;, it should switch to Yandex. (Yandex should be used even if Latin characters are present, as long as at least one Cyrillic character is there.)&lt;br&gt;
&lt;br&gt;
How do I make Firefox switch search engines depending on the charset of the text I enter?</description>
	<guid isPermaLink="false">tag:ask.metafilter.com,2008:site.98613</guid>
	<pubDate>Thu, 07 Aug 2008 13:35:13 -0800</pubDate>
	<category>firefox</category>
	<category>google</category>
	<category>language</category>
	<category>regex</category>
	<category>russian</category>
	<category>search</category>
	<category>unicode</category>
	<dc:creator>qvtqht</dc:creator>
	</item>
	<item>
	<title>Why does Firefox hate hearts?</title>
	<link>http://ask.metafilter.com/91258/Why%2Ddoes%2DFirefox%2Dhate%2Dhearts</link>	
	<description>Firefox refuses to show the heart symbol (&#9829;) correctly. It shows up as a straight, vertical line instead. Is there anything I can do to fix it? Whenever I&apos;m browsing Facebook (or other websites that use the &#9829; symbol, but it most frequently appears on Facebook), Firefox refuses to display the symbol correctly. It works fine in Safari. I have a Macbook with OSX 10.5, and Firefox 2.0.0.14 (should be the most recent version -- I just updated today). Is there anything I can do to make this character display properly? Its a minor annoyance, and I have at times debated whether I really want to see &#9829;s wherever one chooses to put them, but I figure its better than seeing a screen full of lines. &lt;br&gt;
&lt;br&gt;
My friend also has a MacBook, using the same version of Firefox, and also has this problem. What gives?</description>
	<guid isPermaLink="false">tag:ask.metafilter.com,2008:site.91258</guid>
	<pubDate>Tue, 13 May 2008 00:12:13 -0800</pubDate>
	<category>Firefox</category>
	<category>Macbooks</category>
	<category>text</category>
	<category>unicode</category>
	<dc:creator>elisabethjw</dc:creator>
	</item>
	<item>
	<title>How do you deal with Chinese characters that can&apos;t be represented in 16 bits?</title>
	<link>http://ask.metafilter.com/87888/How%2Ddo%2Dyou%2Ddeal%2Dwith%2DChinese%2Dcharacters%2Dthat%2Dcant%2Dbe%2Drepresented%2Din%2D16%2Dbits</link>	
	<description>How are people dealing with &amp;gt;16 bit Unicode code points?  Specifically, in languages like Java, C# and C++, which assume 16 bit characters (I believe), how are you supporting &lt;a href=&quot;http://en.wikipedia.org/wiki/GB_18030&quot;&gt;GB 18030&lt;/a&gt;?  I would suspect that the various languages&apos; methods like substring(), charAt(), operator[], etc can&apos;t be safely used in China.   If your wstring, say, contains a Chinese string, then .size() doesn&apos;t tell you how many characters are in it, right?

On a related note, what interesting Chinese characters require more than &amp;gt;16 bits?  I&apos;m thinking about making a short presentation for my co-workers on this subject and I&apos;d like to have some interesting examples.

(Oh, and I&apos;m going to run any examples by my Chinese colleagues first, so don&apos;t bother trying to make me say &quot;penis&quot; or something in front of my co-workers :-))</description>
	<guid isPermaLink="false">tag:ask.metafilter.com,2008:site.87888</guid>
	<pubDate>Fri, 04 Apr 2008 08:48:52 -0800</pubDate>
	<category>c</category>
	<category>chinese</category>
	<category>i18n</category>
	<category>java</category>
	<category>l10n</category>
	<category>unicode</category>
	<dc:creator>bonecrusher</dc:creator>
	</item>
	<item>
	<title>Captcha? &#1050;&#1072;&#1087;&#1090;&#1095;&#1072;? &#12459;&#12503;&#12481;&#12515;?</title>
	<link>http://ask.metafilter.com/83195/Captcha%2D%3F%3F%3F%3F%3F%3F%2D%3F%3F%3F%3F</link>	
	<description>Is using a Latin-alphabet captcha accessible across languages/character sets? I&apos;m designing a site which will hopefully deal with people worldwide, across a ton of languages and alphabets. Is it realistic to use an out-of-the-box captcha solution, even though they&apos;re all based on the Latin alphabet? Are the characters universally recognizable? Easy to enter on everyone&apos;s keyboard?&lt;br&gt;
&lt;br&gt;
I don&apos;t think the site is going to be spam central or anything, and I&apos;ll gladly strip it down to numbers-only to give bots a little road bump, I&apos;d just rather  have a little protection in place rather than nothing. Alternatives I&apos;ve come across (math equations, &apos;common sense&apos; questions) mostly seem silly or more trouble than they&apos;re worth, but I&apos;m open to suggestions.</description>
	<guid isPermaLink="false">tag:ask.metafilter.com,2008:site.83195</guid>
	<pubDate>Fri, 08 Feb 2008 20:40:12 -0800</pubDate>
	<category>accessibility</category>
	<category>alphabets</category>
	<category>bots</category>
	<category>captcha</category>
	<category>captchas</category>
	<category>language</category>
	<category>unicode</category>
	<dc:creator>soma lkzx</dc:creator>
	</item>
	<item>
	<title>We are not the masters of subtitling yet</title>
	<link>http://ask.metafilter.com/78808/We%2Dare%2Dnot%2Dthe%2Dmasters%2Dof%2Dsubtitling%2Dyet</link>	
	<description>I (well, my boss) am/is inputting Chinese-language sub-station Alpha subtitles into VirtualDub for DVD&apos;s, and what comes out are, surprise surprise, piles of unicode.  The citizens of the PRC are not known to be fluent in unicode, so what could be going wrong and how do I, his designated software monkey, fix it?  Chinese language packs are installed, for the record.</description>
	<guid isPermaLink="false">tag:ask.metafilter.com,2007:site.78808</guid>
	<pubDate>Sun, 16 Dec 2007 18:39:20 -0800</pubDate>
	<category>Chinese</category>
	<category>dvd</category>
	<category>substationalpha</category>
	<category>subtitling</category>
	<category>unicode</category>
	<category>videorendering</category>
	<category>virtualdub</category>
	<dc:creator>saysthis</dc:creator>
	</item>
	<item>
	<title>How to make MD5, Java and Unicode play nicely</title>
	<link>http://ask.metafilter.com/78326/How%2Dto%2Dmake%2DMD5%2DJava%2Dand%2DUnicode%2Dplay%2Dnicely</link>	
	<description>How do I make MD5, Java and Unicode play nicely together? I don&apos;t really get unicode and character sets.  Now I need to. I&apos;m trying to reproduce the same effect in java that &lt;a href=&quot;http://pajhome.org.uk/crypt/md5/md5src.html&quot;&gt;this javascript&lt;/a&gt; gets when the &lt;code&gt;chrsz&lt;/code&gt; variable is set to 16.  When the variable is set to 8, the &lt;a href=&quot;http://pajhome.org.uk/crypt/md5/contrib/md5.java.txt&quot;&gt;java version&lt;/a&gt; gives the same result as the javascript.  I need the java version to give the same result as the javascript version does when &lt;code&gt;chrsz&lt;/code&gt; equals 16.&lt;br&gt;
&lt;br&gt;
I&apos;m not great at java.  I know enough to be dangerous.  I thought I was pretty hot stuff at javascript, but it might help if I knew what &amp;lt;&amp;lt; or &amp;gt;&amp;gt;&amp;gt; meant in javascript.  As you can imagine, that&apos;s really hard to google.</description>
	<guid isPermaLink="false">tag:ask.metafilter.com,2007:site.78326</guid>
	<pubDate>Mon, 10 Dec 2007 19:44:55 -0800</pubDate>
	<category>bitwise</category>
	<category>endian</category>
	<category>java</category>
	<category>javascript</category>
	<category>md5</category>
	<category>resolved</category>
	<category>unicode</category>
	<dc:creator>idb</dc:creator>
	</item>
	<item>
	<title>Is there any way to surf Japanese web pages on a US cell phone?</title>
	<link>http://ask.metafilter.com/77407/Is%2Dthere%2Dany%2Dway%2Dto%2Dsurf%2DJapanese%2Dweb%2Dpages%2Don%2Da%2DUS%2Dcell%2Dphone</link>	
	<description>I have a US cell phone (LG Muziq, Sprint is the provider). I would REALLY like to be able to read Japanese web pages on it, but whenever I surf to a page with Japanese text it just shows up as garbage. I&apos;ve installed Opera Mini on my phone, but that didn&apos;t help, even though the demo on their website does show Japanese characters. Is there &lt;em&gt;any&lt;/em&gt; way to get this to work? I don&apos;t need to be able to type in Japanese, or send Japanese text messages. I just want to be able to read Japanese web pages. If I can&apos;t do this on the phone directly, then maybe there&apos;s another website I could visit that would convert the Japanese text to images?</description>
	<guid isPermaLink="false">tag:ask.metafilter.com,2007:site.77407</guid>
	<pubDate>Thu, 29 Nov 2007 07:40:58 -0800</pubDate>
	<category>browser</category>
	<category>cell</category>
	<category>cellphone</category>
	<category>fonts</category>
	<category>international</category>
	<category>japanese</category>
	<category>keitai</category>
	<category>mobile</category>
	<category>phone</category>
	<category>unicode</category>
	<dc:creator>Vorteks</dc:creator>
	</item>
	<item>
	<title>Ask METARfilter: text weather to twitter?</title>
	<link>http://ask.metafilter.com/75002/Ask%2DMETARfilter%2Dtext%2Dweather%2Dto%2Dtwitter</link>	
	<description>I&apos;m looking for sources of weather data (current conditions, forecasts) that are in text formats so that I can turn them into a Twitter weather bot.  What I have in mind is either existing ASCII products (METAR, AVN MOS) or derivatives of image products that (e.g. a compact text encoding of a radar map through some kind of image analysis).  Ideas welcomed. The current weather bot is http://twitter.com/a2weather - it&apos;s only partially a bot since I have some stuff that I log to it by hand or from scripts.&lt;br&gt;
&lt;br&gt;
Some things that have been useful are Twitterfeed (RSS to Twitter, scheduled every so often) and a simple shell script for posting to Twitter via curl.  &lt;br&gt;
&lt;br&gt;
The radar piece is what I think would be the coolest - I suspect that throwing Imagemagick at a radar GIF and then counting pixels on a county by county basis would do most of the trick.</description>
	<guid isPermaLink="false">tag:ask.metafilter.com,2007:site.75002</guid>
	<pubDate>Tue, 30 Oct 2007 05:59:23 -0800</pubDate>
	<category>automation</category>
	<category>bot</category>
	<category>programming</category>
	<category>script</category>
	<category>twitter</category>
	<category>unicode</category>
	<category>weather</category>
	<dc:creator>edwardvielmetti</dc:creator>
	</item>
	<item>
	<title>Desperately seeking insular G! </title>
	<link>http://ask.metafilter.com/69890/Desperately%2Dseeking%2Dinsular%2DG</link>	
	<description>How do I upgrade unicode?
This is an &lt;a href=&quot;http://en.wikipedia.org/wiki/Insular_G&quot;&gt;insular g&lt;/a&gt;&lt;br&gt;
&lt;br&gt;
It&apos;s Unicode is U+1D79. I have Mac OS X and it&apos;s not listed in the Unicode characters on the Character Palette. How do I upgrade my unicode database to get an insular g? Thanks.</description>
	<guid isPermaLink="false">tag:ask.metafilter.com,2007:site.69890</guid>
	<pubDate>Wed, 22 Aug 2007 18:38:29 -0800</pubDate>
	<category>typography</category>
	<category>unicode</category>
	<dc:creator>quercus</dc:creator>
	</item>
	<item>
	<title>Unicode, Shunicode</title>
	<link>http://ask.metafilter.com/63266/Unicode%2DShunicode</link>	
	<description>Fonts Filter:  In MS Word 2002, I&apos;m trying to type a &quot;combining diaresis&quot; (umlaut applied to the preceding character). Sometimes it actually works; sometimes i get a square box.  I have no idea why it works when it does, or why it doesnt work when it doesnt. It does work consistently in Word 2007 on a different computer, so the easy answer is to upgrade Word. However, 2002 is supposed to be unicode compatible, and it DOES work on 2002 -- sometimes. I just dont know why it works when it does, or doesnt when it doesnt. &lt;br&gt;
&lt;br&gt;
When it doesnt work, I get a square box, and the font changes to &quot;Tahoma&quot; (from the previous arial or arial unicode).  When it does work, I get the combining diaresis correctly applied to the previous character, and the font remains what it was (I&apos;ve seen it work with times new roman, tahoma, arial, etc).&lt;br&gt;
&lt;br&gt;
To type in the diaresis (aka umlaut), i&apos;m using a temporary custom key assignment that I created using MKLC (microsoft keyboard layout creator). Using MKLC, you can assign any unicode character (in this case, the unicode for the combining diaresis) to any key combination. &lt;br&gt;
&lt;br&gt;
Any advice? I&apos;d rather not shell out 300 bucks for Office 2007, especially when i dont really know why it sometimes DOES work with 2002. &lt;br&gt;
&lt;br&gt;
I havent done anything differently as far as I can tell, to the computer, between the times when it does work and the times when it doesnt. &lt;br&gt;
&lt;br&gt;
help!</description>
	<guid isPermaLink="false">tag:ask.metafilter.com,2007:site.63266</guid>
	<pubDate>Wed, 23 May 2007 16:13:35 -0800</pubDate>
	<category>fonts</category>
	<category>unicode</category>
	<category>word</category>
	<dc:creator>jak68</dc:creator>
	</item>
	<item>
	<title>Unicode Conversion Formatting Issue</title>
	<link>http://ask.metafilter.com/57512/Unicode%2DConversion%2DFormatting%2DIssue</link>	
	<description>Unicode Filter: Converting an Office 98 document to Unicode has caused formatting problems. I can do a find and replace ok for the characters, but when I switch the font to Unicode the spacing around the Unicode characters goes haywire with gaps. Microsoft suggests using a Word Add-In called the &quot;broken fonts add-in&quot; that supposedly comes with Office 2003, but I am unable to find that on my Office 2003 disk. Any suggestions would be greatly appreciated.</description>
	<guid isPermaLink="false">tag:ask.metafilter.com,2007:site.57512</guid>
	<pubDate>Thu, 22 Feb 2007 20:19:13 -0800</pubDate>
	<category>office</category>
	<category>unicode</category>
	<category>word</category>
	<dc:creator>AArtaud</dc:creator>
	</item>
	<item>
	<title>how to make a &quot;registered&quot; symbol in unicode?</title>
	<link>http://ask.metafilter.com/49798/how%2Dto%2Dmake%2Da%2Dregistered%2Dsymbol%2Din%2Dunicode</link>	
	<description>how to make a &quot;registered&quot; (r-in-circle) symbol to go in a text file to appear in Flash we&apos;re trying to make a &quot;registered&quot; symbol appear in flash after being imported as part of a variable from a .txt file. We cant use &quot;&amp;amp; reg;&quot; b/c flash gets screwed up by the &quot;&amp;amp;&quot;. The flash is made by a 3rd party and they made a TM symbol by using &quot;%E2%84%A2.&quot;&lt;br&gt;
&lt;br&gt;
I believe this is UTF-8. we&apos;ve googled far and wide for an equivalent code for &quot;registered&quot; but nothing we can find works. We keep coming up with &quot;A2CE&quot; or something with only four characters that displays nothing. (we tried %A2%CE and various permutations but no luck)&lt;br&gt;
&lt;br&gt;
any help would be greatly appreciated.</description>
	<guid isPermaLink="false">tag:ask.metafilter.com,2006:site.49798</guid>
	<pubDate>Mon, 30 Oct 2006 19:41:02 -0800</pubDate>
	<category>flash</category>
	<category>html</category>
	<category>unicode</category>
	<dc:creator>drjimmy11</dc:creator>
	</item>
	<item>
	<title>Will an ipod display international characters?</title>
	<link>http://ask.metafilter.com/48934/Will%2Dan%2Dipod%2Ddisplay%2Dinternational%2Dcharacters</link>	
	<description>How does the new gen ipod handle international characters in ID3 tags of an mp3 file? So I&apos;m thiiis close to getting myself a 30 gig video ipod, and was wondering it&apos;ll scale to internationalized characters. My mp3 collection (all legit) is mostly world music, and I have, on prior occassions, painstakingly ensured that all non-English songs have their ID3 tags in their native languages. Which means, my iTunes collection has song titles and artists information in, among other scripts, Chinese, French, Thai, Urdu, Tamil, Telugu, Hindi and Punjabi.&lt;br&gt;
&lt;br&gt;
I asked the sales guy at the local Apple showroom about this, but I don&apos;t think he understood the question; he was polite about it, but he appeared incredulous as to why anyone would presume non-English songs wont &lt;i&gt;play&lt;/i&gt; on an ipod. &lt;br&gt;
&lt;br&gt;
I googled, of course, but I don&apos;t seem to be hitting the right keywords, which is why I&apos;m asking here.&lt;br&gt;
&lt;br&gt;
(If it&apos;s any help, I know that &lt;i&gt;itunes&lt;/i&gt; for Windows supports unicode; heck, that&apos;s where I editted most of my entries.)</description>
	<guid isPermaLink="false">tag:ask.metafilter.com,2006:site.48934</guid>
	<pubDate>Thu, 19 Oct 2006 01:35:49 -0800</pubDate>
	<category>i18n</category>
	<category>ipod</category>
	<category>unicode</category>
	<dc:creator>the cydonian</dc:creator>
	</item>
	<item>
	<title>PostScript + Unicode</title>
	<link>http://ask.metafilter.com/45691/PostScript%2DUnicode</link>	
	<description>Can I use Unicode characters in PostScript strings? Does PostScript understand Unicode? (in particular, the PostScript as implemented by GhostScript 4.6 or beyond)&lt;br&gt;
&lt;br&gt;
If so, how can I insert a Unicode character, for example U+2204,  into a string literal?</description>
	<guid isPermaLink="false">tag:ask.metafilter.com,2006:site.45691</guid>
	<pubDate>Thu, 31 Aug 2006 21:04:53 -0800</pubDate>
	<category>postscript</category>
	<category>unicode</category>
	<dc:creator>iconjack</dc:creator>
	</item>
	<item>
	<title>How can I preserve Unicode characters in `mysqdump`?</title>
	<link>http://ask.metafilter.com/45417/How%2Dcan%2DI%2Dpreserve%2DUnicode%2Dcharacters%2Din%2Dmysqdump</link>	
	<description>How can I make backups of a MySQL database (via &lt;code&gt;mysqldump&lt;/code&gt;) and preserve Unicode characters (such as em dashes)? I use &lt;code&gt;mysqldump&lt;/code&gt; and &lt;code&gt;rsync&lt;/code&gt; to make backups of my &lt;a href=&quot;http://textdrive.com/&quot; title=&quot;Welcome to TextDrive - Reliable, high performance web hosting you can trust&quot;&gt;TextDrive&lt;/a&gt; account. One of the forums I run, however, often includes unencoded Unicode entities, such as em dashes and bullets.&lt;br&gt;
&lt;br&gt;
Because these characters aren&apos;t straight ASCII, restoring from previous &lt;code&gt;mysqldump&lt;/code&gt; backups has mangled them, producing gibberish but preserving the other, straight ASCII text in the database.&lt;br&gt;
&lt;br&gt;
My current &lt;code&gt;mysqldump&lt;/code&gt; call looks like this: &lt;code&gt;/usr/local/bin/mysqldump -uuser -ppassword --quote-names --complete-insert --extended-insert --quick --lock-tables=false --skip-add-locks --all-databases | gzip &amp;gt; db.sql&lt;/code&gt;. What&apos;s necessary to preserve these Unicode characters?&lt;br&gt;
&lt;br&gt;
Thanks.</description>
	<guid isPermaLink="false">tag:ask.metafilter.com,2006:site.45417</guid>
	<pubDate>Mon, 28 Aug 2006 11:55:32 -0800</pubDate>
	<category>backup</category>
	<category>charset</category>
	<category>cron</category>
	<category>database</category>
	<category>mysql</category>
	<category>mysqldump</category>
	<category>unicode</category>
	<dc:creator>cmyers</dc:creator>
	</item>
	<item>
	<title>Mysterious characters in addresses</title>
	<link>http://ask.metafilter.com/39173/Mysterious%2Dcharacters%2Din%2Daddresses</link>	
	<description>Help me figure out these weird Spanish, Danish, and Portugese characters in an address given to me via the Internet.  I don&apos;t think what I see is what is meant. The sample from Denmark is&lt;br&gt;
   &#1100;sterbrogade&lt;br&gt;
where the first character looks like &lt;a href=&quot;http://www.webbaby.ru/det_pic/1111.gif&quot;&gt;this&lt;/a&gt;.  That&apos;s a Russian letter, so that can&apos;t be right.  What is the correct character?&lt;br&gt;
&lt;br&gt;
The Spanish address is&lt;br&gt;
   Gracia 15, 5&#xba; 2&#xaa;&lt;br&gt;
Where there is Gracia, fifteen, comma, five, then what looks like a degree symbol, then a two, then what looks like a little &quot;a&quot; in superscript.  It&apos;s rare that I see superscripts in an address so I am worried that these must be written as something else.  I actually see a lot of superscripted addresses like this come from Spain.&lt;br&gt;
&lt;br&gt;
Similarly, the Portugese address is&lt;br&gt;
  Outubro n&#xba; 198 6&#xba; &lt;br&gt;
where the &quot;n&quot; and &quot;6&quot; are followed by what looks like a degree symbol.&lt;br&gt;
&lt;br&gt;
How should these be written?</description>
	<guid isPermaLink="false">tag:ask.metafilter.com,2006:site.39173</guid>
	<pubDate>Tue, 30 May 2006 12:20:11 -0800</pubDate>
	<category>ascii</category>
	<category>characters</category>
	<category>characterset</category>
	<category>unicode</category>
	<dc:creator>chef_boyardee</dc:creator>
	</item>
	<item>
	<title>Modifying php-based web calendar to allow UTF-8 (Japanese) input</title>
	<link>http://ask.metafilter.com/35356/Modifying%2Dphpbased%2Dweb%2Dcalendar%2Dto%2Dallow%2DUTF8%2DJapanese%2Dinput</link>	
	<description>Probably easy question about PHP and unicode (UTF-8) and  RegEx. I&apos;m trying to modify a php webcalendar (VTcalendar) to allow Japanese text in calendar postings. I&apos;ve found all the variables to get the UTF-8 headers, and so japanese text manually inserted into pages appears fine. But, there&apos;s an input validation thingy I don&apos;t know how to modify. (short snippet inside) The calendar item input form rejects any Japanese text, and I think I&apos;ve traced it to the file &quot;inputvalidation.inc.php&quot; which starts with the code below. if I try to delete the part about allowable characters in line 7, I get an error about the &apos;^&apos; in the last line. can this be modified to allow UTF-8 characters?&lt;br&gt;
&lt;br&gt;
  if (!defined(&quot;ALLOWINCLUDES&quot;)) { exit; } // prohibits direct calling of include files&lt;br&gt;
 define(&quot;constValidTextCharWithoutSpacesRegEx&quot;,&apos;\w~!@#\$%^&amp;amp;*\(\)\-+=\{\}\[\]\|\\\:&quot;;\&apos;&lt;&gt;?,.\/&apos;);&lt;br&gt;
define(&quot;constValidTextCharWithSpacesRegEx&quot;,&apos;\s&apos;.constValidTextCharWithoutSpacesRegEx);&lt;br&gt;
	define(&quot;constCalendaridMAXLENGTH&quot;,20);&lt;br&gt;
	define(&quot;constCalendaridVALIDMESSAGE&quot;, &apos;1 to &apos;.constCalendaridMAXLENGTH.&apos; characters (A-Z,a-z,0-9,-,.)&apos;);&lt;br&gt;
  define(&quot;constCalendarnameMAXLENGTH&quot;,100);&lt;br&gt;
	define(&quot;constCalendarnameVALIDMESSAGE&quot;, &apos;1 to &apos;.constCalendarnameMAXLENGTH.&apos; characters (A-Z,a-z,0-9,-,.,&amp;amp;,\&apos;,[space],[comma])&apos;);&lt;br&gt;
	define(&quot;constCalendarTitleMAXLENGTH&quot;,50);&lt;br&gt;
  define(&quot;constKeywordMaxLength&quot;,100);&lt;br&gt;
  define(&quot;constSpecificsponsorMaxLength&quot;,100);&lt;br&gt;
  define(&quot;constPasswordMaxLength&quot;,20);&lt;br&gt;
  define(&quot;constPasswordRegEx&quot;, &apos;/^[&apos;.constValidTextCharWithoutSpacesRegEx.&apos;]{1,&apos;.constPasswordMaxLength.&apos;}$/&apos;);&lt;/&gt;</description>
	<guid isPermaLink="false">tag:ask.metafilter.com,2006:site.35356</guid>
	<pubDate>Wed, 29 Mar 2006 22:19:23 -0800</pubDate>
	<category>character</category>
	<category>inputvalidation</category>
	<category>japanese</category>
	<category>kanji</category>
	<category>php</category>
	<category>regex</category>
	<category>resolved</category>
	<category>unicode</category>
	<category>utf-8</category>
	<category>validation</category>
	<dc:creator>planetkyoto</dc:creator>
	</item>
	<item>
	<title>Special characters in Quark</title>
	<link>http://ask.metafilter.com/33337/Special%2Dcharacters%2Din%2DQuark</link>	
	<description>Font recommendation to solve a MS Word --&amp;gt; QuarkXPress issue? I have a lengthy book written in MS Word, pretty simple in all respects. Basic formatting, layout, etc and the font throughout is Times New Roman.&lt;br&gt;
&lt;br&gt;
The Unicode version of Times New Roman has lots of characters that I use to write the English transliteration of words from another language.  These are characters such as the letter &apos;T&apos; with a dash through it, the letters &apos;H&apos;, &apos;Z&apos;, or &apos;S&apos; with a &quot;hat&quot; accent above them, and such...&lt;br&gt;
&lt;br&gt;
However, since these characters are not in the first 256 characters of Times New Roman, QuarkXPress does not pick them up, since it does not have Unicode support.&lt;br&gt;
&lt;br&gt;
Is there a font out there that contains characters similar to these (doesn&apos;t have to be exactly the same) as well the normal characters in the first 256 spots so that I can use it with Quark, or does anyone have a better idea as to how I can solve this problem? &lt;br&gt;
&lt;br&gt;
Yes, for my next book I&apos;ll find another application, like InDesign (although I haven&apos;t researched its capabilities yet) since Quark is just so crappy...</description>
	<guid isPermaLink="false">tag:ask.metafilter.com,2006:site.33337</guid>
	<pubDate>Sat, 25 Feb 2006 15:24:34 -0800</pubDate>
	<category>font</category>
	<category>quark</category>
	<category>unicode</category>
	<category>word</category>
	<dc:creator>omair</dc:creator>
	</item>
	
	</channel>
</rss>

