<?xml version="1.0" encoding="UTF-8"?><!-- generator="wordpress/1.5.2" -->
<rss version="0.92">
<channel>
	<title>Hacklog: Blogamundo</title>
	<link>http://blogamundo.net/dev</link>
	<description>poking holes in the language barrier since approximately 1 month from now</description>
	<lastBuildDate>Thu, 15 May 2008 23:52:04 +0000</lastBuildDate>
	<docs>http://backend.userland.com/rss092</docs>
	<language>en</language>

	<item>
		<title>&#8220;Machine Translation&#8221; is a Misnomer</title>
		<description>	An ickle rant ensues.
	The public doesn&#8217;t understand how machine translation works. And generally speaking, the public doesn&#8217;t understand that machine translation couldn&#8217;t exist without human translators in the loop. 
	In other words, it&#8217;s not really &#8220;machines&#8221; that are &#8220;doing&#8221; the translating, it&#8217;s people. The machines are simply programmed to imitate ...</description>
		<link>http://blogamundo.net/dev/2008/05/15/machine-translation-is-a-misnomer/</link>
	</item>
	<item>
		<title>Aligning translations with text compression?</title>
		<description>	Dear interwebs series of tubes people:
	Random thought: 
	If you have two translations, and you perform some sort of compression on the both of them, could interesting relationships between the content of the two translations be uncovered? For instance, it seems like you might be able to get rid of non-content ...</description>
		<link>http://blogamundo.net/dev/2008/05/13/aligning-translations-with-text-compression/</link>
	</item>
	<item>
		<title>Unicode headed toward World Domination™</title>
		<description>	The Google Blog has a chart showing that there is a very clear trend toward Unicode adoption. 
	Apparently their numbers refer to UTF-8 alone (as opposed to UTF-16/UCS-2 or (haha)UTF-32/UCS-4), which again is good news. (Though one wonders if there is any uptake of UTF-16 on the web&#8230; I hope ...</description>
		<link>http://blogamundo.net/dev/2008/05/05/unicode-headed-toward-world-domination%e2%84%a2/</link>
	</item>
	<item>
		<title>Of the Media, Scientists, Word Lengths, and Colossal Squids</title>
		<description>	I&#8217;m a big fan of Pharyngula, but&#8230; he was kinna wrong about a nitpicky little detail. And this particular nitpicky little detail was about language, so, the truth must out!
	In his post As big as dinner plates?, Dr. Myers compares two articles about the recent dissection of a Colossal squid. ...</description>
		<link>http://blogamundo.net/dev/2008/05/01/of-the-media-scientists-word-lengths-and-colossal-squids/</link>
	</item>
	<item>
		<title>Worth a look: An Introduction to Opentype</title>
		<description>	I&#8217;m a Unicode geek, but I feel like I don&#8217;t really know enough about what goes on in operating systems after the encoding and decoding issues are worked out. That is, when and where does all of that glyph selection and font shaping and other black magic actually happen? How ...</description>
		<link>http://blogamundo.net/dev/2008/04/26/worth-a-look-an-introduction-to-opentype/</link>
	</item>
	<item>
		<title>Upper West Side, Zona Sul, and other tricky subdivisions</title>
		<description>	Random translation observation:
	I was translating some Brazilian Portuguese into English, and the source article was about an earthquake near São Paulo. (See? Brazil does have natural disasters!) 
	A particular phrase got me thinking: a zona leste de São Paulo meaning something like &#8220;the Eastern Zone of São Paulo.&#8221; That&#8217;s a ...</description>
		<link>http://blogamundo.net/dev/2008/04/24/upper-west-side-zona-sul-and-other-tricky-subdivisions/</link>
	</item>
	<item>
		<title>Unicode support in Ruby1.9! Yippee!</title>
		<description>	
$ cat unicode.rb
# -*- coding: utf-8 -*-
	s = &#8220;ABCあいう&#8221;
puts  &#8220;s: #{s}&#8221;
puts &#8220;s[0]: #{s[0]}&#8221;
puts &#8220;s[3,1]: #{s[3,1]}&#8221;
	puts &#8220;s.length: #{s.length}&#8221;
puts &#8220;s.reverse: #{s.reverse}&#8221;
puts &#8220;s.encoding: #{s.encoding}&#8221;
	$ ruby1.8 unicode.rb
s: ABCあいう
s[0]: 65
s[3,1]:
s.length: 12
s.reverse: ��㄁め�CBA
unicode.rb:10: undefined method `encoding&#8217; for &#8220;ABC\343\201\202\343\201\204\343\201\206&#8243;:String (NoMethodError)
	$ ruby1.9 unicode.rb
s: ABCあいう
s[0]: A
s[3,1]: あ
s.length: 6
s.reverse: ういあCBA
s.encoding: UTF-8
$ # yay!

	Big ups to Matz.

 </description>
		<link>http://blogamundo.net/dev/2008/03/31/unicode-support-in-ruby19-yippee/</link>
	</item>
	<item>
		<title>Welcome to Nowhere</title>
		<description>	And now, a wee rant.
	Check out this page:
	CityCarClub.info 
	Now, tell me where exactly in the world this carsharing business exists. This is the front page of the company!
	I was linked to this site by a friend in Helsinki (in the context of another conversation, about carsharing, as it happens). But ...</description>
		<link>http://blogamundo.net/dev/2008/02/29/welcome-to-nowhere/</link>
	</item>
	<item>
		<title>Machine translation and Open Source</title>
		<description>	Information Week blogger Serdar Yegulalp has some thoughts on the intersection of machine translation and open source:
Talk To Me, Openly - Open Source Blog - InformationWeek 
	He&#8217;s got an interesting anecdote about how he tackled studying Japanese, and it serves as an interesting intro to the idea behind bitext and ...</description>
		<link>http://blogamundo.net/dev/2008/02/27/machine-translation-and-open-source/</link>
	</item>
	<item>
		<title>Don&#8217;t sort stuff in Unicode with Bash?</title>
		<description>	Update: Okay, duh: I shouldn&#8217;t have called it &#8220;Bash&#8221;. What I meant was, &#8220;whatever the sort utility is in my default terminal.&#8221; Which, as Bryan points out in a comment below, has nothing to do with Bash: it&#8217;s GNU Sort. More updates below.
	I have a little text file with &#8220;Hello ...</description>
		<link>http://blogamundo.net/dev/2008/02/16/dont-sort-stuff-in-unicode-with-bash/</link>
	</item>
</channel>
</rss>
