2007

198 posts under this date.

Guess what language 2
0
0
7
Jan
17

I doubt someone would find this too useful but I smiled today when I found about the guess YubNub command. You feed it text, it gulps the language it’s in. A great way to showcase YubNub’s open-ended fun, courtesy of Xerox research. It would have been a godsend when I was dealing with Imagery’s multilingual rush (Oh, how GMail angered me then! Smart enough to correctly spellcheck anything I gave her, yet coyly keeping the language name to herself!). Hope I need it again soon.

For all of you that aren’t on the YubNub wagon yet, you can play with it here—but it won’t be even half as much fun ;).

And since we already seem to be on a language landslide, some months ago I found out playing with Google Translate that when you translate a website from Chinese to English (which is currently beta), you can hover on a sentence to get the original Chinese fragment in a quick popup. Mighty cool. All the more impressive a feature coming from a website. (Now let’s only hope they plan to add it to the other language pairs too…)

Google's Chinese Beta Popup

Final language tidbit: translate “Hello, how are you?” to Spanish with Google. Your immediate response is “¿Hola, cómo eres?,” sucking the life out of even the hardiest machine-translation enthusiast.

Star
The TTOEFL: The Turing Test of English as a Foreign Language 2
0
0
7
Jan
17

Turing!

Here’s a (controversial) idea for a language test inspired by the famous Turing test for artificial intelligenceWP:

a native speaker of language X engages in conversation with two other parties, one a native speaker of language X and the other a student of language X as a foreign language; if the judge cannot reliably tell which is which, then (and only then) can the student be said to speak language X.

The test could be easily constrained to test for more specific capabilities: one could test for written command of language X by only permitting written communications, test only for accent by limiting “communication” to the spoken repeating of the judge’s written sentences, and so on.

It is simply stated but almost a “thought test”WP—it could be done, but there would be a myriad practical complications and scaling would be a bitch. What’s important about it, though, is that it is a valid test to demand of (foreign) language learning: passing it should at least be its hypothetical goal.

The problem is that ridiculously few people would pass it if it where applied today. And because it seems impossibly difficult most people turn away, dismiss the test as wrong or irrelevant, and sink their heads in the sand (“what shouldn’t be, can’t be right”). Which only highlights the current sorry state of language education. It is NOT asking too much. It is not asking for exceptional performance—it doesn’t ask of you to be a Nobel-prize, a literati, or a rapper. It’s merely demanding average, pick-a-guy-from-the-street native-speaker capabilities. Why isn’t that a valid goal to ask of language education?

You could say that most people don’t need native-speaker level to start benefiting from a foreign language and that’s entirely true. But it is just as true that not reaching it is a serious, frustrating, even painful hurdle to communication. A hurdle that will plague ever more people the more the world shrinks. Some of the world’s smartest people can’t get their r’s right hard as they try. And we mock them for it. (Soon, we will be the mocked ones for not getting our intonations right.)

Well looked, Turing level is perhaps even a modest goal. We all possess it already in the language we are born into and we all contained within us the same language potentiality at birth. So it should be perfectly achievable and shouldn’t take nearly as much time as starting from zero.

Yes, I know. We are nowhere near knowing how to reach such a level efficiently. It’s too hard and too long a goal—currently. But we should at least strive for it. (And be honest with students on what the status quo of our language technology is: no more “Learn to speak Chinese in 21 days!”—for now.) Languages are some of the most complex and powerful artifacts we have created. It’s only to be expected that their learning is one of the most complex and difficult challenges we face.

But it is also one of our most rewarding (and valuable) experiences. I want to commoditize it.

Chances are we are on the brink of Turing level language translationELZR. Why aren’t we even close to practical Turing level language learning? I’d still want it.

Star
My very first webcomic: Excuse 2
0
0
7
Jan
16

It was a very simple idea—a girl and a boy, in the subway—and yet actually drawing it was a nightmare. There are any number of things I would do different for my next webcomic. I guess that means the effort was worth it: much was learned.

So, again, the idea is a girl and a boy in the subway (I used this Flickr photo to “remember” the subway). None of them can muster the courage to speak to each other, none of them can come up with a clever excuse for starting the conversation. Until the girl realizes there’s no bad excuse for meeting someone. And that is her excuse. That’s it.

Reminds me a lot of an elevator sign I scrawled years ago.

Crocs! 2
0
0
7
Jan
16

Crocs

I bought them New Year’s Eve at VallartaWP and they’re ugly and cliché, I know, but they’re uncannily comfy (almost like a second skin or a squishy crust) and I’m still amazed at how happy some silly Naussica-ish shoes can make me.

Jicama! 2
0
0
7
Jan
16

Jicama!

Making posters has been a hobby of mine since I can remember. At high school I tried to start a series of posters on moral values but only finished one on gayness and another on licentiousness. At college, I made one on Esperanto and several for the cinema club I ran with some friends.

This one, my latest, is about the Jicama fruitWP and it plays on a joke by Friends’s Chandler: “Cheese. It’s milk that you chew.” The funny thing is that for Jicamas it’s almost true, from the fruit’s pedia:

Jícama is high in carbohydrates in the form of dietary fiber. It is composed of 86-90% water; it contains only trace amounts of protein and lipids. Its sweet flavor comes from the oligofructose inulin (also called fructo-oligosaccharide), which the human body does not metabolize; this makes the root an ideal sweet snack for diabetics and dieters.

It's not the post, but the sequence 2
0
0
7
Jan
16

The problem with abandoning a blog is not the lost posts but the lost sequence. I’ve learned so much these last weeks and yet written so little that what I’ll now post may or may not make sense but will undoubtedly feel broken and out of place. Alas, I have lost the path that took me here and while I’ll try to mention it tangentially it will only be a pale sketch of what it really was. The emotions have cooled and forgotten are most of the shameful and silly detours, dead-ends, and retracings that led me to today. Which is a shame, because they were so much anguished fun.

So I apologize. But this blog is back on track. On steroids and with several weeks of bulging backlog. Après cet post-ci, le deluge.

Star
Blogs are comics (wikis are movies) 2
0
0
7
Jan
16

Scott McCloud begins Understanding ComicsAM wrangling to create a definition of comics, contrasting it with pictures and movies. Taken individually, a picture is merely that, a picture. Arranged sequentially however, pictures are transformed into something else. If the sequence is temporal—pictures alternating in time, fixed in space—we call that art movies. If the sequence is spatial—pictures adjacent in space, “juxtaposed”—we call it comics.

So that’s that. His is a great book and you won’t regret reading it, but for now hearken back with me to the web before blogs. There was, in such olden times, something called the «personal homepage»: primeval websites where people put their disjointed personal trivia for the world to admire. (See An Exploratory Profile of Personal Home Pages: Content, Design, Metaphors.) They were usually boring, disorienting, and self-flattering (think photo albums) but there had been nothing like that before, not on this scale—you could learn about fascinating people you’d otherwise never meet and there were some unbelievable things out there. But back to the photo album metaphor. Personal homepages were piles of scattered, assorted personal paraphernalia—they were, in a way, photo albums.

A photo heap. Intriguing at first but quickly unmanageable and unwieldy.

Then the blogs started to appear. It took a while to notice anything had changed. The diary metaphor obscured as much as it enlightened. With some hindsight it’s easy to pinpoint what happened—and to marvel at how simple yet radical a change it was. The blog era is when websites learned about sequence, spatial sequence. They stopped being fractal trees of buried content and became, yes, comics—post became the new panel.

By spatial, we mean that blogs, just as comics, unfold in space. There’s a post and right below it there’s another, further below, another, and so on. It makes as much sense to talk of a “blog strip” as of a “comic strip”—about the only difference is that the former goes from top-to-bottom while the latter from left-to-right (or right-to-left).

And then there’s sequence. Sequence brought context, interface and development to websites, it gave them personality, motion, and tension, made them subject to change and thus to evolution. Sequence brought time.

Every page in a blog has a natural context: it comes after the previous post and before the next. A blog’s homepage is simply a broad sweep of the most recent panels posts in the strip—an easy way to glimpse the website’s personality and recent happenings. If you’re faithful (or diligent), you can see the writing and the themes evolve through time. The mind fills in the gaps, the bleeds, and the continuity that emerges can feel as real and intense as reality itself. Spatial interface is a brilliant interface in its almost ridiculous simplicity.

(Blogs are the web’s poster boy for spatial sequence but they’re by no means the only (or the first) web form to avail itself of it. Comments, within blogs and beyond, are also spatially sequenced, are also comics. So are forum threads.)

A scene from Hayao Miyazaki’s NausicaaWP manga.
Such is the magic of comics. Such is the magic of blogs.

What, then, is the equivalent of movies in the web? What web form is temporally sequenced—”pictures” alternating in time, fixed in space? The hard part, really, was coming up with the question. The answer is obvious: wikis.

Consider a popular pediaELZR (Wikipedia article). It’s the product of a myriad interventions (most of them rather trivial) from several Wikipedists, but you are only seeing the most recent frame when visiting the pedia’s URL. If you visit in a couple of hours, chances are someone corrected a typo, reverted vandalism, or added a sentence, and you will thus see a slightly different frame—the pedia has “moved”! Unknown to a surprising amount of users, every pedia has a history where a log is kept of even the most minute change—the movie’s celluloid film. Wikis are movies.

Time-lapse bloom.
Compare with the Time-lapse pedia and its history.

Finally, a mac! 2
0
0
7
Jan
04