Kyle Machulis

Weird things happen when I get bored.

Luckily, they're usually things that make both me and others less bored.

I've had a 3Com Audrey sitting on my computer desk for a few weeks now, staring at me, begging me to do something with it. Unfortunately, all it can really do is run a web browser. A web browser from the late 90's at that. No CSS love for me, nor even flash or anything else. Nope, only the basics of HTML.

It seemed like it'd be a neat picture frame, at least. However, I didn't really want to just cycle random flickr pictures on it, because that's boring. There had to be some more interesting way to produce stuff for it, that would actually make me want to look at it.

Thus, Fwiktr was born.

The Fwiktr Creation Process

This is all done using python, because it seemed like a fun way to start picking up more parts of the language.

Here's the ingredient list for everything that's not included with python these days:

The idea behind Fwiktr is simple. In its current form, it's bascially a Chinese Room artist. It pulls the last 20 lines from the public timeline of Twitter, runs them through a parts of speech generator, and discards everything but the nouns. It then uses those nouns as basis to do tag searches on flickr. This is currently an "ANY" search, meaning that it will return pictures with any of the tags listed. It pulls a list of up to 100 images from flickr, picks one at random, dumps the information to a file, and uploads it to http://www.30helensagree.com

Right now, the Parts of Speech tagging happens through an external call to Treetagger. This is because before this project I'd had zero experience working with natural language processing, and I'm still working on figuring out exactly how one makes a nicely trained Brill Tagger in nltk. But I'll get to the future plans in a bit.

So, for all intents and purposes, Fwiktr is nothing but superglue right now. There's no real AI to speak of, it's a purely reactive behavioral system. Anything it does right is emergent, anything it does wrong is your fault for not understand the art it produces.

Now, I'm not the first person to do a twitter/flickr ma... mas... Ok, I can't bring myself to say that word. I'm not the first person to do a twitter/flickr combination before, there's some other neat ones out there:

Twitter+Flickr via Yahoo Pipes
Flussgeist 1 - Waiting - Twitter and Flickr and Video thru Flash. Warning: Will give you a nasty case of the ADDs
WPF Screensaver - If you run Vista...

What's Happened So Far

Here is the very first file Fwiktr ever put out (File format: Flickr Message, Parts of Speech Output, Tag list, First 100 pictures returned by tag search)

That file cost me a good 30 minutes of time, just because I didn't format it nicely and found myself cutting and pasting URLs everywhere just so I could see what was going on.

The first message:

"Got my new Russian neighbor drunk on margaritas. USA! USA! USA!"

The first URL gave me this image:

Oh yeah, this was gonna be good.

Second message:

"Laughing at Thomas' outburst in the maxipad section of the drug store: "Look Mum! All the vaginas are lined up in a neat row!" AH, 3. OY."

Image?

Ok, so, 2 for 2, but there was still the human element there, with me having to cut and paste the URLs. If Fwiktr was gonna work right, it would have to decide for itself which picture went with the message. So, it chooses a number between 1 and min(number of pictures returned, 100) and uses that image.

First automated message:

"Lovely spam, wonderful spa-a-m, Lovely spam, wonderful S Spam, Spa-a-a-a-a-a-a-am, Spa-a-a-a-a-a-a-am, SPA-A-A-A-A-A-A-AM, SPA-A-A-A-A-A-A-A"

First automated image:

I now firmly believe that God is communicating with me through Fwiktr.

Fwiktr has the capability to upload results on its own now, and is running a new image every 30 seconds at 30helensagree.com. The current file format is:

Message Text
Image
Treetagger Output
Tag string sent to flickr
Random index selected and total result count
First 100 images returned

It's definitely had some hits as well as some more artistic moments.

The Future of Fwiktr

Fwiktr has a long, long way to go. Right now, I'm content to revel in its emergence and work on a better interface for the display. But, I do have feature plans:

Use Python Image Library to overlay the message on the image
Far future: Do this in some interesting, artistic way.
Language Processing Tasks
Vary parts of speech used for tags
- Obviously, tags are not just nouns. People use verbs, adjectives, and other parts of speech. These should get picked up while still dropping parts of speech that are rarely used for tags.
Use neural nets and manual feedback (via rating system on the site) to train tag combination system
- When we pass tags to flickr, right now the only data Fwiktr knows is "I've got a pile of nouns, you do somnething with them!". If we could possibly say "Ok, I've got this set of nouns and this set of verbs and they go together with these boolean operations", we lose our chance for crazy emergence but possibly gain contextual search
An example of all of the above: In the first text file, one of the twitter messages was "At board walk. Got an ice cream.". Fwiktr did a good job of pull tags, and gave us back the list "board,walk,ice,cream". Unfortunately, when this pile was passed to flickr, it saw "board OR walk OR ice OR cream", and gave us back ANY images with tagged with board, walk, ice, or cream in them, which means, that we got lots of winter pictures of iced over trees, too. If Fwiktr knew to pass "boardwalk AND icecream", then we'd have gotten back 48 completely contextual images versus many thousand hits and misses. Of course, the AI between "board,walk,ice,cream" and "boardwalk AND icecream" is a mighty step.
Train up specific tagger for Fwiktr versus using treetagger
- Natural Language Processing programs are trained off of what are called "Corpora", which is the plural of "corpus". Basically, it's text that's been marked up to show different grammatical structures and parts of speech to make it easy to process. Unfortunately, most of the big, usable corpora are stuck with HUGE, unusable costs to get to, because they're used by academic institutions who can afford them. Now, I'm not sure I really need to get able to hit all of treebank to get this working correctly, but I don't know if I'm gonna have much luck working with just the Brown Corpus either. Still lots to learn in this area, but I'm rather eager.

So, that's it for right now. Fwiktr is slaving away throwing new files up on the site, so keep an eye out. I hope to have a nice auto-reloading interface up by tomorrow.

Off to sunny Georgia, where I'll be speaking at the GDX Conference on Creativity in Second Life. Should be a fun trip!

DesktopMap

Originally uploaded by qdot76367.

In the interest of seeing what happens when I build my own usability interfaces, I put my daily development setup into a mindmap, documenting keystrokes required to get to things, number of sub-contexts each application forms (either fixed, range, range+ which means range is normal but can be more, or ? which means god only knows).

This setup seems completely natural to me, but looking at it this way, it's interesting to see what could be changed and optimized to either be lower in the tree or accessed differently.

I suppose one post every 6 months isn't too bad. All settled in at Linden Lab.

Conferences in the near future:

Game Developers Conference 2007, Moscone Center, San Francisco, CA, March 5-9, 2007
SXSW 2007 Interactive, Austin, TX, March 9-13, 2007
GDX, Savannah, GA, April 26-27, 2007
SIGCHI 2007, San Jose, CA, April 28-May 3, 2007

IMG_4298 Originally uploaded by qdot76367.

Weekend project, while not quite as done as I wanted it to be, was still a success!

Made a USB reference implementation using the USBTiny firmware only USB 1.1 Low-Speed driver. Haven't actually gotten to do much with it outside of putting the board together and running a couple of python (which I'm learning to love quickly) test scripts. Still, it's nice to have some sort of USB connection without having to go through FDTI, though I suppose I could always do something zany like move to a decent, higher speed 32-bit platform too. But for now, this works.

usbref Originally uploaded by qdot76367.

Fwiktr v0.1

Off to GDX!

I thought about my space, and it really got me down

I'm alive, I think

USB Reference Implementation Using LEDs