Silverlight

Microsoft’s Silverlight has hit their 1.0 RC milestone. In case this one has slipped under your radar, here’s how MS describes the technology:

Silverlight is a cross-browser, cross-platform plug-in for delivering the next generation of Microsoft .NET–based media experiences and rich interactive applications for the Web.

From what I gather, it’s essentially an alternative to Flash. Surprisingly for a product from MS, it’s available for Mac OS X and eventually for Linux as well as Windows.

I’m not going to claim the slightest expertise here. This is a case of a technology floating around and me hearing about it without paying much attention until suddenly it gets in my face. I hope to dig into it a bit more and if/when I do I’ll report my findings.

Another day, another widget…

Today I’m testing out Lijit. It’s kind of a personal search engine that gathers content from your blog, all kinds of social networking sites you might be a part of, bookmarking sites, etc.

At least, that’s my understanding. I generally sign up for stuff like this first, then go back and understand it later.

Of course, I’ve got all kinds of links on here to places I signed up for then never used… for instance my del.icio.us bookmarks haven’t been updated in years.

Lijit Search

Well, it’s interesting but certainly not perfect. I tried searching on “Skellig” which is the name of a book I reviewed here some time ago. Search came up empty. Going to google and searching on site: dragonchasers.com skellig does return results so Google has spidered me.

Ah well, what do you want for nothin, right?

Med.ium Widget

I just signed up for Me.dium, which is kind of part social network, part cyber-stalker-tool. I jest, of course, but what it does is let you see where on the web your friends are, and let’s you chat with them.

Honestly I’m not sure what the experience is going to be like, so I won’t talk a lot about it yet. But I got a “Widget” centered around this site and I wanted to try it out, so here it is:

OK, so there it is. But what does it mean? What is the relationship that Med.ium is finding between this site and the ones that ‘surround’ it?

More research required…

Blueprint CSS

Warning: Incoming Geek Post

Blueprint is a CSS Framework that’s supposed to help you jump-start the design of a web page. As I’ve spent most of my web development years doing back-end code and not much worrying about display issues, it looks to be a good tool for me to use. Haven’t tried it yet, though.

Darren from gamemakker.co.uk has a post about his initial experiences with Blueprint that is well worth reading.

Captchas again

A while back I wrote about an article on using CAPTCHAs to digitize books.

Well it looks like the technology has been launched!. I just encountered it when signing up (last one into the pool as always) for a Facebook account.

Anyway in my original post I speculated on how this would work. Well, this new post is just to pat myself on the back, because this is from the ReCaptcha site, and it’s pretty close to what I’d guessed.

But if a computer can’t read such a CAPTCHA, how does the system know the correct answer to the puzzle? Here’s how: Each new word that cannot be read correctly by OCR is given to a user in conjunction with another word for which the answer is already known. The user is then asked to read both words. If they solve the one for which the answer is known, the system assumes their answer is correct for the new one. The system then gives the new image to a number of other people to determine, with higher confidence, whether the original answer was correct.

There’s a lot of smart people in the world but I’m generally not one of them, so when I get something right its cause for celebration! 🙂

Crunch time

So I’m in crunch time for the first time in a long, LONG time. I worked on a project from 9 am to 11 pm today, and tomorrow will be pretty much the same.

And y’know, it feels great. After so long of being a fixer or a cleaner-upper, I’m working on a project that I totally own from start to finish. Of course, it has a crazy short development cycle, ergo the crunch time. But I think I set myself up to work best under pressure way back when I was a teenager working in restaurant kitchens. I learned to love the pressure then, and I think because of that I still do my best work when the chips are down and the deadline is looking impossible.

My biggest problem is the part of me that procrastinates; I think its my subconscious trying to create those high-pressure situations.

Anyway, rambling. But today was the first time in a long time that I truly felt good about something I was building, since I’m building it my way. No one to blame but me if it fails. No one to take the credit if its a hit. 🙂

Another article published

I’ve been lax about posting when I get stuff published. Not that I’ve had a lot lately, but anyway…

Life beyond Google: Do alternative search engines measure up?, published at Computerworld.com.

I never know what’s going to happen when I submit to CW. Some of the editors give my stuff the Computerworld treatment which tends to bleach out any of my personality I’ve put into it, others leave my weirdness intact. I understand the bleaching process… they’re trying to project a consistent voice. But it’s strange to read one of these pieces and think “Wait…I wrote this? This doesn’t sound like me!”

Anyway, this one is more or less intact… probably not a thrilling read for most of my friends, though.

Putting CAPTCHAs to good use

Much of the info in this post is from an Associated Press article I read at CNN: Web registration tool digitizes books

So y’know those CAPTCHA things? Where you’re registering for a website, or adding a comment to a blog, and you have some squiggly letters that you have to type in to prove that you’re a human and not a bot?

Well, in a quasi-twist on Folding@Home and other distributed computing applications, the folks at Carnegie Mellon University are working on a way that will put your CAPTCHA typing to good use. Let’s call it a distributed keyboarding application. 🙂 They’ve estimated that 60 million CAPTCHAs are typed in every day, at an estimated 10 seconds per CAPTCHA. Do the math and it come out to 166,667 hours/day spent typing these things in.

Meanwhile, over at the Internet Archive they’re busily scanning images of book pages for import via OCR. But Luis von Ahn, assistant professor of computer science at Carnegie Mellon and one of the developers of the CAPTCHA system, says that some books can’t be read by OCR systems, due to their age or the condition of the text.

So the new idea is to scan in the pages of these books, use software to break those images up into many tiny images each containing a word, and using these images as the CAPTCHA ‘test images’. Track the results as users type in the resulting word, and when enough of them agree, the computer accepts that this particular image represents this particular word. Over time, the text of an unscannable book will be rebuilt by people registering for web sites. They’re calling this the “reCAPTCHA” system.

And that’s where the article leaves off, but I’m still trying to figure out how this would work. If I’m sending out these unscannable images, how does the registration system know the user is typing in the right word? My best guess is that the article is wrong and the images aren’t of single words, but of pairs of words, one of which has been deciphered (or more likely, the CAPTCHA displayed to the user is 2 ‘words’ long, one of which is provided by the CAPTCHA system and the other is the unknown word). The ‘Turing test’ to see if it’s a real person only uses the first word. The second word is used by this new system to try to scan in books. If this is the case, we’re not really harnessing energy already being expended, but instead adding to the work done by CAPTCHA users.

The only other system I can imagine is one where the CAPTCHA input is sent back to a central database in real time. As a new word/image goes out, it lets everyone in…the input test is in effect a bluff since there’s no data on what word the image represents. After, say, 500 people have responded to that word/image, the system starts to get a good idea of what the word is. At least it’ll be seeing some common letter positions at that point, and then it can start doing a pass/fail on the input from the user. Of course, using this method, a system that gets a ‘fresh’ image from the reCAPTCHA system isn’t really being protected from bots or spammers. On the other hand, the bot/spammer doesn’t know its a fresh image. (Do bots & spammers even try to spoof CAPTCHA systems, I wonder?)

So, assuming that the much-smarter-than-me people at Carnagie Mellon haven’t come up with a better system, the new reCAPTCHA system either adds a bit to the workload of CAPTCHA users, or it slightly compromises the security of the systems using it. But in either case the drawbacks are pretty minimal, and the good work being done is pretty significant. I’m looking forward to the day the system gets put into practical use!

Review: Netgear’s Dual-Mode Skype Phone

My most recent published article. Computerworld edits to present a consistent voice, which means reading it doesn’t really sound like me, but since I haven’t had much to post at the blog lately I figured I might as well mention it.

Review: Netgear’s elegant VoIP/land-line hybrid phone

I was pretty impressed with this phone. If I didn’t already have Vonage I’d consider signing up for a year of “SkypeOut” calling and do away with my landline. If you haven’t tried Skype, well, it really rocks. It makes services like TeamSpeak and Ventrillo seem like child’s toys.