The state of the union is strong.
I’m pretty excited. I’ve been heads-down the past few weeks working towards the official 1.0 release of the Serendeputy application. Probably a few more weeks to go.
Serendeputy is really four applications. Each of them are in pretty good shape.
The librarian understands the world. Its role is to read all the news sites, parse them and figure out what they’re talking about and what’s relevant right now. This has been running stably for eighteen months now, but I’m most excited about something I put in a couple of months ago — the sidecar.
The sidecar allows me to attach snippets (or pages) of code to any element in the librarian system, be it a source, an author, a feed, a tree or a topic. This lets me do little things like fixing the author tags on the New York Times articles, and bigger things like programmatically disambiguating the different meanings of “windows.” This sidecar functionality is really letting me fine-tune the librarian, which makes the overall results appear much more on-target for all the users. The machine learning pieces of the librarian are tremendously helpful, but nothing’s better than being able to hand-tune when I need to perfect one particular thing.
Hooray for Ruby Metaprogramming.
The deputy understands you. The deputy is the real personalization engine that takes all the gestures you give the system and builds them into a profile. It then reads all the librarian’s indexes to find just the right articles for you. The core deputy technology has been in place for more than a year, so I’ve mostly been fine-tuning it.
I’ve been mostly focusing on timeliness boosts and sinks. If you read something on Bill Belichick, the deputy will interpret that as an interest for you and build it into your master profile. The key knob I’m twiddling now is how much should I boost it based on the fact that you’re reading it now? How long should this boost last? Should this end up swamping some of your longer-term but not recent interests? I’m having a fun time modeling all the different ways of doing this, but I’m still working on finding a reasonably optimal configuration.
The web application (at Serendeputy.com) is getting tweaked substantially over the next few weeks. I have a few interface things I’ve wanted to get at, and I’m bundling them into a big release. (Firefox + jQuery + way too many DOM nodes = sluggishness). I’m also building out the site-wide meta lists (most popular for everyone, etc.) and building out the interface for the tree navigation of the sources and topics. Ideally, these improvements will make it easier for people to find what they’re looking for.
The API application is almost done. This is the functionality that helps publishers take advantage of Serendeputy’s personalization engine on their own site. I’m working with several alpha/beta customers right now to get the right balance of functionality. It’s a lot of fun talking to folks back in the industry, and I’m glad that I have a decent amount of cred from my time at the Boston Globe, Abuzz and Amazon.
This will be one of the core revenue drivers of the site, so I’m excited to see it coming together. Revenue is helpful.
So, lots to do, but I think I’m making good (and fast) progress. Never underestimate the ability to understand the entire system. And make decisions quickly. Not a lot of bureaucracy here at Serendeputy world headquarters.
Mostly, I’m very excited that this thing I’ve built is started to really resemble what I had in my head.
If you have any comments or suggestions, please drop me a line!