kottke.org home archives + xml about kottke.org contact me
kottke.org - home of fine hypertext products

Web technology

Facebook is the new AOL

Earlier in the week, I made a comment in passing in a post about Vimeo:

you do know that Facebook is AOL 2.0, right?

A few people picked up on it and speculated what I might have meant by it. In reading those posts and poking around a bit, I found a post that Scott Heiferman made just after Facebook Platform launched in May:

While at Sony in 1994, I was sent to Virginia to learn how to build a Sony "app" on AOL (the #3 online service, behind Compuserve & Prodigy at the time) using AOL's proprietary "rainman" platform.

Fast forward to Facebook 2007 and see similarities: If you want access to their big base of users, develop something in their proprietary language for their people who live in their walled garden.

Scott pretty much nails it here. I've no doubt that Facebook is excited about their new platform (their userbase is big enough that companies feel like they have to develop for it) and it's a savvy move on their part, but I'm not so sure everyone else should be happy about it. What happens when Flickr and LinkedIn and Google and Microsoft and MySpace and YouTube and MetaFilter and Vimeo and Last.fm launch their platforms that you need to develop apps for in some proprietary language that's different for each platform? That gets expensive, time-consuming, and irritating. It's difficult enough to develop for OS X, Windows, and Linux simultaneously...imagine if you had 30 different platforms to develop for.

As it happens, we already have a platform on which anyone can communicate and collaborate with anyone else, individuals and companies can develop applications which can interoperate with one another through open and freely available tools, protocols, and interfaces. It's called the internet and it's more compelling than AOL was in 1994 and Facebook in 2007. Eventually, someone will come along and turn Facebook inside-out, so that instead of custom applications running on a platform in a walled garden, applications run on the internet, out in the open, and people can tie their social network into it if they want, with privacy controls, access levels, and alter-egos galore.

Update: I've clarified my AOL vs. Facebook thoughts here.

Twitter vs. Blogger redux

Regarding the Twitter vs. Blogger thing from earlier in the week, I took another stab at the faulty Twitter data. Using some educated guesses and fitting some curves, I'm 80-90% sure that this is what the Twitter message growth looks like:

Blogger vs. Twitter cumulative messages

Twitter cumulative messages

These graphs cover the following time periods: 8/23/1999 - 3/7/2002 for Blogger and 3/21/2006 - 5/7/2007 for Twitter. It's important to note that the Twitter trend is not comprised of actual data points but is rather a best-guess line, an estimate based on the data. Take it as fact at your own risk. (More specifically, I'm more sure of the general shape of the curve than with the steepness. My gut tells me that the curve is probably a little flatter than depicted rather than steeper.)

That said, most of what I wrote in the original post still holds, as do the comments in subsequent thread. Twitter did not grow as fast as the faulty data indicated, but it did get to ~6,000,000 messages in about half the time of Blogger. Here are the reasons I offered for the difference in growth:

1. Twitter is easier to use than Blogger was and had a lower barrier to entry.
2. Twitter has more ways to update (web, phone, IM, Twitterific) than did Blogger.
3. Blogger's growth was limited by a lack of funding.
4. Twitter had a larger pool of potential users to draw on.
5. Twitter has a built-in social aspect that Blogger did not.

And commenters in the thread noted that:

6. Twitter's 140-character limit encourages more messages.
7. More people are using Twitter for conversations than was the case with Blogger.

What's interesting is that these seeming advantages (in terms of message growth potential) for Twitter didn't result in higher message growth than Blogger over the first 9-10 months. But then the social and network effects (#5 and #7 above) kicked in and Twitter took off.

kottke.org banned from Technorati top 100?

Since swearing off Technorati a couple of years ago, I've been checking back every few months to see if the situation has improved. The site is definitely more responsive but their data problems seemingly remain, at least with regard to kottke.org; Google Blog Search gives consistently better results and easy access to RSS feeds of searches.

Technorati recently introduced something called the Technorati Authority number, which is a fancy name for the number of blogs linking to a site in the last six months. Curious as to where kottke.org fell on the authority scale, I checked out the top 100 blogs list. Not there, so I proceeded to the "Everything in the known universe about kottke.org" page where a portion of that huge cache of kottke.org knowledge was the authority number: 5,094. Looking at the top 100 list, that should put the site at #47, nestled between The Superficial and fishki.net, but it's not there. Technorati also currently states that kottke.org hasn't been updated in the last day, despite several updates since then and my copy of MT pinging Technorati after each update.

Maybe kottke.org has been intentionally excluded because I've been so hard on them in the past. Or maybe it's just a glitch (or two) in their system. Or maybe it's an indication of larger problems with their service. Either way, as the company is attempting to offer an authentic picture of the blogosphere, this doesn't seem like the type of rigor and accuracy that should send reputable media sources like the BBC, Washington Post, NY Times, and the Wall Street Journal scurrying to their door looking for reliable data about blogs.

Update: As of 3:45pm EST, the top 100 list has been updated to include kottke.org. The site also picked up this post right away, but failed to note a subsequent post published a few minutes later..

Coda

Panic has released Coda, a new web development app for OS X. Panic co-founder Cabel Sasser describes it thusly:

We build websites by hand, with code, and we've long since dreamed of streamlining the experience, bringing together all of the tools that we needed into a single, elegant window. While you can certainly pair up your favorite text editor with Transmit today, and then maybe have Safari open for previews, and maybe use Terminal for running queries directly or a CSS editor for editing your style sheets, we dreamed of a place where all of that can happen in one place.

Ever since I switched to a Mac, I've been seeking a suitable replacement/upgrade for Homesite. I limped along unsatisfied with BBEdit and am finally getting into the groove with TextMate, but the inter-app switching -- especially between the editor, FTP client, and the terminal -- was really getting me down. John Gruber has a nice preview/review of Coda:

Each of Coda's components offers decidedly fewer features than the leading standalone apps dedicated to those tasks. (With the possible exception of the terminal - I mean, come on, it's a terminal.) This isn't a dirty secret, or the unfortunate downside of Coda only being a 1.0. Surely Coda will sprout many new features in the future, but it's never going to pursue any of these individual apps in terms of feature parity.

The appeal of Coda cannot be expressed solely by any comparison of features. The point is not what it does, but it how it feels to use it. The essential aspects of Coda aren't features in its components, but rather the connections between components.

Panic's implicit argument with Coda is that there are limits to the experience of using a collection of separate apps; that they can offer a better experience - at least in certain regards - by writing a meta app comprising separate components than they could even by writing their own entire suite of standalone web apps. Ignore, for the moment, the time and resource limitations of a small company such as Panic, and imagine a Panic text editor app, a Panic CSS editor app, a Panic web browser, a Panic file transfer/file browser app - add them all together and you'd wind up with more features, but you'd miss the entire point.

Panic co-founders Steven Frank and Cabel Sasser both weigh in on the launch. Has anyone given Coda a shot yet? How do you find it? I'm hoping to find some time later today to check it out and will attempt to report back.

No One Belongs Here More Than You

Miranda July, who you might remember from her film Me and You and Everyone We Know, has a book coming out in May, a collection of stories called No One Belongs Here More Than You. The book has a web site that's one of the most effective and creative I've seen in a long time. Here's a screenshot of one of my favorite pages, just to give you a taste:

No One Belongs Here More Than You

The really intriguing thing about the site is that it breaks pretty much every rule that contemporary web designers have for effective site design. The site is a linear progression of images, essentially 30 splash pages one right after another. It doesn't have any navigation except for forward/back buttons; you can't just jump to whatever page you want. July barely mentions anything about the book and only then near the end of the 30 pages. There's no text...it's all images, which means that the site will be all but invisible to search engines. No web designer worth her salt would ever recommend building a site like this to a client.

Yet it works because the story pulls you along so well; July's using the site's narrative to sell a book that is, presumably, chock full of the same sort of narrative. If you think the site sucks and quickly click away, chances are you're not going to like the book either...it's the perfect self-selection mechanism. The No One Belongs Here More Than You site is a lesson for web designers: the point is not to make sites that follow all the rules but to make sites that will best accomplish the primary objectives of the site.

pocket

Adam and David recently reminded me of pocket, an episode of 0sil8 I did back in 2001 (the second-to-last episode actually):

pocket

pocket was a broadcast mailing list for mobile phones. People signed up and then I sent them SMS messages on their phones periodically. As I recall it only lasted a few weeks before I shut it down; there just didn't seem to be anything interesting about broadcasting short messages to a group of friends and strangers.

Public and permanent

Marc Hedlund, founder of the intriguing Wesabe, recently made this interesting observation:

One of my favorite business model suggestions for entrepreneurs is, find an old UNIX command that hasn't yet been implemented on the web, and fix that. talk and finger became ICQ, LISTSERV became Yahoo! Groups, ls became (the original) Yahoo!, find and grep became Google, rn became Bloglines, pine became Gmail, mount is becoming S3, and bash is becoming Yahoo! Pipes. I didn't get until tonight that Twitter is wall for the web. I love that.

A slightly related way of thinking about how to choose web projects is to take something that everyone does with their friends and make it public and permanent. (Permanent as in permalinked.) Examples:

  • Blogger, 1999. Blog posts = public email messages. Instead of "Dear Bob, Check out this movie." it's "Dear People I May or May Not Know Who Are Interested in Film Noir, Check out this movie and if you like it, maybe we can be friends."
  • Twitter, 2006. Twitter = public IM. I don't think it's any coincidence that one of the people responsible for Blogger is also responsible for Twitter.
  • Flickr, 2004. Flickr = public photo sharing. Flickr co-founder Caterina Fake said in a recent interview: "When we started the company, there were dozens of other photosharing companies such as Shutterfly, but on those sites there was no such thing as a public photograph -- it didn't even exist as a concept -- so the idea of something 'public' changed the whole idea of Flickr."
  • YouTube, 2005. YouTube = public home videos. Bob Saget was onto something.

Not that this approach leads naturally to success. Several companies are exploring music sharing (and musical opinion sharing), but no one's gotten it just right yet, due in no small measure to the rights issues around much recorded music.

Twitter

As I mentioned the other day, I recently joined Twitter. I've been poking around its nooks and crannies ever since. Here are some observations, presented in Twitter-sized chunks:

Playing with Twitter reminds me of blogging circa 2000. Back then, all weblogs were personal in nature and most people used them to communicate with their friends and family. If I wanted to know what my friends were up to back then, I read their blogs. Now I follow Twitter (and Flickr and Vox).

The reaction to Twitter mirrors the initial reaction to weblogs...the same tired "this is going to ruin the web" and "who cares what you ate for dinner" arguments.

Also like blogs, everyone has their own unique definition of what Twitter is (stripped down blogs, public IM, Dodgeball++, etc.), and to some extent, everyone is correct. Maybe that's when you know how you've got a winner: when people use it like mad but can't fully explain the appeal of it to others. See also: weblogs, Flickr.

For people with little time, Twitter functions like an extremely stripped-down version of MySpace. Instead of customized pages, animated badges, custom music, top 8 friends, and all that crap, Twitter is just-the-facts-ma'am: where are my friends and what are they up to?

Twitter's like Flickr without the images.

When one thing (i.e. Twitter) is easier than something else (i.e. blogging) and offers almost the same benefits, people will use it.

Twitter brings back the "type words in one box and press submit" thing that made Blogger so popular back in the day. Compare with current blogging systems. To publish a post in MT, I've got to fiddle with 7-9 different text boxes and options. See immediately above.

Let's not forget Dodgeball here, which was used extensively at SXSW in 2006. (In other words, all the Twittering at SXSW 2007 was not unprecedented. Chill.) It's more focused on location and SMS though...by allowing updates in more ways and being more flexible about the type of message allowed, Twitter is attractive to a wider group of people.

If your friends are not on Twitter, I can't imagine it would be that interesting.

Twitterholic tracks the top 100 Twitter users in terms of followers. I know, let's not turn absolutely everything on the web into a popularity contest!! We already know Scoble is a big blowhard and has weak ties to lots of people...let's move on, shall we?

I wonder what the average number of followers per person is? The folks with 5 zillion followers get all the attention, but as with blogging, those posting updates for their 20 friends form the bulk of the activity.

Lists of friends and followers are presented alphabetically. Does Anil attract more friends, on average, than Veen because he always shows up near the top of the listings?

I can see why Obvious dropped Odeo for Twitter. With podcasts, you've got all that data locked up in binary format (no easy cut-and-paste) and it takes you 20 listening minutes before you can react to it (by commenting, by linking, etc.). With blogs, the reaction time to a post is 1-2 minutes, with Flickr it's 5 seconds, and Twitter is 2-3 seconds. The barrier to entry for reacting to and remixing podcasts is just so much higher.

Twitter is the first thing on the web that I've been excited about in ages. Like years. The last thing was probably Flickr. (Talk about burying the lede.) It's just so damn simple but useful. Again, reminds me of weblogs in that way.

If you're on a Mac and using Twitter, download Twitterific, a little app that sits on your desktop and displays updates from your friends. My only complaint: it doesn't completely show updates, forcing you to the web to read the last 2-3 words of a longish message. Come on...it's only 140 characters, show them all!

Twittermap displays recent Twitter messages on Google Maps. All you do is send Twitter a message with your location -- like so...the "L:10003" is the important part -- and Twittermap will pick it up.

Even more mesmerizing is Twittervision...a world tour of recent Twitter messages. Just sit back and watch the updates come in one at a time, displayed on a world map. (This is in beta and Twitter's having some downtime issues right now, so the data may be less than fresh when you go.)

Twitter seems to work equally well for busy people and not-busy people. It allows folks with little time to keep up with what their friends are up to without having to email and IM with them all day. Those with a lot of time on their hands can spend a lot of time finding new people to follow, having back-and-forths with friends all day, and updating their status 40 times a day. Too many web apps fail because they only appeal to those with abundant free time.

I'm fascinated to see where Obvious takes this app once they get their scaling issues under control.

The default display of recent messages plus your own messages is genius. Makes it feel more like a conversation. The "with friends" display is great too...perfect for discovering other people to follow.

"Friends" still isn't the right word.

Men look at crotches

Among the many interesting things in Online Journalism Review's article about using eyetracking to increase the effectiveness of news article design is this odd result:

Always look crotch

Although both men and women look at the image of George Brett when directed to find out information about his sport and position, men tend to focus on private anatomy as well as the face. For the women, the face is the only place they viewed. Coyne adds that this difference doesn't just occur with images of people. Men tend to fixate more on areas of private anatomy on animals as well, as evidenced when users were directed to browse the American Kennel Club site.

That is absolutely fascinating. I'd love to hear an evolutionary biologist's take on why that is.

I'm also heartened by the article's first featured finding: that tighter writing, more white space, and jettisoning unnecessary imagery helps readers read faster and retain more of what they've read.

Gender diversity at web conferences

Every few months, the blogosphere addresses the matter of gender diversity of speakers at conferences about design, technology, and the web. The latest such incidents revolved around the lack of women speakers at the the Future of Web Apps conference in San Francisco last September1 and the Creativity Now conference put on by Tokion in NYC last October. Each time this issue is raised, you see conference organizers publicly declare that they tried, that diversity is a very important issue, and that they are going to address it the next time around.

With that in mind, I collected some information2 about some of the most visible past and upcoming conferences in the tech/design/web space. I'm reasonably sure that the organizers of these conferences were aware of at least one of the above recent complaints about gender diversity at conferences (they were both linked widely in the blogosphere), so it will be interesting to see if those complaints were taken seriously by them.

Future of Web Apps - San Francisco
September 13-14, 2006
0 women, 13 men. 0% women speakers.

Tokion Magazine's 4th Annual Creativity Now Conference
October 14-15, 2006
6 women, 30 men. 17% women speakers.

PopTech 2006
October 18-21, 2006
8 women, 30 men. 21% women speakers.

Web Directions North
February 7-10, 2007
5 women, 16 men. 24% women speakers.

LIFT
February 7-9, 2007
10 women, 33 men. 23% women speakers.

Future of Web Apps - London
February 20-22, 2007
1 woman, 26 men. 4% women speakers.

TED 2007
March 7-10, 2007
12 women, 41 men. 23% women speakers.

SXSW Interactive 2007
March 9-13, 2007
147 women, 378 men. 28% women speakers.
164 women, 373 men. 31% women speakers. (updated 2/22/2007)
165 women, 379 men. 30% women speakers. (updated 3/31/2007)

BlogHer Business '07
March 22-23, 2007
43 women, 0 men. 100% women speakers.

An Event Apart Boston 2007
March 26-27, 2007
1 woman, 8 men. 11% women speakers.

O'Reilly Emerging Technology Conference
March 26-29, 2007
9 women, 44 men. 17% women speakers.
12 women, 79 men. 13% women speakers. (updated 3/31/2007)

Web 2.0 Expo 2007
April 15-18, 2007
17 women, 91 men. 16% women speakers.

Future of Web Design
April 18, 2007
2 women, 12 men. 14% women speakers.
4 women, 16 men. 20% women speakers. (updated 3/31/2007)

GEL 2007
April 19-20, 2007
2 women, 11 men. 15% women speakers.
1 woman, 16 men. 6% women speakers. (updated 3/31/2007)

MIX07
April 30 - May 2, 2007
0 women, 4 men. 0% women speakers.
8 women, 89 men. 8% women speakers. (updated 3/31/2007)

The New Yorker Conference 2007
May 6-7, 2007
3 women, 21 men. 13% women speakers. (updated 2/28/2007)
6 women, 29 men. 17% women speakers. (updated 3/31/2007)

Dx3 Conference 2007
May 15-18, 2007
5 women, 48 men. 9% women speakers. (updated 3/2/2007)
5 women, 70 men. 7% women speakers. (updated 3/31/2007)

An Event Apart Seattle 2007
June 21-22, 2007
0 women, 9 men. 0% women speakers.
1 women, 9 men. 10% women speakers. (updated 3/31/2007)

From this list, it seems to me that either the above concerns are not getting through to conference organizers or that gender diversity doesn't matter as much to conference organizers as they publicly say it does. The Future of Web Apps folks seem to have a particularly tin ear when it comes to this issue. For their second conference, they doubled the size of the speaker roster and added only one woman to the bill despite the complaints from last time. This List of Women Speakers for Your Conference compiled by Jen Bekman is a little non-web/tech-heavy, but it looks like it didn't get much use in the months since its publication. Perhaps it's time for another look. (If you think this issue is important, Digg this post.)

Update: To the above list, I added An Event Apart Boston 2007 and corrected a mistake in the count for GEL 2007 (they had one more woman and one less man than I initially counted.) Ryan Carson from Carson Systems, the producers of The Future of Web Apps conferences, emailed me this morning and said that my "facts just aren't correct" for the count for their London conference. He stated that the number of speakers they had control over was only 13. Some of the speakers were workshop leaders (the workshops "are very different" in some way) and others were chosen by sponsors of the conference, not by Carson Systems. I'm keeping the current count of 27 total speakers as listed on their speakers page this morning...they're the people they used to promote the conference and they're the people at the conference in the front of the room, giving their views and leading discussions for the assembled audience. (thx, erik, mark, and ryan)

Update: I added the Future of Web Design conference to the above list. (thx, jeff)

Update: Hugh Forrest wrote to update me on the latest speaker numbers for SXSW Interactive 2007 (he keeps close watch on them because the issue is an important and sensitive one to the SXSW folks)...the ones on their site were less than current. In cases where counts are updated (and not inaccurate due to my counting errors), I will append them to the conference in question so that we can see trends. I plan to update the above list periodically, adding new conferences and keeping track of the speaker numbers on upcoming ones.

[1] Sadly, when I Googled "future of web apps women" while doing some research for this post, Google asked "Did you mean: 'future of web apps when'"?

[2] All statistics as of 2/22/2007. Consider the gender counts rough approximations...in some cases, I couldn't tell if a certain person was a man or a woman from their name or bio.

[3] This conference has released only a very incomplete speaker list.

Fotolog overtaking Flickr?

Quick! Which photo sharing site community thingie is more popular: Fotolog or Flickr? You might be surprised at the answer...but first some history.

Fotolog launched in May 2002 and grew quite quickly at first. They'd clearly hit upon a good idea: sharing photos among groups of friends. As Fotolog grew, they ran into scaling problems...the site got slow and that siphoned off resources that could have been used to add new features to the site, etc. Problems securing funding for online businesses during the 3-4 years after the dot com bust didn't help matters either.

Flickr launched in early 2004. By the end of their first year of operation, they had a cleaner design than Fotolog, more features for finding and organizing photos, and most of the people I knew on Fotolog had switched to Flickr more or less exclusively. They also had trouble with scaling issues and downtime. Flickr got the scaling issues under control and the site became one of the handful of companies to exemplify the so-called Web 2.0 revitalization of the web. The founders landed on tech magazine covers, news magazine covers, and best-of lists, the folks who built the site gave talks at technology conferences, and the company eventually sold to Yahoo! for a reported $30 million.

Fotolog eventually got their scaling and funding issues under control as well, but relative to Flickr, the site has changed little in the past couple of years. Fotolog has groups and message boards, but they're not done as well as Flickr's and there's no tags, no APIs, no JavaScript widgets, no "embed this photo on your blog/MySpace", and no helpful Ajax design elements, all supposedly required elements for a successful site in the Web 2.0 era. Even now, Fotolog's feature set and design remains planted firmly in Web 1.0 territory.

So. Then. Here's where it gets puzzling. According to Alexa1, Fotolog is now the 26th most popular site on the web and recently became more popular than Flickr (currently #39). Here's the comparison between the two over the last 3 years:

Alexa - Fotolog vs. Flickr

This is a somewhat stunning result because by all of the metrics held in high esteem by the technology media, Web 2.0 pundits, and those selling technology and design products & services, Flickr should be kicking Fotolog's ass. Flickr has more features, a better design, better implementation of most of Fotolog's features, more free features, critical praise, a passionate community, and access to the formidable resources & marketing power of Yahoo! And yet, Fotolog is right there with them. Perhaps this is a sign that those folks trapped in the Web 2.0 bubble are not being critical enough about what is responsible for success on the Web circa-2007. (As an aside, MySpace didn't really fit the Web 2.0 mold either, nobody really talked about it until after it got huge, and yet here it is. And then there's Craigslist, which is more Web 0.5 than 2.0, and is one of the most popular sites on the web. Google too.)

What's going on here then? I can think of three possibilities (there are probably more):

1. Fotolog is very popular with Portugese and Spanish speakers, especially in Brazil. According to Wikipedia, almost 1/3rd of all Fotolog users are from Brazil and Chile. In comparing the two sites, what could account for this difference? Fotolog has a Spanish language option while Flickr does not (although I'm not sure when the Spanish version of Fotolog launched). Flickr is more verbose and text-intensive than Fotolog and much of Flickr's personality & utility comes from the text while Fotolog is almost text-free; as a non-Spanish speaker, I could navigate the Spanish-language version quite easily. Gene Smith noted that a presentation made by a Brazilian internet company said that "Flickr is unappealing to Brazilians because they want to the customize the interface to express their individual identities".

Cameron Marlow noticed that Orkut is set to pass MySpace as the world's most popular social networking site (Orkut is also very popular in Brazil), saying that "Orkut's growth reinforces the fact that the value of social networking services, and social software in general, comes from the base of active users, not the set of features they offer". Marlow also notes that Alexa's non-US reporting has improved over the past year, which might be the reason for Fotolog's big jump in early 2006. If Alexa's global reporting had been robust from the beginning, Fotolog may have been neck and neck with Flickr the whole time.

2. Flickr is more editorially controlled than Fotolog. The folks who run Flickr subtly and indirectly discourage poor quality photo contributions. Yes, upload your photos, but make them good. And the community reinforces that constraint to the point where it might seem restricting to some. Fotolog doesn't celebrate excellence like that...it's more about the social aspect than the photos.

3. Maybe tags, APIs, and Ajax aren't the silver bullets we've been led to believe they are. Fotolog, MySpace, Orkut, YouTube, and Digg have all proven that you can build compelling experiences and huge audiences without heavy reliance on so-called Web 2.0 technologies. Whatever Web 2.0 is, I don't think its success hinges on Ajax, tags, or APIs.

Update: You can see how much Fotolog depends on international usage for its traffic from this graph from Compete. They only use US statistics to compile their data. I don't have access to the Comscore ratings, but they only count US usage and, like Alexa, undercount Firefox and Safari users. (thx, walter)

[1] Usual disclaimers about Alexa's correctness apply. The point is that among some large amount of users, Fotolog is as popular (or even more) than Flickr. Whether those users are representative of the web as a whole, I dunno.

How design works

Michael Bierut on his design process, written in plain language that the client never gets to hear (but maybe they should):

When I do a design project, I begin by listening carefully to you as you talk about your problem and read whatever background material I can find that relates to the issues you face. If you're lucky, I have also accidentally acquired some firsthand experience with your situation. Somewhere along the way an idea for the design pops into my head from out of the blue. I can't really explain that part; it's like magic. Sometimes it even happens before you have a chance to tell me that much about your problem! Now, if it's a good idea, I try to figure out some strategic justification for the solution so I can explain it to you without relying on good taste you may or may not have. Along the way, I may add some other ideas, either because you made me agree to do so at the outset, or because I'm not sure of the first idea. At any rate, in the earlier phases hopefully I will have gained your trust so that by this point you're inclined to take my advice. I don't have any clue how you'd go about proving that my advice is any good except that other people - at least the ones I've told you about - have taken my advice in the past and prospered. In other words, could you just sort of, you know...trust me?

It is like magic. Reminds me of something Jeff Veen wrote last year on his process:

And I sort of realized that I do design that way. I build up a tremendous amount of background data, let it synthesize, then "blink" it out as a fully-formed solution. It typically works like this:

- Talk to everybody I possibly can about the problem.
- Read everything that would even be remotely related to what I'm doing. Hang charts, graphs, diagrams, and screenshots all over my office.
- Observe user research; recall past research.
- Stew in it all, panic as deadline approaches, stop sleeping, stop eating.
- Be struck with an epiphany. Instantly see the solution. Curse my tools for being too slow as I frantically get it all down in a document.
- Sleep for three days.

Like I said when I first read Jeff's piece, in my experience, a designer gets the job done in any way she can and then figures out how to sell it to the client, typically by coming up with an effective (and hopefully at least partially truthful) backstory that's crammed into a 5-step iterative process, charts of which are ubiquitous in design firm pitches.

Web 2.0, a definition of sorts

37signals recently polled the customers of their online project management application and one of the questions asked what Web 2.0 meant to them. They've posted 500 answers to that question on their site; it's an interesting read. I decided to do a quick and dirty analysis of the most frequently used words by the respondents, hoping that the result would provide a collective definition of sorts for the term Web 2.0. By the time I'd finished (with several timeouts and distractive blog-related detours), I went back to the thread and saw that Jacob Kaplan-Moss had already completed an analysis. Here are his top 15 words:

web - 348
ajax - 107
applications - 93
new - 78
user - 71
apps - 44
desktop - 40
sites - 37
people - 36
internet - 36
content - 34
think - 33
software - 31
services - 30
technologies - 29

Just for kicks, here's my top 30:

web: 347
ajax: 105
more: 99
applications: 92
new: 77
user: 69
use: 47
apps: 43
desktop: 39
sites: 38
internet: 35
people: 35
content: 33
think: 32
software: 30
using: 30
etc: 29
services: 29
next: 28
technologies: 28
interactive: 28
generation: 27
application: 25
marketing: 25
websites: 23
better: 23
social: 23
users: 22
hype: 22
buzzword: 21
interfaces: 20

For some reason (my shoddy programming skills are a likely culprit), my word counts are slightly different than Jacob's, but they're close. I also left in a few words that he removed but that I thought were relevant, like "more", "use", "using", and "etc". Here are a few more interesting words and their frequency counts:

community: 17
collaboration: 13
companies: 13
bubble: 10
ruby: 9
rounded: 9
gradients: 8
rails: 7
37signals: 6
tagging: 6
flickr: 5
wikis: 5
overused: 5
o'reilly: 5
hyped: 5
overhyped: 5

Not sure this provides much of a definition, but it's fun to play around with.

Big ol' obvious caveat: I performed a straight-up word frequency analysis which did not take into account the context of particular words (e.g. no distinction between different uses of words like "think": "I think Web 2.0 sucks" and "Web 2.0 products make users think"), phrase frequency ("web 2.0", "next generation", "rounded corners"), or anything like that. This obviously limits the utility of the analysis; hence "quick and dirty".

Update: Perhaps a better "definition" of Web 2.0 comes from the related tags at del.icio.us:

ajax
tools
web
blog
webdesign
software
design
social
programming
javascript
business

Not bad.

Update: del.icio.us did this analysis back in November 2005. (thx, maciej)

IndieKarma. Micropayments that work?

I got an email weeks ago urging me to look at a new micropayment system called IndieKarma. Pretty much every other micropayment scheme I've seen is too clunky to actually be useful, but I was pleasantly surprised with IndieKarma when I got around to checking it out. Here's how it works.

If you're a blogger or web site owner, you sign up, put a bit of JavaScript code on your site, and whenever a reader who's signed into IndieKarma visits your site, you get a penny. Seamless and easy.

If you're a reader, you sign up, put some money into your IndiePass account (with PayPal), and then as long as you're signed in, whenever you visit a site that's using the IndieKarma JavaScript, a penny is deducted from your account and into the site owner's account. Again, fairly seamless and easy.

What I love about this system is that it's passive and based on actual usage. The reader doesn't need to decide that they want to support a certain site, just that they want to support the IndieKarma-enabled sites they read often. For a reader who doesn't necessarily want to support a certain site, if they happen to click through for a visit, it only costs them a penny and then they never come back.

Financially, if a reader visits a site 60 times a month (which is not that unusual for weblogs), that's $0.60/mo. or $7.20/yr...the price of a couple lattes at Starbucks. If you've got 1000 people who read your site that are signed up through IndieKarma, that's $7200 per year, a sizable chunk of change.

So that's the good part. Here are some problems with IndieKarma and some suggested features:

  • The "dock" that's placed on the site is way too intrusive and inflexible. Ad banners and boxes are well-established as a way of delivering this type of information...why not use that format? When a reader isn't logged in to IndieKarma, the ad banner/box prompts them to do so and if they're logged in, they get a "receipt" message for their micropayment (e.g. "thanks for supporting the site). Optionally, as a site owner, I should be able to not have the banner show at all for a truly seamless experience for the reader. The easier you make it to pop into a sidebar for bloggers and site owners, the better.
  • Lack of variable pricing. As a reader, I might want to give more or less money per visit to certain sites. I may decide to spend ~$20/yr on my Waxy.org habit and so opt to give three cents per visit instead of one. As a site owner, I should be able to set a suggested and/or minimum cost/visit for my site. If I've got 1000 people giving 3 cents/visit, they each visit 60 times per month for a whole year, that's $21,600, a living wage (depending on where you're living).
  • Alternate payment methods. Readers could buy "subscriptions" to sites for a "buy now" price determined by the site's owner. Or an option for "gosh, that post/video/comic was really good today so here's an extra $5" payments.
  • You could even incorporate advertising into the mix. An advertiser could come along and say, "I'm going pay for unlimited free visits to this site for IndieKarma members for 60 days" and in exchange, the IndieKarma banner is replaced with an ad for that advertiser.

But the big problem with IndieKarma (which I hope they can overcome somehow) is that it's one of those things that's only useful when there's a lot of people using it. As a reader, if only 1 or 2 sites I read are using IndieKarma to generate revenue, I don't have much incentive to go through the sign-up process, but if there are 30 or 40 sites I read that are using it, I'd be much more likely to sign up. Same goes for site owners...if 10 of my readers are using IndieKarma, that's not good, but if 1000 of them are using it, that's something.

It's a chicken and egg problem...you need users to get sites to sign up and you need sites to get users to sign up. This would work much better for someone who already has tons of signed-in users and payment systems (Amazon, PayPal, Google, etc.), established networks of sites that have lots of potential users across many similar sites (Gawker, BlogAds, 9Rules, The Deck, etc.), or really big sites that could sign users up in 4+ digit quantities (Slashdot, MySpace, LiveJournal, Drudge, HuffPo, etc.). Like I said, I hope IndieKarma can overcome this problem because I think the basic idea has a lot of promise to provide an alternative to advertising-supported media, both from the standpoint of readers and web site owners.

BitTorrent = launching a mission to Mars

Must be something in the water today...Paul Boutin has a story on Slate today that makes the same point about BitTorrent, YouTube, and Google Video that I did this morning (although somewhat more succinctly and entertainingly):

The guys behind YouTube hit the sweet spot. Most important, they made it head-slappingly easy to publish and play video clips by handling the tricky parts automatically. Given up on BitTorrent because it feels like launching a mission to Mars? If you've sent an e-mail attachment, you've got the tech skills to publish on YouTube.

The final paragraph of the article contains this interesting bit:

The same Alexa plots that show MySpace and YouTube obliterating top sites reveal that Flickr, Digg and del.icio.us have plateaued with audiences barely bigger than Slate's. Photos, news, and other people's bookmarks just aren't as interesting as bootleg TV and checking out the hotties. The easier it gets to use, the less geeky the Net becomes, and the more it starts to look like real life.

Expect more bootleg TV and hotties from kottke.org in the future...I need some Alexa love.

BitTorrent, YouTube, and Google Video

The other day I realized that within my little online social circle, there's been a lot less mention of BitTorrent lately. It used to be that someone would link to a cool video, the site hosting the video file would go down because of high traffic, and then someone who grabbed the video before the outage would put it up on a torrent site so that everyone could see it again.

And then YouTube and Google Video came along. They offered free hosting and fast (free) bandwidth for videos so when people want to put some neat video of something on their sites, they just slapped it on YT or GV and pointed to it. And more important to the point about BitTorrent, they work completely within the browser environment. You upload videos to YT in the browser (GV has a standalone app for uploading) and the Flash-based viewer works in the browser (most Web users have Flash installed). They offered a seamless end-to-end solution to finding and watching videos all in one application.

Compare that with how you typically watch a video with BT. First you download a torrent file, then open that file up in your BT client (which you need to have previously downloaded and installed), then the file downloads, and finally you open that file in a media player, generally QuickTime, Windows Media Player, or some other player that needs to be downloaded and installed...and hopefully you have the right versions and codecs for the video in question. And that's just the viewing side of things...publishing videos via BT was even more difficult, particularly for non-technical folks.

That BitTorrent took off at all is a testament to the utility of downloading files from multiple sources simultaneously, but it's also telling that once an easier-to-use alternative came along that offered many of the key advantages of BT, people switched1...and really quickly too. Eventually BT will have to find its way into the browser (AllPeers is promising a Firefox extension that will do just that) and somehow overcome the multiple media players problem in order to find success.

[1] For videos of the type I'm talking about anyway. BT is by no means unpopular these days, particularly for feature-length movies, lossless music files, and other really large files. YT and GV are only taking BT's "marketshare" in the realm of short video.

Slashlinks

Ben Engebreth, a compadre of mine at the Eyebeam OpenLab, has released Slashlinks, a tool for automatically mirroring links from del.icio.us to your personal web site. At first glance it might sound like a simple archiving tool, a way to get your data out of del.icio.us, but what it actually does is reproduces your del.icio.us links on your web site.

Check out Ben's links for an example. If you click on a tag name, you can see that not only the links but the underlying tag structure has been reproduced locally. Once the links are on your site, you can style them how you wish (as Ben has), publish them where you want, etc. And Slashlinks will also keep your local links fresh...if you keep using the publishing tools at del.icio.us to add links, they will automagically show up on your site.

Book author to her publishing company: your lawsuit is not helping me or my book

I got an email this morning from a kottke.org reader, Meghann Marco. She's an author and struggling to get her book out into the hands of people who might be interested in reading it. To that end, she asked her publisher, Simon & Schuster, to put her book up on Google Print so it could be found, and they refused. Now they're suing Google over Google Print, claiming copyright infringement. Meghann is not too happy with this development:

Kinda sucks for me, because not that many people know about my book and this might help them find out about it. I fail to see what the harm is in Google indexing a book and helping people find it. Anyone can read my book for free by going to the library anyway.

In case you guys haven't noticed, books don't have marketing like TV and Movies do. There are no commercials for books, this website isn't produced by my publisher. Books are driven by word of mouth. A book that doesn't get good word of mouth will fail and go out of print.

Personally, I hope that won't happen to my book, but there is a chance that it will. I think the majority of authors would benefit from something like Google Print.

She has also sent a letter of support to Google which includes this great anecdote:

Someone asked me recently, "Meghann, how can you say you don't mind people reading parts of your book for free? What if someone xeroxed your book and was handing it out for free on street corners?"

I replied, "Well, it seems to be working for Jesus."

And here's an excerpt of the email that Meghann sent me (edited very slightly):

I'm a book author. My publisher is suing Google Print and that bothers me. I'd asked for my book to be included, because gosh it's so hard to get people to read a book.

Getting people to read a book is like putting a cat in a box. Especially for someone like me, who was an intern when she got her book deal. It's not like I have money for groceries, let alone a publicist.

I feel like I'm yelling and no one is listening. Being an author can really suck sometimes. For all I know speaking up is going to get me blacklisted and no one will ever want to publish another one of my books again. I hope not though.

[My book is] called 'Field Guide to the Apocalypse' It's very funny and doesn't suck. I worked really hard on it. It would be nice if people read it before it went out of print.

As Tim O'Reilly, Eric Schmidt, and Google have argued, I think these lawsuits against Google are a stupid (and legally untenable) move on the part of the publishing industry. I know a fair number of kottke.org readers have published books...what's your take on the situation? Does Google Print (as well as Amazon "Search Inside the Book" feature) hurt or help you as an author? Do you want your publishing company suing Google on your behalf?

Working offline

Back when I wrote about how a WebOS might work (basically XHTML/JS web apps that run on the desktop as well), I got a lot of responses along the lines of: with internet access becoming more ubiquitous (broadband, wifi, wireless broadband, WiMax, etc.), there will be less and less need for applications that don't need a connection to the network to function. When you can literally get a fast, cheap internet connection anywhere, you don't need a version of Gmail that works offline and so that's not going to drive the development of this WebOS thing you're talking about.

I've been thinking for several weeks about why I think that's wrong and I've come up with a couple ideas.

1. Fast, cheap internet everywhere? Hoo boy, wake me when that happens...you'll likely find me driving my hydrogen-powered hovercar with ESP to my paperless office.

2. For many people, the more you get used to having access to your applications/data/etc., the more important that access becomes. Let's say 98% of the applications you use are entirely on the web (with no offline capabilities) and you're online almost all the time wherever you go. Then the network winks out for 1/2 an hour. Or Salesforce.com is down for a couple hours. That last little inch is going to be painful. And no use telling me that sounds insane because I've seen the madness and fear in people's eyes while they clutch their Crackberries, furiously reading email mere minutes away from the office and the full-speed, full-screen experience.

3. The offline thing is a good way for companies to bootstrap the WebOS. I think most people have a sense that the apps they use in their browser are more alive, more social, more connected, even if they can't articulate that feeling. And whether it's true or not (Gmail isn't actually more "connected" than Outlook), companies can market the "aliveness" of their web apps (even when they run offline) versus the "deadness" of desktop apps.

GoogleOS? YahooOS? MozillaOS? WebOS?

Before we get going, here are some alternate titles for this post, just to give you an idea of what I'm trying to get at before I actually, you know, get at it:

  • You're probably wondering why Yahoo bought Konfabulator
  • An update on Google Browser, GooOS and Google Desktop
  • A platform that everyone can stand on and why Apple, Microsoft, and, yes, even Google will have to change their ways to be a part of it
  • The next killer app: desktop Web servers
  • Does the Mozilla Foundation have the vision to make Firefox the most important piece of software of this decade?
  • Web 3.0
  • Finally, the end of Microsoft's operating system dominance

Now that your hyperbole meter has pegged a few times, hopefully the rest of this will seem tame in comparison. (And apologies for the length...I got rolling and, oops, 2500 words. But many of them are small so...)

Way back in October 2004, this idea of how the Web as a platform might play out popped into my head, and I've been trying to motivate myself into writing it down ever since. Two recent events, Yahoo's purchase of Konfabulator and Google's release of a new beta version of Google Desktop have finally spurred me into action. But back to October. At the Web 2.0 conference, Stewart pulled me aside and said something like, "I think I know what Google is doing with Google Browser." From a subsequent post on his site:

I've had this post about Adam Bosworth, Alchemy and the Google browser sitting around for months now and it is driving me crazy, because I want all the credit for guessing this before it happens. So, for the record, if Google is making a browser, and if it is going to be successful, it will be because there is a sophisticated local caching framework included, and Google will provide the reference apps (replying to emails on Gmail or posting messages to Google groups while on the plane).

At the time, Adam Bosworth had been recently hired by Google for purposes unknown. In a blog post several months before he was hired, Bosworth mused about a "new browser":

In this entry, I'm going to discuss how I imagine a mobilized or web services browser handles changes and service requests when it isn't connected. This is really where the peddle hits the metal. If you just read data and never ever alter it or invoke related services (such as approving an expense report or booking a restaurant) then perhaps you might not need a new browser. Perhaps just caching pages offline would be sufficient if one added some metadata about what to cache. Jean Paoli has pointed out to me that this would be even more likely if rather than authoring your site using HTML, you authored it as XML "pages" laid out by the included XSLT stylesheets used to render it because then you could even use the browser to sort/filter the information offline. A very long time ago when I was still at Microsoft (1997) we built such a demo using XSLT and tricky use of Javascript to let the user do local client side sorting and filtering. But if you start actually trying to update trip reports, approve requests, reserve rooms, buy stocks, and so on, then you have Forms of some sort, running offline, at least some of the time, and code has to handle the inputs to the "Forms" and you have to think through how they are handled.

A couple weeks later, Google introduced the first iteration of their Desktop Search. To me, the least interesting thing about GDS was the search mechanism. Google finally had an application that installed on the desktop and, even better, it was a little Web server that could insert data from your local machine into pages you were browsing on google.com. That was a new experience: using a plain old Web browser to run applications locally and on the Web at the same time.

So this is my best guess as to how an "operating system" based on the Web (which I will refer to as "WebOS") will work. There are three main parts to the system:

  • The Web browser (along with other browser-ish applications like Konfabulator) becomes the primary application interface through which the user views content, performs services, and manages data on their local machine and on the Web, often without even knowing the difference. Something like Firefox, Safari, or IE...ideally browser agnostic.
  • Web applications of the sort we're all familiar with: Gmail, Flickr, and Bloglines, as well as other applications that are making the Web an ever richer environment for getting stuff done. (And ideally all Ajaxed up to provide an experience closer to that of traditional desktop apps.)
  • A local Web server to handle the data delivery and content display from the local machine to the browser. This local server will likely be highly optimized for its task, but would be capable of running locally installed Web applications (e.g. a local copy of Gmail and all its associated data).

That's it. Aside from the browser and the Web server, applications will be written for the WebOS and won't be specific to Windows, OS X, or Linux. This is also completely feasible, I think, for organizations like Google, Yahoo, Apple, Microsoft, or the Mozilla Foundation to make happen (more on this below).

Compared to "standalone" Web apps and desktop apps, applications developed for this hypothetical platform have some powerful advantages. Because they run in a Web browser, these applications are cross platform (assuming that whoever develops such a system develops the local Web server part of it for Windows, OS X, Linux, your mobile phone, etc.), just like Web apps such as Gmail, Basecamp, and Salesforce.com. You don't need to be on a specific machine with a specific OS...you just need a browser + local Web server to access your favorite data and apps.

For application developers, the main advantage is that instead of writing two or more programs for multiple platforms (one for the Web, one for Windows, etc.), they can write one app that will run on any machine with the WebOS using the same code base. Bloglines and NetNewsWire both do about the same thing and have radically different codebases (Bloglines uses HTML/JavaScript + some sort of backend programming/scripting language while NNW is a Cocoa app only for OS X), but a version of Bloglines developed for the above platform could utilize a single codebase.

You also get the advantages of locally run applications. You can use them when you're not connected to the Internet. There could be an icon in the Dock that fires up Gmail in your favorite browser. For applications using larger files like images, video, and audio, those files could be stored and manipulated locally instead of waiting for transfer over the Internet.

There are also disadvantages to WebOS applications, not the least of which[1] is that HTTP+JavaScript+XHTML+CSS+Flash is not as robust in providing functionality and user interaction as true desktop applications written in Cocoa or Visual Basic. But as Paul Graham points out, Web applications may be good enough[2]:

One thing that might deter you from writing Web-based applications is the lameness of Web pages as a UI. That is a problem, I admit. There were a few things we would have really liked to add to HTML and HTTP. What matters, though, is that Web pages are just good enough.

Web pages weren't designed to be a UI for applications, but they're just good enough. And for a significant number of users, software that you can use from any browser will be enough of a win in itself to outweigh any awkwardness in the UI. Maybe you can't write the best-looking spreadsheet using HTML, but you can write a spreadsheet that several people can use simultaneously from different locations without special client software, or that can incorporate live data feeds, or that can page you when certain conditions are triggered. More importantly, you can write new kinds of applications that don't even have names yet.

And how about these new kinds of applications? Here's how I would envision a few apps working on the WebOS:

  • Gmail. While online, you read your mail at gmail.com, but it also caches your mail locally so when you disconnect, you can still read it. Then when you connect again, it sends any replies you wrote offline, just like Mail.app or Outlook does. Many people already use Gmail (or Yahoo Mail) as their only email client...imagine if it worked offline as well.
  • A Web version of iTunes. Just like the desktop version of iTunes, except in the browser. Manages/plays audio files stored locally, with an option to back them up on the server (using .Mac or similar) as well. iTunes already utilizes information from the Internet so well (Web radio, podcasting iTMS, CDDB, etc.) that it's easy to imagine it as a Web app. (And why stop at audio...video would work equally as well.)
  • Flickr. Manage image files locally and on Flickr's server in the browser. You could even do some rudimentary photo manipulation (brightness, contrast, red-eye correction, etc.) in the browser using JavaScript or even Flash. Prepare a bunch of photos for uploading to Flickr while on the plane ride home and they automatically sync when you next connect to the Internet.
  • Newsreader. Read sites while offline (I bet this is #1 on any Bloglines user's wish list). Access your reading list from any computer with a browser (I bet this is #1 on any standalone newsreader user's wish list).
  • File backup. A little WebOS app that helps you back up your files to Apple's .Mac service, your ISP, or someone like Google. You'll specify what you want backed up and when through the browser and the backup program will take care of the rest.

I'm looking at the rest of the most commonly used apps on my Powerbook and there's not too many of them that absolutely need to be standalone desktop applications. Text editor, IM[3], Word, Excel, FTP, iCal, address book...I could imagine versions of these running in a browser.

So who's going to build these WebOS applications? Hopefully anyone with XHTML/JavaScript/CSS skills, but that depends on how open the platform is. And that depends on whose platform it is. Right now, there are five organizations who are or could be moving in this direction:

  • Google. If Google is not thinking in terms of the above, I will eat danah's furriest hat. They've already shifted the focus of Google Desktop with the addition of Sidebar and changing the name of the application (it used to be called Google Desktop Search...and the tagline changed from "Search your own computer" to the more general "Info when you want it, right on your desktop"). To do it properly, I think they need their own browser (with bundled Web server, of course) and they need to start writing their applications to work on OS X and Linux (Google is still a Windows company)[4]. Many of the moves they've made in the last two years have been to outflank Microsoft, and if they don't use Google Desktop's "insert local code into remote sites" trick to make whatever OS comes with people's computers increasingly irrelevant, they're stupid, stupid, stupid. Baby step: make Gmail readable offline.
  • Yahoo. I'm pretty sure Yahoo is thinking in these terms as well. That's why they bought Konfabulator: desktop presence. And Yahoo has tons of content and apps that that would like to offer on a WebOS-like platform: mail, IM, news, Yahoo360, etc. Challenge for Yahoo: widgets aren't enough...many of these applications are going to need to run in Web browsers. Advantages: Yahoo seems to be more aggressive in opening up APIs than Google...chances are if Yahoo develops a WebOS platform, we'll all get to play.
  • Microsoft. They're going to build a WebOS right into their operating system...it's likely that with Vista, you sometimes won't be able to tell when you're using desktop applications or when you're at msn.com. They'll never develop anything for OS X or for Linux (or for browsers other than IE), so its impact will be limited. (Well, limited to most of the personal computers in the world, but still.)
  • Apple. Apple has all the makings of a WebOS system right now. They've got the browser, a Web server that's installed on every machine with OS X, Dashboard, iTMS, .Mac, Spotlight, etc. All they're missing is the applications (aside from the Dashboard widgets). But like Microsoft, it's unlikely that they'll write anything for Windows or Linux, although if OS X is going to run on cheapo Intel boxes, their market share may be heading in a positive direction soon.
  • The Mozilla Foundation. This is the most unlikely option, but also the most interesting one. If Mozilla could leverage the rapidly increasing user base of Firefox and start bundling a small Web server with it, then you've got the beginnings of a WebOS that's open source and for which anyone, including Microsoft, Google, Yahoo, and anyone with JavaScript chops, could write applications. To market it, they could refer to the whole shebang as a new kind of Web browser, something that sets it apart from IE, a true "next generation" browser capable of running applications no matter where you are or what computer (or portable device) you're using.

So yeah, that's the idea of the WebOS (as I see it developing) in a gigantic nutshell. The reality of it will probably be a lot messier and take a lot longer than most would like. If someone ends up doing it, it will probably not be as open as it could be and there will likely be competing Web platforms just as there are now competing search engines, portals, widget applications (Konfabulator, Dashboard, Google Desktop Sidebar), etc., but hopefully not. There's lots more to discuss, but I'm going to stop here before this post gets even more ridiculously long. My thanks if you even made this far.

[1] Actually, the biggest potential problems with all this are the massive security concerns (a Web browser that has access to data on your local hard drive?!!!??) and managing user expectations (desktop/web app hybrids will likely be very confusing for a lot of users). Significant worries to be sure, but I believe the advantages will motivate the folks developing the platform and the applications to work through these concerns.

[2] For more discussion of Web applications, check out Adam Rifkin's post on Weblications.

[3] Rumor has it that Google is releasing an IM client soon (more here). I'll be pretty surprised if it's not significantly Web-based. As Hotmail proved for email, there's no reason that IM has to happen in a desktop app (although the alerting is problematic).

[4] Maybe Google thinks they can't compete with Apple's current offerings (Spotlight, Dashboard, Safari, iPhoto) on their own platform, but that's not a good way of thinking about it. Support as many people as you can on as many different architectures as you can, that's the advantage of a Web-based OS. Microsoft certainly hasn't thought of Apple as a serious competitor in the OS space for a long time...until Web applications started coming of age recently, Microsoft's sole competitor has been Microsoft.

So long, Technorati

That's it. I've had it. No more Technorati. I've used the site for, what, a couple of years now to keep track of what people were saying about posts on kottke.org and searching blogs for keywords or current events. During that time, it's been down at least a quarter of the time (although it's been better recently), results are often unavailable for queries with large result sets (i.e. this is only going to become a bigger problem as time goes on), and most of the rest of the time it's slow as molasses.

When it does return results in a timely fashion for links to kottke.org, the results often include old links that I've seen before in the results set, sometimes from months ago. And that's to say nothing of the links Technorati doesn't even display. The "kottke.org" smart list in my newsreader picks up stuff that Technorati never seems to get, and that's only pulling results from the ~200 blogs I read, most of which are not what you'd call obscure. What good is keeping track of 14 million blogs if you're missing 200 well-known ones? (And trackbacks perform even better...this post got 159 trackbacks but only 93 sites linking to it on Technorati.)

Over the past few months, I've been comparing the results from PubSub to those of Technorati and PS is kicking ass. Technorati currently says that 19 sites have linked to me in the past 6 days (and at least four of those are old and/or repeats...one is from last September, fer chrissakes) while PubSub has returned 38 fresh, unrepeated results during that same time. (Not that PubSub is all roses and sunshine either...the overlap between the result sets is surprisingly small.)

While their search of the live web (the site's primary goal) has been desperately in need of a serious overhaul, Technorati has branched out into all sorts of PR-getting endeavors, including soundbiting the DNC on CNN, tags (careful, don't burn yourself on the hot buzzword), and all sorts of XML-ish stuff for developers. Which is all great, but get the fricking search working first! As Jason Fried says, better to build half a product than a half-assed product. I know it's a terrifically hard problem, but Figure. It. Out.

As for the acquisition rumors, I don't know who'd buy such a mess, but if someone does, I look forward to them improving it to a usable level. Pretty much everyone I talk to in the industry thinks the site sucks and we've just been waiting for it to get better because, well, it would have to at some point, wouldn't it? Well, I'm tired of waiting. Goodbye, Technorati...your url will darken the door of my browser no longer.

Update: For the short amount of time I've been using it, IceRocket's blog search seems to work quite well. Thanks to Kevin for pointing me in that direction.

Tags and kottke.org

A few months ago, I began tagging my remaindered links with keywords toward some still-unspecified goal. For instance, this recent post about an interview with Ruth Reichl got tagged with "nyc food restaurants ruthreichl books interviews". As I said, I haven't figured out what to do with them yet, but the other day I whipped up a little PHP script to see how the kottke.org tagspace was shaping up. Here are a few results:

# of entries tagged: 933
total # of tags: 3960
# of distinct tags: 1376
tags per entry: 4.244

Most popular tags (#):
science (80)
nyc (80)
movies (80)
business (73)
food (68)
photography (62)
funny (57)
books (53)
lists (53)
www (43)
music (43)
weblogs (40)
art (39)
design (34)
restaurants (34)
sports (34)
apple (33)
google (32)
technology (29)
nostalgia (27)

That's a fairly accurate description of both what the site is about and what I am interested in. Two of my favorite tags are "lists" and "bestof". Here's a sampling from each of those tags:

lists:
100 people who are qualified to carry the "Bad Mothafucka" wallet besides Pulp Fiction's Jules Winfield
Photo essay of the Hubble Telescope's top ten discoveries
50 Things to Do with Your iPod
Twelve ways to think differently
Pickup Lines Used by Mario [of Mario Bros. fame]
20 things gamers want from the next generation of game consoles
Money Magazine on the 50 smartest things you can do with your money
40 things that only happen in the movies
24 different ways to lace your shoes

bestof:
Is Shaq the greatest NBA player of all time?
Spin names Radiohead's OK Computer the best album from the last 20 years
BusinessWeek Design Award winners for 2005
BBC Radio 4 poll results for Greatest Philosopher Ever!!
New bookmark: interesting Flickr photos from the last 24 hours, automagically determined

The dream is to go back and tag every single entry on the site -- currently ~8700 -- but it would take me approximately forever and I'm not sure it's worth the time and debilitating injuries to my wrists and fingers from all the typing. I've thought about a few alternative approaches (and their associated downsides):

  • Feed all my URLs into del.icio.us via the API and scrape out the tags most commonly associated with those links and posts. I literally haven't looked at the API, so I don't know if this is even possible. Also, I'm not sure I want to trust the del.icio.us community to collaboratively tag my posts and links...there would probably be a significant amount of correction and addition of tags by hand.
  • Use Yahoo's Term Extraction service to build a list of keywords based on an analysis of my posts and the content of the pages I point to within a post or remaindered links. I have no idea how well this would work in practice, especially in returning terms that make good tags. Probably a lot of hand-correction here too.
  • Getting my readers (that's you!) to tag them for me using the list of tags I've already used as a guideline. Unfortunately, you should never trust anyone over 30 or anyone who has access to a HTML textarea into which they can type anything they want. Given enough time, I could probably come up with a system that minimizes the damage a particular malcontent could do, but as with the other two options, I'm still left with a fair amount of correction by hand. A bigger problem I have with this option is there's a lot in it for me (and the site), but I'm not sure there's any real incentive for any of you to spend 20 minutes tagging kottke.org posts (I believe this chore would be the first entry in the dictionary under "mindless busywork"), so I'd feel weird about asking.
  • Some combination of the above approaches.

So yeah, that's where I am with the tagging.

The present future

Perhaps this is impossible or unfair, but can we have a discussion about where technology and user experience on the web are headed without using any of the following words or concepts:

Ajax, web services, weblogs, Google, del.icio.us, Flickr, folksonomy, tags, hacks, podcasting, wikis, bottom-up, RSS, citizen journalism, mobile, TiVo, the Long Tail, and convergence.

That all seems like the present and past, not the future, no? "Web 2.0" arrived a year or two ago at least and we're still talking about it like it's just around the corner. What else is out there? Anything? (Note: This is not an attempt to bring the current "is it really Web 2.0?" discussion (I could care less) here. I'm genuinely interesting in what's out there, if anything.)

Google attempting to patent RSS advertising?

John Battelle points to news of Google (the author is Nelson Minar) attempting to patent the idea of automating the incorporation of targetted ads into RSS files. Here's the application on the USPTO site. I've got a few questions and concerns:

Is this a joke?

Ok, bad first question since it seems unlikely that Nelson and Google would write up this application just to have a few laughs. So here's a better question: where's the prior art on this? The patent was filed on 12/31/2003. I floated the idea of embedding advertising into RSS ads in October 2002 and there was prior art then. But Google's patent application covers "targeted ads" in a "syndicated, e.g., RSS, presentation format in an automated manner". Curiously, I believe this is already covered by an older Google patent, filed in 12/2002:

The relevance of advertisements to a user's interests is improved. In one implementation, the content of a web page is analyzed to determine a list of one or more topics associated with that web page. An advertisement is considered to be relevant to that web page if it is associated with keywords belonging to the list of one or more topics. One or more of these relevant advertisements may be provided for rendering in conjunction with the web page or related web pages.

That's Google AdSense in a nutshell: inserting targeted ads into web documents in an automated manner. So what is it about RSS/Atom files that make them different than plain old web pages and hence not covered under the 2002 AdSense patent? Nothing. This vocabulary of "feeds" and "syndication" is still misleading. RSS/Atom files, especially as they are described in the 12/2003 patent application, are XML files that sit on a web server waiting for someone with a web browser to come along to read them, just like XHTML files:

So, people access documents written in a markup language that have been published on a Web server with a software application. If this seems familiar to you, it should. It's called Web browsing and has nothing to do with syndication. RSS readers and newsreaders are just specialized Web browsers...

The 12/2003 application tries to explain the difference between HTML pages and "syndicated content formats" thusly:

Syndicated content, unlike web pages which are normally stored in an HTML format, are often stored and presented in what may be described as a syndicated content format. Syndicated content formats are often XML (eXtended Markup Language) based and include structured representations of content such as news articles, search results, and web log entries. Syndicated content formats are primarily intended for providing syndicated information, e.g., news headlines, weblogs, etc. in a structured format such as a list of items, with another device, e.g., a user device, usually controlling the ultimate presentation format of the items in the list. This is in contrast to HTML which usually includes a fair amount of presentation and formatting information within an HTML document such as a web page.

That's a pretty weak explanation and sounds a lot like what a web browser (the "user device" that controls the presentation) does with XHTML files (XML-based files without a "fair amount of presentation and formatting information"). It sounds to me like Google already has this covered with their previous patent.

[Long aside: Does the prior art of embedding AdSense ads in XHTML files invalidate this patent? Patents are tricky because they don't cover ideas, they cover specific implementations of ideas. While the 12/2003 application states that "said syndicated format is an XML compliant format" it also specifies that "said syndicated format is a format for listing items corresponding to a channel, said received information including a listing of at least two items and including for each item, a title and a link". That is, the XML files they're talking about have to be RSS/Atom-ish in nature. This doesn't rule out XHTML files in theory, but it does rule out many of them in practice.

But the really tricky part with these software patents is that the implementations of ideas are written so broadly that they might as well be patents of the ideas themselves. If you look at it that way (the patent-holding companies certainly seem willing to litigate on that basis), Google has already embedded automated, targeted advertising into XML-based files. According to news.com, Google launched their AdSense service in June 2003. When the first AdSense advertisement was embedded in an XHTML file soon after that, well, there's your prior art on the very thing that Google attempted to patent 6 months later.]

NewsRank but not particularly new

I missed this April article in New Scientist about Google's plans to rank news stories according to quality and credibility of the sources:

Now Google, whose name has become synonymous with internet searching, plans to build a database that will compare the track record and credibility of all news sources around the world, and adjust the ranking of any search results accordingly.

The database will be built by continually monitoring the number of stories from all news sources, along with average story length, number with bylines, and number of the bureaux cited, along with how long they have been in business. Google's database will also keep track of the number of staff a news source employs, the volume of internet traffic to its website and the number of countries accessing the site.

Google will take all these parameters, weight them according to formulae it is constructing, and distil them down to create a single value. This number will then be used to rank the results of any news search.

The second paragraph of the story mentions that this system has been patented by Google, but I don't see how it's much different than what PageRank does or what Metacritic has been doing with film, game, and book reviews:

This overall score, or METASCORE, is a weighted average of the individual critic scores. Why a weighted average? When selecting our source publications, we noticed that some critics consistently write better (more detailed, more insightful, more articulate) reviews than others. In addition, some critics and/or publications typically have more prestige and weight in the industry than others. To reflect these factors, we have assigned weights to each publication (and, in the case of film, to individual critics as well), thus making some publications count more in the METASCORE calculations than others.

I wonder if these systems will eventually let their users tweak the credibility algorithms to their liking. For instance, it won't take long for conservatives to start complaining about the liberal bias of Google News. In the case of Metacritic, I'd like them to ignore Anthony Lane's rating when he writes about summer blockbusters and put greater emphasis on whatever Ebert has to say. In the meantime, I'm readying my patent applications for RecipeRank, PhotoRank, ModernFurnitureRank, SoftDrinkRank, and, oooh, PatentRank. I'm sure they're brilliantly unique enough to be recognized by the US Patent Office as new inventions.

Ajax and weblogs

Update: The menu is gone for now, so the rest of this probably isn't going to make much sense.

If you take a look at the front page of kottke.org, you'll notice a pulldown at the top of the content column (on the left). When you mouse over the menu, you're presented with a list of choices of what to display in that column and when you click one of those choices, the content is requested from the server and displayed using Ajax. More on the Ajax stuff in a bit.

Back to the menu (go try it out...this will all make a lot more sense if you do), this is a baby step in my desire to get rid of (or at least drastically de-emphasize) the fairly useless archive page**. From the front page, you can now see not only all the latest posts but also only the movie posts, the latest remaindered links, the latest book reviews, etc. You can also request 50 random posts (which is my favorite feature of this whole thing; it's a great way to surf the archives...and find some really horrible writing), a compilation of my favorite posts, some recent photos of mine from Flickr, and my del.icio.us inbox. It's a really fast way to get at a large chunk of the recent archives (dare I say the Long Tail of kottke.org?) and does so in a better way than the archives page.

Oh, and just for the hell of it, I whipped up a little RSS reader that pulls in posts from Boing Boing, waxy.org, Gawker, and the NY Times (cached so that feeds aren't requested every time someone looks at the page) and presents them in the familiar kottke.org format. So that's fun too.

Using Ajax for this was a no-brainer (it's better than loading all that content ahead of time and having it in hidden layers or something), but I'm going to let you in on a little secret: Ajax is a ridiculously easy technology to implement***, especially if you're using something like Movable Type which quickly and easily outputs chunks of XHTML. The XMLHttpRequest part of this took all of 10 minutes to implement, just put the following in the header of your XHTML file (code adpated from Guide to Using XMLHttpRequest (with Baby Steps)):

<script type="text/javascript">
<!--
var xmlhttp=false;
/*@cc_on @*/
/*@if (@_jscript_version >= 5)
try {
xmlhttp = new ActiveXObject("Msxml2.XMLHTTP");
} catch (e) {
try {
xmlhttp = new ActiveXObject("Microsoft.XMLHTTP");
} catch (E) {
xmlhttp = false;
}
}
@end @*/

if (!xmlhttp && typeof XMLHttpRequest != 'undefined') {
xmlhttp = new XMLHttpRequest();
}

function getMyHTML(serverPage, objID) {
var obj = document.getElementById(objID);
xmlhttp.open("GET", serverPage);
xmlhttp.onreadystatechange = function() {
if (xmlhttp.readyState == 4 && xmlhttp.status == 200) {
obj.innerHTML = xmlhttp.responseText;
}
}
xmlhttp.send(null);
}
//-->
</script>

and then you can get an XHTML chunk called "chunk2.html" from the server and put it into the "goeshere" div when you click on the "Change it!" link like so:

<p><a href="javascript://" onclick="getMyHTML('chunk2.html','goeshere')">Change it!</a></p>
<div id="goeshere">Hey, this text will be replaced.</div>

That's it...you don't need to write a whole bunch of JavaScript to parse XML data files and use the DOM to stick each bit of information into the right spot or anything like that. Even the RSS feeds I'm pulling in are processed on the server and pulled into the browser as a whole XHTML chunk. And like I said before, Movable Type, Wordpress, TextPattern, or any other CMS that can output XHTML can be used to generate files that you can then swap in. It's pretty easy to get MT to publish XHTML files containing the last 30 Trackbacks to your site, the 50 most recent comments, a list of the most commented on entries, or dozens of other options.

So that's the easy part. The interesting aspect of Ajax is not the technology but how to apply it sensibly (i.e. using it to solve design problems). One of the problems with Ajax is that the data you're bringing in dynamically often cannot be bookmarked and it breaks the back button. Something like the front page of kottke.org is a perfect place for it though. All the posts on the front page are permalinked and no one links to a specific post that's shown on the front page because that information will scroll off the page in two weeks or so; people link to the post's permanent location instead. When you select different types of content with the pulldown, all the posts returned are permalinked and there's a link to the permanent location for each type of content as well (if you're viewing the latest movies, there's a permanent link to the movies page). Plus, the page still works just fine if the JavaScript doesn't work on your browser or mobile device and you can still get to everything via the archives page.

It's all a bit rough and needs more refinement than most of the stuff I usually launch on kottke.org (there's several things I don't quite like about it), but I wanted to get it out there and get some feedback. Bug reports are especially welcome, but please note that if you're not on IE 5.5+, Firefox, or Safari, it might not work at all. (The line-height on the pulldown is a little bit screwy on IE on the PC and there's an odd flicker on Firefox on the PC...any ideas?) And apologies in advance if any of the above code is wrong or confusing...like I said, this is all a bit more slapdash than usual and I'll correct as necessary.

** I've touched on this before, but most weblogs' archive pages are pretty useless, mine included. They should help people discover and find the non-current posts on a site and a list of links to each month's worth of posts isn't that helpful. The categories are more helpful, but it's still a lot of clicking around. The jury is still out on tag clouds. Two current archive page favorites are Binary Bonsai's (Live Archives section, lists of latest entries, most commented on, personal favorites) and Subtraction's (category descriptions, # of posts in each archive, list of post titles for each of the last few months). Like I said above, my goal is to get rid of the archive page altogether.

*** Obviously there's a lot more you can do with Ajax that is more complicated, but for a lot of applications, the "assemble the data on the server and then just shove it into an id'd div" approach works pretty well, even if it does require a little more bandwidth.

The fundamental unit of the web

Much like the shift from molecules to atoms to subatomic particles (protons, neutrons, etc.) to quarks to (potentially) tiny vibrating strings as the most fundamental unit of physical matter that we can find, the fundamental unit for content on the web has been getting smaller as well:

1. The site. You'd see references on sites or in emails like "check out this cool hotwired.com site" or "go to Bobaworld, scroll down, and click on the 'cool links' link". This quickly gave way to:

2. Individual pages. People learned that the web was all about the page. The X-Files Episode Guide page, your Geocities home page, the product page for that new Thinkpad with the fold-out keyboard.

3. But eventually content producers started gathering several chunks of content on the same page and came up with the post/permalink combination. The idea is that several bits of content might be on this page right now, but may be gone when you come back, so here's a permanent link to it so you can find it at some later time. Weblogs are the best example of this, but there are others...Google Maps gives you a way to permalink the particular map you're on for later reference.

4. And now it seems that there are several efforts underway to cut the fundamental unit down to the phrase or word. Online bookmark managers like del.icio.us and Furl and scores of bloggers doing remaindered links blogs link to things with just a few words to describe them. Sites supporting tagging (del.icio.us, Flickr) are creating vast collections of stuff for single words and short phrases. Wikipedia is working on making any word or phrase linkable to an array of information about that word. Linking words or phrases to a Google search result is always an option as well. The result is something like: the Sun is a large ball of gas that gives off energy to Mother Earth.

Note: the fundamental unit of matter metaphor works ok, but is obviously still a metaphor. In particular, the shift in the fundamental unit for content on the web is not one of discovery (linking a single word to a search result was obviously possible many years ago) but of a very rough consensus of perception. Still, something fun to think about.

A whole new internet?

After reading Janice's piece, It's a Whole New Internet, I didn't really know how to feel about it. It is an exciting time on the web right now, but it doesn't seem any more exciting than 2-3 years ago. At that time, blogs were really taking off, people were paying more attention to structured data in the form of RSS & XHTML/CSS, and using web services to paste together various apps and bits of data from around the web into new and useful services. But after thinking about it for a couple of days, what bothered me about it was echoed by Andre Torrez, who puts it a tad stronger (and funnier) than I would have:

Anyway, yes, there's more money that seems to be available for people who have been building these apps, but the suggestion that people who make these sites are only now springing to life when money is available is kind of disappointing. I hate the equation that $1 million in funding == EXCITING OPPORTUNITIES. It's how you fools lathered yourselves into the last bubble.

If your focus is on the neat technology shoehorned into some idea to make money then you're going to be up to your ass in sock puppets again.

When the dot com economy was crumbling in 2000 and 2001, I remember thinking at the time that although everyone I knew was out of work (myself included), that is was a good thing for the long term. One of the more pleasant side effects of the dot com boom was that billions of dollars were spent training indivduals how to design web sites, program, write, etc. In the years following the bust, when all those people were suddenly unemployed or stuck in high-paying jobs that they didn't care for very much but needed to pay the bills, they responded by starting to tinker around with all sorts of neat things, just for the hell of it. Because they could, because they wanted to, not because they had an artificial deadline to reach or some arbitrary client requests to satisfy.

They made apps and services that they wanted to use or thought that others would like to use, not only apps for which there was money available to build. There was no pressure...these people had nothing to lose and everything to gain. Out of this period came All Consuming, Movable Type, Amazon Light, millions of blogs, thousands of very active blog communities, the first consumer-grade newsreaders, Wikipedia (and thousands of other wikis), Firefox, FilePile, lots of social software (admittedly much of it of dubious value), Muxway (which became del.icio.us), a huge push toward XHTML/CSS-only sites, and a billion other things I'm forgetting, all when no one was putting any money into anything.

If you're buying low and selling high, the time to buy optimism was two to four years ago, not now. That was when a small group of friends looked at a horrible economy and saw an opportunity to educate their clients and the rest of us about the value of user-centered design. When a husband and wife decided to build their own blog tool in their spare time because they wanted to use it. When an entreprenuer gambled that you could make money publishing weblogs. When a few folks decided that people needed a place to share their photos with friends. When a loose collective of designers showed us the possibilities of semantically correct standards-based web design. There's still lots of opportunity these days, but it's more expensive with less return.

Now that the money is back, the focus will necessarily shift even though, as Janice notes, we'll be a little wiser about it this time around. There will be less innovation and activity from individuals because they'll be snapped up by companies to work on their projects for their customers. The information flowing out of companies, even those that are pretty open, will be limited because of competitive and legal concerns. A person who -- when she was unemployed 3 years ago -- could spend a couple weeks in releasing a neat web app for anyone to use because she wanted to or could say what she wanted on her blog will now be putting all her coding energies into an application that serves a few customers & needs to be cash-flow positive and won't have the time to post anything to her blog (and can't say much about what she's working on anyway unless all her readers want to sign NDAs). (Not saying this is bad...this is just what companies are for. But what's good for companies, their shareholders, and their customers isn't necessarily what's good for environment those companies inhabit. On the other hand, everyone I know has more work than they know what to do with and that's a good thing too.)

Consider Six Apart as an example of what I'm talking about. 6A is like a black hole for creative people. Folks who, a year or two ago, were among the leading voices in the discussion of how weblogs were changing our culture, were coding all sorts of useful plug-ins for Movable Type, or were pushing the edges of web design are now focused on making software that generates revenue and aren't saying a whole lot about it. (Sort of ironic that working for 6A kills the weblogs of their employees, isn't it?) That's great for them, for Six Apart, their customers, and their partners, but it kinda sucks for the community as a whole.

(And just to head off some of the obvious criticism here, of course companies contribute to the common good (some more than others), competition creates opportunity, investment isn't evil, Ajax is cool, innovation is still happening, etc., etc.)

Adobe Macromedia acquisition roundup

As you might expect from a story with design, media, and technology angles, Adobe's impending acquisition of Macromedia has resulted in much reaction from a big chunk of the blogosphere. Here are what some technologists, designers, and pundits have had to say about the deal so far:

Mike Chambers, a Product Manager at Macromedia, had a few things to say about the acquisition on his blog. He can't say too much because of legal constraints around the deal, but he specifically mentions Macromedia's "culture of openness and participation" as one of the reasons that Adobe was interested in the company.

Kevin Lynch, Chief Software Architect for Macromedia, posted what sounds like a press release about the deal on his site. He's very hopeful about the future of the combined company.

Macromedia MX evangelist John Dowdell notes that Macromedia is "doing a lot of hiring these days" and points to Google Video's coverage of the deal.

(Note: if you're keeping score, that's three employees of Macromedia chiming in about the acquisition on their blogs. And many more MM employees keep active blogs, so I'm sure we'll be hearing more from that side of the fence (although because of the legal stuff, it looks like posts about this need to be approved). On the other side, I've never heard of an Adobe employee that keeps a blog. Anyone?)

Marc Canter, one of the founders of the com