Sunday, January 01, 2006

New feature: Tag-based recommendations

Tag-based recommendations.
I have added a second set of book recommendations, this time based on the tags people apply to books. So if a book is tagged mostly "constitution" and "american history" it will match up with other books about the Constitution, favoring historical ones. The algorithm is quite complex, weighing the relative frequency of tags by book and overall, book popularity and more. I think you'll find it's quite solid.

Here are some examples:
The new system joins the existing system "people who own X also own Y." More than solid this has the potential to provide truly inspired recommendations, crossing genres to get at what really binds people together around a book. For example, Brooks' The Mythical Man Month triggers other software project-management books. But it also triggers a Dilbert book. People who do software project management have a lot of Dilbert books, and for good reason. Dilbert is, of course, a software engineer and half the jokes are about the stupidies of cubicles and "pointy headed bosses." Or take the first Harry Potter. Amazon suggest five other books—all Harry Potters! LibraryThing also suggests Philip Pullman, Tolkien and—my favorite—Madeline L'Engle.

Connection uncertain.
Connection apparent.
Unfortunately, the system can backfire. Take Thomas a Kempis' The Imitation of Christ, the spiritual reflections of a 15th century monk. This currently triggers Clifford the Big Red Dog. In this case duplicate copies are the problem, something that will shortly be fixed. But there's a general problem when a book is held by only a few people. The results are statistically true, but not necessarily meaningful.

So there are now two systems, providing 25 recommendations in all (to Amazon's five). They will certainly get better as users add more tags and books, but I would argue LibraryThing's recommendations are already better than Amazon's—richer, more inspired and based on what's good to read NOT what's selling this month. Community is, it seems, more powerful than commerce, even for commerce.

Finally, I have a third way of calculating recommendations, different from the other two. I don't think I can talk how it works now, but it's somewhere in between them in terms of noise vs. inspiration, and it works well when there are only a few books. The trick is that it's very expensive—it takes forever to calculate! I think it will have to wait until I have a dedicated "thinking" machine. (If LibraryThing thing keeps growing, I'm forsee a stack of Mac Minis thinking all day.) I also plan to allow users to cheer and boo recommendations—Clifford boo!—and perhaps add their own.

And finally finally, I added another major new feature you may spot—RSS and HTML feeds all over the place. But I don't want to step on my own toes and I have some quirks to work out before I announce it. Intrepid explorers are invited to let me know what you think in advance.

PS: Do you think I can get away with calling tag-based connections between books "contaguinity"?

UPDATE: Between yesterday and today Clifford slid off the Thomas a Kempis recommendations list, like a toaster that isn't broken when you take it to the repair shop. [Yes, I know, repair shops vanished 30 years ago.] So I added them both to my library to force the connection. I'm NOT going to go looking for another example!

10 Comments:

Blogger Dennis said...

Two minor things:

It says that a book needs 5 tags for this feature to work, but I think it means that it needs tags from 5 people.

I've noticed that tag recommendations on a book I updated several hours ago stille rely solely on the old (not updated) tags. Is this something that's calculated periodically, say once a day or once a week? Or is there a bug?

1/01/2006 4:33 PM  
Anonymous Anonymous said...

If you get to call it 'contaguinity,' I'm going to miss the invocation of sanguine blood-ties between these books: but I won't argue their existence. It's excellent. The combination beats the pants off Amazon - and it's surprising how often one want either one or the other of the lists. I'm glad we now have both.

(Thanks!)

1/01/2006 5:51 PM  
Anonymous Anonymous said...

> RSS and HTML feeds all over the place

I saw that we now have three feeds listed on our profile pages (and from our reviews page). I would like the Comments field to be included (in addition to the Review field) in one of the RSS outputs, preferrably the "Recently-added books" one.

I do like 'contaguinity', though I also hope there's a use somewhere for 'contaguous'. (;

1/01/2006 6:20 PM  
Anonymous Anonymous said...

Thanks a lot for this feature, I've been looking forward to it!

I've noticed that tags users combine still appear separately on the Social Data page. Do you think this could be changed?

1/01/2006 7:36 PM  
Anonymous Anonymous said...

Thank you for the system and the improvements. I do have one request. When I go to the author page for Catherine Bishir I do not have an option to combine it with another name that she uses (Catherine W. Bishir). This also happens with Samuel R. Watkins and Sam R. Watkins (same author). This is just a minor thing against all that you have done, but it would be nice.
KC Gordon

1/01/2006 9:18 PM  
Anonymous Anonymous said...

Here's another funny bug to add to the queue.

When you're on the Add Books page and you search for a title, it brings up a list of books that match your query, with the little trangle switch to the left that lets you see more information. Often the switch displays information from the wrong item.

For example, go to the Add Books page and do an LC search for "forms, garfinkel". Turn the switch beside the first item (Forms of Explanation) and you'll see it display the additional data for the other book (on immigration), not Forms of Explanation. This doesn't alter the data when you save the record - it's just a display bug; but it happens very often. (One book about birds that I entered today showed subject headings for Dwight Eisenhower. ;-)

RJO

1/01/2006 10:33 PM  
Anonymous Anonymous said...

KC Gordon,

In your first example there was only one book by each author and they were not the same title. I suspect the author combining thing needs more than this to come up.

In the 2nd example, none of the titles were exactly the same so maybe this is also a factor, or maybe just the difference between "Sam" and "Samual".

1/02/2006 6:41 PM  
Anonymous Anonymous said...

Re: "contaguinity", I say, coin it, claim it, and ride that puppy to your 15 minutes of fame! :)

1/05/2006 1:38 PM  
Anonymous Anonymous said...

A problem with normal recomendations.

I buy an annual book on cricket every year "Wisden" unfortuately nine of these are thought to be the same book. I have editted the titles so they include the year and they have different ISBN's.

The result is that if I own a book and look at its recomendations Wisden is often there with 9/9 of its owners (all me) having the book I looked up.

Just wait till I have all 30 editions catalogued! (and I would like to buy the remaining hundred or so!)

1/06/2006 11:29 AM  
Anonymous Anonymous said...

ringman: my advice is not to edit the titles but to just wait for Tim to implement the title combining and disambiguation feature we all want so badly

1/09/2006 11:12 AM  

Post a Comment

<< Home