6,631,372

Paul Graham 2006-03-01

6,631,372

6,631,372 2006年3月,修订于2009年8月

几天前,我惊讶地发现我被授予了一项专利。它在2003年颁发,但没有人告诉我。如果不是几个月前访问雅虎时,我碰巧遇到了一个我在90年代末在那里工作时认识的大人物,我现在还不知道这件事。他提到了一个叫做收入循环(Revenue Loop)的东西,这是Viaweb在被他们收购时正在研发的东西。

这个想法基本上是,你不按文本”相关性”(如搜索引擎当时那样)排序搜索结果,也不按广告商出价(如Overture那样)排序,而是按出价乘以交易数量来排序。通常你会对购物搜索这样做,但事实上,我们方案的一个特点是它会自动检测哪些搜索是购物搜索。

如果你只按出价排序结果,你可能会使搜索结果变得无用,因为前面的结果可能被那些出价最高的蹩脚网站占据。但如果你按出价乘以交易数量排序,远不是出卖,你得到了一个更好的相关性衡量标准。还有什么比访问网站并购买某物更能表明某人对搜索结果满意呢?

当然,这个算法自动最大化搜索引擎的收入。

现在每个人都专注于这种方法,但在1998年很少有人这样做。1998年都是关于销售横幅广告。我们不知道这一点,所以当我们发现看起来是进行购物搜索的最佳方式时,我们非常兴奋。

当雅虎考虑收购我们时,我们在纽约与杨致远见了面。我现在意识到,对他来说,这应该是那些会议之一,你去考察一个你已经基本决定收购的公司,只是确保他们是不错的人。我们被期望的不过是聊天,显得聪明和合理。当我跳到白板上开始展示我们令人兴奋的新技术时,他一定感到沮丧。

当他看起来完全不在意时,我也同样沮丧。当时我想,“天哪,这家伙真是扑克脸。我们向他展示的必须是排序产品搜索结果的最佳方式,而他甚至不好奇。“直到很久以后我才明白他为什么不在意。1998年,广告商为网站广告支付的费用过高。1998年,如果广告商支付流量对他们的最大价值,雅虎的收入就会减少。

当然,现在情况不同了。现在这种事情很流行。所以当几个月前在雅虎食堂遇到我从前的雅虎高管时,他记得的第一件事不是(幸运的是)我与他所有的争吵,而是收入循环。

“嗯,“我说,“我想我们确实为此申请了专利。我不确定我离开后申请发生了什么。”

“真的吗?那将是一个重要的专利。”

所以有人调查了一下,果然,那项专利申请在之后的几年里继续进行,最终在2003年获得批准。

实际上,阅读时让我感到震惊的主要是,律师在某个时候搞乱了我清晰明了的写作。某个聪明的拼写检查用户将其中一部分变成了禅宗般的不可理解:

此外,常见的拼写错误将趋于修复。例如,如果搜索”compact disc player”的用户最终在提供compact disc players的网站上花费了大量金钱,那么这些页面对于该搜索短语将具有更高的相关性,即使短语”compact disc player”不存在于那些页面上。(那个”compat disc player”不是拼写错误,伙计们。)

想要看到原始的优美散文,请参见1998年2月的临时申请,那时我们还是Viaweb,负担不起付钱给律师把每一个”a lot of”变成”considerable”。


6,631,372

6,631,372 March 2006, rev August 2009

A couple days ago I found to my surprise that I’d been granted a patent. It issued in 2003, but no one told me. I wouldn’t know about it now except that a few months ago, while visiting Yahoo, I happened to run into a Big Cheese I knew from working there in the late nineties. He brought up something called Revenue Loop, which Viaweb had been working on when they bought us.

The idea is basically that you sort search results not in order of textual “relevance” (as search engines did then) nor in order of how much advertisers bid (as Overture did) but in order of the bid times the number of transactions. Ordinarily you’d do this for shopping searches, though in fact one of the features of our scheme is that it automatically detects which searches are shopping searches.

If you just order the results in order of bids, you can make the search results useless, because the first results could be dominated by lame sites that had bid the most. But if you order results by bid multiplied by transactions, far from selling out, you’re getting a better measure of relevance. What could be a better sign that someone was satisfied with a search result than going to the site and buying something?

And, of course, this algorithm automatically maximizes the revenue of the search engine.

Everyone is focused on this type of approach now, but few were in 1998. In 1998 it was all about selling banner ads. We didn’t know that, so we were pretty excited when we figured out what seemed to us the optimal way of doing shopping searches.

When Yahoo was thinking of buying us, we had a meeting with Jerry Yang in New York. For him, I now realize, this was supposed to be one of those meetings when you check out a company you’ve pretty much decided to buy, just to make sure they’re ok guys. We weren’t expected to do more than chat and seem smart and reasonable. He must have been dismayed when I jumped up to the whiteboard and launched into a presentation of our exciting new technology.

I was just as dismayed when he didn’t seem to care at all about it. At the time I thought, “boy, is this guy poker-faced. We present to him what has to be the optimal way of sorting product search results, and he’s not even curious.” I didn’t realize till much later why he didn’t care. In 1998, advertisers were overpaying enormously for ads on web sites. In 1998, if advertisers paid the maximum that traffic was worth to them, Yahoo’s revenues would have decreased.

Things are different now, of course. Now this sort of thing is all the rage. So when I ran into the Yahoo exec I knew from the old days in the Yahoo cafeteria a few months ago, the first thing he remembered was not (fortunately) all the fights I had with him, but Revenue Loop.

“Well,” I said, “I think we actually applied for a patent on it. I’m not sure what happened to the application after I left.”

“Really? That would be an important patent.”

So someone investigated, and sure enough, that patent application had continued in the pipeline for several years after, and finally issued in 2003.

The main thing that struck me on reading it, actually, is that lawyers at some point messed up my nice clear writing. Some clever person with a spell checker reduced one section to Zen-like incomprehensibility:

Also, common spelling errors will tend to get fixed. For example, if users searching for “compact disc player” end up spending considerable money at sites offering compact disc players, then those pages will have a higher relevance for that search phrase, even though the phrase “compact disc player” is not present on those pages. (That “compat disc player” wasn’t a typo, guys.)

For the fine prose of the original, see the provisional application of February 1998, back when we were still Viaweb and couldn’t afford to pay lawyers to turn every “a lot of” into “considerable.”