Pete Cashmore, Paul Montgomery , and Mitch Ratcliff are having a real good conversation regarding the collective intelligence of “humanity” versus “algorithms.” A few random thoughts . . . as I dont have too much time for “coherentness”
1. Reminds me of the debate in the financial industry on quant hedge funds (program based trading) versus investing in broad indexes such as the S&P 500 (ie algorithms vs. collective intelligence). (BTW the third option, investing in human managed funds, would be considered the web 1.0 model . . . stupid, slow, dumb . . . I’m exagerating but you get my point.) The problem with algorithms (in both finance and editorial content) is that it is based on a priori model of the world which assumes that certain unknown variables (not in the model) remains constant. So as long as major shocks to the system do not appear (such as major earning disappointment, corporate fraud etc) , algorithms are much much faster than humans can at detecting market efficiencies, forming a view on market movements, and capturing value by exploiting such an inefficiencies. These models will make money slowly but consistantly until something goes wrong and the fund could potentially lose everything (read Long-Term Capital). As such, most quant hedge funds built into the system ways to for humans to intervene (such as turning off the trading program during earnings week or triple witching hour) as well as ways to add value into the model. The co-dependency implementation is not perfect but it beats losing your shirt. Perhaps thats where we are headed on the web in the not too distant future.
2. This is another key difference between Yahoo and Google. Many people have pointed this out before. Essentially Yahoo’s historic background in directories and Google’s in algorithms has pited the two company in two of major camps in their vision of the future. Yahoo has fully embraced the whole web 2.0 peer economy model. MyWeb is a historic attempt at “PeerRank” while many of its other products like Yahoo Local relies on the contribution of users. Google on the other hand is still taking an aggregated approach to web 2.0. Essentially it is gathering up reviews and content contributed by users of OTHER websites and presenting it to its users. Google believes that it has superior parsing and ranking algorithm to recognize valuable content and create “semantics” around unstructured data. Look at the relaunch of Froogle and its vendor and product reviews. . . all crawled from other content providers. Both model has its merits . . . Google has a much easier time building critical mass and generating value for its OWN users in the early phase of the product lifecycle. . . but eventually, it is at the whims of content providers (see craigslist and oodle). Yahoo, on the other hand will have a much harder time generating network effects, but once it does, its position in the industry is much more defensible because it “owns” much of the content on its own site. I wonder who will win out in the future . . .





Excellent summation of Google v Yahoo, Will. If you hadn’t guessed, I’m on Yahoo’s side on that argument. Yahoo will survive after the #1 search engine mantle gets passed along as it does every couple of years. After that, Google will have to admit that it is an advertising agency first and foremost.
Comment by Paul Montgomery — October 31, 2005 @ 8:53 pm
It’s all about content ownership–if you rely on scraping you are vulnerable to be shut off. Googlebase is an attempt to “own” content. And get it for free. And thus devalue the content in favor of the index. It’s a content v index battle emerging.
Comment by Tom Foremski — November 1, 2005 @ 1:36 pm
Paul- what content Yahoo is all that useful? The only service that I know people use on a regular basis is the finance site. Whatever Yahoo creates in the context of the entire internet is so minuscule that I would argue Yahoo’s content-owning strategy has little promise. It’s like NBC owning a bunch of content - so what. there are about 500 other TV channels one can turn to.
Comment by Frat Party Crasher — November 1, 2005 @ 3:24 pm
Nice analogy in the financial industry. I particularly agree with this:
“As such, most quant hedge funds built into the system ways to for humans to intervene (such as turning off the trading program during earnings week or triple witching hour) as well as ways to add value into the model. The co-dependency implementation is not perfect but it beats losing your shirt. Perhaps thats where we are headed on the web in the not too distant future.”
I think co-dependency might well be a good direction to follow with search - I seem to remember a while ago that Google were considering adding a “Report Spam” button next to their search results. Not sure what became of that - on the other hand, maybe I just imagined it…
Anyway, Wink.com is probably the closest thing to a co-dependency model right now - it lets you rate and tag Google search results (as well as marking them as spam), and seems like a promising idea.
Comment by PeteCashmore — November 1, 2005 @ 5:41 pm
Hi Frat Party Crasher. NBC is not just like 500 other channels, jsut the same as Yahoo is not the same as the other millions of Web sites. Apart from finance, Yahoo has some very healthy inhouse sidelines like Games, Groups, My Yahoo and Sports (including fantasy sports). Then there’s the huge acquisitions, like Geocities and Flickr.
Content ownership is not a dead concept. The Internet has shown time and time again that while corporate understanding of how to exploit their ownership of content has to change to fit new circumstances (e.g. copyright), once that understanding has percolated through to the lawyers’ pin-sized brains, the corporations discover to their delight that the Net actually increases their turnover, instead of cannibalising it (e.g. music).
Comment by Paul Montgomery — November 7, 2005 @ 4:40 am