Recently, a story has been making its way around the blogosphere concerning the results of searching for the word "Jew" with Google. The number one result when the tale began was an anti-Semitic website run by a neo-Nazi organization. In an attempt to get this off the top of the list, a number of bloggers have been linking to the "Jew" entry at Wikipedia. Initially, Google defended itself on the grounds that the ranking was done automatically, and that they absolutely, positively would not interfere with the sacredness of their page-ranking algorithm. Then, it turned out that Google had in fact fiddled with their algorithm to deal with the fact that child pornography turned up near the top for searches of the word "Chester", a fact which bothered the denizens of Chester, England.
However, Google is now not above fiddling with the "Jew" results, not to fix them, but to post a disingenuous disclaimer that only pops up when you search for "Jew". Since SC doesn't wish to distort Google's statistics on searches for Jews any more than they do, here's a direct link to the explanation.
First, Google makes a comment that SC doesn't actually disagree with, although he doesn't have any statistics to back it up:
If you use Google to search for "Judaism," "Jewish" or "Jewish people," the results are informative and relevant. So why is a search for "Jew" different? One reason is that the word "Jew" is often used in an anti-Semitic context. Jewish organizations are more likely to use the word "Jewish" when talking about members of their faith. The word has become somewhat charged linguistically..."
This strikes your host as plausible. Searching for "the Jew Sharon", "the Jew Wolfowitz", "the Jew Perle", and "the Jew Kristol" all turned up either anti-Semitic websites, quotes of same, or satirical work intended to mimic anti-Semitic behavior. SC does not refer here to people writing in disagreement with the aforementioned people's beliefs (which is not to say that there are no anti-Semites who disagree with them), but rather to writings insinuating the existence of a sinister cabal ([time to brush up on your Kabballah -- ed.]). So your host will accept this as valid -- but it's also sort of irrelevant.
The Google explanation goes on to state:
Someone searching for information on Jewish people would be more likely to enter terms like "Judaism," "Jewish people," or "Jews" than the single word "Jew." In fact, prior to this incident, the word "Jew" only appeared about once in every 10 million search queries. Now it's likely that the great majority of searches on Google for "Jew" are by people who have heard about this issue and want to see the results for themselves.
This is offered as explanation for why "Jew" turns up an anti-Semitic site. But it's completely off-point. Google's ranking algorithm doesn't rank pages by how often terms within them are searched for, it ranks them by how often they're linked to, and how often the term actually appears. The fact that "Jew" may only be searched for once in 10 million queries doesn't tell us why the offending site is ranked so highly. Aside from that, though, without any comparable statistics on the frequency of searches for "Jewish", "Judaism", etc., there's no way to tell whether or not 1 in 10 million is an unreasonably small number of searches to deal with. Google's frequency statistics going by simple number of documents returned do bear out the claim to some extent -- there are 1.8 million hits for "Jew" and 13.2 million for "Jewish". But again, without any kind of statistics to provide context about the number of searches overall, the number of searches for other Judaism-related words, and maybe some time series data on these points as well, there's no way to tell if searches for "Jew' are effectively getting the anti-Semitic site in front of people or not.
Google isn't doing anybody any favors by trying to recast this as a minor problem because of the number of searches performed. Let's stipulate that it's economically infeasible -- and from a free speech standpoint, undesirable -- to have a team of editors do nothing but check that search terms bring back only inoffensive results ([so if they did other things as well, it would be OK? -- ed.]). Let's also stipulate that while the moral lines are pretty clear in this case, in many other cases, it's a lot harder to decide if a result ought to count as offensive (imagine if the New York Times decided to demand that Google remove all links to Andrew Sullivan's site, on the grounds that his criticisms offend their employees). Even granting these facts, their explanation simply has nothing to do with the actual mechanics of how the Web is structured (or perhaps more accurately, how their algorithms assign structure to it). If the standard is now that searches which occur frequently enough merit action, then it will not be long before activists of all stripes launch campaigns to boost the profile of particular searches for a long enough time to force Google to do something about them. If it's true that enough pages with "Jew' in them link to the offensive site, and their algorithm is doing its job, then they ought not handle any requests, like the city of Chester's, or treat them all equally in some other way. If SC was advising Google, he'd tell them to just quietly handle these requests on an ad hoc basis -- a little common sense could probably go a long way towards avoiding a lot of bad PR, much of which is now deserved due to Google's inconsistent behavior.
UPDATE: After additional discussion with Radagast, as well as actually reading the original Chester site (warning: it's disgusting), your host feels moved to clarify what he means by "handling". We'll start by paraphrasing an argument that the Wall Street Journal recently made (link courtesy of Seth Friedman, who turns up near the top of searches for Chester in this context now). It's not at all clear that the site is illegal, as Google claims; while the content is repellent, it may not actually violate most pornography laws (there are no pictures, real or simulated, and it's arguably satirical in nature). It might be more clearly illegal under anti-obscenity laws, but those vary by jurisdiction; by playing the "the law made us do it" card, Google opens themselves to vulnerability on the question of whose laws they're claiming to be bound by. Given that Google hasn't made it impossible to locate the "Chester" page -- all they've done is remove an association in their database between a keyword and a URL -- it's fair to say that "Bad Chester" hasn't actually been censored.
So what's an equitable solution? Ideally, it should: 1) not involve censorship, 2) preserve Google's reputation as a neutral arbiter of searches, 3) be legally defensible, and 4) minimize the use of googlebombs as a response to this sort of problem (assuming that Google agrees that googlebombing is detrimental to their goal of accurately reflecting the relevance of content). SC is not above telling people -- himself included -- to suck it up when they encounter speech they dislike. Therefore, his original thought was that Google ought to just run complaints by a lawyer or ethicist, and then perhaps engage in a little unannounced delinking of specific keywords here and there, as they clearly have done before. However, while such a solution would largely meet the tests laid out above, perhaps the policy needs to be more cut and dried in order to keep their PR department from turning into a permanent crisis center. So here's SC's stab at a formal policy: given that the intent of googlebombing is to either raise or lower a specific link in the rankings, and in this case, the goal is to lower it, simply lower the weight of the page ranking for a sufficiently hotly disputed page. Google has shown a willingness to allow legal opinions to influence their judgment; if their legal team judges that a page's appearance is clearly grounds for prosecution -- delete the reference in regard to the specific keyword as they did a la Chester, or the German version of Google (see the WSJ article for details). If the page's appearance is clearly not illegal, simply lock it out of being the first returned listing, which seems to spawn particular ire. Irritating pages may well continue to show up as result #2, but freed from the emotional baggage of seeing something offensive as #1, users might well recognize that searches for a particular term are likely to bring back a diverse group of pages, including ones they don't like. If that fails, announce that the experiment is over, and go back to a policy of strict neutrality, undoing all non-algorithmically-derived rankings in the database.
It's easy to foresee a string of stories coming out for an indefinite, but long, period of time, demonstrating that Google is even more involved in censorship than is already known. A policy like the above represents an effort to reconcile the reality of what they've already done with the goals of running a reasonably transparent operation and preserving their reputation. Is it ideal? Absolutely not. But they're the ones who chose to proclaim one policy before while observing another, so they might as well end the charade and reestablish some clear user expectations. Radagast thinks it might be realistic for them to simply fess up and go straight back to an uncensored database. I'm not sure they're in a position to do that, though, because they've tried to sell that story before. That's why your host thinks they might as well experiment with another policy -- if it keeps Google from being the target of further embarrassment, then they can call it a success and stick with it. If it fails, then they've got an irrefutable argument that they've tried to be responsive to complaints, and that the resuls are simply intolerable.
So we'll close with an observation from Southern California talk-show host Larry Elder:
Q. What is the Elvis Factor?
I once read that 10 percent of the American people think Elvis still alive, and 8 percent believe that if you send him a letter, he will answer it. That's the Elvis factor. You have to remember that, no matter what, 10 percent of the people are probably not capable of clear, rational thought.
As with Elvis, so with complaints. If a transparent policy of database editing proves unworkable, but the critics still won't let up, then at some point they just have to be ignored. Unfortunately, their own behavior has foreclosed that option for now. It's a shame that Google has put itself in the position of making these criticisms credible; no matter what policy they end up adopting (including sticking with the present one), they've done a possibly irreparable hatchet job on their reputation for being the most reliable, neutral guide to Web content available.
(Edited at 2:09 a.m. on 4/20/04 to include additional content.)