One letter at a time: Finally, someone said it

Friday, January 09, 2009

Finally, someone said it

Chris Wilson looks into how to fix Word's spell-check limitations in Slate.

What's behind this disparity? Word processors and search engines have different goals. The latter has to field queries as broad and varied as the Internet itself, so it needs a very large vocabulary in order to differentiate spelling mistakes from legitimate search terms. Word processors are much more conservative, limiting their lexicon to words that are definitely legitimate. This way, a program like Word can catch virtually every typo, even if it means misidentifying some proper names and newer words. In other words, search engines put breadth first and spelling accuracy second while word processors are the other way around. If you type in Monkees, Google will assume you're searching for the band; Word will give you a red squiggly line, thinking you've screwed up the word monkeys.

Not surprisingly, search engines and word processors build their dictionaries differently. A search engine's lexicon is typically put together using words gathered from Web pages or old search queries—a huge corpus of real-world data that constitutes a list of valid words and their frequency in the language. Word-processing lexicons are more heavily chaperoned, and the pace at which new terms enter the dictionary is much slower.

No comments:

Post a Comment

Passing thought

[The novel] is the form that allows a writer the greatest opportunity to explore human experience... For that reason, reading a novel is potentially a significant act. Because there are so many varieties of human experience, so many kinds of interaction between humans, and so many ways of creating patterns in the novel that can’t be created in a short story, a play, a poem or a movie. The novel, simply, offers more opportunities for a reader to understand the world better, including the world of artistic creation. That sounds pretty grand, but I think it’s true. -- Don DeLillo

One letter at a time

Friday, January 09, 2009

Finally, someone said it

No comments:

Passing thought

The Irresponsible Party

People Come And Go So Strangely Here

Back Issues

The Other Blog

Gratuitous Plugs for Friends Who Have Written More Books Than Me

Friends: The [Dis]Honor Roll

Blogroll: You Are Worthy

Recent Reads - 2013

Recent Reads - 2012

Recent Reads - 2011

Recent Reads - 2010

Recent Reads - 2009

Recent Reads - 2008

Recent Reads - 2007

Recent Reads - 2006

Recent Reads - 2005

Disclaimer