Learning About Full-text Search
An anonymous reader writes "Tim Bray who's known for XML and has been /.'ed once or twice for that kind of stuff, actually seems to be a search geek and has been writing this endless series of essays on search technology since summer. He says he's finished now - it's like a textbook on searching."
If Google can cache pages and put them online, so should Slashdot. People say copyright issues would be a problem, but in that case, why is Google's online cache any better?
Martin
From the google cache...
searching for words isn't really what you want to do. You'd like to search for ideas, for concepts, for solutions, for answers.That is why he is going in an XML direction. The relational approach is rectilinear and requires that the information be framed in a highly normalized fashion. Generalized semantic searching is highly non-normalized because, well, humans are highly non-normalized.
I think that he should look at some work by a different Tim, the Semantic Web.
And they considered incorrectly.
Try reading the articles/essays. Knuth's vol 3 is about comparison search, not full-text search.