Ask Slashdot: What Features Would You Like In a Search Engine?
New submitter nicolas.slusarenko writes Nowadays, there is one dominant search engine in the world among few alternatives. I have the impression that the majority of users think that it is the best possible service that could be made. I am sure that we could have a better search engine. During my spare time I been developing Trokam, an online search engine. I am building this service with the features that I would like to find in a service: respectful of user rights, ad-free, built upon open source software, and with auditable results. Well, those are mine. What features would you like in a search engine?
Next to working well, maybe the assurance that not all your search queries were logged and sold to third parties or used for advertisement?
Confidentiality
Search for what I type in, now what you think I want. I'm so sick of having to change every search to "verbatim" because my search terms are being ignored. I'd switch to someone else but they seem to be carbon copies.
What made Google so great when it was still relatively new was the results were more relevant, i.e. they weren't just a bunch of advertisements. With the rise SEO that is less the case now, and looking for something on Google for me now means adding "-buy -purchase -price -shop" automatically.
And stop providing results that fail to have ALL of the search terms.
Amen, brother. Similarly, I switched from Lycos a decade+ ago because they dropped Boolean searching (some of us are power users!). I used Yahoo! next, but it was painful on dial-up with all the extra junk on their home page. Then I came across this new, misspelled site called "Google". I loved it; but lately it has been wearing on me as it panders more and more to the masses.
Note to Google: We nerds might be in the minority, but it is WE who direct the non-nerds as to how to set up their digital devices, avoid online trouble, choose their search engines, etc. Don't ruin it for us. I already started to keep one eye open for another search place, because I fear it'll only get worse.
Make it easy for me to specify I am looking for technical information, or looking to buy something, or what have you. All too often I am trying to do a search for technical information, but if that acronym has also been used by Beiber lately I am SOL. I would love it is I could weed out the pop culture hits when I wanted to omit them.
Similarly I would like a search engine that I could easily specify if I also want hits for related words, or just EXACT match, and whether to ignore capitalization or not. It is maddening when an acronym also happens to be a common word and I get flooded with useless crap.
Search features I depend on:
* Non-English characters. Handle multiple encodings of web pages and URL-encoded characters in search queries.
* site: to search only within a domain. This is often a national domain, such as "site:co.uk" to search only British sites.
* Minus: Begin able to block certain words, or sites.
* Plus: A word prefixed with a plus is required.
* Quotes/hyphen: Searching for exact phrases. "Java class file" is different from "Java File class".
Where current search engines are lacking:
* If there is a period between the words then they do not belong to the same phrase. (A search for "Hello Google" should not return "Say Hello. Google for it." as its top result)
* Use word order in search query to weigh how important a search term is. Rank pages higher wihen those words are closer together.
* Don't correct my spelling by default, assuming that my search query is in US-English. (I am speaking to you Duck-Duck Goo!). I can spell, and I do not always write English. If I misspell then that is my mistake, and sometimes I search for a brand name that was misspelled intentionally.
* When indexing a web page, identify what is the important text on the page and ignore the rest. For instance, on an internet news site, the text in the articles is most important. On a forum text inside the comments. On this forum, articles followed by comments. What people have written in their signatures is not important. Slashboxes are not and ads are definitely not.
It is aggravating when you use Google on a collecting site and you get every other page on that site in every search result because members have listed their collections in their signatures.
If I search for the word "review", I don't want every page on every web store that has a Reviews tab.
Pages on a site often follow a certain pattern - find that pattern to find which text on each page that is the most unique.
"We mustn't be caught by surprise by our own advancing technology" -- Aldous Huxley
Think of Google's search as your type as 1-dimensional suggestion list. I'd like as I type to see around the search bar a matrix of categories: news, videos, documentation, blogs etc. Then as I hover over a category with a mouse I zoom into a matrix of subcategories for that category using the mouse wheel. I zoom out back one level if that's not the branch I'm thinking of.
In addition, I don't want to click until the very end, and maybe not even then. Hovering over a set of results shows me what's at the deeper level, and when I'm looking at a one or a handful of pages that match the criteria as I refine further, it is also shown as a cell. Hovering over it will give me a preview -- from the search engine, not my browser fetching an actual page. Only when I'm certain I want to go there, I'll click.
That would be a search engine of the future. Or, idea #2: make it like google, but when I control-clik on the link for the page it opens a sanitized copy of the page, provided by your server, so I know there are no scripts or malware and crap. And if possible give me that sanitized preview when I hover over the page so if I'm lucky I don't have to click on anything at all.
I know sites wouldn't like it but just saying what I'd like to see that I think is technically possible. Thanks for listening!