New Search Engine Takes "Dyve" Into the Dark Web
CWmike writes "DeepDyve has launched its free search engine that can be used to access databases, scholarly journals, unstructured information and other data sources in the so-called 'Deep Web' or 'Dark Web,' where traditional search technologies don't work. The company partnered with owners of private technical publications, databases, scholarly publications and unstructured data to gain access to content overlooked by other engines. Google said earlier this month that it was adding the ability to search PDF documents. In April, Google said it was investigating how to index HTML forms such as drop-down boxes and select menus, another part of the Dark Web."
this will help me get more porn, how?
The company partnered with owners of private technical publications, databases, scholarly publications and unstructured data to gain access to content overlooked by other engines.
I know why the other engines don't index these documents: they're behind pay walls. As the second link points out, Google already indexes (some) PDFs, but that doesn't help if the site doesn't want me to see the PDF. There are lot of topics, such as disability rehabilitation and linguistics, that I can't search for without Google returning a bunch of results from sites that require a subscription but to which my county library doesn't subscribe. (A tip-off for these results is that "Cached" doesn't show up.)
This will certainly defeat the practice of obfuscating links with e-mail addresses in them, by using a picture link or "click here."
"Click here" still works: use a web form to send e-mail instead of disclosing an e-mail address that doesn't use a whitelist. And AFB has reported that a picture of an address doesn't work even for legitimate users of speech or braille browsers.
I don't know about you guys but I prefer not to have to sign up or use the "pro" version for my web searching needs.
In fact why do I have to sign up to web search anything?
Besides this thing looks like it just gets in your way.
Thanks, but it's not a google killer.
They're also looking into indexing images based on whether they contain boobies.
You mean like these boobies? What about these great tits? And would you tap that ass?
Just what I needed: 40 million NEW search results to sift through; I already have to deal with the first 5 pages being useful, followed by 60 pages of, let's say 'pokemon glitch' that is really someone's blog that has 500 words slapped on the bottom (nothing quite as useful as finding out a website that came up on the search says at the very bottom of the blog 'boobs anal pokemon glitch asians etc'
Good news is I can finally PAY to be annoyed.
It's apparently not working right now. But give it all your personal information now, and they will get back to you.
basically it's like cavity search for the internet.
Login? to search a "dark net".
You are fucking kidding?
I was right about Tesla crashing. I'll make another prediction.
Deap Dyve out of business in 1 year.
Cheers,
Kilgore Trout
P.S. : get the Cyrillic fonts enabled. Russia is invading the U.S.S.A. Finally !!!
And I need to login even if I want to search wikipedia???
Nice way to shoot yourself in the foot, guys.
At least you should offer a checkbox on the search page so that registered users get the payed content and anonymous users get what's out there for "free".
Help! I'm a slashdot refugee.
Consider that at least 75% of the browsing public never gets much deeper than a Google image search for "cute puppies," or some such nonsense.
Yes, but you have to realize that some of that 75% is going to want to see MORE cute puppies they couldn't with just google image search.
The summary is a bit misleading. Google has been indexing the textual parts of PDFs for a long time. According to the article they have now started indexing scans inside of PDF files, which requires OCR.
Google has been doing that for catalogs for a while now, but OCRing large numbers of scans obviously requires a lot more resources.
I'll wait for Google to assimilate DeepDyve before I'll check it out.