I believe some OCR packages do things like this, as do some search engines. However then ones we have tested have not been commercially viable.
Some require more OCR machines than variant-based search (lots more) to do the load we do. This would mean more space, bigger air con, lots more cash for little gain.
Some will not give information out in a way our current system can use, so we would have to rebuild large chuck of our system, or scape products and/or work flow procedures.
I work in the Media Monitoring industry. What we do is scan in newspapers (we have some 4000 publications), OCR them, throw 'em in a search engine and do some bloody complicated searches on that dataset before sending out hits.
roughly OCR a document and store keywords or snippets of text in metadata or an index?
Lots. You could be OK with GOCR and Apache Lucene if you do not require zoning (working out blocks of text and columns).
OCR is not good enough
Oh it is. You will need to add "variants" to your searches. E.g. if you are looking for Microsoft you would search for "M[i1]cr[o0]s[o0]ft". Some search engines can do this for you, others can say "max of two errors".
What formats allow an easy mix of image and text data (without formatting)?
XML (hehe). PDF can. Most systems would have the image as file somewhere on your file store, and the text in a database.
Unless I've miss read (or/. has miss reported) "VIA has pledged that atmospheric carbon released during generation of the power needed to run the chip throughout its expected life-cycle will be offset by regional conservation, reforestation, and energy programs initiated or contributed to by VIA." they mean "Carbon Neutral"
I'm getting 404s on the links above so I can not see the pictures, but I have a HTC Wizard, it has tiny keys but it is fine for typing on. I've written a couple of fair sized documents with out any major issues.
You have a shuffle mode that goes over your complete CD collection? How does that work? I'm envisaging it having a really bad text-to-speech chip and saying "Please Insert "Telytubbies" by "Telytubbies" now.
I have a HTC Wizard. Due to its sucky stylus holder I don't have a stylus. For phone features this is fine, press the green "phone" hardware button. Bash the screen as the number buttons are a fair size. Press the green "phone" hardware button again.
I was lucky, the device I wanted had sold out, and I had to buy the model up (The HTC Wizard, with the side out keyboard). I tried to do a few days using the handwriting recognition to see what the £50 had got me.
I was impressed with the recognition, I could scribble out text, and it would be 80-90% right. But the component selection was a real PIA.
Re:Wait... why does this make them evil?
on
Microsoft Sued Over WGA
·
· Score: 2, Interesting
I have no problem with it checking home on install. But daily dumps of unknown data (look through/. it looks be: User names on the computer, process lists, BIOS information). Why daily, can you magically turn your (legal at install) software into priate software?
Why user names? What good is that? Why process lists?
WTF?
How is your normal consumer going to do that?
I believe some OCR packages do things like this, as do some search engines. However then ones we have tested have not been commercially viable.
Some require more OCR machines than variant-based search (lots more) to do the load we do. This would mean more space, bigger air con, lots more cash for little gain.
Some will not give information out in a way our current system can use, so we would have to rebuild large chuck of our system, or scape products and/or work flow procedures.
Lots.
You could be OK with GOCR and Apache Lucene if you do not require zoning (working out blocks of text and columns).
Oh it is. You will need to add "variants" to your searches. E.g. if you are looking for Microsoft you would search for "M[i1]cr[o0]s[o0]ft". Some search engines can do this for you, others can say "max of two errors".
XML (hehe). PDF can. Most systems would have the image as file somewhere on your file store, and the text in a database.
Unless I've miss read (or /. has miss reported) "VIA has pledged that atmospheric carbon released during generation of the power needed to run the chip throughout its expected life-cycle will be offset by regional conservation, reforestation, and energy programs initiated or contributed to by VIA." they mean "Carbon Neutral"
They did not need new überenemies. They had a galaxy in disarray, and large criminal organization to play with.
Yeap.
The matrix uses FP math.
I was very tempted when I bought my last phone (Wizard), but fails the "trouser-pocket sized" for me. Which is a shame as I wanted 3G (well Skype).
Minimo, is that any good now? I tried it a few months back and it was usability slow?
But does not include a traditional mobile phone. Only the Skype software.
The problem is pocket-size + good sized screen. HTC Universal is close, but it just fails on "pocket sized" for me.
:read: :shrug:
Mmm, it looks like a cut down version of my HTC Wizard.
I can buy a PSP with that? Why not just release a UMD with Skype/other IM software?
I'm getting 404s on the links above so I can not see the pictures, but I have a HTC Wizard, it has tiny keys but it is fine for typing on. I've written a couple of fair sized documents with out any major issues.
k -9100-caught-naked/
http://www.engadget.com/2005/07/27/htc-wizard-qte
I really don't think many people care about home brew games.
Mushroom Mushroom!
You have a shuffle mode that goes over your complete CD collection? How does that work? I'm envisaging it having a really bad text-to-speech chip and saying "Please Insert "Telytubbies" by "Telytubbies" now.
I have a HTC Wizard. Due to its sucky stylus holder I don't have a stylus. For phone features this is fine, press the green "phone" hardware button. Bash the screen as the number buttons are a fair size. Press the green "phone" hardware button again.
It will be used by people at work, where they can access the GMail website, but not install IM clients.
But not used at all at home.
If you have an uber screen of gold-plated doom, why the fuck are you getting pirated stuff?
Web 2.0, like the Web but cool (in the same way that being a member of the Linux Club at school is cool).
It got me too, but this is a good read on ROM/RAM in the WM5 world
I was lucky, the device I wanted had sold out, and I had to buy the model up (The HTC Wizard, with the side out keyboard). I tried to do a few days using the handwriting recognition to see what the £50 had got me.
I was impressed with the recognition, I could scribble out text, and it would be 80-90% right. But the component selection was a real PIA.
I don't get that comment at all.
id go with once an update (month).
Still, does the background change?
I have no problem with it checking home on install. /. it looks be: User names on the computer, process lists, BIOS information).
But daily dumps of unknown data (look through
Why daily, can you magically turn your (legal at install) software into priate software?
Why user names? What good is that?
Why process lists?