See below from Virage's marketing material. I have used this before and it works. You run a video through and it spits out an xml file with time-stamps and metadata about each time stamp: What is being said, on-screen OCR, speaker changes, face recognition, etc. Simply use this metadata file in conjunction with a good pattern matching based search algorithm (Autonomy's IDOL) and you have a great solution for identifying video clips by similarity of the meta-data. QED
IANAP, but I don't think the energy is coming from the gravitational field between the earth and the moon. If it was, the moon would slowly be getting closer to the earth as we sucked up the juice. I would venture that the power is coming from the earth's rotational energy - ie: days will slowly become longer as we suck up more juice...
I think your a bit off-base here. I started off as a Solaris admin, and moved to programming and design.
Now I work in (oh, the horror) sales. That's right, and it's not so bad - I'm on your side, and excel at my job because of it. My company provides enterprise apps for big iron to banks and insurance companies. Someone mentioned that most software is done in house, and that it generally, well, sucks. He was right. Many vendors at the enterprise level develop very specialised software, and that's all they do. Not only is it better than the home-grown stuff, but it's much cheaper too. We sell our wares to hundreds and hundred of clients, so we can afford to spread the development costs accross them.
Most vendors who provide nich software, which, I think accounts for much more of the market than you are aware of, listen to thier customers and try to solve actual problems, simply because the number of customers they have is so few, so it is important to keep them happy. My company's entire development plan is based on user requests, and has been since the begining. In fact, that's the only way new features get into our code.
P2P has been 'adopted outside the Greek Elite'. Even my 65 year old crazy aunt Betty downloads clasical music from Kazza. It has been building for a very long time - even before we had the bandwith and processor power to do it with digital music. I used to trade Phish bootlegs tapes over usenet.
It needs to be analized in much more detail, but when it gets easier for people to pirate music, they will do it more. Human nature. The point is that it can't be good for the recording industry. They are just looking out for themselves, and who can blame them? There are going to be big changes in showbiz - and its already started to happen. They just want them to be in thier favor as much as possible.
They get people to pay because the content of the WSJ is specialized and targeted. The NYT has a much broader audience and, in general, the readers will not be using the information contained therein to make a profit directly. Not quite the case with the WSJ.
If everyone was like you, people who really needed help with issues that were *not* in the FM would never get it. Forums like usenet, irc, and even ask/. are there to discuss novel issues, not to re-hash old topics over and over. People who ask questions before they RTFM are a detriment to the online community. We are *all* fortunate that people put with the likes of you.
When you *do* RTFM, not only do you get better at manual reading, but you also pick up additional information, so you may be less clueless the next time you have a problem
How about strapping mini-GPSes to ally cats, or maybe even cockroaches? Better yet, politicians. Could get interesting.....
Anyhow, as far as I recall, and my memory of Amsterdam is a little hazy (forget why, though), there *are* no subways. The public transport is electric streetcars that kind of look like subways trains, but are much shorter and do not go under the ground. Warning: these things are actually very dangerous as they basically run on the sidewalks, have a tendency to sneak up on you, and need considerable room to stop. Be careful.
If they use a CRC, it could be difficult to get something to the same checksum. Even if it's only a 32 bit CRC, there are a lot of numbers between 0 and 2^32 - especially when they are the result of some unknown hash function.
That's not to say it couldn't be done - the idea is akin to the 'The Club'(TM).....
Bandwidth to graphics cards has been a giant bottleneck up until a few years ago. Back in the ISA days it didn't matter how many pixels your GPU could crunch - the problem was getting the info to it. Either the bus was too slow or the CPU couldn't generate it fast enough. Now that so much functionality has been offloaded to graphics processors and they can be fed information fast enough, it make sense to have processing done there. A new-fangeld GPU has about as many transistors as a newfangeld CPU - why shouldn't they in the same ballpark when it comes to price?
Re:Abby FineReader...
on
Accurate OCR?
·
· Score: 2, Informative
I'd have to agree. I work for a document management software company and we sometimes work with a third party company called Kofax. They provide scanning and OCR. It just so happens that they license their OCR engine from the same people who make Omnipage (scansoft?). We have some clients that are using that engine to scan and OCR 100,000 documents a day. While people do report problems, it works very well for the most part, and there will always be some problems with ocr - at least for the foreseeable future.
I used to use HID (High Intensity Discharge) lamps for growing, uh, tomatoes, in college. I can say that after doing LOTS of research, these lights have the lowest heat/lumens ratio you will find unless you get into some really exotic / expensive stuff.
Advantages of HID lights are: You can mount the ballast remotely - this help because the ballast generates almost as much heat as the bulb. The bulbs are relatively small, though certainly not as small as a halogen. HID's make lots of light for the amount of power they draw - ie: a 400W HPS (High pressure sodium makes 50,000 lumens, while a 100W incandescent makes about 2,000 lumens - that's 125 watt/Watt for the HPS and 20 lumens / Watt for the incandescent).
The three popular types of HID lamps are metal halide, high-pressure sodium, and mercury vapor.
Only drawback here is that the color temp stinks. You will have trouble finding bulbs that do better than 2500k. You might want to look into something called Sun-Argo, which is a HPS / MH hybrid kind of thing. You also might want to look into something called compact fluorescents - these provide a better color temp, but unfortunately, they are florescent, so they are kinda big.
See below from Virage's marketing material. I have used this before and it works. You run a video through and it spits out an xml file with time-stamps and metadata about each time stamp: What is being said, on-screen OCR, speaker changes, face recognition, etc. Simply use this metadata file in conjunction with a good pattern matching based search algorithm (Autonomy's IDOL) and you have a great solution for identifying video clips by similarity of the meta-data. QED
.
Virage VideoLoggerTM
Index and Automate Rich Media With Unparalleled Ease
The award-winning Virage VideoLogger(TM) is the keystone product in Virage's
product offering, forming a critical component in any Enterprise Rich Media
strategy. Virage VideoLogger is the critical first step in any video production
or distribution strategy, scaling to match any environment with an
unparalleled ease of use. Performing multipletasks in real-time, VideoLogger
cuts production costs and transforms video into a highly accessible asset.
As video is being encoded, VideoLogger's advanced capture and analysis
technology works in real time to automatically create a structured index of
the content. The index enables immediate, accurate search and retrieval
of assets. By making video easy to identify, locate and re-use, VideoLogger
allows pieces to be assembled and repurposed faster and with greater
accuracy than ever before.
Virage VideoLogger is currently enabling efficient Enterprise Rich Media management at the UK Department of Trade and
Industry, Italiane Poste, the Canadian National Archive, Case Western University, Cisco and the Library of Congress to name
a few.
Index Video and Other Time coded Information in Real Time
VideoLogger uses advanced technology to form a conceptual understanding
of an analog or digital video signal and to create a structured video index.
VideoLogger automatically extracts information from the signal including
visual storyboard, closed captioning and teletext. Used in combination with
Virage Media Analysis Plug-Ins, it can additionally recognize faces, voices
and types of sounds in the video, identify on-screen text and numbers, and
convert spoken words to text.
VideoLogger can also extract information from external, time coded sources
such as statistical feeds, slide presentations, EDLs, etc. VideoLogger
allows users to further enrich the video database by adding manual
annotations, such as in/out points, titles, descriptions and much more.
Time-synchronized to each encoded copy and to the original content, the
information-rich database provides searchable, pinpoint access to the video.
Once indexed, VideoLogger can be configured to automatically send an email
message to designated persons as an instant alert to the existence of
specified information.
Create Multiple Encodes and Bitrates, Ingest Multiple Formats
Simultaneously during the indexing process, VideoLogger controls the encoding of multiple formats of source content at multiple
bit rates. For example, you can create multiple copies for browsing or publishing to the Web (e.g. QuickTime®, RealVideo®,
Windows Media®, MPEG-1) and, in parallel, for generating program-quality content (e.g. MPEG-2). In addition to
being able to ingest analog video, VideoLogger also supports real-time ingest of a wide variety of digital
video formats, such as those listed above.
Virage VideoLogger delivers advanced capture and analysis for
real-time indexing, processing and managing of content
Copyright © 2006 Virage. All rights reserved. Other trademarks are registered trademarks and the properties of their respective owners.
[VIR DS] 10.08.06 Product specifications and features are subject to change without notice. Use of Virage software is under license.
Virage Inc. (USA)
One Market, Spear Tower, 19th Floor,
San Francisco, CA 94105, USA
Tel: +1 415 243 9955
Fax: +1 415 243 9984
Email: info@us.autonomy.com
Virage Inc. (UK)
C
IANAP, but I don't think the energy is coming from the gravitational field between the earth and the moon. If it was, the moon would slowly be getting closer to the earth as we sucked up the juice. I would venture that the power is coming from the earth's rotational energy - ie: days will slowly become longer as we suck up more juice...
BP
I think your a bit off-base here. I started off as a Solaris admin, and moved to programming and design.
Now I work in (oh, the horror) sales. That's right, and it's not so bad - I'm on your side, and excel at my job because of it. My company provides enterprise apps for big iron to banks and insurance companies. Someone mentioned that most software is done in house, and that it generally, well, sucks. He was right. Many vendors at the enterprise level develop very specialised software, and that's all they do. Not only is it better than the home-grown stuff, but it's much cheaper too. We sell our wares to hundreds and hundred of clients, so we can afford to spread the development costs accross them.
Most vendors who provide nich software, which, I think accounts for much more of the market than you are aware of, listen to thier customers and try to solve actual problems, simply because the number of customers they have is so few, so it is important to keep them happy. My company's entire development plan is based on user requests, and has been since the begining. In fact, that's the only way new features get into our code.
bp
P2P has been 'adopted outside the Greek Elite'. Even my 65 year old crazy aunt Betty downloads clasical music from Kazza. It has been building for a very long time - even before we had the bandwith and processor power to do it with digital music. I used to trade Phish bootlegs tapes over usenet.
It needs to be analized in much more detail, but when it gets easier for people to pirate music, they will do it more. Human nature. The point is that it can't be good for the recording industry. They are just looking out for themselves, and who can blame them? There are going to be big changes in showbiz - and its already started to happen. They just want them to be in thier favor as much as possible.
--bp
They get people to pay because the content of the WSJ is specialized and targeted. The NYT has a much broader audience and, in general, the readers will not be using the information contained therein to make a profit directly. Not quite the case with the WSJ.
Your right, I had *meant* to flame (bit harsh of a word I think) Lord Bitman, but I got sloppy with the mouse. My apologies.
If everyone was like you, people who really needed help with issues that were *not* in the FM would never get it. Forums like usenet, irc, and even ask /. are there to discuss novel issues, not to re-hash old topics over and over. People who ask questions before they RTFM are a detriment to the online community. We are *all* fortunate that people put with the likes of you.
When you *do* RTFM, not only do you get better at manual reading, but you also pick up additional information, so you may be less clueless the next time you have a problem
-BP
How about strapping mini-GPSes to ally cats, or maybe even cockroaches? Better yet, politicians. Could get interesting.....
Anyhow, as far as I recall, and my memory of Amsterdam is a little hazy (forget why, though), there *are* no subways. The public transport is electric streetcars that kind of look like subways trains, but are much shorter and do not go under the ground. Warning: these things are actually very dangerous as they basically run on the sidewalks, have a tendency to sneak up on you, and need considerable room to stop. Be careful.
Dan
I hear you. It is also interesting that sexyKO only posts on the 11th of each month......
Easier said than done.
If they use a CRC, it could be difficult to get something to the same checksum. Even if it's only a 32 bit CRC, there are a lot of numbers between 0 and 2^32 - especially when they are the result of some unknown hash function.
That's not to say it couldn't be done - the idea is akin to the 'The Club'(TM).....
Bandwidth to graphics cards has been a giant bottleneck up until a few years ago. Back in the ISA days it didn't matter how many pixels your GPU could crunch - the problem was getting the info to it. Either the bus was too slow or the CPU couldn't generate it fast enough. Now that so much functionality has been offloaded to graphics processors and they can be fed information fast enough, it make sense to have processing done there. A new-fangeld GPU has about as many transistors as a newfangeld CPU - why shouldn't they in the same ballpark when it comes to price?
I'd have to agree. I work for a document management software company and we sometimes work with a third party company called Kofax. They provide scanning and OCR. It just so happens that they license their OCR engine from the same people who make Omnipage (scansoft?). We have some clients that are using that engine to scan and OCR 100,000 documents a day. While people do report problems, it works very well for the most part, and there will always be some problems with ocr - at least for the foreseeable future.
Dan
I used to use HID (High Intensity Discharge) lamps for growing, uh, tomatoes, in college. I can say that after doing LOTS of research, these lights have the lowest heat/lumens ratio you will find unless you get into some really exotic / expensive stuff.
Advantages of HID lights are: You can mount the ballast remotely - this help because the ballast generates almost as much heat as the bulb. The bulbs are relatively small, though certainly not as small as a halogen. HID's make lots of light for the amount of power they draw - ie: a 400W HPS (High pressure sodium makes 50,000 lumens, while a 100W incandescent makes about 2,000 lumens - that's 125 watt/Watt for the HPS and 20 lumens / Watt for the incandescent).
The three popular types of HID lamps are metal halide, high-pressure sodium, and mercury vapor.
Only drawback here is that the color temp stinks. You will have trouble finding bulbs that do better than 2500k. You might want to look into something called Sun-Argo, which is a HPS / MH hybrid kind of thing. You also might want to look into something called compact fluorescents - these provide a better color temp, but unfortunately, they are florescent, so they are kinda big.
Good Luck.