I read a few of the summaries and they aren't bad at all. Much better than other attempts in the past. Here is a description of one of their projects. Now I don't know if TIDES is used in newsblaster, but it's still interesting.
In the TIDES project, we will develop a practical, multilingual and multidocument information tracking and summarization system. Our design features the integration of robust, statistical techniques, shallow linguistic approaches and machine learning to achieve scalability within languages and portability across languages. To realize these goals, we will develop methods for summarization across documents using information fusion and identification of key differences, summarization across languages relying on identification and translation of terms, and new methods for identification, expansion and translation of terms. Unlike most other approaches, rather than relying on sentence extraction, our work uses information fusion of similar information, merging together repetitive phrases into a single phrase allowing dramatic reduction of information across many articles. Our work will focus on characterizing types of differences to include in a summary, which is an unexplored direction in multi-document summarization. We will develop difference operators to identify new information, contradictions, trends, multiple perspectives, and different topics. Our approach will minimize reliance on full machine translation, instead using identification, expansion and translation of terms where possible. We will begin work with a language such as Spanish, but quickly expand to include Asian languages and other non Indo-European languages.
From the look of it, NLP (natural language parsing) seems to be evolving nicely. It used to be that NLP required processing the entire document and understanding the sentences by mapping heirarchies of valence/word order.
I did some research into this for a pet project of my own. I wanted to write an application to crawl the web and get information. After a couple months of research, I realized how big of a problem it is.
the application needs to be able to determine the relevance of the provided text
to do so, it needs to determine the relative importance of the sentences and words
it has to be able to compose new sentence to write a summary
not all documents follow good structure or grammer
how do you account for spelling/grammar mistakes
From my research, there appears to be two primary methods of performing this kind of processing:
natural language parsing
statistical parsing
Of the two, statistical parsing is more popular these days because it doesn't require knowledgebase, expert system shells, grammar modeling and extensive dictionary. One of the primary method of determining the relative importance of words in a sentence is valence. The main challenge with natural language parsing and statistical technique is it depends on the training dataset. The more specific the dataset is, the better it will perform.
Statistical analysis can also use expert system shells and other AI technologies to improve accuracy, but it doesn't have to.
From my understanding (which is limited), it stems from a principle from linguistics. By counting the frequency of words or more specifically nouns, the program is able to rate each nouns importance. Once it got done, it could then look at the sentence that best describes the document by doing a comparison between the most importance words and the appearance of those words in the sentences. I remember this from my literature and linguistics classes. Congnitive science has also attempted to solve this problem, but it is very difficult.
In either case, if you dealing with well structured documents, your best bet is to grab the first 3 paragraphs assuming the author followed standard thesis/essay structure. If you're planning on summarizing new articles, it might not be that hard if the author followed the inverted pyramid, which many do not. One of the big tools of natural language parsing in the early days was prolog. It is still used a lot in academic settings for natural language processing. You're best bet is to get an intern to read and summarize for yo
what is even scarier is one of the pictures shows the guy with his kid on it. The steel doesn't look too durable to me, but then kind of hard to tell how much force the rails can support. He's got guts. Maybe he should change jobs and go work for a rollercoaster company.
You're comparing the expensive 9110 to 3410 and 7210. The detail spec page for both the 3410 and 7210 don't list memory expansion or the built in memory. Whereas the 9110 and 9210 explicitly say it supports memory cards. From my experience with CDMA chipsets and refernce boards, before 2000 the reference boards from Qualcomm only had 2 megs total including RAM and ROM. The newer reference board for CDMA came out in 2000 with 4 or 8 megs. I don't know about you, but I wouldn't pay more than 150 for a high end phone.
GSM and TDMA are considerably different, but two of the most expensive parts of a cell phone today is the screen and memory. You can't put 16megs of RAM on a phone and have it cost less than 200.
I would have to agree with the parent post. CBR is a strong candidate, if you really want to go with expert system shell. Expert systems are really challenging to build, especially if the domain is too broad or poorly defined. A general purpose tool like the one described by the original post is going to be alot harder to build than one would think.
To build a useful system like the one described in the original post, you're really talking about a couple separate domains:
programming - specific to the programming language, if multiple languages are used on the project
application domain - specific to the features, ui and configuration
user - specific to the type of user using it
Within each domain, it can be broken down to sub domains to make it more manageable. The development schedule for expert system deployment are typically 1-3 yrs depending on the project scope.
It looks like the models are not for US. I don't know how much memory are in those phones, but I'm guessing it's less than 4megs. After you subtract the memory used by all the other applications, it might only be 600K. It should be fairly easy to write a real time mapping application that uses 60-80K of memory assuming the phone can calculate it's location based on the cell.
I never claimed to be smart. Smartass yes! I hardly think he discovered anything. More like stumbled across the atlantic. Had columbus been a good astronomer and studied the motions of the night sky, he would have realized the circular motion and been able to calculate the radius of the earth. But that would have been scientific, which was not his strength. The Aztec were already capable of high precision astronomy according to some researchers. Several shows on pyramid builders have proven it mathematically. He was brave. He had a lot more courage than I do, so he should be celebrated for trying to cross an ocean without a clue and insufficient supplies. Then again, what do I know?
Well a guitar and car are very different types of purchases:) Having a bad guitar won't get me stranded in the middle of no where w/o a tow truck. I'm not that cheap, well not on important purchases.
If I'm reading you correctly, which I may not, the logic is the following:
1. Europeans believed the world was flat
2. Columbus believed he could get to india sailing acrss the atlantic ocean
3. Columbus wanted to prove Europeans are stupid
4. Columbus wanted to get rich
5. Columbus thought america was india
6. because Europeans and Columbus were both wrong and stupid, he is credited with discovering America. Even though the name was coined by Amerigo.
If that is a correct interpretation, then I ask "why are we arguing over who is more stupid?"
NOT! I love the original starwars series. Don't care for Episode I: Phantom pick-pocket. When I buy guitars (it's been a while), it has to be either cheap (100 or less) or sound amazing like an Epiphone, Paul Reed Smith, Martin accoustic, guild accoustic or have a rich sound.
No self respecting guitarist (who can jam), would buy a guitar for some stupid starwars pictures. Not some one like Reeves who used to play with a vibrator, Jimmy Page who was the first to use a bow, Steve and his 7 string guitar, Hendrix who revolutionized distortion, Brian May of Queen, robert johnson, Clapton during cream days, Al Di Meola's jazz guitar and john lee hooker. There's plenty of other great guitarists who wouldn't buy these starwars guitars. It will probably be a gift to some kid who is just learning to play.
that's a difficult question which I have no answer. It depends on a lot of things right. Normally marketing will spin a movie one way purposely to get people in the theater to see it. If that same movie is pirated and available on the net, would it affect its ability to get a audience?
I have no clue. again if it's a good movie, it will probably do well. if it's crap, well you know. as much as I would like all the movies to be great or good, I doubt it will happen. but it will mean the consumer will know sooner than later a movie is crap:). Bad for movie companies, because there's always going to be a percentage of crap movies. The good movies still have to make up for the movies that suck. I don't know what those numbers are, but maybe some one else knows.
I've been thinking about this and the problem is pretty complicated.
1. not all movies are block busters that people watch more than once and buy the dvd/vhs.
2. nitch movies like foriegn or art films may not make as much money in theaters. Most big theaters no longer play art films, unless they are produced and directed by famous people.
3. pirated version of "so-so" movies will have a harder time breaking even. Why spend 10+ bucks for a movie with no production value, which barely keeps you interested?
4. pirated version of popular or great movies tend to see a benefit.
5. pirating may affect movie budgets negatively and force movie makers to do more with less money.
6. pirating of movies before they are released to the public may kill any chance of it making money, let alone profit. Crap movies will be affected the most by this.
7. Pirating DVD disk image may become a bigger issue in the future, but for the most part it's professional pirating by organized criminals that are the biggest problem.
Just my opinion, but I think the movie execs just don't understand it and realize they need to change how they do things. In a lot of ways, art and foriegn films could see an increase in popularity if video on demand becomes reality. Someone might not spend 7.00 for a ticket, 3.00 for popcorn, 2.00 for a drink and 20 minutes to drive to the theater for an art film, but they might spend 3 bucks to see it at home. There are a lot of ways for the movie industry to re-invent itself and make more money. Now if only they would "think" instead of react, they could really see a whole new world of cinema.
I like watching short movies on the net, when they are good. I wouldn't spend 7 bucks on a questionable movie, but I would risk 1-2 bucks. As more people master the art of making short movies, the market will grow. Especially if hollywood continues to crank out formulaic junk.
I heard about this product from motorola back in 2000 doing research in sms and other wireless data. There were very few details back then, since it wasn't ready yet. I guess now I can finally see what they meant when they said there would be "a better, more advanced model with better wireless connection."
Having used both RIM blackberry and timeport product, blackberry's interface feels much more intuitive and user friendly. though client/server apps should be easier with Accompoli. The keyboard on the accompoli looks very similar to the timeport product, so typing will still be difficult. Also the form factor of the timeport was a bit bulky and not very good if someone wants a device they want to put in their pocket. RIM blackberry's slim profile was better than timeport, though the balance between screen size and form factor is a tough decision.
I'm so glad my hard earned tax dollars are hard at work. Otherwise I'd might do something useful with it like donate books or PC's to a needy school. Instead it goes to a really bright judge that spends months and months to come to the conclusion video games causes violence.
It's good 1/3 of my income goes to pay for the judges drinks, golf membership and vacations.
Cnet is also reporting as other mentioned. Here is an interesting blurb in Allchin's deposition:
On the audio and video front, Houck questioned Allchin about whether consumers can obtain the most recent version of Media Player without buying XP and whether PC makers can remove the "software code" for Media Player from the OS.
Wow that is impressive. I seriously doubt that is true, since I've performed window-ectomy of one sort or another since win95. What ever happened to Microsoft's claim their software is modular and scalable? If one were to play along and say that statement is true, a logical conclusion would be "what a piece of junk." That would be like Ford saying, "You can't put a different radio in the car, because it would break the whole electrical system. You have to buy a ford approved radio for your mustang."
The one useful thing I get out of all this.NET virus article and dicussion is power comes at a price. Allowing developers write C/C++ within a.NET application provides a lot of flexibility, but it creates new ways to exploit a system. Perhaps the most interesting fact to me is the big difference in approach between MS and Unix camp. It makes me think of an analogy:
lojack is to unix as an idling car in south central LA is to microsoft
Some might say "making things easy encourages mistakes." If any two bit script kiddie can jump in and write a powerful virus, than I would argue for making it harder to write code. It's not bad to make software engineers and developers stop to think carefully about how they are doing things. Maybe then management won't be as tempted to set unreal development schedules, thereby increasing the time for QA and producing higher quality applications. Using a tool that promotes itself as "super fast and easy" will only give management more reasons to shorten development cycles and make more bad code faster:)
Now think about how Disney owns all the hardware that pixar is using to make it's movies, and is the one risking that pixar's movies will bomb
Wow, I'm glad that you know so much about pixar. So I guess the guys I know who work for pixar are mis-informed. I don't know where you get your information from, but disney doesn't own all the hardware. On the contrary, pixar acts like a division. Here is an old article in businessweek feb 24, 1997.
On Monday, Feb. 24, Eisner and Pixar founder Steven Jobs announced a $15 million deal in which Disney bought a 2.5% stake in Pixar Animation Studios, the computer animation shop that made 1995's hit Toy Story. Disney also gets warrants to buy another 2.5% of Pixar if it chooses.
Pixar doesn't need disney. Pixar had tons of studios wanting to partner with them, so the idea pixar will die without disney is ludicruous.
Why in the world is miyazaki going with disney again. Disney has absolutely no understanding of japanese anime and only wants to do crappy releases to discredit it. IMNSHO, disney needs to get out of the animation business. Pixar doesn't need disney. Everything from disney the last 10 years has sucked (I don't consider pixar's movie disney produced). Here's to hoping michael isner is fired and some one with an understanding of movies takes his place.
You're right it says that once I read through the html version of the paper, but that's no excuse for not labeling the charts. If the guy plans on submitting it to a peer review journal, he'd better have captions. Even though he makes those statements, it's still not totally clear how the charts relate to data/findings. It's the responsibility of the writer to make it easy to read and understand.
hasn't ESR noticed MS is diversifying it's business into games, tv and enterprise services? windows sales might become less important, but it's far from going away. get real.
With all these old, greedy idiots running the show, I get the feeling it's now cool to loathe the consumer and treat consumers as hostile enemies. A sane person would realize common sense says, "people buy things they find valuable." On one side of the coin, who the hell is going to buy completely new radio, tv, microphone, dvd, laserdisk, tape deck and vhs just so they can throw away perfectly good machines w/o that garbage? If it does get passed, it may be a great boon for musicians and artists. If it ever came to that, I'd stop buying new entertainment gizmos and media. Instead I'd go see a real artists, plays, or concerts. Once enough people choose to ignore the "protected" gizmo's they loose all monitary incentive to support or push digital content protection. Instead of making billions a year, they'll make a tiny fraction. Anyone thinks the hardware manufacturers will switch gears 180 degrees overnight is smoking a cocaine+herione cocktail. Look at the stupid Divx players from circuit city. Once they realized subsidizing a bad product was costing hundreds of millions every year, they stopped that non sense.
It's not possible to get rid of all the old VHS, DVD, casset and record players. It's high time the consumers teach the corporations a big lesson about "the customer is right." The CEO's running these large media companies have decided to check their brains at the door. Perhaps the share holders should kick those CEO's to the curb, or file a class action suit against the CEO's to pull their collective heads out of their asses.
In the TIDES project, we will develop a practical, multilingual and multidocument information tracking and summarization system. Our design features the integration of robust, statistical techniques, shallow linguistic approaches and machine learning to achieve scalability within languages and portability across languages. To realize these goals, we will develop methods for summarization across documents using information fusion and identification of key differences, summarization across languages relying on identification and translation of terms, and new methods for identification, expansion and translation of terms. Unlike most other approaches, rather than relying on sentence extraction, our work uses information fusion of similar information, merging together repetitive phrases into a single phrase allowing dramatic reduction of information across many articles. Our work will focus on characterizing types of differences to include in a summary, which is an unexplored direction in multi-document summarization. We will develop difference operators to identify new information, contradictions, trends, multiple perspectives, and different topics. Our approach will minimize reliance on full machine translation, instead using identification, expansion and translation of terms where possible. We will begin work with a language such as Spanish, but quickly expand to include Asian languages and other non Indo-European languages.
From the look of it, NLP (natural language parsing) seems to be evolving nicely. It used to be that NLP required processing the entire document and understanding the sentences by mapping heirarchies of valence/word order.
From my research, there appears to be two primary methods of performing this kind of processing:
Of the two, statistical parsing is more popular these days because it doesn't require knowledgebase, expert system shells, grammar modeling and extensive dictionary. One of the primary method of determining the relative importance of words in a sentence is valence. The main challenge with natural language parsing and statistical technique is it depends on the training dataset. The more specific the dataset is, the better it will perform.
Statistical analysis can also use expert system shells and other AI technologies to improve accuracy, but it doesn't have to.
From my understanding (which is limited), it stems from a principle from linguistics. By counting the frequency of words or more specifically nouns, the program is able to rate each nouns importance. Once it got done, it could then look at the sentence that best describes the document by doing a comparison between the most importance words and the appearance of those words in the sentences. I remember this from my literature and linguistics classes. Congnitive science has also attempted to solve this problem, but it is very difficult.
In either case, if you dealing with well structured documents, your best bet is to grab the first 3 paragraphs assuming the author followed standard thesis/essay structure. If you're planning on summarizing new articles, it might not be that hard if the author followed the inverted pyramid, which many do not. One of the big tools of natural language parsing in the early days was prolog. It is still used a lot in academic settings for natural language processing. You're best bet is to get an intern to read and summarize for yo
oops posted too soon. on page 5 the guy says they didn't send the kid up.
what is even scarier is one of the pictures shows the guy with his kid on it. The steel doesn't look too durable to me, but then kind of hard to tell how much force the rails can support. He's got guts. Maybe he should change jobs and go work for a rollercoaster company.
Well you wouldn't need more than 80K for one game or application, but what if you want to put 10 apps on it that use j2me? 10 * 80 = 800K :)
GSM and TDMA are considerably different, but two of the most expensive parts of a cell phone today is the screen and memory. You can't put 16megs of RAM on a phone and have it cost less than 200.
To build a useful system like the one described in the original post, you're really talking about a couple separate domains:
- programming - specific to the programming language, if multiple languages are used on the project
- application domain - specific to the features, ui and configuration
- user - specific to the type of user using it
Within each domain, it can be broken down to sub domains to make it more manageable. The development schedule for expert system deployment are typically 1-3 yrs depending on the project scope.It looks like the models are not for US. I don't know how much memory are in those phones, but I'm guessing it's less than 4megs. After you subtract the memory used by all the other applications, it might only be 600K. It should be fairly easy to write a real time mapping application that uses 60-80K of memory assuming the phone can calculate it's location based on the cell.
It good looking and has a camera or what looks like a camera. Sony sure knows industrial design.
I never claimed to be smart. Smartass yes! I hardly think he discovered anything. More like stumbled across the atlantic. Had columbus been a good astronomer and studied the motions of the night sky, he would have realized the circular motion and been able to calculate the radius of the earth. But that would have been scientific, which was not his strength. The Aztec were already capable of high precision astronomy according to some researchers. Several shows on pyramid builders have proven it mathematically. He was brave. He had a lot more courage than I do, so he should be celebrated for trying to cross an ocean without a clue and insufficient supplies. Then again, what do I know?
Well a guitar and car are very different types of purchases :) Having a bad guitar won't get me stranded in the middle of no where w/o a tow truck. I'm not that cheap, well not on important purchases.
1. Europeans believed the world was flat
2. Columbus believed he could get to india sailing acrss the atlantic ocean
3. Columbus wanted to prove Europeans are stupid
4. Columbus wanted to get rich
5. Columbus thought america was india
6. because Europeans and Columbus were both wrong and stupid, he is credited with discovering America. Even though the name was coined by Amerigo.
If that is a correct interpretation, then I ask "why are we arguing over who is more stupid?"
No self respecting guitarist (who can jam), would buy a guitar for some stupid starwars pictures. Not some one like Reeves who used to play with a vibrator, Jimmy Page who was the first to use a bow, Steve and his 7 string guitar, Hendrix who revolutionized distortion, Brian May of Queen, robert johnson, Clapton during cream days, Al Di Meola's jazz guitar and john lee hooker. There's plenty of other great guitarists who wouldn't buy these starwars guitars. It will probably be a gift to some kid who is just learning to play.
I have no clue. again if it's a good movie, it will probably do well. if it's crap, well you know. as much as I would like all the movies to be great or good, I doubt it will happen. but it will mean the consumer will know sooner than later a movie is crap :). Bad for movie companies, because there's always going to be a percentage of crap movies. The good movies still have to make up for the movies that suck. I don't know what those numbers are, but maybe some one else knows.
1. not all movies are block busters that people watch more than once and buy the dvd/vhs.
2. nitch movies like foriegn or art films may not make as much money in theaters. Most big theaters no longer play art films, unless they are produced and directed by famous people.
3. pirated version of "so-so" movies will have a harder time breaking even. Why spend 10+ bucks for a movie with no production value, which barely keeps you interested?
4. pirated version of popular or great movies tend to see a benefit.
5. pirating may affect movie budgets negatively and force movie makers to do more with less money.
6. pirating of movies before they are released to the public may kill any chance of it making money, let alone profit. Crap movies will be affected the most by this.
7. Pirating DVD disk image may become a bigger issue in the future, but for the most part it's professional pirating by organized criminals that are the biggest problem.
Just my opinion, but I think the movie execs just don't understand it and realize they need to change how they do things. In a lot of ways, art and foriegn films could see an increase in popularity if video on demand becomes reality. Someone might not spend 7.00 for a ticket, 3.00 for popcorn, 2.00 for a drink and 20 minutes to drive to the theater for an art film, but they might spend 3 bucks to see it at home. There are a lot of ways for the movie industry to re-invent itself and make more money. Now if only they would "think" instead of react, they could really see a whole new world of cinema.
I like watching short movies on the net, when they are good. I wouldn't spend 7 bucks on a questionable movie, but I would risk 1-2 bucks. As more people master the art of making short movies, the market will grow. Especially if hollywood continues to crank out formulaic junk.
Having used both RIM blackberry and timeport product, blackberry's interface feels much more intuitive and user friendly. though client/server apps should be easier with Accompoli. The keyboard on the accompoli looks very similar to the timeport product, so typing will still be difficult. Also the form factor of the timeport was a bit bulky and not very good if someone wants a device they want to put in their pocket. RIM blackberry's slim profile was better than timeport, though the balance between screen size and form factor is a tough decision.
It's good 1/3 of my income goes to pay for the judges drinks, golf membership and vacations.
On the audio and video front, Houck questioned Allchin about whether consumers can obtain the most recent version of Media Player without buying XP and whether PC makers can remove the "software code" for Media Player from the OS.
Wow that is impressive. I seriously doubt that is true, since I've performed window-ectomy of one sort or another since win95. What ever happened to Microsoft's claim their software is modular and scalable? If one were to play along and say that statement is true, a logical conclusion would be "what a piece of junk." That would be like Ford saying, "You can't put a different radio in the car, because it would break the whole electrical system. You have to buy a ford approved radio for your mustang."
Anyone think Allchin is lying through his teeth :)
lojack is to unix as an idling car in south central LA is to microsoft
Some might say "making things easy encourages mistakes." If any two bit script kiddie can jump in and write a powerful virus, than I would argue for making it harder to write code. It's not bad to make software engineers and developers stop to think carefully about how they are doing things. Maybe then management won't be as tempted to set unreal development schedules, thereby increasing the time for QA and producing higher quality applications. Using a tool that promotes itself as "super fast and easy" will only give management more reasons to shorten development cycles and make more bad code faster :)
Wow, I'm glad that you know so much about pixar. So I guess the guys I know who work for pixar are mis-informed. I don't know where you get your information from, but disney doesn't own all the hardware. On the contrary, pixar acts like a division. Here is an old article in businessweek feb 24, 1997.
On Monday, Feb. 24, Eisner and Pixar founder Steven Jobs announced a $15 million deal in which Disney bought a 2.5% stake in Pixar Animation Studios, the computer animation shop that made 1995's hit Toy Story. Disney also gets warrants to buy another 2.5% of Pixar if it chooses.
Pixar doesn't need disney. Pixar had tons of studios wanting to partner with them, so the idea pixar will die without disney is ludicruous.
Why in the world is miyazaki going with disney again. Disney has absolutely no understanding of japanese anime and only wants to do crappy releases to discredit it. IMNSHO, disney needs to get out of the animation business. Pixar doesn't need disney. Everything from disney the last 10 years has sucked (I don't consider pixar's movie disney produced). Here's to hoping michael isner is fired and some one with an understanding of movies takes his place.
Mod up the parent.
You're right it says that once I read through the html version of the paper, but that's no excuse for not labeling the charts. If the guy plans on submitting it to a peer review journal, he'd better have captions. Even though he makes those statements, it's still not totally clear how the charts relate to data/findings. It's the responsibility of the writer to make it easy to read and understand.
hasn't ESR noticed MS is diversifying it's business into games, tv and enterprise services? windows sales might become less important, but it's far from going away. get real.
It's not possible to get rid of all the old VHS, DVD, casset and record players. It's high time the consumers teach the corporations a big lesson about "the customer is right." The CEO's running these large media companies have decided to check their brains at the door. Perhaps the share holders should kick those CEO's to the curb, or file a class action suit against the CEO's to pull their collective heads out of their asses.