Seeking Prior Art on Markov-Based SPAM Filters?
"Not too long ago, I was discussing it with a co-worker who has a degree in Electrical Engineering. She had taken an AI class in college which mentioned the idea of negative examples for Markov models, and this was well before the year 2000. The bottom line is that I think I have a great idea that could potentially add to our collective arsenal against the ever-growing SPAM problem. I would very much like to work on it and publish it under GPL, but before I can do this, I have to protect my self against the patent and the large pocketbook of Mitsubishi.
I'm not asking for legal advice. I have already consulted an attorney, and it was suggested that I should remove the SourceForge project, which I have done. I also attempted to contact the EFF (no luck so far). I'm asking for those of you out there who are familiar with this sort of thing to help me to find verifiable prior art that dates from before the 2000 patent. I would very much like to share with the world my ideas and the code I have written, but this is standing in our way."
Youd rather don not figth with a fearless corporation...
let me send the boys round the patent office. sort this patent malarky once and for all
this sig steers like a cow. and i can prove it
Calling it a "Probabilistic Finite State Network", and denying it has anything to do with Markov models? Screw them patent bastards!
--
drug, n: A substance that, injected into a rat, produces a scientific paper
Andrei Markov himself used his models to filter text. Not e-mails obviously, but poetry.
He would both write and solicit poetry and he devised what are now know as markov models to assess poetry for aesthetic quality. Sounds like a silly idea now, but this was in the Victorian era when science and maths were blossoming. It was believed that everything could be measured and quantified; even philosophical qualities such as 'goodness' of poetry.
So,
step 1 : first search for rhyme and meter.
step 2 : then search for spamlike characteristics.
I've had this idea before, too...heh. Seemed obvious to me! What else are neural networks for? I know some companies use neural networks to sort out acceptable vs. unacceptable on the assembly line, and this is really the same thing...
--TheOrangeSquid Is it any wonder things seem so awry? We swim in a sea of confusion and don't have to think to survive
Then again, I also thought it was 'obvious'
The full term in patent law is 'obvious to one with ordinary skill in the art'.
In other words, it has to be REALLY obvious.
This is absurd. The whole point of mathematical techniques is that they work no matter what you apply them to--which is why you didn't used to be able to patent mathematics.
If this keeps up we'll start seeing patents on "counting small ovoids" and "counting imaginary woolly creatures" and people will be posting here asking if anyone can prove prior art on counting sheep.
General purpose mathematical tools should be wide open, patent free for any purpose because they are general purpose tools!
-- MarkusQ
You haven't "run into a patent" until they sue you. Seriously.
If every inventor/programmer gave up because there was a similar patent, no one would ever program or invent anything.
No one knows if you are really infringing on a patent until the judge actually hands down a ruling. Patents aren't as clear cut as trademarks or copyrights. It's impossible to say something like "this code infringes" or "this code doesn't" until it actually goes to court. Same with even whether the patent is valid.
I think this is the reason behind Linus basically saying "fuck patents". Once you get the cease and desist, then make your decision, until then, you might as well not worry about it (unless you have large bank accounts, in which case, you should already have a relationship with a lawyer).
I am not a lawyer. Get a lawyer if you want legal advice, because this isn't it.
I've had enough abrasive sigs. Kittens are cute and fuzzy.
To obtain useful prior art you have to go further back than the 2000 patent issue date. You have to show that the 'prior art' occurs before the date the invention occured, not when the patent issued. Obviously that date is before the FILING date of the patent, which in this case was Dec 1997.
Have you thought about asking the patent holders? If this is an open source/free software project it is possible that they will give limited permission to use their IP. It seems to me that if your only other option is to remove the code, why don't you at least get a feel for the inventor's attitude. It would be best if you could find the original inventor, not send some random email to the legal department. Sometimes original inventors (even in big companies) can be helpful or lead an introduction to foward thinking people in tech transfer.
-Sean
I'm not sure I fully understand what you're attempting and what prior art you are looking for, but let me try anyway.
The google groups message says the patent covers explicity "markov model discriminator using negative examples". Markov decision problems are often those associated with a set of states and a policy which transitions between them. For spam filtering, I would guess there are three states: unknown, spam, non-spam, and the policy needs only determine if the transition should go between the unknown and and either the spam or non-spam state.
The optimal policy in an MDP is static, and thus it would say "the optimal policy is to mark an email spam". This sounds useless. A Partially Observable MDP (POMDP) may follow more with your plans since it follows percepts and probability distributions over the current set of beliefs.
As for training for an optimal policy for MDPs, one algorithm ("Value Iteration") makes use of a reward function. This reward function is explicitly allowed to be negative, and probably should be for some decisions or else the problem is trivial. And "positive" or "negative" examples is just a boolean simplification of multiple categories. Why not have "pr0n" "ads" "personal" "work" and a bunch of other categories?
In this case, positive and negative are meaningless, but all (both) are *always* neeeded to define the optimal policy.
In other words, I'm not sure I understand enough of what you are attempting to help find prior art, but from what I can tell "negative" examples are inherently part of defining the optimal policy in an MDP. For a reference, Russel & Norvig, Artificial Intelligence, A Modern Approach (1995) has a ton of information about MDPs and the training of them.
Markov models have been used extensively in cognitive science, particularly the field of text comprehension, for the purpose of studying text comprehension and retension.
Spreading acitivation models are mathematically similar to Markov chains, and ha ve also been used for similar purposes, trained and/or designed with both positi ve and negative exemplars.
Further, concept categorization and discovery research in cognitive science has used Markov chains for automated categorization, once again using positive and n egative examples.
If you're serious about this, I'd check the literature in this area, particularl y journals like Discoure Processes and similar. Also check the Proceedings of t he Cognitive Psychology conference.
All of this is published, peer-reviewed work well before 2000. Your best bet is to hit a decent university library and start searching.
Heck, my dissertation work didn't use negative examples, but the spreading activ ation models were originally Markov chains.
Markov chains and their use in AI, cognitive psychology, speech/text recognition , categorization, and text comprehension have been in use for decades. With the interdisciplinary approach common in the 80's and 90's, there were many efforts that adapted things like Markov chains and neural network learning algorithms, simulated annealing techniques, and so forth. It shouldn't be hard to dig some up. Worst case, find a cognitive psychology professor specializing in text proc essing and ask. UCSD has a strong showing, as UC Boulder, Manitoba, U Memphis, U Chicago, and many, many others.
.@.
I don't know enough about MDP to know if this is feasible work but it may move the scope of the solution far enough away from the patent to allow the submitter to continue his work.
Bleh!
Publish it through freenet, or anonymously through mixmasters to a list of subscribers.
Problem solved, and you're (mostly) protected. FreeNet is a superb example of peaceful civil disobedience, and quite frankly, if we have the power of superior technology (which we do) we need to employ that to Just Say No to the corporations that are screwing us all over.
The only thing that's needed now is a system by which collaborators on a project can do so completely anonymously. FreeNet CVS, anyone?
Spam is junk e-mail. Why do you want ham filters?
IANAL, but it's my understanding that just because there is a patent, it doesn't mean you can't improve on the invention. Quote from http://usgovinfo.about.com/blpatents.htm:
Utility Patents - The most common type of patent, utility patents are issued to inventors of new devices and processes or improvements to existing devices or processes. Most utility patents are issued for inventions that improve existing devices. Inventors of the proverbial "better mousetrap," seek utility patents.
The existing patent you mention says "inappropriate web page material"
It may be possible the wording can be changed to be more specific - "unsolicited electronic mail messages" and "advertisements". I've bounced around the patent site a bit, and it seems there are a lot of patents that are given with prior art in existence, they just have to find it and say 'reference this patent' or something. If someone can get a patent for swinging on a swing, it's worth a shot if you've got the $$.
...naive bayes as used in popfile?
Large print giveth, and the small print taketh away
Researching this issue and even writing and publishing example source code ist NOT patent infringement. If RedHat includes your code in their distro and ships it in the USA, THAT is patent infringement (assuming the patent really applies here). But this is RedHat's problem not yours. You, as a researcher are not affected by the patent.
I think you've not made it clear enough to your lawyer that you're not marketing a product containing this (or are you?). Otherwise your lawyer would not have told you that you can't continue work on this.
while not related to filtering spam (though i have done a lot of research in the past few months on that, as well, and am solidly sold on the baysean method now) markov generators on spam makes good art: a series of 10 spam "collages" for your viewing pleasure. it's fun stuff ...