IBM Wants Patent For Regex SSN Validation
theodp writes "What do you get when you combine IBM contributors with the Dojo Foundation? A patent for Real-Time Validation of Text Input Fields Using Regular Expression Evaluation During Text Entry, assuming the newly-disclosed Big Blue patent application passes muster with the USPTO. IBM explains that the invention of four IBMers addresses a 'persistent problem that plagues Web form fields' — e.g., 'a social security number can be entered with or without dashes.' A non-legalese description of IBM's patent-pending invention can be found in The Official Dojo Documentation. While IBM has formed a Strategic Partnership With the Dojo Foundation which may protect one from a patent infringement lawsuit over validating phone numbers, concerns have been voiced over an exception clause in IBM's open source pledge."
Online Prior Art at the Regex Library from 2004:
Put that into your favorite Javascript regular expression object and write a stupid onChange reference to it in your HTML and ... tada! Too complicated? Here's some more prior art. Or here. A little bit of Googling must be too much for the USPTO.
Are we suddenly shocked to discover one line of code can be patented when a whole mess of code can be patented?
My work here is dung.
What is this buillshit? "A persistent problem is dashes in SSNs"???
How fucking hard is it to strip non-numeric characters from a string?
I cannot believe there could be such programmer incompetence; no, it has to be some managerial cluelessness and hard-headness.
Are you fucking kidding me? Did they just really patent the format "###-##-####"? I didn't RTFA because I didn't want my head to explode.
this is my sig
^\d{3}-\d{2}-\d{4}$
ahh thats right baby, patent infringement!
I live on the edge...
this is bullshit.
We parse SSNs all day long. I think WE may have prior art.
I see lots of comments coming up about how ridiculous this is. Maybe that's the point. Maybe the best way to bring out patent reform to to patent every simple thing there is. You have to remember that IBM is paying to patent something as simple as:
s/(^[0-9])+//g
which most certainly has prior art all over the web. Why would it be worth IBMs money and time to do such a thing? The best reason I can come up with is that they want to prove a point. There's probably quite a bit an open-source firm can gain by causing a collapse of the software patent system, and this may be the best way to do it.
Sorry, you forgot to put question marks after the hyphens. Obviously, you'll need to license IBM's patent, because that additional tweak makes it a non-obvious invention.
$_ = "wftedskaebjgdpjgidbsmnjgcdwatb"; tr/a-z/oh, turtleneck Phrase Jar!/; print
The first claim mentions the real time nature of the validation. The example regexes are for validating a completed string. This is still silly and obvious but you may have a harder time finding specific prior art for this case.
I am becoming gerund, destroyer of verbs.
You can read more about it here
excitingthingstodo.blogspot.com
put a damned example on your site, like this: nnn-nn-nnnn
You can put as many examples on your site as you want but your users will still find a way to fuck it up. You need code that checks for this and either corrects their stupidity or kicks it back and makes them re-enter it.
I want peace on earth and goodwill toward man.
We are the United States Government! We don't do that sort of thing.
We need more overly-broad patents on embarrassingly horrible user interfaces. In fact, someone ought to patent *all* the common mistakes. That way their lawyers could run around suing everyone building crap.
Program Manager: What the hell is happening?! Why is the website down?! ... what are you trying to tell me? This is an emergency, accounting said our money is leaving! ... we could remove it and then it could read back the file after waiting for a few seconds. We would have to hope that more users don't come while we are performing emergency dash extraction.
Web Programmer: It's the users, sir, one of them put dashes in their SSN on the form!
Program Manager: I don't have time for this mumbo jumbo geek jargon
Web Programmer: Well, you see the dashes are inside the string.
Program Manager: Inside? How is this possible?
Web Programmer: Well, the user must have paused to push the dash key, sir.
Program Manager: So if the dashes are inside the string, we have to get them out. Is there someone we can pay for this service?
Web Programmer: I'm afraid it's too complicated for that. But maybe if we had it write to a file and one of us kept refreshing a text editor on that file
Program Manager: Goddamnit! Why didn't testing find this?!
Web Programmer: Well, they did but to fix this bug we just removed the dash keys on their keyboards.
Program Manager: Can we do that to each of the users?
*IBM employee enters with massive box labeled "Enterprise SSN Dash Extractor"*
IBM Sales Rep: Gentlemen, let IBM solve all your SSN problems for a mere $2,000 per site license!
My work here is dung.
Patent Application 973255489
"Method of enhancing sarcasm through the intentional introduction of typographical errors within multiple exclamation marks."
Within a set of not fewer than four (4) and not more than eight (8) Exclamation Marks ("!"), an Erroneous Character from the set of characters [1, 2, @, #, ~, `] is inserted after the third or fourth Exclamation Mark. The Erroneous Character is perceived by the reader as a typographical error consistent with hurried, careless typing, reinforcing any sarcasm contained in the textual comment preceding the Exclamation Marks.
Heck a lawyer patented the method for swinging on a swing
Why not IBM patenting something stupid like this! Maybe enough of these will bring the patenet system into reform or it's destruction...
Ref:
http://www.google.com/patents?vid=6368227
http://www.freepatentsonline.com/6368227.html
http://en.wikipedia.org/wiki/Reexamination
The Truth is a Virus!!!
I'd like to assert that I've personally written prior art.
Actually, they're trying to patent "A system for providing real-time validation of text input fields in a Web page comprising:a validation-enhanced text input element configured to contain an attribute for a validation expression for a text field in a rendered Web page, wherein the validation-enhanced text input element is contained within a source code document corresponding to the rendered Web page; andan input text validator configured to validate a user-entered character of the text field against the validation expression in real-time and visually indicate invalid user-entered characters," and "A method for providing real-time validation of text input fields in a Web page comprising:receiving a user-entered character in a text field displayed in a Web page;immediately validating the user-entered character against a validation expression contained within a validation-enhanced text input element associated with the text field, wherein the validation expression defines a set of acceptable characters and character positions for the text field; andwhen the user-entered character is determined invalid, visually marking the user-entered character," and "An input text validator for validating a text field of a Web page in real-time comprising:a partial input expression generator configured to generate an expanded version of a validation expression, wherein the expanded version of the validation expression defines a set of acceptable characters and character positions for a text field of a Web page; andan invalid text highlighter configured to visually highlight a user-entered character in the text field when the user-entered character is determined as invalid for the expanded validation expression."
Remember, patents are all about the claims. You don't know what they're "trying to patent" until you have read and understand the claims.
Today's Sesame Street was brought to you by the number e.
I run into this problem with entering phone numbers into web forms. Some want them as xxxxxxxxxx, some as xxx-xxx-xxxx, some as (xxx)xxx-xxxx, and even other weirdness. Some sites take whatever I put in and mold it to their desired format; others tell me my input is invalid and make me enter it again (some even tell me the desired format). Some sites actually break it up into three input fields with appropriate limits on the number of characters.
I've seen similar cases with SSNs.
It's pretty obvious that some sites have no trouble parsing the input data and making it fit what's expected. How is this a novel concept to be patented?
Edward Burr
Having a smoking section in a restaurant is like having a peeing section in a swimming pool.
If you read the patent application, they aren't patenting just validation of a text field. They are patenting the idea of validating a string, one character at a time, as it is entered by the user. As the string is entered, when invalid characters are found using regex, a "visual change" is made to the input to let the user know they made a mistake.
An example they give is that in an email input field, as soon as the user enters a comma, the comma would change colors.
It's still not groundbreaking, but it's not quite as trivial as it sounds.
You didn't read the patent application, did you?
They are not patenting a regular expression to validate social-security numbers, they are patenting an entire validation system for web application, in which there is an API for a developer to specify a regular expression, and the framework will then validate the user input in real-time, while the front-end highlights the specific characters that caused the failure. The particular problem they are trying to solve is the user confusion when they submit a form which tells them that a field was rejected without telling them what's wrong with the input.
This is not to say that there isn't prior art for that, but as you can see it is much more than just a patent on a simple reg-exp pattern.
-dZ.
Carol vs. Ghost
Wow! All this steam and no one read the patent. It's been a while since the Slashdotter stereotype was so well validated.
The patent is for incremental validation as the characters come in. The text input widget is primed with the regex and validates each character as it is keyed, and reacts immediately if it gets an invalid-in-context character. The effect is that it's not possible to enter an invalid string.
Whether you think this is novel or not, it's not ordinary.
I'm a Programmer. That's one level above Software Engineer and one level below Engineer.
Why don't we try to get USPTO's attention over to Slashdot? Then, if they think they don't understand what's going on with a patent, they can find other peoples' interpretation of it over here. They're bound to understand at least one of a hundred different wordings of that patent in Slashdot's comments.
Any ideas?
I disagree philosophically with our current legal system allowing software patents. However it never ceases to amaze me how the internets take a patent, don't read it or understand it and then complain about things that don't even make sense in regards to the patent in question.
If you read the actual patent, it is talking about validating the text input as the characters are being typed in and highlighting the specific characters that don't match the regular expression. For example if you type in a SSN as: 1112-113-1111, then the 2 and 3 within the text field would be highlighted (e.g. highlighted red) as not matching the regular expression for a SSN. I think the key is that the error highlighting is done inside the text field. The highlighting of the text wouldn't occur until some timer expired (e.g. 200 ms without any new typing). This makes it so that the error highlights don't show up as you are typing but as soon as you stop. This is definitely more novel than the comments on this article make it out to be.
Should this or other software algorithms be patentable? No. However companies like IBM are forced to patent because if they don't then other patent troll companies sue them and win because they have trouble proving prior art. It is not illogical for companies like IBM to simultaneously pursue patent reform and continue to patent as much as possible under the current legislation. This is just taking advantage of the broken system while talking about how broken it is.