Slashdot Mirror


Falsehoods Programmers Believe About Names

Jamie points out this interesting article about how hard it is for programmers to get names right. Since software ultimately is used by and for humans, and we humans are pretty tightly linked to our names (whatever the language, spelling, or orthography), this is a big deal. This piece notes some of the ways that names get mishandled, and suggests rules of thumb (in the form of anti-suggestions) to encourage programmers to handle names more gracefully.

773 comments

  1. As the author of RFC 2100... by jra · · Score: 4, Interesting

    I found the piece very interesting.

    Though my inability to post this comment appears to have outlived the slashdotting of the site.

    1. Re:As the author of RFC 2100... by OneAhead · · Score: 3, Funny

      That doesn't make sense. I can read your comment, therefore your inability to post it has gone away. The site is still slashdotted. Ergo, the slashdotting of the site has outlived your inability to post.

      Oh wait... RFC2100...

    2. Re:As the author of RFC 2100... by TheLink · · Score: 2, Interesting

      I dunno, the guy just lists out reasons why you can't uniquely identify people by names. e.g. "some people don't have names".

      Well that's why Governments start handing out people national ID numbers[1]. Then even if you aren't who you claim you are, at least the poor data entry person has something to key in and can actually type it in on his/her keyboard ;).

      [1] As for foreigners wihtout a passport number or national ID, please wait here for those friendly guys in uniforms...

      --
    3. Re:As the author of RFC 2100... by patio11 · · Score: 2, Funny

      After Reddit got done with the site yesterday, I decided "Sure, why not upgrade to Wordpress 3.0. I'll just turn off caching for a little while and..."

    4. Re:As the author of RFC 2100... by pushf+popf · · Score: 5, Funny

      I found the article to be contrived and pointless.

      Yes, there are people and entities that do not fit into a normal name slot in a database, and no, I don't care at all because it hasn't been a problem for anything I've written in the last thrity years. When someone pops up and says "My name is this thing I drew on the sidewalk using chipmunk poop, and it doesn't fit in your database", I'll say "Yes, you're right it doesn't, then go have a beer.

      You can't handle every edge case in the universe because you'll never actually release anything.

    5. Re:As the author of RFC 2100... by Anonymous Coward · · Score: 5, Funny

      If you program like you talk, you'll never ship anyway, because it'll never compile.

      Unexpected EOF in String constant:

      "Yes, you're right it doesn't, then go have a beer.

      You can't handle every edge case in the universe because you'll never actually release anything.

    6. Re:As the author of RFC 2100... by vikstar · · Score: 2, Insightful

      I would find it more interesting if it contained approximate statistics for each type point. I will not spend time designing a system which caters for the 2 individuals having some weird exception to the detriment of millions of others which adhere to a much more useful schema. IE, sure you can just have Name and accept a 2048-length UTF-16 string to accommodate everyone, or skip a few outliers and have given and last names with certain restrictions to catch user error in the input.

      --
      The question of whether a computer can think is no more interesting than the question of whether a submarine can swim.
    7. Re:As the author of RFC 2100... by fbjon · · Score: 2

      I found the article to be insightful. It shows that there is no point in messing around with assumptions about names. Just put one field in the form that takes up to e.g. 128KB that accepts any string of data, including the empty string, and call it "Your real full name". Put in a dropdown for the encoding if you need it, another field for low-ascii transliterated name if you need something recognizable by most, and separate fields for official registered name if you really need that sort of thing.

      --
      True confidence comes not from realising you are as good as your peers, but that your peers are as bad as you are.
    8. Re:As the author of RFC 2100... by VinylPusher · · Score: 1

      Remember that many databases sit upon a Microsoft SQL server. Microsoft SQL doesn't understand UTF-8 or UTF-16. It only knows UCS-2.

      Great fun for database programmers.

    9. Re:As the author of RFC 2100... by pinkushun · · Score: 1

      Sure... unless your Government has big problems issuing duplicate ID numbers

    10. Re:As the author of RFC 2100... by riT-k0MA · · Score: 1

      Or a 16 year old European female ends up with an ID with a DOB from 1979 an a picture of an african male in his 30's.
      I LOVE South Africa :/

    11. Re:As the author of RFC 2100... by TheLink · · Score: 2, Insightful

      Fake, duplicate or not, numeric IDs are still easier to key in ;).

      As most slashdotters will know, if your data records are in a digital computer, it's pretty hard to avoid being linked to at least one number.

      Even if you don't have national ID numbers, someone could go around claiming to be you, or the "System" could still confuse you with someone else.

      At least accidental/erroneous duplicate IDs are easier to detect.

      Of course if some Big Bad Ruler/Government starts issuing citizens with Citizen Certificates that have to be renewed every year then that's a problem :).

      --
    12. Re:As the author of RFC 2100... by Top4man · · Score: 0, Offtopic

      www.TOP4MAN.ru

    13. Re:As the author of RFC 2100... by gizmod · · Score: 1

      From TFA: >> People’s names are almost globally unique. Hi my name is 2E9ADBC9-D87F-485e-9608-02251B68ECC6. Please to meet you.

    14. Re:As the author of RFC 2100... by Anonymous Coward · · Score: 0

      My name is this thing I drew ....

      Like the bullshit thing Prince pulled with that Ebola-looking thing he used for his name. And the crap about "the artist formerly known as ...."

      Your call, son of a bitch. Just have fun trying to cash your social security check.

    15. Re:As the author of RFC 2100... by Anonymous Coward · · Score: 0

      Maybe we have a system where, at birth, everyone gets a name generated in the format [whatever string you want, i.e. "Bob Smith"][timestamp of the time of birth][random number just in case the first two aren't enough to ensure a unique name] and you use that everywhere a name is asked for, so it's both your name "Hey Bob Smith!" and your unique identifier "I sentence you, Bob Smith 330170739-189503 to five year's hard labour". If you want to change your name, you can only ever change the initial string identifier, and the resulting name has to be unique. On the downside - massive database storing information about everyone on the planet. Plus side - better data management, erm... women can't lie about their age?

    16. Re:As the author of RFC 2100... by digitig · · Score: 1

      Yes, there are people and entities that do not fit into a normal name slot in a database, and no, I don't care at all because it hasn't been a problem for anything I've written in the last thrity years.

      Well, you don't know of any problems it has caused. The people who couldn't do business with you or your employer (or who could, but found it annoying) probably just quietly went elsewhere with their business. Sure, you're not going to account for every case the RA mentions, but I think he's right: not handling a particular case should be a definite design decision, not simply a lack of thought.

      --
      Quidnam Latine loqui modo coepi?
    17. Re:As the author of RFC 2100... by Anonymous Coward · · Score: 0

      made sense to me

    18. Re:As the author of RFC 2100... by Tridus · · Score: 1

      I'd have liked the article more if he came up with a practical solution instead of just complaining that names are hard. Keeping in mind that the people using the database want to be able to pull out a last name so they can send christmas cards to the 99.99% of people who fit that naming scheme in this area.

      --
      -- "So they told me that using the download page to download something was not something they anticipated." - Bill Gates
    19. Re:As the author of RFC 2100... by mypalmike · · Score: 1

      If I had mod points, you'd have a "Funny". I hope that was the intent.

      --
      There are 0x40000000 types of people: those who understand 32-bit IEEE 754 floating point, and those who don't.
    20. Re:As the author of RFC 2100... by Chrisq · · Score: 1

      Our company has a lot of Urdu speaking customers. One of the problems that I have seen is a number of people who cannot consistently spell their name in the Latin alphabet. They work it out phonetically each time , and since a lot of the sounds are between the speech sounds used in English they come up with a different version each time.

    21. Re:As the author of RFC 2100... by mcgrew · · Score: 1

      I was amused by it.

      12 People's names are case sensitive.
      13 People's names are case insensitive.

      I found the article to be a big DUH. I did hear once that there was a man who couldn't get a credit card, becase his last name was O and the system wouldn't accept a one letter last name.

      I don't know how many bills I've gotten with my name spelled Mcgrew rather than McGrew. Actually the c should be superscripted, but I have yet to see a system that allows that. OTOH typewriters even have a hard time with it.

    22. Re:As the author of RFC 2100... by mcgrew · · Score: 2, Interesting

      This one was humorous: "surely people's names are diverse enough such that no million people share the same name"

      Anybody who's ever run a mainframe database knows that's just stupid. Back in 1997 Altavista found six people with my exact full name on the internet. In 1997!

      I hate doing a name lookup on my company's database -- do you have any idea how many people in Chicago alone are named "Johnson"? I once joked that they should rename it Johnson City.

      And in the US, there are people with more than one SSN, and people who have none at all. I know a guy with two SSNs, he somehow got in the middle of a feud between the Outlaws and the Hell's Angels about twenty years ago and a judge ordered him to change both his name and SSN, so it was not only done with the government's bleasing, but on their orders.

    23. Re:As the author of RFC 2100... by nschubach · · Score: 1

      ... I'm sure there was no talking involved. Only typing. ;)

      --
      Every time I start to have faith in humanity, I ruin it by driving to work between 7 and 8 am.
    24. Re:As the author of RFC 2100... by mdwh2 · · Score: 1

      Oh, and - you can't fix every bug, so there's no point trying to fix any at all?

      Just because we can't handle everything perfectly, doesn't stop us reading and learning to avoid many of the common pitfuls, in order to minimise the chance of a problem occuring.

    25. Re:As the author of RFC 2100... by jbengt · · Score: 1

      You don't need such a large field, limit the length to something reasonable. Rather, you could provide a unique identifier, a main name field that can accept arbitrary strings, and additional "aka" field(s). Mainly, don't panic just because the name is unusual or off a little. When really in doubt, ask for it to be input again, but don't reject it outright.

    26. Re:As the author of RFC 2100... by operagost · · Score: 1

      Anybody who's ever run a mainframe database knows that's just stupid. Back in 1997 Altavista found six people with my exact full name on the internet. In 1997!

      6 1,000,000

      --

      Gamingmuseum.com: Give your 3D accelerator a rest.
    27. Re:As the author of RFC 2100... by ronocdh · · Score: 1

      I dunno, the guy just lists out reasons why you can't uniquely identify people by names. e.g. "some people don't have names".

      Yeah, I don't see anything Insightful, Informative, Interesting, or even Funny about his blog post. Still waiting for CmdrTaco to authorize the use of Slashdot mod points across the entire internet.

    28. Re:As the author of RFC 2100... by operagost · · Score: 1

      That was supposed to be 6 < 1,000,000.  I suppose that's what I get for making snide comments.

      --

      Gamingmuseum.com: Give your 3D accelerator a rest.
    29. Re:As the author of RFC 2100... by Junior+J.+Junior+III · · Score: 1

      English isn't compiled, though, it's interpreted. And the interpreters are generally very fault tolerant, making "best guess" attempts at parsing the author's intent.

      --
      You see? You see? Your stupid minds! Stupid! Stupid!
    30. Re:As the author of RFC 2100... by geekoid · · Score: 1

      ". I will not spend time designing a system "

      if this issue take more then 15 minutes of your precious design time, you need to rethink your ability to actually design worth a damn.

      If ti takes more then 5 minutes the second time you do it, please take up painting or something and get the hell out of the field.

      --
      The Kruger Dunning explains most post on /. http://en.wikipedia.org/wiki/Dunning%E2%80%93Kruger_effect
    31. Re:As the author of RFC 2100... by Maxo-Texas · · Score: 2, Funny

      One of the interesting problems a friend of mine in the food industry deals with is duplicate social security numbers combined with duplicate first and last names.

      At some restaurants every one of the half dozen servers as the same first and last name.

      Oh.. and sometimes their social security numbers are not consistent from week to week either.

      --
      She was like chocolate when she drank... semi-sweet at first and then increasingly bitter.
    32. Re:As the author of RFC 2100... by mcgrew · · Score: 1

      My point was that there were six on the internet in 1997; not just on the internet, but with web pages. There are thousands of Johnsons in Chicago alone, maybe hundreds of thousands, and hundreds (at least) of them have the same first names as well. Since the US population is 350,000,000, yes it's unreasonable to expect one in every 350 people to have the same name, but I wouldn't be surprised a bit if there was a name out there with 100,000 owners of it; maybe the author couldn't count his zeros? But that's pedantically beside the point anyway.

    33. Re:As the author of RFC 2100... by DamnStupidElf · · Score: 1

      The standard solution is to issue a single paycheck for them to divvy up.

    34. Re:As the author of RFC 2100... by Anonymous Coward · · Score: 0

      I think the point may have gone over your head, so I'll spell it out: [b]the concept of a name is abstract, not concrete.[/b] For example, n=n+1 with a base case of n=0 enumerates the natural numbers but does not enumerate any real numbers. It may be possible to enumerate a subset of names by enumerating over ASCII strings, but it by no means enumerates all names. When designing a system, it is important to keep your "name" data management very abstract. For example, we might have a name class and then inherit from it in a class called "ASCII_Name." It may prove true that in the life of our program, we wrote that tiny little inheritance for no reason, but it's also possible one day we'll need an upgrade -- and on that day, you'll be glad you spent an extra minute writing an abstract base class.

    35. Re:As the author of RFC 2100... by yakovlev · · Score: 1

      I'll bite.

      How do you handle a name that cannot be expressed using UTF? Allow an infinite-sized JPEG to be submitted in lieu of a text name?

      That is an incredibly ugly hack for something as obscure as a non-UTF name.

      As for the size, it is reasonable for the programmer to set an arbitrary size and say that NO, I won't let you use the name field as a remote disk drive. Sorry. As to whether or not 2048 UTF-16 characters (512 UTF-32 characters) is big enough for that is hard to say, as the choice of boundary is by definition arbitrary. The nice thing about 2048 UTF-16 characters is that it matches the page size of most computers, and so seems to be a reasonable boundary.

    36. Re:As the author of RFC 2100... by pushf+popf · · Score: 1

      I think the point may have gone over your head

      I understand the point completely. I just don't accept it. It's a recipe for creating data that won't inter-operate with the rest of the world.

    37. Re:As the author of RFC 2100... by Anonymous Coward · · Score: 0

      You must at least handle some cases like someone named: Bob "; DROP TABLE Student;

      I found the article to be contrived and pointless.

      Yes, there are people and entities that do not fit into a normal name slot in a database, and no, I don't care at all because it hasn't been a problem for anything I've written in the last thrity years. When someone pops up and says "My name is this thing I drew on the sidewalk using chipmunk poop, and it doesn't fit in your database", I'll say "Yes, you're right it doesn't, then go have a beer.

      You can't handle every edge case in the universe because you'll never actually release anything.

    38. Re:As the author of RFC 2100... by Hognoxious · · Score: 1

      There are thousands of Johnsons in Chicago alone

      Huh huh. You said Johnson. Huh huh. Heh heh.

      --
      Confucius say, "Find worm in apple - bad. Find half a worm - worse."
    39. Re:As the author of RFC 2100... by Anonymous Coward · · Score: 0

      Why aren't things like this encapsulated into standard types or libraries - get it right once and for all, and prevent all the errors that come with reinventing the wheel in different ways? Names, time, money - certain real-world things are hard to map into basic data types.

  2. Sounds like people need to fix thier names by h4rr4r · · Score: 3, Funny

    Who the hell has numbers in there name?

    1. Re:Sounds like people need to fix thier names by Anonymous Coward · · Score: 5, Funny

      3Jane Tessier-Ashpool, for one.

    2. Re:Sounds like people need to fix thier names by dogdick · · Score: 1, Informative

      Andre3000

    3. Re:Sounds like people need to fix thier names by Anonymous Coward · · Score: 0

      While we're at it, can we get a spell checker that handles homonyms?

    4. Re:Sounds like people need to fix thier names by ChipMonk · · Score: 2, Informative

      Chad 8 5, for another.

    5. Re:Sounds like people need to fix thier names by Khakionion · · Score: 5, Funny

      homonyms?

      Hey, learn a little tolerance, bud.

      --
      OMG! Wau!
    6. Re:Sounds like people need to fix thier names by Trepidity · · Score: 1

      Fortunately for programmers, she doesn't exist.

    7. Re:Sounds like people need to fix thier names by 0100010001010011 · · Score: 4, Informative

      Mr. Ochocinco

      For those that aren't privy to American Football. Apparently some guy with the number 85, renamed himself 85.

    8. Re:Sounds like people need to fix thier names by Anonymous Coward · · Score: 0

      Some people I know - Native Americans, East Indians

    9. Re:Sounds like people need to fix thier names by spitzig · · Score: 5, Informative

      Chinese, written in pinyin, has numbers. Pinyin is how Chinese is typed. The numbers represent tones and every word in Chinese has a tone.

    10. Re:Sounds like people need to fix thier names by LearnToSpell · · Score: 1

      You do!

    11. Re:Sounds like people need to fix thier names by Kitkoan · · Score: 2, Funny
      --
      Attention... all grammer nazi"s! Is they're anything; wrong with: my post,
    12. Re:Sounds like people need to fix thier names by notthepainter · · Score: 2, Interesting

      Bo3b Johnson

      http://www.linkedin.com/pub/bo3b-johnson/13/846/a52

      The 3 is silent. And no, I don't know him but I know someone who does.

    13. Re:Sounds like people need to fix thier names by DarrenBaker · · Score: 2, Informative

      OCHOCINCO!!!!

    14. Re:Sounds like people need to fix thier names by Anonymous Coward · · Score: 0

      Does that refer to BOOB

    15. Re:Sounds like people need to fix thier names by Anonymous Coward · · Score: 1, Informative

      No, he didn't, he renamed himself Chad Ochocinco, which any standard name field would handle just fine. Incidentally, despite legally changing his name, he claims to still primarily use Chad Johnson.

    16. Re:Sounds like people need to fix thier names by Anonymous Coward · · Score: 1, Funny

      I met that guy once. He's a fucking douchebag. Wouldn't do anything I told him.

    17. Re:Sounds like people need to fix thier names by PopeRatzo · · Score: 5, Funny

      Who the hell has numbers in there name?

      Well, for starters, Thurston B. Howell, III. Malcolm X, and Jimmy Two Times.

      --
      You are welcome on my lawn.
    18. Re:Sounds like people need to fix thier names by chooks · · Score: 1

      Simple.

      Logan 5 if you're a guy.
      Jessica 6 if you're not.

      --
      -- The Genesis project? What's that?
    19. Re:Sounds like people need to fix thier names by RedWizzard · · Score: 1

      According to this BBC article there is a New Zealander legally called "Number 16 Bus Shelter".

    20. Re:Sounds like people need to fix thier names by jedidiah · · Score: 1

      The X in Malcolm X is not a number.

      The 3rd in Thurston Howell is not part of the name. It's a generational suffix.

      --
      A Pirate and a Puritan look the same on a balance sheet.
    21. Re:Sounds like people need to fix thier names by BluBrick · · Score: 4, Informative

      Bo3b? Presumably, the 3 is silent because he wants to point out how individual he is (ironically, by rehashing a joke made over 50 years ago.)

      From Tom Lehrer's introduction to "We will all go together when we go":

      I am reminded at this point of a fellow I used to know whose name was Henry, only to give you an idea of what an individualist he was he spelt it H-E-N-3-R-Y. The 3 was silent, you see.

      --
      Ahh - My eye!
      The doctor said I'm not supposed to get Slashdot in it!
    22. Re:Sounds like people need to fix thier names by xenn · · Score: 1

      Johnny 2 Hats

    23. Re:Sounds like people need to fix thier names by Fnordulicious · · Score: 4, Informative

      You are a little confused. Please reread the Wikipedia article on Hanyu Pinyin. It normally uses diacritics - namely macron, acute, hacek ("caron"), and grave - to represent the Mandarin tones other than neutral tone. Numbers have been used by people who lack diacritics on their typewriter or input system, but using numbers is not standard in Hanyu Pinyin, instead it's a kludge.

      That said, if your input form doesn't allow some guy to type in his name with tone number suffixes on a US Windows keyboard layout where he lacks access to diacritics, then you're not a very thoughtful programmer.

      Also, people who make software with an input fields that accept Unicode but specify a particular font that has a tiny character repertoire suck.

      Oh, and Slashdot sucks even more for only supporting ASCII and stripping everything else.

    24. Re:Sounds like people need to fix thier names by Miseph · · Score: 3, Informative

      He legally changed his name because fans refer to him as "Ochocinco" and he wanted to put it on his jersey, but because the NFL hates both fans and lulz, they only allow a person's legal surname to appear there. Rather than lay down and take it, he gave them a massive middle finger by changing his name.

      The NFL actually has a surprising number of players that behave like btards, it's rather amusing.

      --
      Try not to take me more seriously than I take myself.
    25. Re:Sounds like people need to fix thier names by Anonymous Coward · · Score: 0

      Given the forum I seriously doubt the wikipedia link is needed. Either that or those damn kids need to get off my lawn.

    26. Re:Sounds like people need to fix thier names by Speare · · Score: 2, Interesting

      Love the literary reference. In a much earlier sci-fi story, This Perfect Day, every citizen has a nameber, an identifier that is part name, part number. There are only four male names, four female names, and these are combined with a multi-digit code to make the ID unique. Ever since online forums started suggesting logins like "MaryBeth131" I can't help but think of namebers.

      --
      [ .sig file not found ]
    27. Re:Sounds like people need to fix thier names by westcoast+philly · · Score: 1

      half a million Norwegians, probably...

    28. Re:Sounds like people need to fix thier names by sonamchauhan · · Score: 1

      Variables

    29. Re:Sounds like people need to fix thier names by tux0r · · Score: 1

      Who the hell has numbers in there name?

      Oh, the irony.

      --
      ( Redundancy is ) ^ n
    30. Re:Sounds like people need to fix thier names by sonamchauhan · · Score: 3, Funny

      And King James III

    31. Re:Sounds like people need to fix thier names by fishexe · · Score: 3, Informative

      Who the hell has numbers in there name?

      Former New York Times writer Jennifer 8 Lee does.

      --
      "I don't care about the Constitution!" --Bill O'Reilly, November 17, 2009
    32. Re:Sounds like people need to fix thier names by kenj0418 · · Score: 2

      The NFL actually has a surprising number of players that behave like btards, it's rather amusing.

      I'd be a bit more concerned with the Michael Vicks and Leonard Littles of the NFL than some guy who changes his name. (dogfighting and and drunk-driving-with-fatal-accident for those not in the US or otherwise not aware)

    33. Re:Sounds like people need to fix thier names by aiht · · Score: 3, Funny

      What about Arthur "Two Sheds" Jackson?
      Nah, I guess that doesn't count 'cause it's written as a word.

    34. Re:Sounds like people need to fix thier names by shutdown+-p+now · · Score: 3, Interesting

      That said, if your input form doesn't allow some guy to type in his name with tone number suffixes on a US Windows keyboard layout where he lacks access to diacritics, then you're not a very thoughtful programmer.

      Or you code in some language where Unicode support is not there by default, and you have to jump through hoops to get it working.

      Like, say, PHP. Or stable Ruby.

      Which might explain a lot of things about why so much of the Net is largely broken I18N-wise even on the most basic level, come to think of it.

    35. Re:Sounds like people need to fix thier names by fishexe · · Score: 5, Informative

      Pinyin is how Chinese is typed. The numbers represent tones...

      No it isn't. Pinyin is how Chinese is romanized. Chinese is typed using an IME to produce Han characters. Pinyin is typically only used to represent pronunciation, for example in dictionaries, and to represent names in contexts where romanization is necessary (such as international contexts, like Western media), as well as a few other limited contexts. Writing Chinese in Pinyin, even with tone marks, is often inadequate because each syllable/tone combination corresponds to several characters, and the distinction between them is easily lost in romanization. For example, Zhang Zilin and Zhang Ziyi do not have the same surname, even though both are Zhang1 in pinyin.

      --
      "I don't care about the Constitution!" --Bill O'Reilly, November 17, 2009
    36. Re:Sounds like people need to fix thier names by Excelcior · · Score: 1

      Sa5m? Seriously, am I the only one who's seen Band Slam? :-P

      --
      A small comparison of interest:
      Windows: Public School. Mac: Private School. Linux: Homeschool. Assembly: Unschool.
    37. Re:Sounds like people need to fix thier names by Kitkoan · · Score: 1

      Make something idiot proof and someone will make a better idiot...

      --
      Attention... all grammer nazi"s! Is they're anything; wrong with: my post,
    38. Re:Sounds like people need to fix thier names by arekq · · Score: 2, Informative

      Pinyin is just one way Chinese is typed.
      There are other ways to type Chinese characters, for example, Cangjie input method, which is based on the graphological aspect of the characters instead of it's sound.

    39. Re:Sounds like people need to fix thier names by deniable · · Score: 4, Funny

      Yep, there's rampant homophonia around here.

    40. Re:Sounds like people need to fix thier names by Kitkoan · · Score: 3, Funny

      What about Arthur "Two Sheds" Jackson?

      A tragic accident happened, and now he's Arthur 'No Sheds' Jackson. Very tragic.

      --
      Attention... all grammer nazi"s! Is they're anything; wrong with: my post,
    41. Re:Sounds like people need to fix thier names by bonch · · Score: 1

      People with numerical suffixes, such as "III" or "the 3rd."

    42. Re:Sounds like people need to fix thier names by mogness · · Score: 3, Interesting

      If you are a guy (not an unreasonable assumption on /.), I think it's really strange that online forums are suggesting you the name "MaryBeth131"
      What were your parents thinking?

      --
      that's teh shizzle bizzle
    43. Re:Sounds like people need to fix thier names by Anonymous Coward · · Score: 2, Funny

      Johnny 5

    44. Re:Sounds like people need to fix thier names by nacturation · · Score: 3, Funny

      What about Arthur "Two Sheds" Jackson?

      A tragic accident happened, and now he's Arthur 'No Sheds' Jackson. Very tragic.

      Last I heard he was living next to No Shed Sherlock.

      --
      Want to improve your Karma? Instead of "Post Anonymously", try the "Post Humously" option.
    45. Re:Sounds like people need to fix thier names by nacturation · · Score: 1

      Who the hell has numbers in there name?

      Girls who practice cup sharing?

      --
      Want to improve your Karma? Instead of "Post Anonymously", try the "Post Humously" option.
    46. Re:Sounds like people need to fix thier names by M.+Baranczak · · Score: 1

      Who the hell has numbers in there name?

      John Paul 2.0

    47. Re:Sounds like people need to fix thier names by Kitkoan · · Score: 2, Funny

      If you are a guy (not an unreasonable assumption on /.), I think it's really strange that online forums are suggesting you the name "MaryBeth131" What were your parents thinking?

      Maybe they were listening to 'A Boy Named Sue'?

      --
      Attention... all grammer nazi"s! Is they're anything; wrong with: my post,
    48. Re:Sounds like people need to fix thier names by Anonymous Coward · · Score: 0

      Da5id?

    49. Re:Sounds like people need to fix thier names by droopycom · · Score: 2, Informative

      The Queen of England

      God save her from programmers!

    50. Re:Sounds like people need to fix thier names by fishexe · · Score: 1

      Apparently some guy with the number 85, renamed himself 85.

      No, he didn't, he renamed himself Chad Ochocinco, which any standard name field would handle just fine.

      Legally, he changed his name to Ochocinco. In a larger sense, I think he changed it to 85. From his wikipedia page: "Ochocinco announced on his live USTREAM broadcast that he will be legally changing his last name to “Hachi Go” next season. He also held up a Customized Cincinnati Bengals Jersey with the last name "Hachi Go" on the back. Just as the words Ocho Cinco translate to 8 and 5 in Spanish, the words Hachi Go ([characters suppressed, fuck you slashcode]) translate to 8 and 5 in Japanese."
      I would say rotating through various languages' ways of pronouncing 8 5, making annual or biennial legal name changes to achieve this, might represent a personal commitment of sorts to one's name actually being a number.

      --
      "I don't care about the Constitution!" --Bill O'Reilly, November 17, 2009
    51. Re:Sounds like people need to fix thier names by Anonymous Coward · · Score: 0

      7 of 9!

    52. Re:Sounds like people need to fix thier names by russ_allegro · · Score: 1

      7 of 9

    53. Re:Sounds like people need to fix thier names by Malc · · Score: 2, Interesting

      A lot of mobile phones, including my Samsung phone, use Pinyin as a way of entering Chinese characters. For each word/syllable I enter, there's a sometimes long list of matching Chinese characters to select from.

      Pinyin is also used on things like street signs in some of the larger cities, which gives Western people at least some chance of recongnising names.

    54. Re:Sounds like people need to fix thier names by vux984 · · Score: 2, Interesting

      "Bo3b"

      Never seen that one but I've heard of a: !bo

      The leading exclamation is apparently a...lol i dunno what its called, but its apparently one of the hollow popping/clicking sounds you see in some African languages.

    55. Re:Sounds like people need to fix thier names by ChipMonk · · Score: 1

      Do you really think the spoken language makes a difference? It's just a stunt with his name, no matter how you slice it.

    56. Re:Sounds like people need to fix thier names by Anonymous Coward · · Score: 0

      And the Two in Jimmy Two Teeth is not a number, because it doesn't contain any digit, right?

    57. Re:Sounds like people need to fix thier names by Anonymous Coward · · Score: 0

      William Henry Gates III ?

    58. Re:Sounds like people need to fix thier names by Anonymous Coward · · Score: 0

      And then there is Zhuyin, used in Taiwan, commonly known as BoPoMoFo- an actual Chinese alphabet, of a sort. This is a whole separate keyboard layout.
      I don't even know what Cantonese speakers use.

    59. Re:Sounds like people need to fix thier names by snowgirl · · Score: 2, Interesting

      Funny, I actually use the Chinese IME on Windows... it is called "Chinese (simplified) - Microsoft Pinyin - New Input Style"

      And I do actually type in characters using Pinyin, because they have adaptive algorithms that guess at what the most likely character to follow is. They guess well, but it also displays 9 choices at a time, that you select with number keys.

      --
      WARNING! This girl exceeds the MAXIMUM SAFE standards established by the FDA for BRATTINESS
    60. Re:Sounds like people need to fix thier names by maweki · · Score: 1

      THX1138

      what do you think is his last name?

    61. Re:Sounds like people need to fix thier names by Anonymous Coward · · Score: 2, Interesting

      http://en.wikipedia.org/wiki/Perri_6

      This is how you become top listed in every citation index.

    62. Re:Sounds like people need to fix thier names by Anonymous Coward · · Score: 0

      Who the hell has numbers in there name?

      2girls1cup?

    63. Re:Sounds like people need to fix thier names by Anonymous Coward · · Score: 0

      http://en.wikipedia.org/wiki/Jennifer_8._Lee

    64. Re:Sounds like people need to fix thier names by Patch86 · · Score: 1

      Surely you would type his name "Ochocinco", not "85", though? Thus no problem, from a database point of view.

    65. Re:Sounds like people need to fix thier names by identity0 · · Score: 1

      For one that is not a publicity stunt/modern tomfoolery, there's Ichiro Suzuki in baseball.

      "Ichiro" means "bright" or "number 1", and has the Japanese character for "one" in it. It's a regular name, and most Japanese would not think about the fact that it has a number.

      An alternate form of "Ichiro"(as written in Japanese, but pronounced and transliterated to English the same way), which is more common, means "firstborn son", and is a common name in Japan. There is a whole series of male names that you can use to indicate their order: Ichiro (firstborn), Jiro (secondborn), Sanjiro (thirdborn), Yojiro (Fourthborn), and so on.

      More fun Japanese names with numbers:
      Admiral Isoroku Yamamoto of WWII fame, whose first name means "fifty six", because that's how old his father was when he was born.
      Junichiro Koizumi, recent Prime Minister of Japan, the "ichiro" in Junichiro means "firstborn"
      Joichi "Joi" Ito, Tech journalist and Creative Commons CEO his name contains the character for "one".

    66. Re:Sounds like people need to fix thier names by Anonymous Coward · · Score: 0

      Sa5m - http://en.wikipedia.org/wiki/Bandslam

    67. Re:Sounds like people need to fix thier names by crossmr · · Score: 1

      !xabbu was a central character in one of Tad Williams books.

    68. Re:Sounds like people need to fix thier names by dargaud · · Score: 1
      I was really surprised to discover this US tradition of naming offsprings with the same first name, and adding 'Junior' or 'III'. How arrogant is that to want your kids to be just like you even in name, feel like starting a dynasty ?!?

      Oh, and please notice that only males do that, I've never seen a Jane Junior Doe.

      --
      Non-Linux Penguins ?
    69. Re:Sounds like people need to fix thier names by warrenb10 · · Score: 1
    70. Re:Sounds like people need to fix thier names by Hognoxious · · Score: 3, Funny

      Without it he'd get three offtopic mods, one overrated, and two replies saying [citation needed]

      --
      Confucius say, "Find worm in apple - bad. Find half a worm - worse."
    71. Re:Sounds like people need to fix thier names by Anonymous Coward · · Score: 0

      You have not been online the last hum... 20 years? Or you simply aren't 1337 enough!!!1111

    72. Re:Sounds like people need to fix thier names by CrashandDie · · Score: 1

      You are a little confused. Please reread the Wikipedia article on Hanyu Pinyin.

      No, no; he was right at the time he posted his comment. But then I decided he shouldn't be, so now he isn't.

    73. Re:Sounds like people need to fix thier names by magpie · · Score: 1

      Queen Elizabeth II? You might have heard of them.

    74. Re:Sounds like people need to fix thier names by Anonymous Coward · · Score: 1, Insightful

      THX1138

    75. Re:Sounds like people need to fix thier names by bickerdyke · · Score: 1

      William Gates III.

      --
      bickerdyke
    76. Re:Sounds like people need to fix thier names by Anonymous Coward · · Score: 0

      From the Wiki article:

      "While he references his last name as two words 'Ocho Cinco', his legal last name is condensed into one word, as spelled on his name change application."

      Similarly, I knew a woman whose legal name was Earl Line, because that's the way her mother spelled Earline on her birth certificate.

    77. Re:Sounds like people need to fix thier names by Anonymous Coward · · Score: 0

      Who the hell has numbers in there name?

      Bob Jones 3, son of Bob Jones Jr.

    78. Re:Sounds like people need to fix thier names by Anonymous Coward · · Score: 0

      "Henry was financially independent, having inherited his father's tar-and-feather business, and was therefore able to devote his full time to such intellectual pursuits as writing. I particularly remember a heartwarming novel of his about a young necrophiliac who finally achieved his boyhood ambition by becoming coroner."

      Man... Lehrer consistently cracks me up. God only knows how many times I've listened to "An Evening Wasted with..."

      "Mr. Lehrer's muse [is] not fettered by such inhibiting factors as taste" - the New York Times.

      Sorry to be a fanboy but... That's gooood satire.

      - burnttoy.

    79. Re:Sounds like people need to fix thier names by delinear · · Score: 1

      You would think so, yet two facts render your point moot. 1) I've encountered people with female names online, and 2) we know for a fact that everyone online is either a guy, a guy pretending to be a girl or an FBI agent. Therefore it's highly plausible that even a percentage of /.'s users are guys generating female personas.

    80. Re:Sounds like people need to fix thier names by Hognoxious · · Score: 1

      William Gates III.

      You mention the name without any explanation. Should I have heard of him?

      --
      Confucius say, "Find worm in apple - bad. Find half a worm - worse."
    81. Re:Sounds like people need to fix thier names by delinear · · Score: 1

      It does seem that, given the option, he would rather use "85" as his name. I'm pretty sure the reason he can't is a limitation on not allowing numbers in proper names by whatever body controls name registration in the US, so it's a valid point that the data systems don't handle the names people want to use. As more people become immersed in the internet, it's not unfeasible that this will come up more often, I can imagine some people thinking it would be a hoot to rename themselves to their online handle (and then wondering why nobody will hire someone legally named l337haxx0r85).

    82. Re:Sounds like people need to fix thier names by delinear · · Score: 1

      I guess the only issue would be explaining the name over the telephone in Spanish, but even then a simple explanation that it's the number spelled out as a word would suffice.

    83. Re:Sounds like people need to fix thier names by Hognoxious · · Score: 1

      According to comments in TFA, they've extended the size of unicode character (or whatever they're called) again, and yet there are still plenty missing. Seems to me the whole thing was a futile exercise to start with.

      I wonder if writing systems that use combinations of a small number of characters, as opposed to having a very large number of symbols with more information per unit, are inherently more suited to computer applications. Or is it the case that computers (and forerunner technologies like the telegraph) were developed largely by people who used the former kind of languages?

      --
      Confucius say, "Find worm in apple - bad. Find half a worm - worse."
    84. Re:Sounds like people need to fix thier names by bickerdyke · · Score: 1

      It's the true name of him, who never shall be named, the root of all that's evil.

      Perhaps you might recognize him from his terror instilling face.

      --
      bickerdyke
    85. Re:Sounds like people need to fix thier names by digitig · · Score: 1

      Seven of Nine?

      --
      Quidnam Latine loqui modo coepi?
    86. Re:Sounds like people need to fix thier names by kauttapiste · · Score: 1

      The Whooooosh sound that just went over your head wasn't silent. Although some call it The Whoooosh with a silent 'o' in the middle..luckily Slashdot accommodates both!

    87. Re:Sounds like people need to fix thier names by Anonymous Coward · · Score: 0

      Carl the sixteenth Gustaf Bernadotte of Sweden.

    88. Re:Sounds like people need to fix thier names by digitig · · Score: 1

      Sure it counts. The question was about numbers in the name, not digits or numerals.

      --
      Quidnam Latine loqui modo coepi?
    89. Re:Sounds like people need to fix thier names by Anonymous Coward · · Score: 0

      Good thing we were talking about Japanese then eh? A language that has little to do with english.

      Thanks for showing your meagre knowledge of it though, weaboo fuck.

    90. Re:Sounds like people need to fix thier names by amRadioHed · · Score: 1

      True, but the OP was referring to western numerals. Chinese characters aren't really comparable to individual letters or numerals. As you yourself point out Ichiro is a valid Japanese word.

      --
      We hope your rules and wisdom choke you / Now we are one in everlasting peace
    91. Re:Sounds like people need to fix thier names by amRadioHed · · Score: 1

      Formally pinyin uses numbers or tone marks, but I've never seen that used with a name. Even in my Chinese textbooks the Chinese names have no tone information when written in English text.

      --
      We hope your rules and wisdom choke you / Now we are one in everlasting peace
    92. Re:Sounds like people need to fix thier names by YttriumOxide · · Score: 1

      I once had an acquaintance who changed his legal name to "Da5id" (pronounced "David" of course). The government had no objections to this, although most the people who knew him (myself included) considered him a bit silly for doing so.

      --
      My book about LSD and Self-Discovery
      Also on facebook as: DroppingAcidDaleBewan
    93. Re:Sounds like people need to fix thier names by Swampash · · Score: 1

      Who the hell has numbers in there name?

      William H. Gates III

    94. Re:Sounds like people need to fix thier names by Anonymous Coward · · Score: 0

      Seven.

      It's a perfectly good name!

    95. Re:Sounds like people need to fix thier names by Anonymous Coward · · Score: 0
    96. Re:Sounds like people need to fix thier names by Anonymous Coward · · Score: 0

      Henry V

    97. Re:Sounds like people need to fix thier names by Anonymous Coward · · Score: 0

      III is three i's capitalized, X is a character and two is spelled out. A lot of naming requirements come from governmental data handling requirements. Get the governments to agree with you and you will be halfway there. If the governmental reporting will not take your name, then you should probably pick one you chosen polity likes.

    98. Re:Sounds like people need to fix thier names by DarrenBaker · · Score: 1

      But it's not a stunt. His name is legally, actually, and in every sense that matters, Chad Ochocinco.

    99. Re:Sounds like people need to fix thier names by Gulthek · · Score: 1

      Completely incorrect. Did you get your pinyin education from a Firefly-Chinese site? Pinyin has tone markers.

      Besides, anyone with a Chinese name wouldn't be using pinyin: it's a scheme for western speakers.

    100. Re:Sounds like people need to fix thier names by Gulthek · · Score: 1

      Yeah, but then your mobile phone is translating the pinyin (with varying degrees of accuracy, depending on your phone) into actual Chinese characters.

    101. Re:Sounds like people need to fix thier names by cmdr_klarg · · Score: 1

      Mr. Ochocinco

      For those that aren't privy to American Football. Apparently some guy with the number 85, renamed himself 85.

      A couple of us sneaked a look at his personnel file the day he arrived. It's his IQ.

      --
      THE SOFTWARE, IT NO WORKY!!!
    102. Re:Sounds like people need to fix thier names by FatAlb3rt · · Score: 1

      Not completely accurate - the NFL allows you to put any name you want on a jersey, but maintains a list of blacklisted names. For example, Ron Mexico, the name that Michael Vick used when getting treatment for an STD, can't be put on a jersey. But I'm sure you could put Miseph on one if you wanted.

    103. Re:Sounds like people need to fix thier names by Anonymous Coward · · Score: 0

      Could be represented as Eighty-Five.

    104. Re:Sounds like people need to fix thier names by fishexe · · Score: 1

      A lot of mobile phones, including my Samsung phone, use Pinyin as a way of entering Chinese characters.

      Right, but that's an IME generating Han characters, and once you finish each character it isn't pinyin anymore. The pinyin never gets sent in your text message or stored in a database, it's just a temporary way of picking the character. Also, most pinyin IMEs don't accept tone number inputs anyway, they just pop up the menu based on the letters. Also, it's non-standard to type out full pinyin anyway, because IMEs tend to accept the first letter of each syllable. I usually type out "Wo hen xihuan zhongguo wenhua" where my Chinese friends would just type "whxhzgwh" and have a good chance that it picks mostly the right characters, and only have to pick one or two from the menus.

      You're right about street signs though, I have seen pinyin there.

      --
      "I don't care about the Constitution!" --Bill O'Reilly, November 17, 2009
    105. Re:Sounds like people need to fix thier names by Anonymous Coward · · Score: 0

      There's a guy at Microsoft named Bo3B Johnson (http://www.linkedin.com/pub/bo3b-johnson/13/846/a52). I think he used to work at Apple.

    106. Re:Sounds like people need to fix thier names by BarryJacobsen · · Score: 1

      Oh, and please notice that only males do that, I've never seen a Jane Junior Doe.

      This is due to the common practice of the female taking the males name upon marriage while males maintain their name. It would therefore be insinuating that she would never get married if she was a "Junior".

    107. Re:Sounds like people need to fix thier names by russotto · · Score: 1

      Oh, and please notice that only males do that, I've never seen a Jane Junior Doe

      Of course not, in the US the number goes last -- it's "Jane Doe Junior". It's pretty unusual (but not unheard of) for daughters to be named after mothers, but often they're named after grandmothers, so you can get Dorothy/Jane/Dorothy/Jane/Dorothy/Jane alternating through the generations.

    108. Re:Sounds like people need to fix thier names by geekoid · · Score: 1

      On /., you do.

      --
      The Kruger Dunning explains most post on /. http://en.wikipedia.org/wiki/Dunning%E2%80%93Kruger_effect
    109. Re:Sounds like people need to fix thier names by geekoid · · Score: 1

      Number as names ahve not been allowed by the courts. You can spell a number, of course.

      --
      The Kruger Dunning explains most post on /. http://en.wikipedia.org/wiki/Dunning%E2%80%93Kruger_effect
    110. Re:Sounds like people need to fix thier names by metamatic · · Score: 1

      Or you code in some language where Unicode support is not there by default, and you have to jump through hoops to get it working. Like, say, PHP. Or stable Ruby.

      Stable Ruby has Unicode. 1.9 is stable. We're up to 1.9.1 now.

      --
      GCHQ Quantum Insert installed. If only our tongues were made of glass, how much more careful we would be when we speak
    111. Re:Sounds like people need to fix thier names by geekoid · · Score: 1

      No, he changed his name to the spelling of 85, not to 85.

      Minor difference, but in the US numbers have been shut down by the courts.

      --
      The Kruger Dunning explains most post on /. http://en.wikipedia.org/wiki/Dunning%E2%80%93Kruger_effect
    112. Re:Sounds like people need to fix thier names by Anonymous Coward · · Score: 0

      You mean stable Rails... the rest of us are just doing fine and dandy here with 1.9.

    113. Re:Sounds like people need to fix thier names by Anonymous Coward · · Score: 0

      Worry about all these cases and you'll be fired, because it will take you years just to write the name entry field. People don't understand that in programming you have to make some basic assumptions or you can't get anything done. I'm not saying you shouldn't read that list and keep some of what it says in mind but if you followed all of their suggestions you'd end up writing a custom database for each application.

    114. Re:Sounds like people need to fix thier names by geekoid · · Score: 1

      Thats 3 'l's, and x, and the spelling of a number.

      None of which are actual numbers.

      --
      The Kruger Dunning explains most post on /. http://en.wikipedia.org/wiki/Dunning%E2%80%93Kruger_effect
    115. Re:Sounds like people need to fix thier names by shutdown+-p+now · · Score: 1

      According to comments in TFA, they've extended the size of unicode character (or whatever they're called) again, and yet there are still plenty missing. Seems to me the whole thing was a futile exercise to start with.

      I can tell you that support for my language and locale - at least insofar as the ability to input text in it, and then read that input - in apps not written in my country has increased drastically since Unicode became mainstream (largely thanks to Java and .NET pushing it as the one and only way of handling strings). I've heard similar sentiments from quite a few people from other cultures, too. So, no, it's not futile. It's not a silver bullet, and it's far from perfect or all-inclusive, but it did make the world that much better already, and it keeps developing to cover more.

      Actually, if you look at the existing Unicode coverage, the "plenty missing" characters are typically those from exotic scripts used by something on the order of a few hundred people. The only really big thing is CJK unification, which is more of a political NIH problem.

    116. Re:Sounds like people need to fix thier names by geekoid · · Score: 1

      I couldn't find any evidence that's her legal middle name and not something she just started doing.

      Anyways, in some states it wouldn't be legally excepted.

      --
      The Kruger Dunning explains most post on /. http://en.wikipedia.org/wiki/Dunning%E2%80%93Kruger_effect
    117. Re:Sounds like people need to fix thier names by Voyager529 · · Score: 1

      It's 7 of 9, tertiary adjunct of unimatrix 01. Lots of numbers in that one, really.

    118. Re:Sounds like people need to fix thier names by BluBrick · · Score: 1

      The rest of you can look it up when you get home!

      --
      Ahh - My eye!
      The doctor said I'm not supposed to get Slashdot in it!
    119. Re:Sounds like people need to fix thier names by Jake+Griffin · · Score: 1

      I actually knew someone once whose name was "La-a". I saw it written on paper and assumed it was "LAY - uh". Boy was I wrong. She got pissed off at me and told me, "it's Luh - DASH - uh". Are you freaking kidding me?! You PRONOUNCE the DASH?!?!

      --
      SIG FAULT: Post index out of bounds.
    120. Re:Sounds like people need to fix thier names by Malc · · Score: 1

      The funny thing about the road signs is it does mean Westerners can get around more easily, but it still doesn't help with pronunciation. I had an apartment for a few months on Qingdao Lu in Shanghai... you'd think I'd be able to say the name of China's most common beer properly (Tsingtao - not Pinyin, so different romanisation). I had to get by with North/South/East/West that I was more proficient at from playing Mah Jiang in order to guide taxi drivers home.

    121. Re:Sounds like people need to fix thier names by alcourt · · Score: 1

      Hen3ry

      --
      "I may disagree with what you say, but I will defend unto the death your right to say it." -- Voltaire
    122. Re:Sounds like people need to fix thier names by Anonymous Coward · · Score: 0

      a bit of an oddball, but an old telegraph transliteration system for Arabic made use of numbers and would be a way to represent Arabic names in ASCII.

    123. Re:Sounds like people need to fix thier names by Anonymous Coward · · Score: 0

      "I am reminded at this point of a fellow I used to know whose name was Henry. Only to give you an idea of what an individualist, he was he spelt it H-E-N-3-R-Y. The 3 was silent, you see."

      (Tom Lehrer, from the intro to "We will all go together when we go)

    124. Re:Sounds like people need to fix thier names by mcgrew · · Score: 1

      I take it you're too young to remember The Prisoner?

      "Who are you?"
      "I am number 2."
      "Who's number 1?"
      "You are number 6."
      "What do you want?"
      "Information."

    125. Re:Sounds like people need to fix thier names by isny · · Score: 1

      William Gates III.

    126. Re:Sounds like people need to fix thier names by careysub · · Score: 1

      John3:16 has been a somewhat popular name in the Bible Belt of the U.S. for many years. However attempting to check for its frequency of occurrence in the Social Security Administration's data base: http://www.ssa.gov/OACT/babynames/ revealed that only "alphabetic characters" are allowed!

      --
      Starships were meant to fly, Hands up and touch the sky - Nicky Minaj
    127. Re:Sounds like people need to fix thier names by Albatrosses · · Score: 1

      Hey, as a girl on the internets, I take offense to that!

      No...
      wait...
      You're right, I'm actually a guy.

    128. Re:Sounds like people need to fix thier names by Quirkz · · Score: 1
      I've never liked the tradition, either. Weirdly, my father is Charles, the fourth of his line. They'd gone through nicknames of Charles, Chas, Chuck, and Charlie for the four generations and ran out of names so when they got to me they named me Ross. I'm rather glad of that. Though with the significant variation in nickname, other than when I'm really thinking about it, it's hard to remember that Charlie is named after Chuck.

      We're also not talking any rich family dynasty here, either. We're talking four generations of solidly middle class midwestern farmers.

      Of course my family has plenty of problems with our names. Our last name is a very common woman's first name, and exceptionally uncommon as a last name. And all four Charleses have the same middle name, which I've never heard of as a middle name anywhere else, but which is a common noun. It's all very strange.

      Sometimes, only in weird moments, it does seem like it might have been fun if I could have had a V at the end of my name. But not at the cost of being called Chucky, Chaz, or Chip.

    129. Re:Sounds like people need to fix thier names by SleazyRidr · · Score: 1

      Who the hell has numbers in there name?

      Andre 3000

    130. Re:Sounds like people need to fix thier names by Anonymous Coward · · Score: 0

      There are still people in France with the name 1790.

    131. Re:Sounds like people need to fix thier names by mcgrew · · Score: 1

      Not to mention R3D2 and C3PO.

    132. Re:Sounds like people need to fix thier names by Bigjeff5 · · Score: 1

      THX 1138, that's who. And he'll run away if you don't get it right!

      --
      Security is mostly a superstition... Avoiding danger is no safer in the long run than outright exposure. - Helen Keller
    133. Re:Sounds like people need to fix thier names by Bigjeff5 · · Score: 1

      Yeah, it's those friggin antonym nut-jobs!

      --
      Security is mostly a superstition... Avoiding danger is no safer in the long run than outright exposure. - Helen Keller
    134. Re:Sounds like people need to fix thier names by sydneyfong · · Score: 1

      I'm Chinese, and deal with CJK characters on a daily basis. My pet peeve is when developers make crazy assumptions about locales -- For example, I live in Hong Kong, and every so often I get a Chinese version of a website even when my preferred language is English. Sometimes getting the site to give me English becomes a huge struggle (Google, among others, did this on me).

      Even so, correctly handling Unicode is such a PITA that I normally try avoiding it unless I really, really, really need to deal with it. And I often get the implementation wrong too. Of course, I blame the quirkiness of Unicode. UTF-8 has variable character size, and due to its backwards compatibility with ASCII you often mix up the two. UTF-16 uses 16 bits characters except that it doesn't. UTF-32 is a huge waste of space. And worst of all, most languages interpret Unicode differently, and make various wrong/bad assumptions.

      I wonder how people who have never used anything other than Latin characters are going to get the Unicode stuff right. I was there when GNOME transitioned from 1.x to 2.x. GNOME 2.x was supposed to have "full unicode" support, but it took them literally YEARS before input and display of Chinese characters actually worked.

      !@#%@%
      </rant>

      --
      Don't quote me on this.
    135. Re:Sounds like people need to fix thier names by DCFusor · · Score: 1

      Been looking at that sig a lot. I don't think you are getting your money's worth at all. GoodLuckWithThat.

      --
      Why guess when you can know? Measure!
    136. Re:Sounds like people need to fix thier names by mcgrew · · Score: 1

      Wall, he's core wrecked. Ewe can knot truss you're spill chucker.

    137. Re:Sounds like people need to fix thier names by sydneyfong · · Score: 1

      I only speak for people in Hong Kong (where Cantonese usage is most common).

      First you need to understand that there are ways to input Chinese characters that are not based on pronunciation. There are a variety of input methods that use strokes and common (visual) components of the characters. Most of them are developed in Taiwan, the most common one is "Changjie". As long as the input method doesn't rely on pronunciation, Cantonese speakers can use it.

      There are a few "Cantonese" input methods, but none are standard. Quite a few are proprietary, which means it's a pain to switch computers if you get too comfortable with them. There are a few romanization systems, see eg. http://arts.cuhk.edu.hk/Lexis/lexi-can/

      Just for your interest, since there are no official romanization systems for Cantonese, we have a wide range of ad hoc substitutes that vary from person to person, and sometimes even self-inconsistent.

      For example, the classic Mandarin "Ni hao" (i.e. Hello) could have the following romanizations in Cantonese:
      - nei ho
      - lei ho (very common mispronunciation)
      - nei hoe
      (and maybe other creative ones I can't come up with right now)

      The message usually gets across though ;-p

      --
      Don't quote me on this.
    138. Re:Sounds like people need to fix thier names by shutdown+-p+now · · Score: 1

      Even so, correctly handling Unicode is such a PITA that I normally try avoiding it unless I really, really, really need to deal with it. And I often get the implementation wrong too. Of course, I blame the quirkiness of Unicode. UTF-8 has variable character size, and due to its backwards compatibility with ASCII you often mix up the two. UTF-16 uses 16 bits characters except that it doesn't. UTF-32 is a huge waste of space.

      The representation only matters when you deal with it directly. A good framework should hide all that. Why would you care if Gtk+ uses UTF-8, and Java uses UTF-16, if string handling APIs in both do what you want them to do?

      And worst of all, most languages interpret Unicode differently, and make various wrong/bad assumptions.

      By "interpret Unicode differently", do you mean different representations?

      And what wrong/bad assumptions do you have in mind?

      I can think of one, which is exposing individual not-quite-characters (rather 16-bit values which may be characters, or may be parts of a surrogate pair) on strings via a deceivingly accessible operator[] in Java in .NET. I've seen too much code that assumes that s[i] is a character because of that.

      For .NET specifically, a major annoyance is that the stock regex library doesn't understand surrogates at all, so it effectively only supports UCS2, not UTF-16.

      Anything else?

    139. Re:Sounds like people need to fix thier names by shutdown+-p+now · · Score: 1

      Been looking at that sig a lot. I don't think you are getting your money's worth at all.

      Well, I reside in Canada, and I kinda like the arrangement - the whole "peace, order and good government" thing seems to be working fine.

      For those in other countries, YMMV. In my home country, paying taxes was definitely a complete waste of money.

    140. Re:Sounds like people need to fix thier names by sydneyfong · · Score: 1

      Pinyin is taught in Chinese primary schools. Chinese people use it, except maybe not in the same way that westerners use it.

      If I remember correctly, Pinyin was sort of the first phase of a proposed transition from Chinese characters to a more alphabet based language. The leaders and intellectuals during the revolution periods of China in early 1900s believed that using Latin alphabets to replace Chinese characters would boost literacy rates because Chinese characters were too hard to learn.

      That proposal was obviously shelved, but Pinyin has remained, and remains useful for a number of reasons. Textbooks could be annotated in Pinyin to help students learn pronunciations of new Chinese characters, Pinyin could be used as an input method for computer systems, etc.

      The fact that it would help westerners is just an added bonus.

      --
      Don't quote me on this.
    141. Re:Sounds like people need to fix thier names by Anonymous Coward · · Score: 0

      I recall in several "Peanuts" cartoons there was a character named "5". He explained that his parents had decided that since everybody was just going to be a number in the future that they might as well name him with one in the first place. His last name was all digits too, but I don't remember what they were. At one point he corrected someone saying his last name because they accented the wrong digit.

    142. Re:Sounds like people need to fix thier names by Anonymous Coward · · Score: 0

      Actually, fans refer to him as "Ocho Cinco", but he forgot to include the space on his name change paperwork. The league wouldn't let him have "Ocho Cinco" on his jersey, because it wasn't official his last name, so he got stuck with "Ochocinco".

      Management gets the last laugh. Again.

    143. Re:Sounds like people need to fix thier names by Asian+Freud · · Score: 0

      "Pinyin to help students learn pronunciations of new Chinese characters."

      Yes, piny-in -- combination of sounds.

      --
      Excellence is an attitude.
    144. Re:Sounds like people need to fix thier names by Thing+1 · · Score: 1

      Therefore it's highly plausible that even a percentage of /.'s users are guys generating female personas.

      Okay, sure, but are they doing it via spontaneous generation? (Or do they have to wait a while, like double-posting to Slashdot?)

      --
      I feel fantastic, and I'm still alive.
    145. Re:Sounds like people need to fix thier names by PopeRatzo · · Score: 1

      The X in Malcolm X is not a number.

      I guess you're right. It's a variable, isn't it?

      Drop me an email, jedidiah. I've got some extra sense of humor I could send you. I hate to see you have to go without.

      --
      You are welcome on my lawn.
    146. Re:Sounds like people need to fix thier names by Anonymous Coward · · Score: 0

      Well, you've got that mostly right, but a lot of people actually use pinyin-based IMEs. Perhaps this is what the grandparent post was referring to? That is, you type out "Zhang zi yi", or "Zhang1 zi3 yi2" and then manually select whichever Zhang character, whichever zi character, and whichever yi character you mean. When you take a single syllable like "shi", which serves as the pronunciation for 78+ commonly used characters, this selection time can be very... slow. But pinyin-based IMEs are pretty commonly used in mainland China.

    147. Re:Sounds like people need to fix thier names by Anonymous Coward · · Score: 0

      I'm praying to $DEITY that he gets traded to a team where the 85 jersey number is already taken.

    148. Re:Sounds like people need to fix thier names by PCM2 · · Score: 1

      Who the hell has numbers in there name?

      I'm surprised everybody thinks this is so strange. There are lots of examples of "weird geek names," but what about Hank Williams III? True, most people in the English-speaking world would write the Roman numeral, rather than writing Hank Williams 3... but what's stopping them? And even then, a Roman numeral is a number. You wouldn't want to run a regex on the name string, decide it was all [A-Za-z], and output his name as "Mr. III."

      --
      Breakfast served all day!
    149. Re:Sounds like people need to fix thier names by Miseph · · Score: 1

      I'm sure they'd insist upon it...

      --
      Try not to take me more seriously than I take myself.
  3. Rip out the vowels by jimmydevice · · Score: 2, Funny

    and let god sort them out...

    1. Re:Rip out the vowels by bkpark · · Score: 3, Funny

      and let god sort them out...

      If written Hebrew is any indication, God doesn't bother with vowels either, apparently.

    2. Re:Rip out the vowels by Anonymous Coward · · Score: 0

      If written Hebrew is any indication, God doesn't bother with vowels either, apparently.

      You dare befoul the name of YHWH with that ovoid abomination?!

    3. Re:Rip out the vowels by Anonymous Coward · · Score: 0

      *avoids argument of how to spell YHWH*

      How do you say that again? I've been meaning to yell it as a curse a few times...

    4. Re:Rip out the vowels by daem0n1x · · Score: 1

      Then we'll all have Polish names.

    5. Re:Rip out the vowels by tepples · · Score: 1

      *avoids argument of how to spell YHWH*

      How do you say that again?

      It's pronounced "haShem".

  4. I've been dealing with this for years. by Wonko+the+Sane · · Score: 4, Interesting

    I am fortunate enough to be the child of a professional smart-ass who intentionally gave all his children two middle names so that we would not fit into the computer systems of the era.

    When I grew up my parents used my first middle name as a "given nickname" (it's actually in quotation marks on my birth certificate). So most of the time when I give my name for something I use my "given nickname" as my first name. Unless I feel like using my legal first name as my first name in which case I use that. There are probably four or five different versions of my name attached to my SSN in various different databases.

    I've also got a sufffix: III. I don't have two ancestors with the exact same name as me, but since the various parts come from two different relatives my parents settled on III.

    1. Re:I've been dealing with this for years. by Graff · · Score: 5, Funny

      I prefer the story of this mom.

    2. Re:I've been dealing with this for years. by Anonymous Coward · · Score: 1, Informative

      It's also fun when a parent has the same first name, yet a different middle name, but the problem being that the middle name has the same first letter. So all the damn computer databases that insist on reducing the middle name to an initial are a pain in the ass. And no, I'm not interested in all this senior citizen stuff I'm not qualified for. (Give another 30 years maybe.) I also wonder if the ol' fart is getting junk mail relating to video games and electronics that he likely has no interest in. The real problem comes up in billing and city stickers and things like that.

      The only solution so far is that I put both my first and middle name in the "first name" field in cases where a space is allowed as a valid character. It's something I'll have to keep doing until enough people get a clue and changes their database conventions.

    3. Re:I've been dealing with this for years. by fishexe · · Score: 1

      I don't have two ancestors with the exact same name as me, but since the various parts come from two different relatives my parents settled on III.

      Normally, you get the parts from different relatives and don't get a suffix. In fact, that's extremely common. So common that getting various parts of your name from multiple relatives could not possibly justify adding "III". I'm guessing that's just what they told you, and the real reason was to make you feel special. Either that, or you're disinherited royalty.

      --
      "I don't care about the Constitution!" --Bill O'Reilly, November 17, 2009
    4. Re:I've been dealing with this for years. by arekq · · Score: 2, Interesting

      A similar issue happens with Chinese names.
      Most Chinese people have one word or two word names.
      If a person have a two word name and fill it in in the form: "Chow, Yun Fat", the system likely would take "Yun" as the middle and and "Fat" as the first name, or vice versa, which often reduce the name to "Chow, Yun", or "Chow, Fat".
      One way to reduce this confusion is to use hyphen to join the words, like "Chow, Yun-Fat".

    5. Re:I've been dealing with this for years. by inflamed · · Score: 1

      I name my constructs like certain aboriginal cultures name people: based on the first thing I see around my work desk when I decide to assign something to the variable, or what I will be doing when I decide to use my construct again. I am not a professional programmer though; I mainly do scientific modeling work and use scripting languages wrapped around C++ libraries.

    6. Re:I've been dealing with this for years. by identity0 · · Score: 1

      In the same vein, I once met a geek who changed his last name to "dot net", all lowercase, with an umlaut over the "o" in "dot".

      I asked him if he ever had problems with databases. He just shrugged and said "yes".

    7. Re:I've been dealing with this for years. by houghi · · Score: 1

      Then the computer system of that era was lousy. My mother has 4 names. That would mean that if she were to get to be put in some system (working, getting married, getting the US nationality) the system would not be able to cope. Having more then just a first and middle name is pretty common in many parts of the world.

      I have no middle name and I have seen places where I needed to enter a middle name or at least a letter, otherwise I could not proceed.

      --
      Don't fight for your country, if your country does not fight for you.
    8. Re:I've been dealing with this for years. by jimmux · · Score: 1

      I feel your pain.

      My parents decided to give me six names. That's right.

      Because they couldn't give me more than two middle names on the birth certificate they hyphenated them together. For example (not my actual name): First Second-Third Fourth-Fifth Last.

      My name would often not fit on official documents so I always had trouble identifying myself. It was actually impossible for me to get a driver's licence because my various forms of ID were inconsistent, so I had my name changed by deed poll to: First Second Third Last (again, not my real name).

      Incidentally, this was when I discovered that I spelled one of my middle names wrong my whole life. Now I have much less trouble, but my name still barely fits on tax forms - to the letter.

    9. Re:I've been dealing with this for years. by JaredOfEuropa · · Score: 4, Interesting

      I have an apostrophe in my surname, and you'd be surprised at how many systems break when I try to enter it... even in this day and age where character escaping and scrubbing for SQL are readily available in most languages, often even in the standard libraries. And you'd be surprised at how many systems return a response that hints at something like that cartoon being possible...

      Even worse are the systems that seem to accept the response, then break down internally. I've had some bitter arguments over reservations at car rental and airline check in counters.

      --
      If construction was anything like programming, an incorrectly fitted lock would bring down the entire building...
    10. Re:I've been dealing with this for years. by unkiereamus · · Score: 4, Interesting

      I actually knew a girl in HS who came from a very traditional Mexican family, as a result, she had 7 middle names.

      Here's the thing, in California, in order to be issued a driver's license, your full name had to appear on the card, and there was insufficient space for all of her middle names, as a result, in order to get a driver's license, she had to have her name legally changed.

      --
      I needed a sig so people would know who I am, but I was too drunk to make something witty, so you get this instead.
    11. Re:I've been dealing with this for years. by roman_mir · · Score: 1

      Hey, it's an excellent reason not to fill out tax forms at all.

    12. Re:I've been dealing with this for years. by dargaud · · Score: 1
      That's funny. I have a colleague whose parents couldn't agree on a name when he was born. The father wrote down his choice on his birth certificate and the mother was pissed after she discovered it. They each always called him by the name they chose. And even now in his 50s, his colleagues call him by his 'official' name while his friends call him by his 'mother's' name.

      But all that pales in comparison to a student I knew who came from old nobility: he had over 50 words in his name...

      --
      Non-Linux Penguins ?
    13. Re:I've been dealing with this for years. by Tim+C · · Score: 1

      Both my brother and I have two middle names, and they often don't fit on forms - printed ones don't leave enough space, computer ones reject the space between them, etc. I've also had to fill in forms that allow zero or one middle initials, not two (or more - I've known people with more). I visited the doctor for the first time in years last week, and noticed that the computer system she was using didn't have both of my middle names.

      My father has one middle name, but uses that as his first name - legally he's "Antony Michael", but he calls himself Mike, as that's what his parents always called him. (And I notice that the Firefox dictionary is rejecting "Antony" as being misspelt...)

      As you demonstrate, even perfectly ordinary English names can cause trouble.

    14. Re:I've been dealing with this for years. by soppsa · · Score: 1

      That'd almost be funny if it wasn't pasted on slashdot near daily.

    15. Re:I've been dealing with this for years. by digitig · · Score: 1

      And when a Chinese woman gets married she usually prepends her husband's family name to hers. My wife's name is Lo Chan Suk Kuen. The Lo is the Chinese form of my family name, "Chan" is her family name and "Suk Kuen" is her given name. None of them is a middle name.

      --
      Quidnam Latine loqui modo coepi?
    16. Re:I've been dealing with this for years. by amRadioHed · · Score: 1

      Two character names you mean, not two words. There's no real need to put a space or dash between the syllables. Of course many Chinese do prefer the space when writing in English, so it should be supported.

      --
      We hope your rules and wisdom choke you / Now we are one in everlasting peace
    17. Re:I've been dealing with this for years. by Graff · · Score: 1

      That'd almost be funny if it wasn't pasted on slashdot near daily.

      Except this time it's topical!

      I admit it, I went for the cheap laugh. It's not something I'm proud of but someone had to do it.

    18. Re:I've been dealing with this for years. by Wonko+the+Sane · · Score: 1

      and the real reason was to make you feel special

      No, they were just a couple of smart-asses.

    19. Re:I've been dealing with this for years. by Anonymous Coward · · Score: 1, Informative

      I do as well, and it's hilarious or maddening depending on what mood I'm in. I mean, seriously, surnames with apostrophes date back hundreds of years.

    20. Re:I've been dealing with this for years. by DarenN · · Score: 1

      That's retarded.

      I have two middle names on my birth certificate, and no-one here in Ireland cares whether I use them or not. In fact, the middle name boxes are marked "optional" on the drivers license and passport application forms. My new passport contains no middle names, because filling them out in immigration forms when travelling is a pain in the ass, although my old passport had one, and the one before that had both, and the old passport was considered enough of a proof of identity to get the new one. Same idea with the drivers license. They already know who I am, as long as I use my real name on official documentation they're not too bothered.

      Why be that anal about it? Internally on all these systems you're a number (a passport or PPS number)

      --
      Rational thought is the only true freedom
    21. Re:I've been dealing with this for years. by Anonymous Coward · · Score: 0

      Mod Parent Up

      I'm fucking sick of that comic'); DROP TABLE Slashdot; --SQLi huuuuuuuuuuuuuuuuuuuuuuuuuuurrrrrrrrrrrrrrrrrrrrrrrrrrrrrrrrrrrrrrrrrrr

    22. Re:I've been dealing with this for years. by metamatic · · Score: 1

      Imagine how sick of it I am, since I posted the same idea before xkcd, and now get people sending it to me as a wonderfully amusing thing xkcd thought up...

      --
      GCHQ Quantum Insert installed. If only our tongues were made of glass, how much more careful we would be when we speak
    23. Re:I've been dealing with this for years. by metamatic · · Score: 1

      I am fortunate enough to be the child of a professional smart-ass who intentionally gave all his children two middle names so that we would not fit into the computer systems of the era.

      I changed my name to a single word years ago. That screws up a lot of databases. For a while the bank had to hand-address my monthly statements. I'd get some questions about my credit card too.

      I've gone back to mostly using my family name again, but I still have no middle name, which messes with the assumptions of particularly bad software. It also gives me a handy way to track who uses which databases for junk mail, though, as I can use a different letter for different databases.

      --
      GCHQ Quantum Insert installed. If only our tongues were made of glass, how much more careful we would be when we speak
    24. Re:I've been dealing with this for years. by geekoid · · Score: 1

      You would be surprised how many system are the sames one from 30 years ago.

      About your Sig:
      Apparently people do care about my children. Like these dicks:
      http://www.parentstv.org/

      I wish some people would stop caring for my children.

      --
      The Kruger Dunning explains most post on /. http://en.wikipedia.org/wiki/Dunning%E2%80%93Kruger_effect
    25. Re:I've been dealing with this for years. by geekoid · · Score: 1

      That sounds like a lie.

      My CA Drivers license just used my middle initial as the middle name. That was 10 years ago. I don't live there any more.

      --
      The Kruger Dunning explains most post on /. http://en.wikipedia.org/wiki/Dunning%E2%80%93Kruger_effect
    26. Re:I've been dealing with this for years. by alexo · · Score: 1

      I actually knew a girl in HS who came from a very traditional Mexican family, as a result, she had 7 middle names.

      Here's the thing, in California, in order to be issued a driver's license, your full name had to appear on the card, and there was insufficient space for all of her middle names, as a result, in order to get a driver's license, she had to have her name legally changed.

      Good thing that Pablo Picasso" never had to get a Californian driver's license.

    27. Re:I've been dealing with this for years. by elrous0 · · Score: 1

      Your parents will regret little things like this when it comes time for you to choose a nursing home for them. I know mine certainly will.

      --
      SJW: Someone who has run out of real oppression, and has to fake it.
    28. Re:I've been dealing with this for years. by Pandrake · · Score: 1

      I have two middle names, which appear on my CA drivers license. They fit, sometimes on two lines, but the biggest advantage is when I got pulled over for a broken tailight and the officer started to write a fixit ticket until he saw my full name and said, "Really? That's too long for me to write out on the ticket. Just get it fixed soon, okay?"

    29. Re:I've been dealing with this for years. by Graff · · Score: 1

      Apparently people do care about my children. Like these dicks:
      http://www.parentstv.org/

      I wish some people would stop caring for my children.

      If more people cared about their children we'd need less people caring about our children!

      Unfortunately there are a lot of people who treat their children as accessories or side effects of sex and that's why those sort of sites spring up, to take place of people actually needing to care for their children. How about people actually watch TV with their children or *gasp* turn it off and go places or play some intelligent games?

      Ahh well, stepping off the soapbox now...

    30. Re:I've been dealing with this for years. by Graff · · Score: 1

      I name my constructs like certain aboriginal cultures name people: based on the first thing I see around my work desk when I decide to assign something to the variable

      Ahh, so just like this comic?

    31. Re:I've been dealing with this for years. by unkiereamus · · Score: 1

      See page 13 of this document [PDF warning.]

      --
      I needed a sig so people would know who I am, but I was too drunk to make something witty, so you get this instead.
    32. Re:I've been dealing with this for years. by Anonymous Coward · · Score: 0

      That's what this is for:
      http://goatkcd.com/327/ (warning: NSFW)

  5. Slashdotted already? by RenQuanta · · Score: 5, Informative

    After just 15 minutes of the story being posted?

    Wow, that's gotta be a personal best for /. (or, the site is a wee bit underpowered... ;)

    Here's the Google cache in the meanwhile: http://webcache.googleusercontent.com/search?q=cache:http://www.kalzumeus.com/2010/06/17/falsehoods-programmers-believe-about-names/

    1. Re:Slashdotted already? by CAIMLAS · · Score: 1

      Not so... back in the day, such a slashdotting was quite regular. Surely you remember that.

      --
      ~/ssh slashdot.org ssh: connect to host slashdot.org port 22: too many beers
    2. Re:Slashdotted already? by RenQuanta · · Score: 2, Funny

      Not so... back in the day, such a slashdotting was quite regular. Surely you remember that

      Yeah, I might, if my memory weren't failing with age. ;-)

    3. Re:Slashdotted already? by The+Wild+Norseman · · Score: 1

      Not so... back in the day, such a slashdotting was quite regular. Surely you remember that.

      He probably does remember, but I think he now prefers to be called 5ir1ey.

      --
      "A government is a body of people usually -- notably -- ungoverned." -Shepherd Book
    4. Re:Slashdotted already? by ultranova · · Score: 1

      Switch to a cloud-based storage, such as Facebook. Those are guaranteed to hold your memories forever, or at least the most embarrassing ones.

      --

      Forget magic. Any technology distinguishable from divine power is insufficiently advanced.

    5. Re:Slashdotted already? by sznupi · · Score: 1

      One day, when dementia will take hold of most of you 5-digit UIDs and below - we, 6-digit UIDs, won't look so stupid anymore... ;p

      --
      One that hath name thou can not otter
  6. Text only cache by SuperKendall · · Score: 2, Informative

    Even the cache needs tweaking to load.

    Text only version.

    --
    "There is more worth loving than we have strength to love." - Brian Jay Stanley
    1. Re:Text only cache by patio11 · · Score: 1

      While I'm busy replacing the pile of molten slag that is my VPS, you can find the whole thing here, served as all static (including assets):

      http://www.bingocardcreator.com/kalzumeus-cache/names.html

  7. Article text by Anonymous Coward · · Score: 5, Informative

    John Graham-Cumming wrote an article today complaining about how a computer system he was working with described his last name as having invalid characters. It of course does not, because anything someone tells you is their name is--by definition--an appropriate identifier for them. John was understandably vexed about this situation, and he has every right to be, because names are central to our identities, virtually by definition.

    I have lived in Japan for several years, programming in a professional capacity, and I have broken many systems by the simple expedient of being introduced into them. (Most people call me Patrick McKenzie, but I'll acknowledge as correct any of six different "full" names, any many systems I deal with will accept precisely none of them.) Similarly, I've worked with Big Freaking Enterprises which, by dint of doing business globally, have theoretically designed their systems to allow all names to work in them. I have never seen a computer system which handles names properly and doubt one exists, anywhere.

    So, as a public service, I'm going to list assumptions your systems probably make about names. All of these assumptions are wrong. Try to make less of them next time you write a system which touches names.

    1. People have exactly one canonical full name.
    2. People have exactly one full name which they go by.
    3. People have, at this point in time, exactly one canonical full name.
    4. People have, at this point in time, one full name which they go by.
    5. People have exactly N names, for any value of N.
    6. People's names fit within a certain defined amount of space.
    7. People's names do not change.
    8. People's names change, but only at a certain enumerated set of events.
    9. People's names are written in ASCII.
    10. People's names are written in any single character set.
    11. People's names are all mapped in Unicode code points.
    12. People's names are case sensitive.
    13. People's names are case insensitive.
    14. People's names sometimes have prefixes or suffixes, but you can safely ignore those.
    15. People's names do not contain numbers.
    16. People's names are not written in ALL CAPS.
    17. People's names are not written in all lower case letters.
    18. People's names have an order to them. Picking any ordering scheme will automatically result in consistent ordering among all systems, as long as both use the same ordering scheme for the same name.
    19. People's first names and last names are, by necessity, different.
    20. People have last names, family names, or anything else which is shared by folks recognized as their relatives.
    21. People's names are globally unique.
    22. People's names are almost globally unique.
    23. Alright alright but surely people's names are diverse enough such that no million people share the same name.
    24. My system will never have to deal with names from China.
    25. Or Japan.
    26. Or Korea.
    27. Or Ireland, the United Kingdom, the United States, Spain, Mexico, Brazil, Peru, Russia, Sweden, Botswana, South Africa, Trinidad, Haiti, France, or the Klingon Empire, all of which have "weird" naming schemes in common use.
    28. That Klingon Empire thing was a joke, right?
    29. Confound your cultural relativism! People in my society, at least, agree on one commonly accepted standard for names.
    30. There exists an algorithm which transforms names and can be reversed losslessly. (Yes, yes, you can do it if your algorithm returns the input. You get a gold star.)
    31. I can safely assume that this dictionary of bad words contains no people's names in it.
    32. People's names are assigned at birth.
    33. OK, maybe not at birth, but at least pretty close to birth.
    34. Alright, alright, within a year or so of birth.
    35. Five years?
    36. You're kidding me, right?
    37. Two different systems containing data about the same person will use the same name for
    1. Re:Article text by Merls+the+Sneaky · · Score: 1

      Names are random data of random length.

    2. Re:Article text by Anonymous Coward · · Score: 0

      A name by any other rose would give me a migraine.

    3. Re:Article text by Anonymous Coward · · Score: 0

      Of course, in the interest of actually getting your system off the ground, you need to not accommodate all of these... a fraction of your users, depending on how many you omit workarounds for, might need to spend a little bit of time adjusting to your system. No, sir, you cannot have an irrational number for a name, fuck off and find a pseudonym.

    4. Re:Article text by Anonymous Coward · · Score: 0

      Its a bit more complicated... a name is a time changing set of arbitrary data elements of arbitrary number (including zero, in someone lacks a name) and length (including zero, just in case someone has a name equal to the empty set). Furthermore, the symbol set is arbitrarily large since there is no legal restriction to use of some alphabet in your name - the limit is merely the human imagination which we may approximate as infinite for now. For this reason alone, we immediately recognize that comprehensive, unrestricted encoding of arbitrary name(s) is mathematically impossible. Use of standardized symbol sets (e.g., ASCII) induces a permanent and irreversible error for a subset of names. Restriction of name set size for an individual of less than or equal to n creates another error, and restriction of name length yet another.

    5. Re:Article text by feepness · · Score: 5, Funny

      Nice rules. Still wouldn't handle my name.

    6. Re:Article text by codeButcher · · Score: 1

      the human imagination which we may approximate as infinite for now

      Which is the pessimistic view. In reality, I think it approaches zero.

      --
      Free, as in your money being freed from the confines of your account.
    7. Re:Article text by Anonymous Coward · · Score: 0

      Humans can't read minds and shouldn't expect computers to be able to read minds either.

      Most of the problems in your list boil down to "I gave the computer a different sequence of tokens as my identifier today than I did yesterday, why doesn't it understand it's me?"...

      If only we could give users some kind of code instead. We could call it something like an ID number, or a login name...

    8. Re:Article text by bertok · · Score: 2, Interesting

      Reminds me of a classic database developer nightmare story that I heard:

      A local school was receiving complaints that two students were getting the exam results and the like mixed up.

      The two students? Identical twins living in the same house, with the same name.. John Smith Jnr.

      Apparently their father was John Smith Snr, and the whole "Senior / Junior" thing has been done for generations of "Johns Smiths", and it was a tradition and all, and we can't just break a tradition just because we had twin boys.. so... we'll name them both John Smith Jnr.

    9. Re:Article text by Swordsmanus · · Score: 1

      I think there's a single elegant approach to follow that allows you to avoid all those assumptions you listed.

      Imagine everyone that will be in your database does porn for a living.

      Dead serious.

    10. Re:Article text by jewelises · · Score: 1

      (Most people call me Patrick McKenzie, but I'll acknowledge as correct any of six different "full" names, any many systems I deal with will accept precisely none of them.)

      Guys! I figured out who Anonymous Coward really is!

    11. Re:Article text by JaredOfEuropa · · Score: 2, Funny

      Who complained? The parents? If so, the only proper response would have been: "Well, what did you expect, numbnuts?"

      --
      If construction was anything like programming, an incorrectly fitted lock would bring down the entire building...
    12. Re:Article text by Anonymous Coward · · Score: 0

      Aside from all that, I just found out recently (in an old Irish Gaelic grammar book), that the ' in names like O'Brien is not really an apostrophe. It is the long vowel mark (called a fada, which means long in Irish).

      The O means "descendant of".

    13. Re:Article text by digitig · · Score: 1

      No, sir, you cannot have an irrational number for a name.

      A Life of Pi?

      --
      Quidnam Latine loqui modo coepi?
    14. Re:Article text by Anonymous Coward · · Score: 0

      Humans can't read minds and shouldn't expect computers to be able to read minds either.

      I knew you were going to say that.

    15. Re:Article text by Anonymous Coward · · Score: 0

      Really? I always thought he was David Davidson.

    16. Re:Article text by geekoid · · Score: 1

      Nipple?

      --
      The Kruger Dunning explains most post on /. http://en.wikipedia.org/wiki/Dunning%E2%80%93Kruger_effect
    17. Re:Article text by geekoid · · Score: 1

      Or to ask them how Sr/Jr can be done foe more the 2 generations.

      --
      The Kruger Dunning explains most post on /. http://en.wikipedia.org/wiki/Dunning%E2%80%93Kruger_effect
    18. Re:Article text by Anonymous Coward · · Score: 0

      Please introduce me to several examples of number 40. Thanks.

    19. Re:Article text by JesseMcDonald · · Score: 1

      For this reason alone, we immediately recognize that comprehensive, unrestricted encoding of arbitrary name(s) is mathematically impossible.

      Actually, if you start with the assumption that names are either 2D visual images or sounds, both within the range of human perception, then it is possible to define a format capable of representing any single name as a finite-length bitstring. For example, the first bit could represent the type of name (visual or auditory), after which visual names could be encoded as SVG while auditory names are represented by an MP3 (or FLAC or WAV, etc., if MP3 is too lossy).

      --
      "The state is that great fiction by which everyone tries to live at the expense of everyone else." - Bastiat
    20. Re:Article text by russotto · · Score: 1

      Or to ask them how Sr/Jr can be done foe more the 2 generations.

      Several ways.

      1) Dynastic: Sr. (called that when Jr. is born), Jr., III, IV, etc
      2) Dynamic: Sr. (called that when Jr. is born), Jr. (called only when his dad is around -- later in life will be called Sr. when his son is around), Jr. (called that when his dad or grandpa are around)
      3) Alternating: Sr, Jr, Sr, Jr. Weird, but people do it.

    21. Re:Article text by Bigjeff5 · · Score: 1

      Actually he does, right here:

      This list is by no means exhaustive.

      --
      Security is mostly a superstition... Avoiding danger is no safer in the long run than outright exposure. - Helen Keller
  8. who needs vowels? by theNAM666 · · Score: 3, Interesting
    1. Re:who needs vowels? by impaledsunset · · Score: 1

      It does have a vowel, it's just not written down. When you pronounce this sequence of consonants (prst), a special mid back unrounded vowel would appear between "pr" and "st". If you try pronouncing it "prst" and "prast" (where "a" signifies this vowel), you'll notice it sounds the same. Other languages have the same word, and they have a letter for the vowel. Czech doesn't seem to have the letter, but the vowel would still be there implicitly, since you can't technically pronounce it without it (though if you try hard it will be almost inaudible). If you hear the pronunciation in Wiktionary, you'll notice that the vowel is evident there.

  9. Dumbfuck summary by oldhack · · Score: 5, Insightful

    Names of what?!

    --
    Fuck systemd. Fuck Redhat. Fuck Soylent, too. Wait, scratch the last one.
    1. Re:Dumbfuck summary by bigstrat2003 · · Score: 4, Informative

      Yeah, TFS is very ambiguous about that. Turns out that TFA is talking about names of people, and the pitfalls you can run into when allowing someone to enter their name into a system.

      --
      "16MB (fuck off, MiB fascists)" - The Mighty Buzzard
    2. Re:Dumbfuck summary by thePowerOfGrayskull · · Score: 2
      Indeed. Reading the summary, I thought it was some kind of article on how programmers can't remember names (I know I can't...)

      But basically, it's some dude whining about how - because there is no single set of rules that can be universally applied to all names - no systems handle them correctly. That seems kind of self-evident to me; computers are rules-based creations. If you can't define the rules, it sure is hard to code for them. Blaming the programmers is stupid - as his own article shows. (eg. "[don't assume] Names are case-sensitive. [don't assume] Names are case-insensitive".

      Not sure why this made it to slashdot -- it's just a rant.

    3. Re:Dumbfuck summary by sjames · · Score: 2, Informative

      Many of the systems that handle names the worst are the ones that try to be "clever", doing things like insisting on first (and only first) letter capitalized, rejecting digits, refusing to allow middle name (or initial) to be blank, always using the first letter of the Middle name and adding a period after or refusing to accept a single character as a name, and many more sins. The "dumb" systems are actually more graceful about it.

      The best policy is to accept what is entered. Even that tends to fail if someone has more than 3 names. Then there's the Spanish naming conventions.

    4. Re:Dumbfuck summary by maxwell+demon · · Score: 1

      Blaming the programmers is stupid - as his own article shows. (eg. "[don't assume] Names are case-sensitive. [don't assume] Names are case-insensitive".

      OK, so how should, in his opinion, the system be programmed? Having a check box to tell whether the name is case sensitive or not? But then, there's surely someone with a name where some part is case sensitive, while others are not.

      OK, I've got the ultimate solution for him. Instead of simply asking for a name, ask for a piece of code implementing a function which gets a name as input, and as result gives true (yes, this is a valid way to write my name) or no (no, that's not a valid way to write my name).

      OK, but there's another problem:

      11. People’s names are all mapped in Unicode code points.

      So we cannot simply accept normal strings, because whatever character set we assume, there is always the possibility of a character which isn't in it. Well, let's make a drawing area where anyone can write his name. The function above would get the drawing area and decide if what is found in there is a valid writing of the name.

      But wait, there might be names that cannot be written! Well, allow audio files as well! Actually, allow the user to come up with data any way he likes; since he is the one writing the function to recognize it, it's his problem to identify it.

      What, too hard for you to write your own matching function? Well, then simply accept that computers simply cannot handle any conceivable situation. If your name doesn't fit into Unicode, well, then transcribe it. Have several ways to write your name? Well, just chose one to use at the system and stick to it. Why should programmers try to do the impossible just because you cannot decide how to write your name?

      --
      The Tao of math: The numbers you can count are not the real numbers.
    5. Re:Dumbfuck summary by NewWorldDan · · Score: 1

      Yeah, it also turns out that a lot of times you need to be able to name any arbitrary thing, be it an object, animal, person, business, concept, or whatever. I generally try not to put any unneccesary constraints on my users, other than providing an ASCII approximation, which works so far because I only have customers in North America. Also, Unicode is an invention of the devil. Addresses are equally problematic. Often times, localities and the USPS can't agree on what is correct. The locals will insist on using the name of their little suburb while the USPS wants the major city nearby because that's where the post office is. And international addresses? Fuhgetaboutit.

    6. Re:Dumbfuck summary by EricX2 · · Score: 1

      I was trying to do something online once and I had to enter in my drivers license number. I'm not sure about other states, but in WA state the number starts with the first 5 letters of your last name, if you have less than 5 it puts * in it's place, the system said that was an invalid character.

    7. Re:Dumbfuck summary by DamnStupidElf · · Score: 1

      My real name is an infinite non-repeating sequence of symbols taken from an uncountably infinite alphabet, you insensitive clod!

  10. I don't know what the complaint is about? by jackb_guppy · · Score: 1, Interesting

    Most developers, do not get that the world is made up multiple standards and refuse to consider local vs database relationships like:

    Boolean: Yes/Si/... No/No/..

    Amounts: ,. ., 0,2,3 places from right for display, 0,2,3 places from right for value (USD: ,.22, JPY: .,20)
    Dates:: ./ 0 suppressed ISO, JPN, USA

    How do you thing the do working with phone@, addresses and names?

    Database engines fail with these simple-complex constructs because sorting and matching tests are still left hand and character set driven. A database MUST treat all of these names the same: McClean, MacClean, MCLean, Mc Clean, Mac Clean. McCleen, ...

    1. Re:I don't know what the complaint is about? by scdeimos · · Score: 4, Interesting

      A database MUST treat all of these names the same: McClean, MacClean, MCLean, Mc Clean, Mac Clean. McCleen, ...

      Are you sure? What if "Mac Clean" is actually somebody's first and last names?

      I know plenty of people whose legal name is a single word, such as "Alex", "Max" or "Virgil." Would your system put that in the first_name, middle_name or surname column? Storing names and using them sensibly is hard, as TFA acknowledges.

      You'd think that e-mail addresses by comparison would be simpler, but I have a hard time trying to register my e-mail address with sites that won't allow even simple things like "+", "-" or "." characters in the local part.

    2. Re:I don't know what the complaint is about? by Dragonslicer · · Score: 4, Interesting

      A database MUST treat all of these names the same: McClean, MacClean, MCLean, Mc Clean, Mac Clean. McCleen, ...

      I assume you left out a "not" in that sentence? I think there are quite a few people that will kindly (or maybe not-so-kindly) explain why "Mc" and "Mac" are not the same.

    3. Re:I don't know what the complaint is about? by paeanblack · · Score: 3, Informative

      You'd think that e-mail addresses by comparison would be simpler, but I have a hard time trying to register my e-mail address with sites that won't allow even simple things like "+", "-" or "." characters in the local part.

      Proper email validation is not trivial

      Check out the huge regex at the bottom of the RFC 5322 compliant validator from CPAN:

      http://cpansearch.perl.org/src/RJBS/Email-Valid-0.184/lib/Email/Valid.pm

    4. Re:I don't know what the complaint is about? by larry+bagina · · Score: 1

      that's for email addresses as used in email programs, including shit like an optional full name ("First, Last" ). Unless you're parsing mail headers, you don't need to deal with all that shit.

      --
      Do you even lift?

      These aren't the 'roids you're looking for.

    5. Re:I don't know what the complaint is about? by nacturation · · Score: 3, Informative

      A database MUST treat all of these names the same: McClean, MacClean, MCLean, Mc Clean, Mac Clean. McCleen, ...

      I assume you left out a "not" in that sentence? I think there are quite a few people that will kindly (or maybe not-so-kindly) explain why "Mc" and "Mac" are not the same.

      Read between the lines a bit. Treat them the same means: treat them as all potentially valid, not that all the names would match in a string comparison.

      --
      Want to improve your Karma? Instead of "Post Anonymously", try the "Post Humously" option.
    6. Re:I don't know what the complaint is about? by Anonymous Coward · · Score: 0

      What sites don't allow -'s and .'s? I use em all the time and have never had a problem.

    7. Re:I don't know what the complaint is about? by scdeimos · · Score: 1

      Proper email validation is not trivial

      Beg to differ.

      Most sites are sending challenge e-mails with a hyperlink that you have to click on to validate that it's a current and correct e-mail address. Why have a complex regular expression in front of it as well? Either the person gets the e-mail and clicks the link, or they got it wrong and never receive it.

      The regular expression, if one must be used, doesn't need to be any more complex than:

      ^[^@]+@[^@]+$

    8. Re:I don't know what the complaint is about? by Anonymous Coward · · Score: 1, Interesting

      >A database MUST treat all of these names the same: McClean, MacClean, MCLean, Mc Clean, Mac Clean. McCleen, ..

      Given that you used upper case for the "MUST", it would be nice if your statement wasn't total garbage. McClean and MacClean aren't the same surname. You do know that, right? They sound the same, but they are actually different. I am an HR systems developer and I can tell you that people get very upset when people spell their names incorrectly, including using incorrect capitalisation.

      If you had stated that some kinds of searches should return names with spellings that are similar to (or sound like) the specified search criteria, you might be kind of on the right track, but to say they must be treated the same is patently utter toss. When you have 40,000 employees, you are likely to have multiple people with very similar names. I'd kind of like not to treat them as if they are the same person and if I'm searching for one, I don't always want to have to put up with a bunch of inexact matches just because some bozo who has no idea has decided they all need to be treated as being the same.

    9. Re:I don't know what the complaint is about? by Anonymous Coward · · Score: 4, Funny

      A database MUST treat all of these names the same: McClean, MacClean, MCLean, Mc Clean, Mac Clean. McCleen, ...

      I assume you left out a "noot" in that sentence? I think there are quite a few people that will kindly (or maybe not-so-kindly) explain why "Mc" and "Mac" are noot the same.

      fixed that

    10. Re:I don't know what the complaint is about? by scdeimos · · Score: 2

      A lot of forums and banking sites written by noob programmers for starters. Notice I said "local part", I haven't found one that cares about -'s and .'s in the domain part.

    11. Re:I don't know what the complaint is about? by fishexe · · Score: 1

      A database MUST treat all of these names the same: McClean...MCLean...[et cetera]...

      But what if they're substantively not the same? McLean would be the son of Lean, and McClean would be the son of Clean (referring to the surnames' origins). Would you insist all databases should treat "lean" and "clean" as though they were the same word?

      --
      "I don't care about the Constitution!" --Bill O'Reilly, November 17, 2009
    12. Re:I don't know what the complaint is about? by fishexe · · Score: 4, Funny

      A database MUST treat all of these names the same: McClean, MacClean, MCLean, Mc Clean, Mac Clean. McCleen, ...

      I assume you left out a "not" in that sentence? I think there are quite a few people that will kindly (or maybe not-so-kindly) explain why "Mc" and "Mac" are not the same.

      Yeah, one goes in front of 'Donald's' and the other goes in front of 'beth'.

      --
      "I don't care about the Constitution!" --Bill O'Reilly, November 17, 2009
    13. Re:I don't know what the complaint is about? by clarkkent09 · · Score: 1

      I know plenty of people whose legal name is a single word, such as "Alex", "Max" or "Virgil."

      Ummm...really? That must not be fashionable where I live cause I've never met a single one. But your point still stands I guess, a system has to account for that possibility.

      --
      Negative moral value of force outweighs the positive value of good intentions.
    14. Re:I don't know what the complaint is about? by Shoe+Puppet · · Score: 2

      But in practice, shouldn't you just be able to copy the regex from somewhere (open source, with attribution of course) and check if it matches?

      --
      (+1, Disagree)
    15. Re:I don't know what the complaint is about? by TedRiot · · Score: 4, Informative

      True. I run into email validation problems constantly. I have a two-part first name that has "-" in the middle, so my firstname.lastname email addresses (usually work addresses) always have a "-". In addition at the moment I'm a consultant in a large company, where they put "ext-" in front of everyone who is not employed by them but works for them and has an email account from them. I also often run into problems with length, because my name is 19 characters and the last place I worked for had a 15 character company name and when you add TLD to that, you sum to an email address that is 39 characters long, which for some seems to be too much. I really don't get why you would use only 32 characters to store an email address..

      This problem very often bites in name fields, too, that don't accept "-" and two capital letters in my first name.

      And I used to live near a border of two cities, where my postal address was from one city while my real city of residence was the other one. I have had a lot of problems with that, when the guys who made the systems were trying to deduce my city of residence from my postal address. Which is also impossible in my country, because the national post office also permits addresses that have postalnumber + company (instead of city) for large companies who take their mail in one place and deliver it themselves the rest of the way.

    16. Re:I don't know what the complaint is about? by Anonymous Coward · · Score: 5, Insightful

      The regular expression, if one must be used, doesn't need to be any more complex than:

      ^[^@]+@[^@]+$

      Sending out response emails to an improperly validated address just turned you into an open relay. Spammers can use your server to send spam by embedding their entire message as the email address, trailed by '\x004@.'

      Validate your inputs. Always.

    17. Re:I don't know what the complaint is about? by VinylPusher · · Score: 5, Informative

      Wow, if you consider McLean and MacLean the same, I suggest you never visit Scotland.

      The Mc's and the Mac's consider the correct usage as a matter of extreme pride. You could end up with one or more bruises if you get it wrong and then insist that "well, they're the same anyway".

    18. Re:I don't know what the complaint is about? by beelsebob · · Score: 1

      McClean, MacClean, MCLean, Mc Clean, Mac Clean. McCleen, ...

      Coming from Scotland, I can tell you, these names are all very different. What you're suggesting is equivalent to treating Alexandra and Alexander as the same name.

    19. Re:I don't know what the complaint is about? by CarpetShark · · Score: 2, Insightful

      Read between the lines a bit. Treat them the same means: treat them as all potentially valid, not that all the names would match in a string comparison.

      I don't think that's what it meant at all. I think the author is trying to be too smart by suggesting that someone looking for MacDonald might have heard it wrong, and so might type in McDonald instead. It's probably a valid point for fuzzy searches, but to say that they should all be treated the same is wrong.

      That said, his other points, especially about the fact that not all names are properly mapped in unicode, is a good one. I just wish he'd posted citations and solutions, rather than simply pointing out the issues. But the first step in fixing a problem is acknowledging the problem.

    20. Re:I don't know what the complaint is about? by Anonymous Coward · · Score: 0

      You don't understand natural language processing. Such things are not to be decided by some programmer. That'll cause huge problems.

    21. Re:I don't know what the complaint is about? by Anonymous Coward · · Score: 0

      Step one, manage the forces on a liquid to make a constant bowl -- this allows you to make a telescope and point it anywhere you want.

      Bullshit. Those are separate and distinct names.

    22. Re:I don't know what the complaint is about? by jandersen · · Score: 1

      Computers will probably never be better than humans at handling names, although one can hope. Sometimes I despair when I read or hear somebody referring to eg. Djengis Khan as "Mr Khan" ("Khan" is a title, not a name) or even call Hu Jintao, "Mr Jintao"; you would have thought people would, by now, have caught on to the idea that something like half the world's population has the family name first.

      On a completely different subject, though: Why is it that when you travel to the US, you get asked questions like "Are you coming to the US to commit an act of terror?" Does anybody expect to hear anything other than "No" in answer? Is it simply a sort of idiotic bullying tactic - treating you like a fool by forcing you to answer silly questions that clearly nobody takes serious?

    23. Re:I don't know what the complaint is about? by Hognoxious · · Score: 2, Interesting

      Why would you be doing the validation in the database?

      If he'd meant "should treat them all as valid", then he should have written that.

      --
      Confucius say, "Find worm in apple - bad. Find half a worm - worse."
    24. Re:I don't know what the complaint is about? by Hognoxious · · Score: 1

      I think the author is trying to be too smart by suggesting that someone looking for MacDonald might have heard it wrong, and so might type in McDonald instead.

      You mean something like soundex?

      --
      Confucius say, "Find worm in apple - bad. Find half a worm - worse."
    25. Re:I don't know what the complaint is about? by Anonymous Coward · · Score: 0

      So what you're saying is, if an email address (maybe copy-pasted from an email?) is perfectly valid, and we have a good regex handed to us that'll tell us it's valid, we should implement our own homegrown validation that'll reject it because it has a '+' in it, or even (supposing we spend the time to get the part we do care about right, which doesn't usually happen...) because it has a name brought in with it?

    26. Re:I don't know what the complaint is about? by Anonymous Coward · · Score: 0

      Fuck. And I thought I had it rough. Nice to see awareness in the general developer pool.

    27. Re:I don't know what the complaint is about? by Anonymous Coward · · Score: 0

      On a completely different subject, though: Why is it that when you travel to the US, you get asked questions like "Are you coming to the US to commit an act of terror?" Does anybody expect to hear anything other than "No" in answer? Is it simply a sort of idiotic bullying tactic - treating you like a fool by forcing you to answer silly questions that clearly nobody takes serious?

      It is part of a series of question, including "Have you ever smoked marihuana?", "Have you ever had a serious illness?", "Have you ever used explosive devices of any kind?". They are checking to see if you are enough know enough of American culture to realise you have to answer 'No' to all those questions. If you fail, they will deny you access as your lack of cultural sensitivity and lying ability would put you in harms way in America.

    28. Re:I don't know what the complaint is about? by bytesex · · Score: 1

      That question is there so that, if you *do* commit an act of terror *and* you survive *and* you get caught, you can have an additional charge of 'lying' stuck on you. Right next to 'committing an act of terrorism'. Because lying is bad, Ok ?

      --
      Religion is what happens when nature strikes and groupthink goes wrong.
    29. Re:I don't know what the complaint is about? by Hognoxious · · Score: 1

      Sometimes I despair when I read or hear somebody referring to eg. Djengis Khan as "Mr Khan" ("Khan" is a title, not a name)

      It may have originally been a title, but I don't think Imran (the cricketer) Chaka (the singer) or M (the bent one) are monarchs of anywhere.

      --
      Confucius say, "Find worm in apple - bad. Find half a worm - worse."
    30. Re:I don't know what the complaint is about? by Anonymous Coward · · Score: 0

      We'd only be replicating the behaviour of most of the rest of the internet. After testing this first hand when implementing the (proper) validation a shocking number of sites refuse perfectly valid email addresses (I guess the moral is, don't have a stupid email address full of weird punctuation, but even so you'd think everyone could get this right when they've essentially told us the answer).

    31. Re:I don't know what the complaint is about? by bickerdyke · · Score: 2, Funny

      You could end up with one or more bruises if you get it wrong and then insist that "well, they're the same anyway".

      Isn't that a general risk when dealing with scotsmen? :-)

      But where does the difference with the missing 'a' come from?

      --
      bickerdyke
    32. Re:I don't know what the complaint is about? by delinear · · Score: 1

      I know we had a new developer start who's surname started with O', and while there was no problem getting him set up in the email system, his [name].o'[surname]@[address].com email address wouldn't work with a particular instance of Bugzilla we had to use (it just silently threw away the emails to him). Admittedly I've never met many people with names beginning with O', but I thought they were sufficiently common that most people would at least know of them and code accordingly.

    33. Re:I don't know what the complaint is about? by delinear · · Score: 1

      I think this is more a data entry issue - that the person entering the data might spell the name in either way, and how far should the system go in compensating for that. The trouble is always that, once you start down that path, all you add is further confusion (why should the person entering the data even bother trying to figure out the correct spelling if they assume the system will figure it out anyway, leading to a spiral of the system having ever more convoluted rules and the meat-sack doing ever less to take the strain off). Better in the long term to just educate your data entry people on the importance of getting the spelling right.

    34. Re:I don't know what the complaint is about? by Hognoxious · · Score: 0

      Spammers can use your server to send spam by embedding their entire message as the email address, trailed by '\x004@.'

      Yeah, if they can type really fast they might be able to send several per hour!

      --
      Confucius say, "Find worm in apple - bad. Find half a worm - worse."
    35. Re:I don't know what the complaint is about? by bickerdyke · · Score: 1

      you would have thought people would, by now, have caught on to the idea that something like half the world's population has the family name first.

      And to makes things worse: Sometimes people know that the other half of the world has a different name and give their name the other way round..

      --
      bickerdyke
    36. Re:I don't know what the complaint is about? by Anonymous Coward · · Score: 0

      Not to speak of different cultures where more than one surname is the common thing. In Spain, for example, we legally have 2 (the first from our father, the second from our mother). So for a guy named "Miguel Perez Gomez", "Perez" would be chosen as a middle name, when it's in fact a surname and there are no middle names there.

    37. Re:I don't know what the complaint is about? by bickerdyke · · Score: 1

      And what's with McLean and MacLean?

      --
      bickerdyke
    38. Re:I don't know what the complaint is about? by Nikker · · Score: 1

      I know that's a joke but it's kind of like getting charged for J walking on your way to rob a bank.

      --
      A loop, by its nature, continues. If that didn't make sense, start reading this sentence again.
    39. Re:I don't know what the complaint is about? by somersault · · Score: 1

      One is Irish, one is Scottish. Can't remember which is which and I don't give a toss because my second name isn't Mc/Mac :P

      --
      which is totally what she said
    40. Re:I don't know what the complaint is about? by xelah · · Score: 1

      No, because some people have invalid but working e-mail addresses. I've used open source regexes copied from somewhere before. Refusing to sell someone something from your online shop because someone has let them set up a technically invalid address leads to awkward questions. The same is true if your filter turns out to be wrong. And it's pointless....why do you care if the hostname has an underscore in it, for example?

      What's most important is to try to catch likely mistakes. You can catch a few - no @, a ',' instead of a '.' before the TLD - but most typos will pass your filter anyway. If you REALLY want to be helpful and don't mind the load, a DNS lookup would do far more.

      Besides, your customer will be a lot more understanding of not getting an e-mail because he mistyped his address than being refused service because your software disapproves of it.

    41. Re:I don't know what the complaint is about? by somersault · · Score: 3, Informative

      Just looked it up. I'm Scottish, live in Scotland and always hear people say that the difference in Mac/Mc is important because of the Scots/Irish thing, but according to this article, that's bollocks:

      http://www.scottishhistory.com/articles/misc/macvsmc.html

      --
      which is totally what she said
    42. Re:I don't know what the complaint is about? by bickerdyke · · Score: 1

      Thx. Knowing that now, I'll never mix them up.....

      --
      bickerdyke
    43. Re:I don't know what the complaint is about? by delinear · · Score: 4, Insightful

      Sometimes I despair when I read or hear somebody referring to eg. Djengis Khan as "Mr Khan" ("Khan" is a title, not a name) or even call Hu Jintao, "Mr Jintao"; you would have thought people would, by now, have caught on to the idea that something like half the world's population has the family name first.

      Oh, come now - are you seriously saying you expect every single person to understand every subtle nuance of every other culture's use of titles and names? Here are some non-English equivalents to Mr., are you seriously telling us you know all of these? Here are the various forms of address in the UK alone, do you know all of these and every other culture's equivalent? How many of these should I learn before I go from being someone you despair of to someone you feel is welcome in your titular elite?

      If half the world's population has the family name first, which half do I choose to offend when I don't know the exact rule for the home country of the person I'm speaking to? That's even assuming I know which country they're from. There's no reason to assume in this shrinking planet that someone who looks like they're from country A wasn't in fact born in country B to parents from countries A and C - a person born in Japan but with lineage in China might take great offence if I use Chinese honorifics to address him, surely it's better to be polite within the confines of my own known culture than to make such crass assumptions about his? The key thing I take from someone saying "Mr Khan" or "Mr Jintao" is that they're at least making the effort to communicate in a civil manner, which certainly causes me no despair.

    44. Re:I don't know what the complaint is about? by ultranova · · Score: 1

      Sometimes I despair when I read or hear somebody referring to eg. Djengis Khan as "Mr Khan" ("Khan" is a title, not a name)

      "Mr. President, the Great Khan is attacking us! Professor, did you forgot to turn off your time machine? Oh never minds! Captain! Arm the proton cannon!"

      --

      Forget magic. Any technology distinguishable from divine power is insufficiently advanced.

    45. Re:I don't know what the complaint is about? by Anonymous Coward · · Score: 2, Interesting

      Because no one ever automated the process of filling out web-forms right?

    46. Re:I don't know what the complaint is about? by kangsterizer · · Score: 1

      I made some algorithms for names and addresses comparisons @ work.
      Mc Clean Mac Clean Mac Cleen etc are indeed all assumed to be the same name. They're doing the same a bit everywhere. You just get less correct results if you take them all (or even some) as different names. Especially if one notes the name over the phone ;)

      What you need is simply more data to be accurate, if you combine name (phonetized) and address it's already better. Then there's quite a few other things to take into account, but anyway..

    47. Re:I don't know what the complaint is about? by digitig · · Score: 1

      On a completely different subject, though: Why is it that when you travel to the US, you get asked questions like "Are you coming to the US to commit an act of terror?" Does anybody expect to hear anything other than "No" in answer? Is it simply a sort of idiotic bullying tactic - treating you like a fool by forcing you to answer silly questions that clearly nobody takes serious?

      My understanding is that it's because the standard of proof required to deport you for lying on the entry form is considerably less than that required to convict you of any offence relating to planning a terrorist act.

      --
      Quidnam Latine loqui modo coepi?
    48. Re:I don't know what the complaint is about? by samjam · · Score: 1

      or caught fiddling taxes while committing all kinds of illegal activities which are hard to prove

    49. Re:I don't know what the complaint is about? by digitig · · Score: 1

      Historically the same, but they've diverged in time. Like "Bullen" and "Boleyn", "Rowe" and "Roe", and lots of others. Same origin, but bearers of those names are likely to get at least frustrated if you get them wrong.

      --
      Quidnam Latine loqui modo coepi?
    50. Re:I don't know what the complaint is about? by Hognoxious · · Score: 1

      There's no reason to assume in this shrinking planet that someone who looks like they're from country A wasn't in fact born in country B to parents from countries A and C

      Obviously knowing beats assuming, but that's not always possible. In that case, if you guess that someone who looks Chinese and has a name that sounds Chinese is from China you'd be right more often than you'd be wrong.

      --
      Confucius say, "Find worm in apple - bad. Find half a worm - worse."
    51. Re:I don't know what the complaint is about? by Sique · · Score: 4, Interesting

      To make things worse, it's not necessarily the family name you use to address someone politely.

      If you have to speak to Paul McCartney (of Beatles' fame), you have to formally address him as "Sir Paul". No, "Sir McCartney" is impolite, you shouldn't use it.
      If you have to speak to Vladimir Putin, you won't address him as "Mr. Putin". It's "Vladimir Vladimirovich", please!

      --
      .sig: Sique *sigh*
    52. Re:I don't know what the complaint is about? by Hognoxious · · Score: 1

      s/noot/nae/

      Dick Van Bloomin' Dyke.

      --
      Confucius say, "Find worm in apple - bad. Find half a worm - worse."
    53. Re:I don't know what the complaint is about? by Shoe+Puppet · · Score: 1

      Thanks. I thought an invalid e-mail adress wouldn't reliabily work anyway.

      --
      (+1, Disagree)
    54. Re:I don't know what the complaint is about? by amRadioHed · · Score: 1

      I've encountered at least one person with only 1 name at work. Of course the standard for Unix usernames is last name followed by first initial, so they ended up with a username "nonamer"

      --
      We hope your rules and wisdom choke you / Now we are one in everlasting peace
    55. Re:I don't know what the complaint is about? by TheRaven64 · · Score: 1

      Sometimes I despair when I read or hear somebody referring to eg. Djengis Khan as "Mr Khan" ("Khan" is a title, not a name) or even call Hu Jintao, "Mr Jintao"; you would have thought people would, by now, have caught on to the idea that something like half the world's population has the family name first.

      And some people just assume that the title is always Mister, even when they've just asked it. While on the telephone to my bank recently, I finally got around to changing the title on my cards to Doctor. This, of course, didn't stop the person in the call centre from calling me Mister for the whole of the conversation. Why not just use my name, rather than make up a title?

      On a completely different subject, though: Why is it that when you travel to the US, you get asked questions like "Are you coming to the US to commit an act of terror?"

      Shocking as it may seem, there is a good reason for this. Lying on immigration forms is a serious criminal offence. This gives one more thing to throw at these people in court if they are caught. Remember Al Capone...

      --
      I am TheRaven on Soylent News
    56. Re:I don't know what the complaint is about? by Beale · · Score: 1

      That said, his other points, especially about the fact that not all names are properly mapped in unicode, is a good one.

      I didn't quite get this - is it a thing about combination characters and accents, or about Unicode not being complete?

    57. Re:I don't know what the complaint is about? by Anonymous Coward · · Score: 0

      my name is 19 characters and the last place I worked for had a 15 character company name and when you add TLD to that, you sum to an email address that is 39 characters long, which for some seems to be too much. I really don't get why you would use only 32 characters to store an email address..

      That's down to good, old fashioned incompetence. Email addresses can have an overall length upto 254 characters; 64 characters in the local part and the max limit for domain names is 255 characters. These checks are trivial.

      I've run into problems using '+' as a delimiter when registering for sites (the form 'user+sitename'). Never have I encountered an email validation scheme that failed on hyphens. I imagine most are so badly written that they would fail on quoted strings user1+"user2@host2"@host1.domain.tld.

      My answer to validation is to perform more than one check, a simple regex will validate 90% of email address. If the first check fails I retry using a full validation routine.

    58. Re:I don't know what the complaint is about? by arth1 · · Score: 1

      A database MUST treat all of these names the same: McClean, MacClean, MCLean, Mc Clean, Mac Clean. McCleen, ...

      In some countries, like the US, rules dictate that the first letter of the last word is significant. But in many other countries, the rules are more involved:
      MacLean: M is significant
      macLean: L is significant
      I.e. the two above names are NOT the same. Theoretically, you could even have a MacLean marrying a macLean, and one of the spouses being MacLean-macLean, and the other macLean-MacLean.

      In the OP, someone gave an example of von/van/van der/de/d'/of, and how it changes between countries and languages. This is indeed the case, and yes, it causes problems. A famous example is author Charles deLint. In the US, you'll find his books under "D". In other countries with more nuanced name rules, more often under "L".

      To sum it up, names with similar spelling may or may not be identical based on the context in which the names are used. And in some cases, they are only going to be best approximations because the input system does not allow the name to be written.
      My birth name is a good example -- it contains a double pharyngeal trill, which doesn't even have an IPA designation. If a Unicode symbol is introduced one day, I'll use it, but that doesn't mean that my name has changed.

    59. Re:I don't know what the complaint is about? by gilgongo · · Score: 1

      True. I run into email validation problems constantly. I have a two-part first name that has "-" in the middle, so my firstname.lastname email addresses (usually work addresses) always have a "-".

      In the year 2010, why anyone would validate email addresses with anything other than "does it have an @ symbol?" and "do we have it on file already?" is totally beyond me. The chances of anyone inputting an invalid address by mistake (other than mistaking the email field for something else entirely) are infinitesimally lower than the chances of preventing somebody from inputting a valid one due to stupid rules.

      --
      "And the meaning of words; when they cease to function; when will it start worrying you?"
    60. Re:I don't know what the complaint is about? by Tridus · · Score: 1

      Mostly because the people coming up with airport security are by and large morons. They had an incident about that very thing in Charlottetown once. Screener asks a person "is there a bomb in your bag?"

      Realizing what a stupid question that is, he says "yes". Turns out security guys don't realize they're asking stupid questions.

      --
      -- "So they told me that using the download page to download something was not something they anticipated." - Bill Gates
    61. Re:I don't know what the complaint is about? by Migala77 · · Score: 2, Interesting

      Proper email validation is not trivial

      The regular expression, if one must be used, doesn't need to be any more complex than:

      ^[^@]+@[^@]+$

      Actually, the local part of an e-mail address can be a quoted string, containing pretty much any character, so "user@host"@example.org is a perfectly valid e-mail address, and doesn't match your regex. Most systems won't accept it, but it's valid...

    62. Re:I don't know what the complaint is about? by Anonymous Coward · · Score: 0

      Really?

      I do full validation, check against a cached copy of ICANN tlds and verify there's an MX or A record in DNS. For users, it helps identify typos. For me, it's simply more elegant than DSNs from the mailserver explaining that you@suckdonkeyballsasaprogrammer could not be delivered.

    63. Re:I don't know what the complaint is about? by Anonymous Coward · · Score: 0

      And nobody ever came up with a way of stopping them.

    64. Re:I don't know what the complaint is about? by Anonymous Coward · · Score: 0

      Mac means "Son of.." e.g. MacDonald = "son of Donald" MacKay = "Son of fire" (in Gaelic i believe)

      AFAIK - Mc means bugger all.

      So if you care about the meaning of your surname then yeah it matters - though is more common amongst the older generations. The younger ones don't seem to make the distinction unless they're really into the etymology stuff.

    65. Re:I don't know what the complaint is about? by Anonymous Coward · · Score: 0

      What? A validation email != open relay. If I can type in:

      bob@example.com
      joe@example.com
      sue@example.com
      etc@example.com

      They're still "valid" and you're still sending them emails. If I can type in:

      @!$$@example.com

      Then all I'm going to do is create a bounce that at most probably goes back to the admin, and possibly doesn't even go anywhere (ie noreply@). A validation email, which generally doesn't/shouldn't contain much user-content beyond the email address, isn't particularly valuable to a spammer.

      Of course, if your code is terrible, and I can use the "to" to overflow and write to the body of your email, that's another story. But generally the reason you do this is so you save the user a little bit of time if they make a mistake, not that you don't accidentally send out to a bad email address.

    66. Re:I don't know what the complaint is about? by Anonymous Coward · · Score: 0

      You mean, "The Mc's and the Mac's consider the correct usage as a matter of extreme obsessiveness over something trivial"

      It's the kind of thing that drunken idiots fight over.

    67. Re:I don't know what the complaint is about? by gfreeman · · Score: 1

      M (the bent one)

      Milky milky

      --
      Ceci n'est pas un sig.
    68. Re:I don't know what the complaint is about? by Anonymous Coward · · Score: 0

      If you have to speak to Paul McCartney (of Beatles' fame), you have to formally address him as "Sir Paul". No, "Sir McCartney" is impolite, you shouldn't use it.

      No, I'd call him "Macca".

      If you have to speak to Vladimir Putin, you won't address him as "Mr. Putin". It's "Vladimir Vladimirovich", please!

      "Fuckhead" would suffice. Not that the meeting or subsequent polonium laced beverage is likely to happen.

    69. Re:I don't know what the complaint is about? by Anonymous Coward · · Score: 0

      That's very clever. Does it stop me entering yousuckdonkeyballsasaprogrammer@yourdomain.com?

    70. Re:I don't know what the complaint is about? by Chrisq · · Score: 1

      I know that's a joke but it's kind of like getting charged for J walking on your way to rob a bank.

      Its fairly common for authorities to prosecute someone with a lesser offence which they can prove easily first, so they have plenty of time to work out the evidence for the real charge.

    71. Re:I don't know what the complaint is about? by DrgnDancer · · Score: 1

      How do you figure that? If they've entered a valid e-mail address (IE it's capable of actually getting to someone to spam them), then their can't be an advert in the address (else is wouldn't be valid... unless you happen to really have a Viagra ad embedded in your address). If they've entered an advert, it can't go anywhere. I suppose if you wanted to make a really inefficient spam generator you might use sites with verification e-mails and a format like: name@domain.com;buystuff@viagrasite.com but it would be trivial enough to strip semi-colons without a large complicated regex, and frankly I can see no value in it. It would be a slow as Hell way to send spam, and the only way that people would notice it is if they regularly analyze the "to" line in their e-mail headers.

      --
      I don't need a million points of light, just two points of multi-mode fiber and a 10 Gig-E router.
    72. Re:I don't know what the complaint is about? by Anonymous Coward · · Score: 0

      I'll fix it right away, Captain Picard.

    73. Re:I don't know what the complaint is about? by DrgnDancer · · Score: 1

      Or you could have just said "Gee, an exception to the normal rule. I guess I'll just use their one name as their user name."

      Nah, much better to turn their user name into some sort of "Red Badge of Different". This is the sort of thing that this article is precisely right about. I'm sure that user got a happy gushy little feeling every time he sat down at a terminal and logged in as "nonamer". Was his GECOS "Weird name that I can't be chuffed to sort out, Robert"?

      --
      I don't need a million points of light, just two points of multi-mode fiber and a 10 Gig-E router.
    74. Re:I don't know what the complaint is about? by Anonymous Coward · · Score: 0

      That's very clever. Does it stop me entering yousuckdonkeyballsasaprogrammer@yourdomain.com?

      In this instance, "yourdomain.com" would be in a blacklist and no, it's not clever, merely thorough.

    75. Re:I don't know what the complaint is about? by DrgnDancer · · Score: 1

      Nearly every Chinese person I know who has immigrated to the US reverses the form of their name to make things easier for the locals. Therefore you are MORE likely to be wrong when addressing them in the manner you suggest than otherwise, at least in my experience. There's no real right answer here. The best you can do is be polite, be gracious if corrected, and expect that if you spend lots of time dealing with internationals you're going to get it wrong sometimes. If you're obviously addressing someone from a traditional East Asian background, perhaps make a point of asking how they structure their name.

      --
      I don't need a million points of light, just two points of multi-mode fiber and a 10 Gig-E router.
    76. Re:I don't know what the complaint is about? by Anonymous Coward · · Score: 0

      Of course, if you handle the problem up front, the cat has no mouse to chase.

    77. Re:I don't know what the complaint is about? by DrgnDancer · · Score: 1

      By contrast in lots of Latin America they've picked up both the Anglo "middle name" and the Spanish "two surnames" resulting in a total of four names for most people. That screws most name systems up but good.

      --
      I don't need a million points of light, just two points of multi-mode fiber and a 10 Gig-E router.
    78. Re:I don't know what the complaint is about? by CarpetShark · · Score: 1

      It's the kind of thing that armies fight over.

    79. Re:I don't know what the complaint is about? by geekoid · · Score: 1

      Hey. don't blame the programmer, blame your fence sitting parents who couldn't make up their minds~

      --
      The Kruger Dunning explains most post on /. http://en.wikipedia.org/wiki/Dunning%E2%80%93Kruger_effect
    80. Re:I don't know what the complaint is about? by geekoid · · Score: 1

      IT's a know issue. Anyone can make a lsit of known issues. People who advance society make a list of issues and solutions.

      I don't know any programmer with more then 6 months experience who isn't aware of these problems.

      If it sin't mapped in Unicode, then the are going to be in for a tough time due to limitations with the computers at this time.

      Add to that, I think it's pretty bad idea to expect everyone to understand every language.

      --
      The Kruger Dunning explains most post on /. http://en.wikipedia.org/wiki/Dunning%E2%80%93Kruger_effect
    81. Re:I don't know what the complaint is about? by BarryJacobsen · · Score: 1

      On a completely different subject, though: Why is it that when you travel to the US, you get asked questions like "Are you coming to the US to commit an act of terror?" Does anybody expect to hear anything other than "No" in answer? Is it simply a sort of idiotic bullying tactic - treating you like a fool by forcing you to answer silly questions that clearly nobody takes serious?

      My understanding is that it's because the standard of proof required to deport you for lying on the entry form is considerably less than that required to convict you of any offence relating to planning a terrorist act.

      Which is interesting in that if they can't prove/convict you of a terrorist act, can they really say that you lied on the entry form?

    82. Re:I don't know what the complaint is about? by Anonymous Coward · · Score: 0

      What the tiny, monospaced fuck? I like your comment, .... I'm just... Am I the only one seeing this tiny?

    83. Re:I don't know what the complaint is about? by operagost · · Score: 1

      No true Scotsman has "Mc" in his name!

      --

      Gamingmuseum.com: Give your 3D accelerator a rest.
    84. Re:I don't know what the complaint is about? by operagost · · Score: 1

      Or even the greater offense of K walking.

      --

      Gamingmuseum.com: Give your 3D accelerator a rest.
    85. Re:I don't know what the complaint is about? by jeffasselin · · Score: 1

      I collided with a retarded validation system last year where I never managed to register with my real last name. The thing was filtering for "obscenities" and didn't like my last name (Asselin) because it begins with "ass".

      --
      If he explores all forms and substances Straight homeward to their symbol-essences; He shall not die.
    86. Re:I don't know what the complaint is about? by russotto · · Score: 1

      Would have been funnier if he'd had a bombe in his bag.

    87. Re:I don't know what the complaint is about? by Eunuchswear · · Score: 1

      Of course the standard for Unix usernames is last name followed by first initial

      Sez who? Or do you mean at your place of work.

      --
      Watch this Heartland Institute video
    88. Re:I don't know what the complaint is about? by Eunuchswear · · Score: 1

      Yeah, and if we call you anonymous cowheard you'd be ok with that?

      --
      Watch this Heartland Institute video
    89. Re:I don't know what the complaint is about? by mcgrew · · Score: 1

      Any database programmer worth his salt has two address fields, home address and mailing address. This list seems to be for those who really suck at database programming.

      Speaking of which, I miss NOMAD. They have me doing everything in MS Access now (ugh).

    90. Re:I don't know what the complaint is about? by melikamp · · Score: 1

      "Fuckhead" would suffice.

      It would work on Gorbachev, who knows English. Putin would understand "yeblo", "yebalo", "dolboyob", or "poyeben'".

    91. Re:I don't know what the complaint is about? by nbauman · · Score: 1

      In Afghanistan many people traditionally have only one name.

    92. Re:I don't know what the complaint is about? by nbauman · · Score: 1

      Screener asks a person "is there a bomb in your bag?"

      Realizing what a stupid question that is, he says "yes". Turns out security guys don't realize they're asking stupid questions.

      Yes, I read about a case like that. Homeland Security insisted on prosecuting the guy, for a felony. He went to trial. The jury found him innocent. It turned out juries don't like to convict people of felonies for making jokes. They changed the law to make it a misdemeanor.

    93. Re:I don't know what the complaint is about? by balbus000 · · Score: 1

      Help push all the browsers to support HTML5 and just use
      <input type="email" name="user_email" />

    94. Re:I don't know what the complaint is about? by nabsltd · · Score: 1

      I know we had a new developer start who's surname started with O', and while there was no problem getting him set up in the email system, his [name].o'[surname]@[address].com email address wouldn't work with a particular instance of Bugzilla we had to use (it just silently threw away the emails to him).

      You got lucky in being able to incorrectly set up the e-mail address in your system.

      The single quote is a delimiter in e-mail addresses, and needs to be correctly paired or escaped. Otherwise, the address is not valid according to the RFC.

    95. Re:I don't know what the complaint is about? by Frater+219 · · Score: 3, Insightful

      Check out the huge regex at the bottom of the RFC 5322 compliant validator from CPAN:

      Honestly, this sort of thing is an example of overusing regex when it's the only parsing tool they know. Regex becomes unwieldy when you put too much of it in one place -- but this is because regex is unwieldy, not because the problem of parsing email addresses is fundamentally hard. Parsing email addresses is a case for a modular parser such as Parsec (or any of its ports and imitators) ... which will give you the added advantage of useful error messages on invalid input, instead of just a match failure.

      Moreover, isn't it kind of silly to point at an example of someone already having written the code to do something as a way of saying that doing it is difficult? In code, once it's already been done once, correctly, it doesn't need to be done again. If you think CPAN's huge regex (or any other implementation) is correct, and you've tested it to your satisfaction, you don't need to reimplement it; just use it.

    96. Re:I don't know what the complaint is about? by oblio_one · · Score: 1

      Besides, your customer will be a lot more understanding of not getting an e-mail because he mistyped his address than being refused service because your software disapproves of it.

      We must have very different customers. If there's any difference between the two levels of understanding in mine, it's in the noise.\ Sales manager says the customer is always right; please fix the website and email back Joe SixPack at trueeBeleiver@aol.com when it's working properly, if he doesn't get it I'll be calling you to ask why again.

    97. Re:I don't know what the complaint is about? by Bigjeff5 · · Score: 2, Funny

      Because no one ever automated the process of filling out web-forms right?

      Pffft, the idea is absurd! You'd need a computer to do that, and what spammer has a computer?

      Oh, right. Yeah, all of them. My bad.

      --
      Security is mostly a superstition... Avoiding danger is no safer in the long run than outright exposure. - Helen Keller
    98. Re:I don't know what the complaint is about? by Bigjeff5 · · Score: 1

      Without proper address validation, the server accepts the entire message, instead of just the address. That's what the \x004@ bit was about. When an email gets sent, it's a big string of data with headers. If you do email address filtering (which is what the GP did) instead of actual validation you can send an entire message in a field that is only supposed to accept the address.

      It is literally the entire email message encapsulated in the address, and any time that address gets processed it shows up as a normal email, not just an absurd email address.

      Filtering is easy, validation is hard. Validation is necessary for email to work as intended.

      --
      Security is mostly a superstition... Avoiding danger is no safer in the long run than outright exposure. - Helen Keller
    99. Re:I don't know what the complaint is about? by amicusNYCL · · Score: 1

      Sometimes I despair when I read or hear somebody referring to eg. Djengis Khan as "Mr Khan"

      Is that guy related to Ghengis? They have the same last name.

      --
      "Our two-party system is like a bowl of shit looking at itself in a mirror." - Lewis Black
    100. Re:I don't know what the complaint is about? by Bigjeff5 · · Score: 1

      That's right, a true Scotsman would use the old "mac meic X" convention.

      --
      Security is mostly a superstition... Avoiding danger is no safer in the long run than outright exposure. - Helen Keller
    101. Re:I don't know what the complaint is about? by theaceoffire · · Score: 1

      "Proper email validation is not trivial" Why not just send a message with a random pin? If they get it, then accept it as valid. If it is a fake email, then they are hosing themselves (No password recovery, etc), and they would have given you a fake email anyway (throw away accounts, etc). Your users will appreciate the lower annoyance factor too. I use Gmail's "Me+tagname@gmail.com" and "M.....e@gmail.com" methods to activate various filters, but I will use fake email if they complain about it.

      --
      I steal signatures. This one used to be yours.
    102. Re:I don't know what the complaint is about? by Bigjeff5 · · Score: 1

      Mr. Khan and Mr. Jintao are just as valid as Mr. President. Same with Mr. General or Mr. Principal, or Mr. Secretary. Using "Mister" in front of a title is perfectly valid and very common in English.

      In other words, you're completely fucking wrong, you idiot retard.

      And "Mr." is an abbreviation for "Mister" and should be punctuated as such.

      --
      Security is mostly a superstition... Avoiding danger is no safer in the long run than outright exposure. - Helen Keller
    103. Re:I don't know what the complaint is about? by Bigjeff5 · · Score: 1

      I think there are quite a few people that will kindly (or maybe not-so-kindly) explain why "Mc" and "Mac" are not the same.

      And they'd all be completely fucking wrong.

      "Mc" is an abbreviated form of "Mac", which is a condensed version of "Meic" which is the condensed version of the phrase "mac meic", which means "son of the son of". It is found in both Scotland and Ireland because the Scots got their surname convention directly from the Irish. You don't find any O's (i.e. O'Flanigan) in Scotland simply because the "ua" (from which the O comes) form had fallen out of favor by the time the Scottish began using the Irish naming conventions.

      If more Scots knew their own ancestry it wouldn't be a problem, but that's true of almost every argument of the sort.

      The Scots and Irishmen are nothing more than brothers who fight too much.

      --
      Security is mostly a superstition... Avoiding danger is no safer in the long run than outright exposure. - Helen Keller
    104. Re:I don't know what the complaint is about? by paeanblack · · Score: 1

      Moreover, isn't it kind of silly to point at an example of someone already having written the code to do something as a way of saying that doing it is difficult?

      Not when people think their 12-character regex that they just thought up will validate an email address.

      The whole point of the article is that too many programmers think a one-liner can suffice when they should be using an existing library that has been hammered upon for 20 years.

      Email address verification is hard. "No it isn't! See! ^[^@]+@[^@]+$ "

      Handling names is hard. "No it isn't! See! SELECT FROM publishers WHERE name = 'O'Reilly' "

      It's basic laziness. You know what really is hard? Getting one-liner guy to actually sit down and think about a problem.

    105. Re:I don't know what the complaint is about? by amRadioHed · · Score: 1

      Sure, I would have just used his name. Alas, I was not the mindless automaton who set up the account.

      --
      We hope your rules and wisdom choke you / Now we are one in everlasting peace
    106. Re:I don't know what the complaint is about? by k8to · · Score: 1

      Unicode is not a complete description. That's not a problem with software handling names, though. It's a problem with unicode. Any system which tried to do something else is going to be worse.

      --
      -josh
    107. Re:I don't know what the complaint is about? by amRadioHed · · Score: 1

      Right, that would be the standard at that particular organization.

      --
      We hope your rules and wisdom choke you / Now we are one in everlasting peace
    108. Re:I don't know what the complaint is about? by Anonymous Coward · · Score: 0

      Robert '); DROP TABLE Students;-- ?

    109. Re:I don't know what the complaint is about? by Anonymous Coward · · Score: 0

      If it is a fake email, then they are hosing themselves

      1) Spammer notices you accept an arbitrary character string (that you assume is an "email address") and you will send out emails containing that string.

      2) You fail to properly validate your inputs and spammer submits a carefully crafted string to exploit you.

      3) Your server starts sending spam and ends up on all the RBLs. Now your legitimate customers aren't getting their messages.

      4) ???

      5) Loss.

    110. Re:I don't know what the complaint is about? by Anonymous Coward · · Score: 0

      In Iceland, the formal way to address someone is simply to use their first name.

      Not only that, but they don't have a family name. Most members of a given family will actually have a different last name. The last name indicates who they are the son or daughter of. For example, if Jon marries Bjork, then their son's last name will be Jonsson ("Jon's son") and their daughter's last name will be Jonssdottir ("Jon's daughter"). Occasionally, the mother's name will be used (ie. Bjorksson and Bjorksdottir). The mother's last name will not change when she gets married. She will keep her last name, which indicates who she's the daughter of.

    111. Re:I don't know what the complaint is about? by Wow8agger · · Score: 1

      And of course, Mr President. After all, what a faux pas it would be to call President Obama "Mr President" here in the US.

      Oh wait..

    112. Re:I don't know what the complaint is about? by Anonymous Coward · · Score: 0

      Spammers can use your server to send spam by embedding their entire message as the email address, trailed by '\x004@.'

      Not unless they figure out how to embed newlines into the string that *also* somehow pass validation, which will be mighty hard given that $ will typically match at the first newline in most civilized programming languages.

    113. Re:I don't know what the complaint is about? by ultramk · · Score: 1

      In Soviet Russia, you would just call him "comrade" and be done with it.

      --
      You catch enchiladas by picking them up behind the head and holding them underwater until they don't kick anymore -VeGas
    114. Re:I don't know what the complaint is about? by Thing+1 · · Score: 1

      Hi, this is off-topic, I wanted to thank you for your signature, which I googled, and found:

      http://www.washingtonpost.com/wp-srv/style/longterm/books/chap1/trying_t.htm

      Well worth the 15 minutes it took to get to the punchline, your signature, at the end of it. Thanks!

      --
      I feel fantastic, and I'm still alive.
    115. Re:I don't know what the complaint is about? by jackb_guppy · · Score: 1

      I guess you do not work with large databases and trying to find a person the first time NO MATTER how the name is spelled. It is just good customer service. with as your example an Irish person entering a Scottish name and vis-a-vers-a easily making a mistake.

      Now add in German names with a beta symbol instead of a double s. The Greek with AE combo letter vs AE. And all the other punctuations from the French, German, and ... That are NOT normally not on capitalized words.

      I did hotel software for 20yrs, these are common problems for cross boarder travel. I wrote custom search and translation software to "fix" these issues so the guest service can shine even with errors. But the cost was greater disk size and two files to read for each sorted name.

    116. Re:I don't know what the complaint is about? by st0nes · · Score: 1

      And "Mr." is an abbreviation for "Mister" and should be punctuated as such.

      Horseshit. The rule in English is that if the first and last letters of the abbreviation are the same as the first and last letters of the word abbreviated, then the full stop is omitted. In American, this rule is ignored. You idiot retard.

      --
      Tempora mutantur, nos et mutamur in illis
    117. Re:I don't know what the complaint is about? by Sique · · Score: 1

      Actually it's "tovarishtsh".

      --
      .sig: Sique *sigh*
    118. Re:I don't know what the complaint is about? by konohitowa · · Score: 1

      at the moment I'm a consultant in a large company, where they put "ext-" in front of everyone who is not employed by them but works for them

      Here's to hoping that you don't become a zzz* in the near future. (yes, I'm pretty sure I know the company -- I too have been an EXT-). And you know, God bless Outlook's inability to do a substring search in the damned address book. That always makes it fun to find someone. "Hmmm... well, that didn't work. Maybe they're an EXT-. Hmmm... nope. Ummm, typo? Gee. Did they add some new prefixes? Oh - yes. There we go. GOV-. Doh."

    119. Re:I don't know what the complaint is about? by b4upoo · · Score: 1

      Years ago they used to ask potential drivers' license renewals if they were addicted to heroin or other illegal drugs in Florida.My answer was that I rarely confessed to major felonies.

    120. Re:I don't know what the complaint is about? by jandersen · · Score: 1

      Oh, come now - are you seriously saying...

      Of course not - I am employing exaggeration as a device for conveying humour; this is something we often do in Europe, and it appears that I have inadvertently been rude to the American minority on /. by expecting them to spot this.

      ... are you seriously telling us you know all of these?

      Is that a trick question? As a matter of fact, I know, actively or passively, most of the ones used in European languages, as well as Chinese, but that is beside the point. It is actually possible to be something in the middle between Mrs Manners and GWB. If you refer to Djengis Khan as "Mr Khan" or just "Khan" you are committing the sin of trying to appear knowledgeable about the subject without having bothered to learn that fairly basic fact; you can hardly read about Djengis Khan without coming across expressions like "the Khan" which should tell you that it is not a personal or family name. The same applies for Chinese names - you hear Mao Zedong referred to as "Chairman Mao", and Hu Jintao as "President Hu"; based on that, is it then very clever to refer to "Mr Jintao"?

      The key thing I take from someone saying "Mr Khan" or "Mr Jintao" is that they're at least making the effort to communicate in a civil manner, which certainly causes me no despair.

      What I take from it is that some people think they can just free-wheel it through life without bothering to pick up even elementary knowledge on the way.

    121. Re:I don't know what the complaint is about? by Anonymous Coward · · Score: 0

      The "@." at the end would make the regex fail to match, assuming that somewhere within the message-embedded-in-address there already exists an @ symbol in the actual target address. That regex will only allow a single @ anywhere in the string.

      But trailing it with a simple \x004 seems like it would have the desired effect.

    122. Re:I don't know what the complaint is about? by jandersen · · Score: 1

      ...you can have an additional charge of 'lying' stuck on you.

      Funny enough, I have always had the impression that the defendant in any criminal process had an outright or at least an assumed right to lie.

    123. Re:I don't know what the complaint is about? by jd · · Score: 1

      The Gaelic naming convention of Mc/Mac places no rules on which you use. Mc and Mac are entirely interchangeable and having studied by Scots heritage, I can say even the same person could vary between them randomly. This creates a serious problem, because if some people insist THEIR spelling is the one true spelling and an equal number insist on the right to vary at will, you CANNOT make either approach a firm policy. There is no simple solution.

      --
      It's a small world and it smells funny; I'd buy another if it wasn't for the money; Take back what I paid (SoM)
  11. Article makes wrong assumption about software. by Vellmont · · Score: 5, Insightful

    Software is NOT designed to be perfect and cover every case. Have a numeral in your name? Too bad. Need some names to be case sensitive, and others case insensitive? Sucks to be you. Have a 200 character name that doesn't fit in the 100 characters the designers thought no crazy person would ever have? Tough.

    I started reading through the list, and it's just ridiculous. There's a few good points, like names don't change, or names are unique. But they're so obvious that the vast majority of the times it's not a big problem. More often it's just a matter of training the data edit/entry folks how to change someones name, or how to not assume a name is a sole identifier.

    But assuming the worst and trying to design a system that'll allow people's names to be Chinese characters when you don't do business in China, have presence in China, or ever ever plan to? That's ridiculous. Software doesn't have to be perfect out of the shoot. It should be adaptable though if some unforeseen shortcoming becomes a larger problem. Gee, I guess if you ever chose to do business in China and need Chinese character names you might have to re-write part of the damn software. Oh well, that's what software developers are FOR!

    If you don't even HAVE a name, then I submit you're crazier than the artist formerly known as the artist formerly known as Prince. At least HE had a name, though it was an unpronounceable symbol. The world can't accommodate every possibility, and software is no exception.

    --
    AccountKiller
    1. Re:Article makes wrong assumption about software. by lennier · · Score: 0, Flamebait

      Software is NOT designed to be perfect and cover every case.

      Then it's designed to fail, and probably also has security holes, and should not be deployed. At the very least it will be a block to compatibility, at worst it will become a botnet.

      Software in the Internet age NEEDS to be 100% correct because it has a potential lifetime of infinity years and 6 billion and counting potential users. If you're building a system 'to throw away' as the XP people suggest, you're also building one to be rooted unless you DO throw it away before the. And if you ever put it or any references to it on the Web, you're going to leave permanent identifiers to your system scattered all over the planet even after you throw it away - so if you're designing it to be thrown away, you're designing to leave a trail of broken data and angry users in your wake.

      Is that really what you want to do?

      --
      You are not a brain: http://books.google.com/books?id=2oV61CeDx-YC
    2. Re:Article makes wrong assumption about software. by Anonymous Coward · · Score: 0

      I 100% agree.

      Generally when building a form that asks for a name, I create a first name field and a surname field. I allow A-Z, a-z, space, - and '. Considering the company I work for is local (i.e. Australian), I don't need to waste time trying to think out all the insane ways names could be formatted in other languages. My rules might be a bit restrictive but most names would fit. Sure you might get the few odd names that contain strange characters etc, but this is not extremely common.

      Some of the rules on that page are really stupid as well, like for example saying it's wrong to have separate fields in a database for first name, surname etc. If you were to create one large field for a name that makes it even more difficult. What if, for example, you wanted to display on your webpage a message like "Welcome {first name}". With all the crazy ways the article mentions that people come up with names it would be impossible for the system to split the first name from the rest of the string.

    3. Re:Article makes wrong assumption about software. by lennier · · Score: 4, Insightful

      But assuming the worst and trying to design a system that'll allow people's names to be Chinese characters when you don't do business in China, have presence in China, or ever ever plan to? That's ridiculous.

      Or sell in New Zealand, or Australia, or anywhere else in the Pacific, or deal with immigrants, or be used by anyone who has a Chinese name?

      This is the Internet now. Welcome to it.

      --
      You are not a brain: http://books.google.com/books?id=2oV61CeDx-YC
    4. Re:Article makes wrong assumption about software. by lennier · · Score: 1

      Edit: 'unless you do throw it away before the bad guys find it.'

      Which is our current approach to patching, and is failing, hard. mmm, zero-days.

      --
      You are not a brain: http://books.google.com/books?id=2oV61CeDx-YC
    5. Re:Article makes wrong assumption about software. by Vellmont · · Score: 2, Insightful


      Then it's designed to fail

      Anything ever designed is designed to fail. This applies to bridges, the pyramids, and all software. This belief you have that software doesn't have to be maintained is as ridiculous as the idea that a bridge or any physical structure doesn't have to be maintained. Software lives and dies like anything else. Nothing lives forever.

      --
      AccountKiller
    6. Re:Article makes wrong assumption about software. by Trepidity · · Score: 5, Insightful

      Most Chinese emigrants to countries that use a Roman alphabet are perfectly capable of writing their name in Roman characters if they need to. If they weren't, they wouldn't have been able to get visas and get into the country in the first place.

    7. Re:Article makes wrong assumption about software. by PrecambrianRabbit · · Score: 3, Insightful

      You're overreacting (I know, I know, "welcome to the Internet"). Software should behave in some sane, safe manner given any input. Sometimes, the sane thing to do is to throw an error, or say "Sorry, Dave, I can't do that."

      In particular, systems don't necessarily have to shoehorn insane data into their processing. To use a relevant example, simply because Prince wants to upload a PNG in the "Name" field doesn't mean that the software has to let him. Rejecting this case does not doom said software system to "become a botnet" or "leave a trail of broken data."

    8. Re:Article makes wrong assumption about software. by jrumney · · Score: 2, Insightful

      Generally when building a form that asks for a name, I create a first name field and a surname field.

      And you fail right there. For some people, their first name is their surname. Others don't have a surname. Some of those without a surname may use a patronym or matrinym as part of their full name, but you never use it to address them without their personal name. Some people have a first name and second name that always go together, so parsing a first name out of the full name, or disallowing whitespace in the first name field is another common fail.

      Names are complex. Don't assume it doesn't matter because your database is only intended for local use, because unless you live somewhere as closed as North Korea, there are immigrants in your town that break your assumptions.

    9. Re:Article makes wrong assumption about software. by atamido · · Score: 1

      Just use UTF-8, all modern databases accept it as a field type. This isn't exactly a complex issue.

      If you can't write your name in UTF-8, it doesn't deserve to be written.

    10. Re:Article makes wrong assumption about software. by YoshiDan · · Score: 1

      Well, I'm pretty sure that a surname is a legal requirement in Australia, so even if there are immigrants that have a single name, they would have had to create a surname to immigrate here anyway. (I'm the AC btw, that was my first post here and I didn't notice the post anonymously checkbox)

    11. Re:Article makes wrong assumption about software. by yyxx · · Score: 1

      Then it's designed to fail

      You bet it is. All software can and will fail.

      and probably also has security holes

      It probably does, but not because it rejects non-standard names. In fact, having a well-defined model of what a name is and rejecting input that doesn't conform to it is necessary for security.

      At the very least it will be a block to compatibility

      No, a "block to compatibility" would be to accept arbitrary inputs as names. Compatibility requires some standardization, and that requires rejecting things that the software doesn't know about.

      Software in the Internet age NEEDS to be 100% correct

      Software can never be 100% correct; it's not even well-defined what that means. But accepting arbitrary inputs as names is probably not correct anyway.

    12. Re:Article makes wrong assumption about software. by rainmaestro · · Score: 1

      Software doesn't need to be 100% correct. It just needs to be able to handle the incorrect things gracefully. You don't need to support every possible name type (letters, pictographs, audio file containing a sequence of letters in morse code, blah blah blah), you just need it to reject non-standard formats without shitting all over the server.

      As far as lifespan, yeah, just yesterday I was using a bunch of spiffy Win 3.1 apps. And I hear those dirt farmers in Ghana are really tech savvy (though they spend all their time preaching on Usenet about anarcho-syndicalist communes). All software has a finite lifespan, Internet be damned, and it is typically very short. How many people do you see running Netscape Navigator 4.X nowadays? How about Microsoft Bob? Hell, I've got door controller software at work written in the XP era that can't even run properly in a SP2 environment, let alone Windows 7. When XP dies, that software is buried with it.

    13. Re:Article makes wrong assumption about software. by canajin56 · · Score: 3, Funny

      So, you shouldn't deploy software that doesn't, as the retarded article says, properly handle people with names that are over 65,000 characters long, where some portions are case sensitive, so if that part is lowercase instead of upper, that's a different name. But other parts are case insensitive, so its still the same name even in all caps. Oh, did I also mention that some of the letters in the name aren't part of any character set, so they can't even be typed in the first place? Because the article says that assuming names can even be text at all is wrong and your software is broken if you made that stupid assumption. (See Prince) PS, that person with the 65 thousand letter long name? He has 8,000 aliases and needs to enter all of them, better hope you allow that many aliases. Also, there is a huge subset of his name in common with a friend of his, but they are not related, it is sheer coincidence, you better not assume relation just because only two people in the world have the same last few hundred words of their name in common! Also, his brother has no name. Not, like his name is "No name" or he goes by "The artist formerly known as Prince", as in, his name is just the empty string, so your software better fucking not have name as a required field!

      --
      ASCII stupid question, get a stupid ANSI
    14. Re:Article makes wrong assumption about software. by lennier · · Score: 1

      That's true, I guess. Pinyin is a wonderful thing. But you need to at least make sure that you don't make assumptions about firstname/surname order, and allow for a separate 'preferred English name' which is different from the Pinyin Chinese first name, since many Chinese people that I know have one of those.

      --
      You are not a brain: http://books.google.com/books?id=2oV61CeDx-YC
    15. Re:Article makes wrong assumption about software. by Vellmont · · Score: 1


      Or sell in New Zealand, or Australia, or anywhere else in the Pacific, or deal with immigrants, or be used by anyone who has a Chinese name?

      This may come as a surprise to you, but the U.S. is a nation of immigrants some of which are Chinese and have Chinese names. They seemed to have figured out how to trans-literate their name into English characters.

      And if you have to support Chinese characters for a name, pony up some dough and pay someone to do it. Why do you think the software has to be perfect from day one?

      --
      AccountKiller
    16. Re:Article makes wrong assumption about software. by shutdown+-p+now · · Score: 1

      Is it so hard for you to just use Unicode, and avoid making assumptions about what goes into a name? It's not a point of you having to do something - it's a point of you not doing something - as in, not writing "validation" code that is incorrect. It's less work for you, ultimately, and that work didn't even serve any useful purpose in the first place (so you have code that checks for numbers in a name - what, exactly, is the benefit to either you or the user of your system?).

      It's not just the problem of character sets, either. You know that there are cultures out there which do not have family names at all? Are you going to tell someone to GTFO because he's an immigrant from such a country?

    17. Re:Article makes wrong assumption about software. by Draek · · Score: 5, Funny

      You're not a programmer, are you?

      Oh, don't worry, I can tell.

      --
      No problem is insoluble in all conceivable circumstances.
    18. Re:Article makes wrong assumption about software. by Trepidity · · Score: 1

      You can't "just use Unicode" and do no validation, though, unless you're perfectly fine with all sorts of bidi control characters showing up random places, or nonprintable characters causing two different names to look identical, etc., etc.

      I doubt there's code that specifically blacklists numbers. Safe practice is to whitelist "known good", not to blacklist "known bad", so it's probably just a system with a very conservative whitelist. I agree there'd be no real harm in expanding the whitelisted character ranges somewhat, but it's still something you have to write; you can't reasonably get away with doing no validation at all.

      And yes, I would say that if someone can't invent something to put in a "family name or surname" field, then too bad. They would also find themselves unable to travel to most countries, since most countries' immigration forms have such a box (and Icelanders in particular do not have a problem filling out such forms correctly).

    19. Re:Article makes wrong assumption about software. by ghostdoc · · Score: 1

      Having had to design and implement an application form processing system that deals with the general public, and had millions of applications, then you do have to deal with those edge cases. You can't just say to someone 'no you can't have a card because you've got a stupid name'. You do get Chinese people living in your (English-speaking) country wanting to apply in their own name.

      You get idiotic things like people applying with a name of (insert vulgar word here), who then go to the press and complain that you sent them a card with (same vulgar word) on it. The press joyfully print the story and suddenly you're implementing a blacklist on your data and cursing the good people of Scunthorpe.

      You get people who name their kids with the same name as themselves, so they have multiple people of the same name at one address.

      You get people who marry and have the same first name. Sam, Tony, Robin, for example, are all gender-neutral, and in our culture a wife takes her husband's name. Given enough random pairings eventually a Sam will marry a Sam. And that's without considering same-sex marriages.

      Names are not identifiers in any meaningful sense. The combination of name, date of birth, gender, and zip code/postcode/area code is about the best you can get, but even then it's trivial to think of examples that will break this (I bet there's some joker couple out there who named their identical twins with identical names).

      --
      Business/App ideas are like arseholes: everyone's got one, they're mostly shit, but very rarely they contain a diamond
    20. Re:Article makes wrong assumption about software. by Splab · · Score: 1

      What a load of BS.

      Yeah in a perfect world everything is tested and running 100% correct. That Linux you are using, *GASP*! it has bugs in it!

    21. Re:Article makes wrong assumption about software. by shutdown+-p+now · · Score: 2, Insightful

      You can't "just use Unicode" and do no validation, though, unless you're perfectly fine with all sorts of bidi control characters showing up random places

      That is output validation problem, not input validation. So go ahead and strip it on output, when (and if) you need to do it.

      or nonprintable characters causing two different names to look identical

      And why would that be a problem (anymore so than people with identical names in general)?

      And yes, I would say that if someone can't invent something to put in a "family name or surname" field, then too bad. They would also find themselves unable to travel to most countries, since most countries' immigration forms have such a box

      Having a box is not a problem. By all means, keep one. The problem is making input there mandatory. Do immigration forms make it so?

    22. Re:Article makes wrong assumption about software. by the_raptor · · Score: 1

      They have family names, they are just not consistent across generations. Which is largely not an issue for a computer based system.

      --

      ========
      CINC, 4th Penguin Legion
    23. Re:Article makes wrong assumption about software. by Anonymous Coward · · Score: 0

      Just list him by his nickname: "DoS"

    24. Re:Article makes wrong assumption about software. by Anonymous Coward · · Score: 0

      I'd argue that it's more imperative than ever to get name handling correct, or at least reasonably correct. My name isn't even weird in any way and I *STILL* come across websites that think 8 letters ought to be enough for anyone's name! (I've got 9). Honestly, p

      However, the one thing I can't figure out, is who in the world *doesn't* have a name, as stated in the article. I mean, sure, sometimes not a legal name till way later in some countries, but everyone has some kind of name they go by to everyone around them....

    25. Re:Article makes wrong assumption about software. by SEE · · Score: 4, Informative

      Is it so hard for you to just use Unicode

      Unicode doesn't cover the full set of CJK characters used for names, nor does it cover all writing systems in actual use.

    26. Re:Article makes wrong assumption about software. by Centurix · · Score: 1

      Led Zeppelin IV was a bitch. Need to apply these rules to ID3 tags.

      --
      Task Mangler
    27. Re:Article makes wrong assumption about software. by BZ · · Score: 1

      > Software is NOT designed to be perfect and cover every case.

      It really depends on the situation.

      For example, the software that handles the US passport system needs to handle anything that can appear in a legal name in the US, since the name on the passport needs to be the legal name.

      One would think that so should state drivers license systems, except the Massachusetts one, for example, doesn't allow a space in the last name. The consequence? It's technically not legal to use your MA license as ID to board a plane if you happen to have a space in your last name (because the name on the license is not your legal name). All thanks to people with attitudes like yours not bothering to think a bit more before coding in extraneous checks in their code.

    28. Re:Article makes wrong assumption about software. by shutdown+-p+now · · Score: 1

      Patronymic is not a family name. I, for one, have both.

    29. Re:Article makes wrong assumption about software. by MHolmesIV · · Score: 1

      Oh yes, the "screw you" reaction. It appears a lot of the comments on the TFA too.

      Unfortunately, if your software gets used by an official organization, like Immigration, or the IRS, you should be able to deal with a lot of the issues listed in the article. If not, it can actually destroy people's lives.

      My friend almost had to start his naturalization process all over again because one of his middle names (russian) is spelled differently in a number of government databases. In the end, they let him officially remove the middle name so he could continue.

      I've had a number of issues with web forms (Government and corporate) refusing to let me submit because I have no middle name. I've had paper forms rejected and sent back to me because I had "forgotten to fill in my middle name, and the database rejected the data entry". I have endless hassles with flying because, with only two names, I have a much greater chance of matching someone on the watch list, and I have had lots of issues with the credit reporting agencies too. If I, with two perfectly serviceable ascii names, have these troubles, think about the folks who have only one, or have characters that get filtered out etc.

      These problems are because thoughtless idiots make assumptions and then defend them with facile arguments about software not being perfect.

    30. Re:Article makes wrong assumption about software. by shutdown+-p+now · · Score: 2, Informative

      That's true also. However, Unicode covers much more ground immediately with practically no effort required from the programmer - but once you go beyond that, the complexity increases very rapidly (since you have to start dealing with multiple different encodings simultaneously etc).

      As well, new Unicode versions come out regularly which expand its reach, and new frameworks/databases update their Unicode support every now and then, so if you start using it today, it'll be much easier (in many cases, completely free) for you to expand coverage in the future in backwards-compatible way.

      In contrast, if you, say, use Latin-1 today, you'll either have to start dealing with multiple encodings much sooner, or to recode the database eventually.

    31. Re:Article makes wrong assumption about software. by ssyladin · · Score: 1

      And damnit it was invented by Americans, or at least Dan Quayle, so quit fighting it and get an American name!

    32. Re:Article makes wrong assumption about software. by Anonymous Coward · · Score: 0

      Belittling the problem doesn't solve anything. You lose.

    33. Re:Article makes wrong assumption about software. by jrumney · · Score: 1

      I couldn't find anything on Federal Law concerning names, but there is this from NSW:

      2.3 Naming systems In other cultures, including those of migrants to Australia, show a diversity of practices.6 While It is true to say the majority follow the hereditary paternal surname custom as was transferred from the British Isles, there are some significant differences, especially in Asian and Slavic communities.7 Other systems identified by the Commission do not use either parent's surname In the child's name.8 Names in Aboriginal languages use traditional forms, including skin names, but even within Aboriginal society, practices are not uniform? As it has been impossible to obtain definitive statements of naming practices In ethnic communities represented In Australia, the Commission has not sought to formulate statutory provisions to regulate the naming of children in New South Wales.

    34. Re:Article makes wrong assumption about software. by Anonymous Coward · · Score: 0

      41. I as a programmer care if my naming system is adequate. (Hint: I don't - which makes this entire list pointless.)

    35. Re:Article makes wrong assumption about software. by Vellmont · · Score: 1


      It really depends on the situation.

      No, it really doesn't. You're just not imagining all the cases where you'd never even possibly have to cover. Should the software be able to handle a character set not even invented yet? How about in 500 years when people just use their entire DNA set as their name?

      For example, the software that handles the US passport system needs to handle anything that can appear in a legal name in the US, since the name on the passport needs to be the legal name.

      So you've already put a limit on the names of character set, size, and actually being a name itself. You can't have a US legal name have Chinese characters in it. (I'm not even sure numerals are legal in a name). In any case it's most certainly not every conceivable possibility.

      All thanks to people with attitudes like yours not bothering to think a bit more before coding in extraneous checks in their code.

      No, what I actually said was software isn't designed to be perfect. If you think it can ever be, you've got a lot to learn.

      --
      AccountKiller
    36. Re:Article makes wrong assumption about software. by Anonymous Coward · · Score: 0

      I have a friend who's family is from a culture that tends to form long surnames. His full written name was 23 characters long... a bit unwieldy but not outrageous. It was also quite easy to spell.

      One time a software glitch result in him being hauled into court for unpaid taxes— various bits of government software truncated his name in different places and failed to connect together the relevant records even after multiple manual interventions by the desk drones. His story ended well enough, but a lot of time was wasted for everyone.

      With that in mind I think the length (and many of the other suggestions) are good ones. In particular you don't need to handle 200 character names but you should have a _safe_ and intentional way of failing in those cases you don't handle so that people don't get dragged to court due to your short-sightedness.

    37. Re:Article makes wrong assumption about software. by noidentity · · Score: 1

      My name cannot be represented in a 2D medium. It requires three dimensions, and some of the "words" must be made of specific atoms. Why doesn't your system allow me to enter my name? I've got the atoms right here. Your system sucks.

    38. Re:Article makes wrong assumption about software. by story645 · · Score: 1

      Having a box is not a problem. By all means, keep one. The problem is making input there mandatory. Do immigration forms make it so?

      I'm pretty sure the guy at border control who looks at the form is gonna question that part of the form being left blank. If you're lucky/look sufficiently innocent, it'll probably just be him questioning and not the security guys/homeland security agents/other people with the power to arrest you/torture you/etc.

      --
      open source modern art: laser taggi
    39. Re:Article makes wrong assumption about software. by Concerned+Onlooker · · Score: 2, Funny

      "Software doesn't have to be perfect out of the shoot."

      Are you saying software is like a bullet? Or perhaps you meant chute.

      --
      http://www.rootstrikers.org/
    40. Re:Article makes wrong assumption about software. by YoshiDan · · Score: 1

      In any case I'd like to see somebody with no surname try to sign up for any government assistance or benefits like Medicare or Centrelink. Or apply for a Tax File Number. Or even apply for utilities (phone, water, power etc). It would be impossible to live in Australia without a surname. If it's good enough for government departments/organisations to require a surname then it's good enough for me.

    41. Re:Article makes wrong assumption about software. by Anonymous Coward · · Score: 0

      Or sell in New Zealand, or Australia, or anywhere else in the Pacific, or deal with immigrants, or be used by anyone who has a Chinese name?

      Chinese people in a western country usually have a conventional romanised version of their name, or even a secondary westernised name that they invent. This has a lot in favour of it. We've got this wonderful, widespread text standard in the form of ASCII. Why can't we simply say, when creating a database: "Your name can be whatever you like - but for our purposes, we need a label for you which consists of 100 or less standard ASCII characters."? If you try to be all-inclusive, there'll still always be someone who doesn't fit into your namespace.

    42. Re:Article makes wrong assumption about software. by Anonymous Coward · · Score: 0

      As you endeavour to approach perfection, the costs associated rise towards towards perfection too. Either you've been working on the same unreleased product for your lifetime, you're not a programmer, or you've deluded yourself into thinking your product is perfect.

    43. Re:Article makes wrong assumption about software. by Anonymous Coward · · Score: 0

      > That is output validation problem, not input validation.

      Hmm? Entering invisible characters IS an error. Imagine we do it your way, and store it in a database. Ok. Now if we try to search based on that name field, we won't be able to find what we're looking for because we didn't type the invisible characters that were saved there. And if we try to sort based on that column, things may show up in an unexpected order too.

      >>or nonprintable characters causing two different names to look identical
      > And why would that be a problem (anymore so than people with identical names in general)?

      Additional potential for errors (on top of the already existing kinds). Like making two records for the same person, or searching for one person and getting the wrong one. Currently, at least a human staring at a screen has a chance of seeing typos. Obviously that doesn't work as well when the typos can be invisible.

    44. Re:Article makes wrong assumption about software. by A+Life+in+Hell · · Score: 1

      This is not true. I actually worked for a couple of years in an Australian government department with a coleague who had a mononym name (i.e. only one name). Passport and everything. Caused no end of problems with some of the databases, though I believe she enjoyed that aspect...

      --
      Commodore 64, Loading up the dance floor!
    45. Re:Article makes wrong assumption about software. by Anonymous Coward · · Score: 0

      >Software in the Internet age NEEDS to be 100% correct because it has a potential lifetime of infinity years and 6 billion and counting potential users.

      What a lot of crap. Not all software is aimed at every user on the internet. Many systems are only intended for deployment in a corporate environment and will never have "6 billion and counting" potential users.

      Also, "100% correct" is a meaningless term. Do you mean that the software will handle 100% of possible cases, is 100% implemented according to specification. I may well create software that doesn't support 100% of possible names, but my specification does not require it to do so. Do I care if there is someone out there with a name of "Xzy999@Jiz". Nope, because my company would not employ them or would require them to have a meaninful and usable identifier for themselves.

      We format names in our system on entry in order to keep reporting clean and to help prevent accidental entry of duplicates (I don't want "MCBAIN", "McBain", and "mcbain" in my system, I want "McBain"). Honestly, writing a stored procedure that can correctly validate and format the vast majority of names really isn't that hard. Sure, there might be the odd exception, but that is life - our software needs to cater for everything that is commercially relevant, not everything possible.

    46. Re:Article makes wrong assumption about software. by SharpFang · · Score: 1

      China has a perfectly working romanization system for names.
      There is really no point making a special case for it if they will be less than 0.1% of your business and can still be accepted - using romanizations...

      --
      45 5F E1 04 22 CA 29 C4 93 3F 95 05 2B 79 2A B2
    47. Re:Article makes wrong assumption about software. by Your.Master · · Score: 1

      100 characters, maybe not, but there's a lot of software that limits you to nine for your first/given name. I know it's nine because my sister has 10 characters in her name. This comes, I assume, from certain standardized government forms that give 9 boxes for the first name and a whole fuckload of boxes into which you can stick my 3-letter surname.

    48. Re:Article makes wrong assumption about software. by SharpFang · · Score: 1

      Screw the Chineese problem, they are unicoded.

      I suggest the guy follow his counter-examples and project a system to accept all variations he required, and give examples. In particular:

      - people with non-unicode characters in their names
      - people who have infinite, or undefined number of names (as opposed to a precise N names)
      - people with names of unlimited length
      - people with names that can't be reliably subjected to any consistent ordering system.
      - people with no names (and refusing/unable to pick/be assigned one)

      I say: fuck these assholes, let them get what they deserve: nothing.

      --
      45 5F E1 04 22 CA 29 C4 93 3F 95 05 2B 79 2A B2
    49. Re:Article makes wrong assumption about software. by mpe · · Score: 1

      They have family names, they are just not consistent across generations. Which is largely not an issue for a computer based system.

      But might well be an issue for the humans using the system. Including those who program it in the first place.

    50. Re:Article makes wrong assumption about software. by Anonymous Coward · · Score: 0

      Leave chatswood and go west.

      You might even meet some of these people that have in your word illegal names.

      Some of them have family lines leading a long way back living in australia.

    51. Re:Article makes wrong assumption about software. by mikael_j · · Score: 1

      In particular, systems don't necessarily have to shoehorn insane data into their processing.

      It's not just "insane" data, for us non-americans it's common to run across the following problems way too often:

      • "ASCII only please" - Because 7 bits is plenty.
      • "First name, middle name initial and surname please" - Because everyone is named John Q Smith, god forbid someone's name not fitting into that mold.
      • "I'm sorry, I couldn't find your name in my long list of acceptable names" - Because clearly if a name is not in your predefined list of names, it doesn't exist.
      • "I'm sorry, your name contains/is a dirty word" - Because keeping your staff from seeing dirty words is more important than letting your customers input their names properly...
      --
      Greylisting is to SMTP as NAT is to IPv4
    52. Re:Article makes wrong assumption about software. by JustOK · · Score: 1

      you going to filter their web surfing too?

      --
      rewriting history since 2109
    53. Re:Article makes wrong assumption about software. by dunkelfalke · · Score: 1

      Sort of. Depending on where you live. Also many family names started being just patronymics. This is true for both Russian and Scandinavian ones.

      --
      "It's such a fine line between stupid and clever" -- David St. Hubbins, Spinal Tap
    54. Re:Article makes wrong assumption about software. by Chris+Mattern · · Score: 1

      Have a numeral in your name? Too bad.

      Why? What do you gain by not letting someone enter a numeral as part of a name?

      Need some names to be case sensitive, and others case insensitive? Sucks to be you.

      This is more reasonable, since it would take some effort to accommodate (How do you determine which names are case sensitive? A flag just for that purpose? Ick.)

      Have a 200 character name that doesn't fit in the 100 characters the designers thought no crazy person would ever have? Tough.

      Dynamic memory allocation should be used as a matter of course for any strings whose sizes do not have a preset limit. In SQL, it should be ntext; though I dislike hard-coded limits on principle, one billion characters should be enough for anybody.

      But assuming the worst and trying to design a system that'll allow people's names to be Chinese characters when you don't do business in China, have presence in China, or ever ever plan to? That's ridiculous,

      Because that means you'll never do business with Chinese people. Ever. Or Japanese. Or Bulgarian. Or Russian. Or Arabic. Or... Proper Unicode handling is not that difficult and should be standard routine whenever your program handles strings. It enables your program to handle not just Chinese but *any* non-English character set. And you should NOT be assuming your program will handle only the English alphabet. See this.

    55. Re:Article makes wrong assumption about software. by dave1791 · · Score: 1

      Never ever do business in China? Ever? Can you be certain that strategy won’t change next year? Or even tomorrow? This king of lazy thinking is how we got the Y2K nonsense in the last century. You're not one of those people with an allergy to unicode, are you? ( http://www.joelonsoftware.com/articles/Unicode.html )

    56. Re:Article makes wrong assumption about software. by Sique · · Score: 1

      For people with names, that can't be reliably subjected to any consistent ordering system:

      Umlauts.

      How do you sort someone with the name "Schwärzer"?

      A naive approach would be to sort 'ä' behind 'a', so Schwarzer comes before Schwärzer, and Schwerzer is last.
      But then Schwäb would also come after Schwarzer. But Germans would like to have Schwäb before Schwarzer, because 'b' comes before 'r'.
      So you have to have a special rule for sorting umlauts. They come behind the vowel they are derived from, but are considered equivalent to the original vowel for all other sorting purposes. Try that with a database system that treats umlauts just as another character!
      But it gets worse. If your system does not support umlauts, then german spelling rules require you to replace 'ä' with 'ae'. Suddenly Schwaerzer comes before Schwarzer. So you have to create another rule: 'ae' as an replacement for 'ä' has to be treated like 'a' for all sorting purposes, except that names with 'ae' have to be sorted behind names with 'a' and the same following letter(s).
      But then there are people which have 'ae' in their names which aren't derived from 'ä', those have to be sorted differently than those with 'ae' as a replacement for the umlaut.

      --
      .sig: Sique *sigh*
    57. Re:Article makes wrong assumption about software. by SharpFang · · Score: 1

      If the rules are defined somewhere, the system can be implemented and applied consistently.
      Rules for leap year are somewhat convoluted but people implement them.
      Rules for date of easter are wildly crazy but people implement them too.

      --
      45 5F E1 04 22 CA 29 C4 93 3F 95 05 2B 79 2A B2
    58. Re:Article makes wrong assumption about software. by Anonymous Coward · · Score: 0

      upload a PNG

      PNG, my ass -- my name is 4-dimensional. If your dumb shit of a system can't handle it, then you can't arrest and book me. I'm leaving.

    59. Re:Article makes wrong assumption about software. by digitig · · Score: 2, Insightful

      But assuming the worst and trying to design a system that'll allow people's names to be Chinese characters when you don't do business in China, have presence in China, or ever ever plan to? That's ridiculous.

      No, but making a conscious design decision not to accommodate names in non-Roman character sets, and documenting that in the specification, is sensible.

      If you don't even HAVE a name, then I submit you're crazier than the artist formerly known as the artist formerly known as Prince.

      The discussion gives examples of people who don't have names, such as somebody born into slavery in the Sudan. In that case, it's not the person who is crazy. Do you need to account for that in your data entry? Well, it depends. If it's online sales then the chances are that that person will never be a customer. If you're doing a missing persons database for a relief agency, though, you probably need to find a way to account for them. So no, you don't have to address all of the cases that the author mentions, but if you're smart you'll at least consider whether you should in your particular context.

      --
      Quidnam Latine loqui modo coepi?
    60. Re:Article makes wrong assumption about software. by Anonymous Coward · · Score: 0

      Abbo's? Who gives a fuck what about their names? They don't - the only time they ever see them written down is when they're cashing their welfare checks.

    61. Re:Article makes wrong assumption about software. by Anonymous Coward · · Score: 0

      Yeah this argument is reminiscent of arguments that say you must port to every OS because some guy is using it. "This is the world of computers now," they might say.

      The argument was flawed because the world of computers later turned into a hegemony of Windows, Mac OS and Linux with Windows alone covering most of your bases unless you're in some particular niche.

      The internet is going the same way. English is the language of the internet and people in Asia must be already well used to entering a name in roman characters. I'm sure it doesn't bother them that much either.

      The idea that computers must adapt to the world is an ivory tower concept. In fact, history shows the world adapting to computers.

      I have sympathy for people with double-barrelled surnames being unable to enter them because my wife has one and this is a rookie coding error, but I expect most people will simply adapt their name to the computer systems them use.

    62. Re:Article makes wrong assumption about software. by Sique · · Score: 1

      The main problem is: Your system, to handle german sorting rules correctly, has to know about german umlauts even if it can't store them.

      --
      .sig: Sique *sigh*
    63. Re:Article makes wrong assumption about software. by Anonymous Coward · · Score: 0

      > Tough.

      But on whom? Probably on your client and, unless your client was careless with the contract, tough on you.

      By the time that the issue with the software becomes evident, the business may be bound by contract, in which case, they can't simply renege on the contract. Or if they have already provided the goods or service, they may end up having to forego payment, which they're not going to be particularly happy about.

      Also, many problems relating to "unusual" names would disproportionately affect specific racial groups; simply refusing service on such a basis would often be unlawful (and even if it isn't, it could be seriously detrimental to the company's reputation).

    64. Re:Article makes wrong assumption about software. by Hognoxious · · Score: 1

      Software in the Internet age NEEDS to be 100% correct

      By the time you've got it correct, the definition of correct will have changed. Just when you've finished extending unicrap to allow every symbol of every written language (and added on a slate applet so they can draw their own just in case) they'll discover a tribe whose names are whistled notes of precise pitches.

      If you're building a system 'to throw away' as the XP people suggest, you're also building one to be rooted

      I don't see the connect between security and name validation at all. One of us must be stupid.

      --
      Confucius say, "Find worm in apple - bad. Find half a worm - worse."
    65. Re:Article makes wrong assumption about software. by CarpetShark · · Score: 1

      If you don't even HAVE a name, then I submit you're crazier than the artist formerly known as the artist formerly known as Prince.

      Names aren't as essential or universal as you might think. For instance, I believe that a lot of Japanese peasants in medieval times were only referred to by their role -- fisherman, or cook, for instance. At some point, just like in the West, "blacksmith" became "black" and "smith" became a name rather than a job, but arguably, for a while at least, they were not names; just another index into the same data record that could benefit from a unique name/ID.

      The world can't accommodate every possibility, and software is no exception

      Why not? Humans do it every day, by assuming something until their model breaks down, then building a bigger mental model and going back to assuming with it. Software SHOULD be that adaptable. We just don't really know how to do that yet.

    66. Re:Article makes wrong assumption about software. by donscarletti · · Score: 1
      Or sell in New Zealand, or Australia, or anywhere else in the Pacific, or deal with immigrants, or be used by anyone who has a Chinese name?

      It is reasonable to expect all names in New Zealand and Australia to be written in an appropriate romanisation scheme such as Hanyu Pinyin, Wade-Giles or Yale. The reason for this is that Latin script is fairly well understood in these areas where people cannot be expected to transcribe names referring to people that they may not even need to directly communicate with.

      Latin script is an official script in all Chinese speaking territories, whereas comprehension of Han characters is low in Australia and New Zealand, even among ethnic Chinese who may speak their parent's tongue fluently, there is no reason for such databases to interpret non latin characters as anything but an error since it would diminish the utility of the identifier.

      --
      When Argumentum ad Hominem falls short, try Argumentum ad Matrem
    67. Re:Article makes wrong assumption about software. by GigsVT · · Score: 1

      7 bits is plenty.

      If your name is some fucked up mess of symbols, it's not going to kill you to come up with something 7 bit clean for computers to know you by.

      --
      I've had enough abrasive sigs. Kittens are cute and fuzzy.
    68. Re:Article makes wrong assumption about software. by russotto · · Score: 1

      And damnit it was invented by Americans, or at least Dan Quayle, so quit fighting it and get an American name!

      That's J. Danforth Quayle, so even his name doesn't fit the mold. First initial and full middle name? The horrors!

    69. Re:Article makes wrong assumption about software. by shutdown+-p+now · · Score: 1

      Hmm? Entering invisible characters IS an error. Imagine we do it your way, and store it in a database. Ok. Now if we try to search based on that name field, we won't be able to find what we're looking for because we didn't type the invisible characters that were saved there.

      Quite obviously, your search should take those characters into account. It's not that hard to do, either - indeed, if your DB and/or programming framework has proper support for internationalization, it should just work.

      For example, in .NET, all System.String methods dealing with search - Contains, IndexOf etc - will correctly handle (i.e. ignore) characters such as zero-width non-breaking space or soft hyphen, so long as you tell them to use culture-specific (or invariant culture) and not direct codepoint comparison.

      Additional potential for errors (on top of the already existing kinds). Like making two records for the same person, or searching for one person and getting the wrong one. Currently, at least a human staring at a screen has a chance of seeing typos. Obviously that doesn't work as well when the typos can be invisible.

      Clearly, either problem already exists for people with identical names, which is not a rare occurrence by any means. You just have to live with the fact that name does not uniquely identify a person.

    70. Re:Article makes wrong assumption about software. by Anonymous Coward · · Score: 0

      Correct. Several of the "assumptions" he lists are actually filters to prevent people from using crazy typography, like names without capitals (joe cool), names with capitals in bizarre places (jOe cOoL), all caps names (JOE COOL), names with numbers (Joe the Indi3vidual), names that don't follow the official capitalisation for prefixes in your country (Joe DiPamplona), names that use kanji that aren't on the government approved list (or maybe you don't want to see kanji at all), and so on and so forth. In many cases you have to put these things in place, either as re-formatters or rejecters, to get a modicum of sanity. Occasionally you might miss something (oh "van der" is a prefix too) but most of the time the complainers really have no point and are perfectly able to get their name accepted by your system once they write it down in an civilised manner.

    71. Re:Article makes wrong assumption about software. by geekoid · · Score: 1

      It would also mean the users would need to learn to read Chinese.

      --
      The Kruger Dunning explains most post on /. http://en.wikipedia.org/wiki/Dunning%E2%80%93Kruger_effect
    72. Re:Article makes wrong assumption about software. by Taevin · · Score: 1

      It probably does, but not because it rejects non-standard names. In fact, having a well-defined model of what a name is and rejecting input that doesn't conform to it is necessary for security.

      Why is it necessary for security? If you're not insane, your database is accessed via parameterized queries and text is properly escaped. I couldn't care less if you enter 'lol<script src="hackme.com"></script>' for your name. It will be happily stored and since HTML is not allowed to be embedded in that field, it will be HTML encoded before output. If someone wants to be addressed as the small shell script that they are, who am I do deny them?

      No, a "block to compatibility" would be to accept arbitrary inputs as names. Compatibility requires some standardization, and that requires rejecting things that the software doesn't know about.

      Again, why? If people would stop forcing bizarre rules on their data there would be no compatibility problem. Of course they do and will continue to for the foreseeable future so if I do have to export data to a so-clever-it's-a-failure system, I can always mangle the text.

    73. Re:Article makes wrong assumption about software. by Hognoxious · · Score: 1

      That is output validation problem, not input validation. So go ahead and strip it on output, when (and if) you need to do it.

      Generally there are less places where data is entered than displayed. Thus it's much easier & cleaner to stop it getting in to start with than to bodge it at every possible place it could be output. And that's before you have any interfaces that send to other systems.

      --
      Confucius say, "Find worm in apple - bad. Find half a worm - worse."
    74. Re:Article makes wrong assumption about software. by mikael_j · · Score: 1

      Here's the thing, the vast majority of people in the world can't write their names properly using just 7 bit ASCII. Back in the "good old days" those of us who were still using alphabets somewhat similar to the one used for 7 bit ASCII would use tricks different 8 bit codepages but there's no need for that anymore, we have unicode and with UTF-8 a lot of these problems are solved, if only developers would actually get around to using it.

      I'm lucky in the sense that I can write my name with just the regular 26 characters that americans use in their alphabet but unfortunately the order of my names is a bit different from the regular "First M Last" order that americans assume is standard (mine is "House First Middle Last" and traditionally the house name would've been more significant than the middle name).

      --
      Greylisting is to SMTP as NAT is to IPv4
    75. Re:Article makes wrong assumption about software. by david_thornley · · Score: 1

      You're not quite right. It's easy to make software reject non-standard formats cleanly, but that's not what it's for. Software that gathers names is for interfacing with the user, who is very likely a customer (i.e., somebody who gives you the money you need). People, and by that I include customers, are often sensitive about their names.

      This means that you should try to handle all your potential customers' names adequately (if not necessarily perfectly).

      Also, software is a lot more volatile than data formats, and business is getting increasingly global.

      Don't make rigid assumptions about names. At the very least, allow the possibility of using one potentially long Unicode string.

      --
      "When you have eliminated the unacceptable, whatever is left, however improbable, must be the truthiness" - Holmes
    76. Re:Article makes wrong assumption about software. by sydneyfong · · Score: 1

      That is no excuse. Who the fuck modded you up?

      You're saying we shouldn't use Unicode because there are 0.01% of the population in the world who still can't type their correct name in your system, and therefore need to keep using ASCII or ISO-8859-1 that excludes like 50% of the world's population?

      Ridiculous.

      --
      Don't quote me on this.
    77. Re:Article makes wrong assumption about software. by rabtech · · Score: 1

      Unicode doesn't cover the full set of CJK characters used for names, nor does it cover all writing systems in actual use.

      But it will some day. True, Unicode gets expanded to cover more writing systems... but the reality is that our inter-connected world is converging toward standardized writing systems and languages. Other systems are going extinct. That can have negative elements to it (lost culture/history), but it increasingly means that human beings can talk to other human beings and exchange cultures without a language barrier. That leads to a lot of good things including more peace and less war.

      In 1,000 years I expect that almost every human alive will speak a common language. (Whether that will be everyone's second/third/etc language or whether most other languages will have died out I wouldn't hazard a guess.)

      --
      Natural != (nontoxic || beneficial)
    78. Re:Article makes wrong assumption about software. by rainmaestro · · Score: 1

      Oh, I agree with you there. It is almost criminal for software to not handle Unicode nowadays. But there comes a point where you have to say, "sorry, we don't support that". The presence of a tribe somewhere that speaks in a pop-click language and has no written language doesn't mean my software needs to support audio files for names.

      Your software should seamlessly support the majority of users, and quasi-support most of the remainder, but there will always be fringe cases where you have to flat-out reject them.

      Most of the article's points should be common sense. Not supporting numbers, caring about case, not supporting Unicode, etc. That's just lazy coders. And few things piss me off more than secret questions that are case sensitive. Three years after creating the account, what's the likelihood of me remembering if my favorite teacher was written as Mrs. Smith or mrs. smith?

    79. Re:Article makes wrong assumption about software. by Anonymous Coward · · Score: 0

      Having a box is not a problem. By all means, keep one. The problem is making input there mandatory. Do immigration forms make it so?

      I'm pretty sure the guy at border control who looks at the form is gonna question that part of the form being left blank. If you're lucky/look sufficiently innocent, it'll probably just be him questioning and not the security guys/homeland security agents/other people with the power to arrest you/torture you/etc.

      You can fill out a form at the border instead of providing a passport or an ID from your home country?

    80. Re:Article makes wrong assumption about software. by Anonymous Coward · · Score: 0

      Then it's designed to fail, and probably also has security holes, and should not be deployed. At the very least it will be a block to compatibility, at worst it will become a botnet.

      Congratulation on the most crazy-ass string of non sequiturs I've ever encountered. Fucking lunatic Republicans.

  12. Yeah, article is kind of asinine by Trepidity · · Score: 5, Insightful

    He's essentially arguing that, because names vary a lot and are complex, your software should never do anything useful with them. Sorry, but that's a stupid answer. In a lot of systems, being able to sort by surname may well be more important than being able to handle people who claim they have no surname.

    Of course, you shouldn't gratuitously do stupid things, and interfaces should aim to be relatively clear. But most people can figure out how to enter their names into relatively standardized forms, and those that don't should probably figure out how.

    1. Re:Yeah, article is kind of asinine by snowgirl · · Score: 2, Informative

      I'm going to throw in my agreement here. Yes, there are people who put numerals in their names, or non-unicode point characters, or various other things, but there just isn't a reason to foist that on other people.

      There is frustration about things like, "people have N number of names", and "names don't change" which are good and valid points... but some of the things are just like "dude... seriously..."

      --
      WARNING! This girl exceeds the MAXIMUM SAFE standards established by the FDA for BRATTINESS
    2. Re:Yeah, article is kind of asinine by rve · · Score: 1

      I'm going to throw in my agreement here. Yes, there are people who put numerals in their names, or non-unicode point characters, or various other things, but there just isn't a reason to foist that on other people.

      There is frustration about things like, "people have N number of names", and "names don't change" which are good and valid points... but some of the things are just like "dude... seriously..."

      And there are immigrants from cultures where it is completely common to have a hyphen or an apostrophe in their surname, or more than one word.

      Why is the concept of allowing spaces in a surname so offensive? It's not like it's more work to allow this, you have to write validation rules to reject these otherwise perfectly valid printable ascii characters.

    3. Re:Yeah, article is kind of asinine by metamatic · · Score: 1

      In a lot of systems, being able to sort by surname may well be more important than being able to handle people who claim they have no surname.

      It's not a question of claiming. There are people who genuinely have no surname.

      If you need to sort by surname, have a surname field and allow it to be null and decide where you want null values to sort.

      However, also have a name field that doesn't assume any particular format, and use that for display.

      --
      GCHQ Quantum Insert installed. If only our tongues were made of glass, how much more careful we would be when we speak
    4. Re:Yeah, article is kind of asinine by Anonymous Coward · · Score: 0

      Well it's really only neccessary until you hire the guy who doesn't have a surname.

      Also on the subject of names being unique, the postal service thinks that I'm the other guy with the exact same name as me who lives in apartment EE201 as opposed to GG201. I got a huge bundle of his overdue bills, credit collection notices, etc.. The difference is that I have a different middle name. That's the only way to distinguish between me and him, but none of the mailing systems actually have that distinction built in. So to all you jackass programmers out there that thinks that situation never comes up and is never a problem, think again.

    5. Re:Yeah, article is kind of asinine by SilasMortimer · · Score: 1

      And there are immigrants from cultures where it is completely common to have a hyphen or an apostrophe in their surname, or more than one word.

      In the US, you don't even need to look to immigrants for names with more than one word. In fact, it's fairly common among the people for whom the word "immigrant" is least applicable.

      I guess this is another case where demographics are important. I don't imagine that someone programming for the Aryan Brotherhood is going to worry about "Red Feather" in a single field. If your pool of consumers isn't so narrowly defined, however, take shortcuts at your own risk. My given name is Constantinos. To this day, some name entry fields feel that name is two characters too long. Really? It's only twelve characters! And I'll be honest with you, if a business can't keep my full first name in their database, I'll keep an eye out for something else or skip them entirely if it's not vital. It's no big deal to them, maybe, but it is to me.

      Is no one else reminded of the Texas politician who last year told Asian Americans that they should change their names so that "American poll workers" wouldn't make so many mistakes with them? I'm sure you catch the irony in that. It's the same with my own name. My name is American. I am American and that is my name. If that's not important to you, then apparently neither is my business.

      I'm kind of surprised to see how many people in these comments dismiss the ideas in the article, though. I'd have thought that after all this time, some of that attitude would have been adjusted, especially when people have had to deal with complaints (for instance, from people like me).

      --
      Omnes tuae crepidines sunt nobis sunt. Ascendo tuum!
    6. Re:Yeah, article is kind of asinine by Trepidity · · Score: 1

      That isn't really the case in practice, though. Most people who claim they have no surname have something that functions close enough to one for computer use. For example, Icelanders use their patronymic in that box.

  13. Thanks, Prince by BlueBoxSW.com · · Score: 4, Informative

    Thanks, Prince

    1. Re:Thanks, Prince by Anonymous Coward · · Score: 0

      My wife has a friend named Sarah * Daniels.

      I know. I know. You don't believe me. It's true. I'm a geek and I have a wife.

  14. Only names you need by Alcoholist · · Score: 1

    Foo and Bar, only names you need. Besides, Foo is a pretty name for a girl!

    --
    Bibo Ergo Sum.
    1. Re:Only names you need by Anonymous Coward · · Score: 0

      Foo Kyou
      Foo Kme

    2. Re:Only names you need by shutdown+-p+now · · Score: 1

      Besides, Foo is a pretty name for a girl!

      I prefer "Defenestratia".

      (yes, I do like Python)

    3. Re:Only names you need by Anonymous Coward · · Score: 0

      Don't forget Baz. We have a threesome tonight!

    4. Re:Only names you need by eth1 · · Score: 1

      Besides, Foo is a pretty name for a girl!

      You're right... it goes well with "Quit yo' jibba jabba, Foo!" :)

  15. American journalist by Anonymous Coward · · Score: 0
  16. Irish need not log in? by thepainguy · · Score: 5, Insightful

    My last name is O'Leary and over the past 5 years web sites have not gotten any better, and arguably have gotten worse, at handling the apostrophe in my last name

    Help me Slashdot, you're my only hope.

    1. Re:Irish need not log in? by kenj0418 · · Score: 5, Funny

      You've probably compiled a lengthly list of sites vulnerable to SQL-injection. I'm sure you could sell that to someone somewhere to compensate you for your pain and suffering.

    2. Re:Irish need not log in? by shutdown+-p+now · · Score: 4, Funny

      Help me Slashdot, you're my only hope.

      Do you think they'd let you change your legal name to something like "O';DROP TABLE users"? If so, you should be all set.

    3. Re:Irish need not log in? by MyLongNickName · · Score: 1

      Slashdot help you? Go to Idle, look at the CSS "design" and tell me that Slashdot is capable of helping anyone with coding...

      --
      See my journal for slashdot ID's by year. Mine created in 2005. http://slashdot.org/journal/289875/slashdot-ids-by-year
    4. Re:Irish need not log in? by Anonymous Coward · · Score: 0

      I have the same last name, and my current state won't even recognize it.

      I'm pretty sure the website of a certain airline doesn't block out the apostrophe. I can never select a damn seat because of javascript errors. My wife was fine, until she married me.

    5. Re:Irish need not log in? by SunFireSpaz · · Score: 0, Redundant

      Remember the XKCD comic from a few years ago?

    6. Re:Irish need not log in? by fishexe · · Score: 1

      Help me Slashdot, you're my only hope.

      Help me Obi-wan CowboyNeal, you're my only hope!

      --
      "I don't care about the Constitution!" --Bill O'Reilly, November 17, 2009
    7. Re:Irish need not log in? by EvilIdler · · Score: 1

      Well, stay the fuck away from Submin (a Subversion web admin tool). There was a bug report about how it could not accept apostrophes in names. It was marked "wontfix", and people have given up. The author of that software strikes me as an arrogant prick. I don't like dicks slapping me, so he should cut that right out.

      This is a general trend among English-speaking people with boring names. They can store their own name, so why bother making a system which can accept group members with foreign names?

    8. Re:Irish need not log in? by LaurieF · · Score: 1

      It's unfortunate you're not a customer of mine - I have code which deals with your name, and MacLean (and Machine!) and Fa'atau (Samoan) and A'Court. I appreciate that a certain number of people with Mc/Mac prefixes want the next letter to be lower-case: I used the local phone book as my guide; assuming (loosely) that they all had their names presented correctly, the majority had an upper-case letter after the 'C'.

      When first and last names are stored in the same field, I often want to strip off the surname from the first name(s). This is in most cases straight forward, but is more complicated when the surname is compound, like Te Heu Heu (Maori), Ter Borg and Van Der Water (Dutch) and Le Blanc (French). If someone's first name is "Van" and his surname is "Der Water", I'm going to get it wrong. But these situations are rare. Without extra information, it's just not possible to get it right.

      So the idea is to design the data entry form such that these are minimised. Make the parts of the name distinct entry fields, and put error checking behind each one of them. If it's possible, do the mix-casing in the background code, so the user can see how it's going to appear. Then if it's wrong at that point, Mr O'Leary, it's your fault!

      But it will never be 100% right. And it will only be part of the solution. The next bit - name matching, is hard. It will always be hard. False positives and false positives will always pop up. I leave that to much clever people than me to try and fail.

    9. Re:Irish need not log in? by Anonymous Coward · · Score: 0

      *Drool* ...

    10. Re:Irish need not log in? by j_sp_r · · Score: 1

      For cases like "van der" I usually add an extra field "things between". Names like this are quite common in Dutch.

    11. Re:Irish need not log in? by Anonymous Coward · · Score: 0

      Oh, yes. Little Painguy Tables, we call him.

    12. Re:Irish need not log in? by Calinous · · Score: 1

      I really shouldn't think O'Leary would be a foreign name for a US resident - Irish and Scots would be in the US by the millions

    13. Re:Irish need not log in? by nyctopterus · · Score: 1

      My last name is O'Leary and over the past 5 years web sites have not gotten any better, and arguably have gotten worse [...]

      I think this was one of the take-home points of this article. Don't get to clever with names by sanitising and dividing, just accept fookin' input.

    14. Re:Irish need not log in? by digitig · · Score: 1

      Irish yes. Scots don't have the O' prefix unless their family earlier migrated from Ireland.

      --
      Quidnam Latine loqui modo coepi?
    15. Re:Irish need not log in? by YttriumOxide · · Score: 1

      When first and last names are stored in the same field, I often want to strip off the surname from the first name(s). This is in most cases straight forward, but is more complicated when the surname is compound, like Te Heu Heu (Maori), Ter Borg and Van Der Water (Dutch) and Le Blanc (French). If someone's first name is "Van" and his surname is "Der Water", I'm going to get it wrong. But these situations are rare. Without extra information, it's just not possible to get it right.

      You should also be careful with "Van Der Water", as either that or "van der Water" may be preferred. This also affects sort order (the former should be sorted under "V", but the latter under "W"). I myself have the common Dutch surname "de Waal", and it causes me no end of grief to find my name in large lists since invariably it'll be sorted to a different location that I first guessed (it should be under "W", but if I look there, I'll not find it and it'll be under "D" instead... if I think they'll probably get it wrong, I check "D" only to find they've got it right under "W")

      My full name is "Benjamin de Waal" (no middle name). One of my bank cards reads "B DEWAAL", and another reads "B DE WAAL", while my American Express has "BEN D WAAL" (as if "de" were a middle name). My passport and birth certificate both have "Benjamin de Waal". I definitely do feel sorry for people with any more complexity than my relatively simple name!

      --
      My book about LSD and Self-Discovery
      Also on facebook as: DroppingAcidDaleBewan
    16. Re:Irish need not log in? by Anonymous Coward · · Score: 0

      http://xkcd.com/327/

    17. Re:Irish need not log in? by tepples · · Score: 1

      Don't get to clever with names by sanitising and dividing, just accept fookin' input.

      I do this until the credit card processing gateway requires a nonempty bill_to_first_name and a nonempty bill_to_last_name.

    18. Re:Irish need not log in? by Anonymous Coward · · Score: 0

      Where I live the official spelling of your name would be (replace first name with your own) Sean o Leary, or when the last name is used by itself O Leary. (Note: the apostrophe would not be used because the "o" in your name is not a shortening of "of" but derives from "ó" (originally "ua") which means "grandson".) I write this as a reminder that when people talk about the official spelling of names, there's also a large local component to it and if you were to move, or use a website located in some foreign country, you may have to adjust, especially in cases like this.

    19. Re:Irish need not log in? by alexo · · Score: 1

      My wife was fine, until she married me.

      99% of married women have that complaint.

    20. Re:Irish need not log in? by moogaloonie · · Score: 1

      My last name is O'Leary and over the past 5 years web sites have not gotten any better, and arguably have gotten worse, at handling the apostrophe in my last name Help me Slashdot, you're my only hope.

      I can understand all of these programmers failing to anticipate Gaelic names, but Klingon too?

    21. Re:Irish need not log in? by Anonymous Coward · · Score: 0

      My last name is O'Leary and over the past 5 years web sites have not gotten any better, and arguably have gotten worse, at handling the apostrophe in my last name

      Help me Slashdot, you're my only hope.

      As I posted earlier, it's not an apostrophe. That little thing near the O is a fada -- the Irish long vowel mark. The O is not part of what follows -- it simply means "descendant of".

    22. Re:Irish need not log in? by Anonymous Coward · · Score: 0

      And the Webby for Best Version of the XKCD callback goes... to

  17. not surprising by Phoenix+Dreamscape · · Score: 2, Interesting

    Considering how many entry forms still don't allow '+' in an e-mail address (or, worse, allow it in the sign-up box but not in the unsubscribe box), and considering how many banks still restrict you to an 8-character password, does it come as any surprise that they have difficulty with something that isn't defined in an RFC?

    1. Re:not surprising by panda · · Score: 1

      I have found that most people in IT, who should know better, are blissfully ignorant of RFCs. I wish I had a dollar for every time I have had to point out RFC 2822 to a "professional" mail server administrator, and explain to them point by point why their server is broken. And these are admins at businesses, like banks, publishers, etc. These are people who are paid with the expectation that they know these things. Half of the time, the first words out of their mouth is "What's a RFC?"

      There is a lot of "cargo cult" programming and administration going on, and I suspect that is largely where problems like those outlined in the article come from.

      Oh, and after reading the above, I thought it was a complete waste of my time. The author makes no suggestions for correcting these problems. I've dealt with a lot of issues with names over the years as a database programmer. There are no universal solutions. Everything depends on the needs of the users and the specification that you're given.

      --
      Just be sure to wear the gold uniform when you beam down -- you know what happens when you wear the red one.
    2. Re:not surprising by Ash-Fox · · Score: 1

      These are people who are paid with the expectation that they know these things. Half of the time, the first words out of their mouth is "What's a RFC?"

      You wouldn't like me, I have a tendency to quote things from RFC 1796 when I don't follow RFCs in certain instances due to my own 'good reasoning' which people may oppose.

      And now, I shall quote you a piece:

      It is a regrettably well spread misconception that publication as an RFC provides some level of recognition. It does not, or at least not any more than the publication in a regular journal.

      --
      Change is certain; progress is not obligatory.
    3. Re:not surprising by geekoid · · Score: 1

      That drives me up a wall. It's a perfectly valid email character.

      --
      The Kruger Dunning explains most post on /. http://en.wikipedia.org/wiki/Dunning%E2%80%93Kruger_effect
  18. You are number 6 by Punto · · Score: 1

    You are number 6

    --

    --
    Stay tuned for some shock and awe coming right up after this messages!

    1. Re:You are number 6 by Anonymous Coward · · Score: 0

      I am not a number. I am a FREE MAN!

  19. Coral Cache FTW... by Qubit · · Score: 1

    Okay, I finally got the page to load:

    Falsehoods Programmers Believe About Names

    If only Slashcode would do a single hit on <domain> + .nyud.net + /rest/of/url/ for every link posted in an article, then it would be trivial to switch over to cached copies of the content.

    --

    coding is life /* the rest is */
  20. Yea f-ing right. What about little Bobby Tables? by Narcocide · · Score: 1, Redundant
  21. Well Duh by Saint+Stephen · · Score: 4, Insightful

    First thing I learned back in 1993 when I got started.

    1) George Foreman has five boys named George Foreman. Your database better be able to handle that.
    2) Your database better be able to handle Cher (no last name).
    3) People are not required to have Social Security numbers. (it's an optional program - you don't have to partipate).
    4) Not everyone's last name starts with a capital letter.
    5) Mexican people's names break ASCII (the tilda n).
    6) People named O'Grady have a hard time getting their name in a database sometimes and have a hard time getting their name passed via a URL sometimes and generally mess stuff up.
    7) People from Sri Lanka will break your name length limits.
    8) Some people's name is only a single letter.
    9) Some people go by their middle name god damn it! :-)

    1. Re:Well Duh by Eskarel · · Score: 2, Interesting
      1. Don't use names as a unique identifier, they're not.
      2. Cher has a last name, as most likely did Homer and Virgil and everyone else, they're last names might have been "from _____ or the ______", but they still had one.
      3. It's illegal to use SSN as a unique identifier, so don't use it as one.
      4. Who cares, don't muck around with case, and search case insensitive, more matches are better than not enough.
      5. There are conventions to get around that in ASCII, but unicode solves most of it anyway.
      6. Always properly encode and decode your data to meet the requirements of your medium.
      7. You still have to have name limits, and someone's name will always break it, using some ridiculous number of characters in your database is just going to kill your database.
      8. No ones name is a single letter in any language I've ever heard of(a single character, but that's not the same thing), and since names aren't unique or identifying this doesn't really matter.
      9. Who cares?

      Names are not meaningful except to the people who have them, and they're deluding themselves. You are not your name, and your name is not you.

    2. Re:Well Duh by SpecBear · · Score: 2

      Our software can handle eight of those. Possibly nine, I don't know how long Sri Lankan's names get.

      The company I work for gets paid to make software. If someone wants to pay my employer to support certain features, then we'll build in that support. If the client says "Anyone without a last name can suck it" (and that has happened) then the system won't support that.

      As a hired gun (keyboard?) whether what i believe about names is true or false is irrelevant. I believe what I get paid to believe.

    3. Re:Well Duh by FauxPasIII · · Score: 2

      > 8) Some people's name is only a single letter.

      A former coworker of mine is named "Hyun O". He was the only person in the company not to follow the first initial, last name standard for his email address. =)

      --
      25% Funny, 25% Insightful, 25% Informative, 25% Troll
    4. Re:Well Duh by RedWizzard · · Score: 1

      Cher has a last name, as most likely did Homer and Virgil and everyone else, they're last names might have been "from _____ or the ______", but they still had one.

      Cher had a last name at birth. But her legal name is now "Cher", with no last or middle name. She legally changed her name in 1979.

    5. Re:Well Duh by YttriumOxide · · Score: 1

      # No ones name is a single letter in any language I've ever heard of(a single character, but that's not the same thing)

      I have friend named "E". It's pronounced roughly like the dismissive "Eh" sound you might make when you want to express apathy. So yes, a single latin letter is quite possible as a name.

      and since names aren't unique or identifying this doesn't really matter.

      Absolutely correct... that's sort of the point of both the article and many of the posts here.

      --
      My book about LSD and Self-Discovery
      Also on facebook as: DroppingAcidDaleBewan
    6. Re:Well Duh by Anonymous Coward · · Score: 1, Interesting

      Also, "Teller" (from Penn & Teller) has no first name, which may or may not be an important distinction.

    7. Re:Well Duh by bjourne · · Score: 1
      Good points, except:

      You still have to have name limits, and someone's name will always break it, using some ridiculous number of characters in your database is just going to kill your database.

      It is only in MySQL where you have to specify a max length for the varchar column types. In more advanced rdbms:s, such as postgres, you do not have that limitation and can happily store as long names as you want.

    8. Re:Well Duh by Saint+Stephen · · Score: 1

      Some people assume full name is unique within a single family policy. I.e., which dependent do you want to file a claim for? They assume you won't have two kids with the same name, and the U/I can't distingish which is which.

    9. Re:Well Duh by sydneyfong · · Score: 1

      You still need limits. Otherwise good luck when you get that hacker who has a 10-Gigabyte last name.

      --
      Don't quote me on this.
    10. Re:Well Duh by Anonymous Coward · · Score: 0

      I knew a woman who worked for a company who used First Initial, Middle Initial, Last Name. Her name was something like Melanie R. Smith (not gonna give her real name), so her email address was mrsmith@company.com.

    11. Re:Well Duh by Anonymous Coward · · Score: 0

      Item 3 has not been true in the US for about 15 or 20 years; you need an SSN for an infant by the time you propose to claim them as a dependent on your taxes.

      And item 7 is only true of traditional ASCII7, anything newer, like the very common 8859-1, handles it fine.

      Item 9: there are people whose middle name is "Goddamnit"?

  22. Programmers hate my real name by SexyKellyOsbourne · · Score: 4, Funny

    My first name: "where 1=1 "
    My last name: "'; drop table users; --"

    1. Re:Programmers hate my real name by ProzacPatient · · Score: 1

      Hey, you must be Bobby Tables!

  23. Looking at it from the wrong direction by quietwalker · · Score: 1

    The article writer started out well, and then immediately ignored his own line of thought.

    "John Graham-Cumming wrote an article today complaining about how a computer system he was working with described his last name as having invalid characters. It of course does not, because anything someone tells you is their name is--by definition--an appropriate identifier for them."

    Yes, in your scope, your name is going to be accurate by whatever cultural, political, legal, optional, character-set or other restrictions or freedoms apply.

    However, in the scope of any given app, your name may very well be an 8 character, [a-zA-Z] string. That's what the app means by the name field - by definition.

    Naturally, no programmer would use any user-provided input for any sort of unique key ( err... right?) ... so really what this is about is that someone did not properly set user expectations, that they may not be allowed to arbitrarily spool data in any format or fashion into any given input field - 'name' in this case.

    Or someone already knows that fact, and is ranting because they chose today to ignore it.

    1. Re:Looking at it from the wrong direction by pjt33 · · Score: 1

      For any database which is used to communicate with you by phone or mail, the name field is expected to correspond to your name. Except, of course, that usually there are multiple fields to handle different parts of your name, and plenty of possibility for someone to stick the wrong parts in the fields.

  24. Why do programmers get the blame? by justfred · · Score: 5, Insightful

    I code to spec. The product and marketing departments write the spec (what little there is); the QA department amends the spec with overly specific test cases. I suggest that the spec is incomplete and won't handle...but I'm told, just code it to spec. I recommend changed, but we don't have time for edge cases. I point out potential problems, but we're unlikely to get any of those. I warn of potential compatibility problems but we don't care. Are you just trying to be difficult? If there's a problem QA will catch it. The project is overdue already, and by the way here are some new requirements that need to make it in, and we can't change the release date because we already promised the stockholders. Why is your code so complicated, my twelve-year-old kid could write this.

    It's not my fault. I code to spec.

    1. Re:Why do programmers get the blame? by joost · · Score: 1

      This is so true!! "We" developers know what's going on most of the time. But we are required to code to spec because the business owners have final say. So some times bad software ships, because we have been required to code it wrong.

    2. Re:Why do programmers get the blame? by Anonymous Coward · · Score: 0

      Been there. Solution: find a smarter boss. Life can be so much better.

  25. Missing in list: Single names & Initials by aneroid · · Score: 1

    Single names: I've done data conversions on a project which covered multiple countries in Asia, Africa and Latin America. A "new" thing I came across about African names was: Some people have just a single name. It's not their first name or their last name, it's just their name.Though, they're generally okay with it being considered their first name. However, most legal docs require a last name (well, maybe not in some African countries) so we had to use dummy text in the first name.
    (Almost covered in assumptions 19 & 20 but not quite.)

    Initials: Globally, there are ppl who have a single letter, or two, as their first or last name. These could be initials of their parents' names or based on something else. Point is, min. length requirement of 3 or even 2 does not work.

    Implications of the article in general:
    1. You can't do any validation on people's names. No programmer would consider this proper.
    2. You can't use someone's nationality to create validation rules, since nationality/citizenship can change - and using 'country of birth' doesn't cover re-located/migrated families, assuming your application uses that field.
    3. Most Importantly...and Obviously: Either ask the client/customer what allowances they want in the application. Or, clearly state your assumptions and have them review & refine it.

    1. Re:Missing in list: Single names & Initials by shutdown+-p+now · · Score: 1

      You can't do any validation on people's names.

      That, really, is the point of TFA. It is 100% true. You go for anything stricter than that, and you will get it wrong, so don't.

      No programmer would consider this proper.

      Programmers, like other people, have to deal with the way reality works. Constraints in your database schema do not have a magical property of changing the world to fit them.

    2. Re:Missing in list: Single names & Initials by Jack9 · · Score: 1

      You can't do any validation on people's names.

      That, really, is the point of TFA. It is 100% true. You go for anything stricter than that, and you will get it wrong, so don't.

      Wrong is a matter of perspective. Remove all non-alphanumeric. Unless you're working with a legal requirement (which will have it's own validation contract), just be safe by remaining simple and setting an understandable limit. Avoiding hurting people's feelings because they have to put in a phonetic translation or alias, does not have a good ROI.

      --

      Often wrong but never in doubt.
      I am Jack9.
      Everyone knows me.
    3. Re:Missing in list: Single names & Initials by thsths · · Score: 1

      Nationality is an interesting issue. Not only can it change, but you can also have several nationalities. And since laws governing names are national laws, you could be known under different names for your different nationalities. How would you cope with that?

    4. Re:Missing in list: Single names & Initials by Anonymous Coward · · Score: 0

      Avoiding hurting people's feelings because they have to put in a phonetic translation or alias, does not have a good ROI.

      Because nvarchar(100) is such a large investment compared to writing an entire validation function that fails on real world names?

    5. Re:Missing in list: Single names & Initials by Tim+C · · Score: 1

      Wrong is a matter of perspective. Remove all non-alphanumeric.

      One of my friends has the surname O'Reilly and would not be terribly impressed with that suggestion. For that matter, I suspect the founder of the well-known publishing company wouldn't be too happy either.

      You're also going to upset people with hyphenated names - my daughter's mother's first name, for example, is Lisa-Jane, and I've known people with double-barrelled surnames.

      My point being that "strip all non-alphanumerics" is very easy to code, but fails with even with perfectly ordinary English names.

    6. Re:Missing in list: Single names & Initials by amRadioHed · · Score: 1

      Single names: I've done data conversions on a project which covered multiple countries in Asia, Africa and Latin America. A "new" thing I came across about African names was: Some people have just a single name. It's not their first name or their last name, it's just their name.Though, they're generally okay with it being considered their first name.

      Using the terms "First name" and "Last name" are bound to cause confusion if you include single names and Asian names. Using the terms "Family name" and "Given name" solves the problem.

      --
      We hope your rules and wisdom choke you / Now we are one in everlasting peace
    7. Re:Missing in list: Single names & Initials by alcourt · · Score: 1

      Mc* and Mac* names sometimes do and sometimes do not have spaces in the name. Where I work, two people have the same first and last name. One person spells his with a space, the other without. It's part of the proper spelling of their name. Don't try and tell either one to change the spelling of their name.

      --
      "I may disagree with what you say, but I will defend unto the death your right to say it." -- Voltaire
    8. Re:Missing in list: Single names & Initials by ZombieBraintrust · · Score: 1

      It is not terribly hard to keep a sort by name. Basically you strip out all characters you can't sort and up-shift everything. Then you never show the modified name to the user. You just use it for sorting on user interfaces, finding potential duplication of data, indexing the database for search, and interactions with legacy systems.

    9. Re:Missing in list: Single names & Initials by david_thornley · · Score: 1

      Remove all non-alphanumeric.

      Congrats; you have just invalidated my wife's name (hyphenated middle name), a colleague's (hyphenated last name) and the names of a lot of people of Irish descent, and that's just in the US.

      --
      "When you have eliminated the unacceptable, whatever is left, however improbable, must be the truthiness" - Holmes
    10. Re:Missing in list: Single names & Initials by Jack9 · · Score: 1

      Congrats; you have just invalidated my wife's name (hyphenated middle name), a colleague's (hyphenated last name) and the names of a lot of people of Irish descent, and that's just in the US.

      I didn't DO anything to anyone.

      --

      Often wrong but never in doubt.
      I am Jack9.
      Everyone knows me.
    11. Re:Missing in list: Single names & Initials by Jack9 · · Score: 1

      One of my friends has the surname O'Reilly and would not be terribly impressed with that suggestion. For that matter, I suspect the founder of the well-known publishing company wouldn't be too happy either.

      You're also going to upset people with hyphenated names - my daughter's mother's first name, for example, is Lisa-Jane, and I've known people with double-barrelled surnames.

      My point being that "strip all non-alphanumerics" is very easy to code, but fails with even with perfectly ordinary English names.

      Ahem, that's a lot of specious nonsense.

      I don't care if it's impressive to someone else. It's the simplest solution.
      People getting upset is inevitable. It's Monday, people are upset. It's Wednesday, people are upset. So what.

      Your summation of "fails" and "ordinary" are arbitrary characterizations. If there's a business case to be made, fine. For the sake of "correctness", the threshold of accuracy can be set very low and still yield recognizable monikers.

      --

      Often wrong but never in doubt.
      I am Jack9.
      Everyone knows me.
  26. Eat some oats by Anonymous Coward · · Score: 0

    Eat some sheep too

  27. I can has cheeseburger? by Anonymous Coward · · Score: 0

    I can has cheeseburger

  28. it's not software, it's people by yyxx · · Score: 2, Insightful

    Software shouldn't have to satisfy every whim and excentricity. If you don't have a well-defined first name and last name that consists of extended alphanumeric characters in Unicode and starts with a letter, well, then get one, OK? And while you're at it, come up with decent Romanized and ASCII (traditional Latin) versions of your name, conformant with one of the common Romanization systems of your language; you will need that too if you want to travel internationally. Single letter names are also a potential problem because they are confusable with abbreviations, so consider using a variant spelling ("O" -> "Oh").

    This isn't because programmers have some sort of hangups about names, it's because people themselves need to be able to refer to individuals in some reasonable and standardized way, they need to be able to write your name, alphabetize it, and correct errors.

    1. Re:it's not software, it's people by shutdown+-p+now · · Score: 1

      Software shouldn't have to satisfy every whim and excentricity.

      Are you implying that cultural differences are "whim and eccentricity"?

      Software should have to satisfy real world user input, it's as simple as that. It certainly shouldn't be intentionally coded so as to reject it. The database itself doesn't care if the value of the "last_name" field for some row is blank, or doesn't start with a letter, and neither the browser when displaying the HTML input form. So why does your code sitting between the two does this silly check? What valid business case does it enable?

      If you don't have a well-defined first name and last name that consists of extended alphanumeric characters in Unicode and starts with a letter, well, then get one, OK?

      Where do you propose people who don't have a legal last name get one?

    2. Re:it's not software, it's people by jack2000 · · Score: 1

      Depends on which country you are in but I'd say you can get or change your last name wherever your legal papers are issued. Everyone should have at least two names. No excuses.

    3. Re:it's not software, it's people by Lunix+Nutcase · · Score: 1

      Everyone should have at least two names. No excuses.

      Why?

    4. Re:it's not software, it's people by yyxx · · Score: 1

      Are you implying that cultural differences are "whim and eccentricity"?

      No, but insisting on, say, putting Chinese characters into a French e-commerce application would count as a "whim and eccentricity". So would spelling your name as "3'mi1e 20la".

      Software should have to satisfy real world user input,

      It's not just about the "user input", it's what happens with it. If a Chinese user enters Chinese characters for his name, the people actually trying to fill his order won't be able to read or pronounce his name; they can't even write it down or type it. That's why the correct thing to do is to refuse accepting Chinese characters in the first place.

      The database itself doesn't care if the value of the "last_name" field

      No, but the users of the database do.

      Where do you propose people who don't have a legal last name get one?

      In some cases, they can just make one up. In other cases, there may be standard ways of dealing with the issue (e.g., Iceland). In yet other cases, it's a good idea to get one legally (e.g., when immigrating to the US).

    5. Re:it's not software, it's people by Anonymous Coward · · Score: 0

      you can get or change your last name wherever your legal papers are issued.

      Why should I have to legally change my name to suit your xenophobic restrictions?

      Everyone should have at least two names. No excuses.

      Why? There are cultures where people do not have two names. There are cultures where people do not have surnames. Just because you have never been further than fifty miles from Podunk, doesn't mean that the whole world looks like Podunk.

    6. Re:it's not software, it's people by soppsa · · Score: 1

      You must be a sheltered america. Most people who immigrate to an english speaking country take an english name, and give their children english names. This is not even for computers, but like the GP pointed out, for *people* to be able to read and understand. The dude working at the DMV isn't going to (nor should he) be able to read 20 different written languages.

    7. Re:it's not software, it's people by Anonymous Coward · · Score: 0

      Why should I have to legally change my name to suit your xenophobic restrictions?

      Because if you don't, you ain't getting in.

      Yours in the department of immigration & naturalization,
          Kilgore Trout.

    8. Re:it's not software, it's people by Anonymous Coward · · Score: 0

      Why? There are cultures where people do not have two names.

      Yes, but American culture uses two names (three actually).

      Why should I have to legally change my name to suit your xenophobic restrictions?

      You don't have to; the US will let you get away with pretty much anything. However, it's a good idea to change your name to conform to US cultural norms when you immigrate because that's how your fellow Americans will interact with you.

      Immigration means giving up your old culture and becoming a member of a new culture and society. The real question is: if you are so hostile and negative towards American culture as you seem to be, why would you possibly want to become an American? Seems to me the xenophobe is you, not the Americans.

    9. Re:it's not software, it's people by shutdown+-p+now · · Score: 1

      You must be a sheltered america. Most people who immigrate to an english speaking country take an english name, and give their children english names. This is not even for computers, but like the GP pointed out, for *people* to be able to read and understand. The dude working at the DMV isn't going to (nor should he) be able to read 20 different written languages.

      Do you mean writing system when you say "take an English name"? 'cause I sure am not changing my name to its English analog, and very few Indian or Chinese immigrants around here do so, either.

    10. Re:it's not software, it's people by michaelwv · · Score: 1

      There's such a fine line between trolling and irony.

    11. Re:it's not software, it's people by Estanislao+Mart�nez · · Score: 1

      Most people who immigrate to an english speaking country take an english name, and give their children english names.

      I don't know about the children part, but most immigrants to the USA are from Latin America, and they keep their Spanish names.

      Immigrants' handling of names, however, is very dependent on ethnicity. Chinese who come to the USA very much do tend to take an English name; for their children, they tend to give them an English first name and a Chinese middle name, but they use that middle name as the child's first name in Chinese. Japanese and Koreans do this less in my experience, but it does happen. Indians with very long names that are hard for Americans tend to simplify the names, but they don't adopt English names.

  29. ...so what? by SanityInAnarchy · · Score: 4, Insightful

    It seems to me that most misconceptions about names can be fixed by the following:

    Allow a single, Unicode-enabled field of "unlimited" length (let's say 4 kilobytes) which represents "name". Several would be defined by different roles -- "Real name", "Nickname", "login", where only login (sometimes simply an email address) is required to be globally unique.

    Now let's look at what that breaks:

    First, #1, 2, 4, and 5. How am I supposed to avoid assuming these? People should be allowed to enter an arbitrary number of names for themselves? I suppose that's possible, but it immediately kills most of the potential uses of this data. If I want to set a nickname that goes with my forum posts, say, what good is it for me to have five nicknames? Seems like the only potential use would be making people easy to find by real name -- so, a social network.

    #6 -- surely 4k is enough, but this is also not a terribly difficult assumption to change later. Annoying, but not devastating, not even as hard as changing from the first name / last name combination into one "real name" field.

    #7, 8 -- most systems would make it trivial for people to change their names.

    #9, 10 -- UTF8 is easy.

    #11 -- very, very curious to see an example. And wouldn't that be a bug in Unicode? And this is again one where I have to ask -- how do you change this? Allow arbitrary images?

    #12, 13 -- obvious solution is to make the name system case-preserving, thus allowing both case-sensitive and case-insensitive searches.

    #14 -- again, avoid by simply allowing the name to be a single opaque field.

    #15, 16, 17 -- if your name supports random unicode, no idea why these would be a problem.

    #18 -- not sure why it matters.

    #19, 20 -- again, if it's just arbitrary text, it just works.

    #21, 22, 23 -- not sure how I'd make that assumption.

    #24, 25, 26, 27 -- again, the name is just an opaque bunch of characters.

    #28 -- what?

    #29 -- opaque characters.

    #30 -- keep the original text as-is. If you want to try to split people out by naming scheme, do it later, but keep the original. This should be a "duh" concept -- always preserve the original user input. Cache transformations for speed, if you like, but they're a cache -- keep the original. Your algorithm might change.

    #31 -- bad idea to assume bad words won't cause problems in general. I currently play an MMO in which I physically can't talk about Emily Dickinson, and have occasion to more frequently than you might suspect.

    #32-36 -- why would it matter? Unless...

    #37 -- Fine, but how would I otherwise connect the same person?

    #38 -- How about unicode-equivalent? And of course, they might not -- one might make a mistake, or the name might be represented differently. But you'd have to deal with typos anyway, so this isn't exactly shocking.

    #39 -- I'm going to have to agree with the assumption, though. If I develop a system which works well for people who only follow the US standard, and I suddenly have a ton of people from China wanting to use my service -- enough that this is actually a problem for me -- that's a nice problem to have.

    #40 -- People can make up names. I guess this explains #32-36, though.

    The sense I get is that half the list is stuff you'd almost have to be stupid to run into (seriously, who doesn't use Unicode?), and the other half involves some seriously weird names and cultures that are going to have to meet me halfway, if they expect me to do anything interesting with their name. As I understand it, the only way to get this right would be to allow people to have zero or more names, each of which is either an unlimited amount of text in any encoding, or an image (raster or vector) of unlimited size. To query such a system requires insane amounts of logic just to deal with the text, and throw in some OCR for good measure.

    I think this is a case where I would much rather see people evolve to match the technology, rather than the other way

    --
    Don't thank God, thank a doctor!
    1. Re:...so what? by Culture20 · · Score: 1

      People can make up names. I guess this explains #32-36, though.

      In some cultures, you don't get assigned a real name until you're an adult and have had a sacred vision. Your true name is assigned to you, not made up by you.

      I'm not going to store your name as a bitmap

      Although in the case of people who do make up their own names, what have you got against (TM) between 1993 and 2000? Wikimedia stores his name as an SVG. Skads better than bmp.

    2. Re:...so what? by Anonymous Coward · · Score: 0

      #11 -- Prince

      And no, rockstars being publicly retarded to draw attention to themselves is not a bug in unicode, nor is it a bug in your website.

    3. Re:...so what? by M.+Baranczak · · Score: 1

      The sense I get is that half the list is stuff you'd almost have to be stupid to run into (seriously, who doesn't use Unicode?)

      In this day and age, I can't imagine any halfway-serious system that doesn't support Unicode

    4. Re:...so what? by Anonymous Coward · · Score: 0

      #11 -- very, very curious to see an example. And wouldn't that be a bug in Unicode? And this is again one where I have to ask -- how do you change this? Allow arbitrary images?

      http://en.wikipedia.org/wiki/File:Prince_logo.svg

    5. Re:...so what? by Todd+Knarr · · Score: 1

      For #s 1-5, take my father as a simple case. Depending on who's talking, he can be either "Daniel" or "Tim". "Daniel" usually means the person doesn't know anybody else in my father's family (fairly reasonable, Dad lives in Nevada and the rest of his family's in Pennsylvania (the ones who're still alive, that is)). He gets called "Tim" by anybody he grew up around, because every male of that generation of his family got "Daniel" as a first name and inevitably went by their middle names or some abbreviation thereof to avoid endless rounds of "Daniel. No, the other Daniel. No, the other other Daniel.".

      Now, think about how many variations you can get on a name taking into account just various shortenings of first and middle name and the ways they can be combined (ie. first initial plus middle name vs. first name plus middle initial), and add in nicknames ("No, I'm just named 'Dweezil', I'm called 'Ed'."). Now account for changes in name (a friend of mine's on her fourth legal first plus middle name). And we haven't even begun to touch on changes due to marriage.

      And this is for ordinary (in the US) names. We haven't even begun to get into the rest of the world, or cultures who distinguish between what you call yourself, what people who know you call you, what strangers call you and what your name is (which we even have a bit of in the US, as in what my Mom said to an official who'd annoyed her: "My friends call me Patricia. You will call me Mrs. Knarr.").

    6. Re:...so what? by Anonymous Coward · · Score: 0

      seriously, who doesn't use Unicode?

      Apparently, enough that the author felt the necessity to point it out.

      Slashdot simply strips out Unicode. Whether or not that's good is neither here nor there, the fact is that it is that way. I'd imagine there are quite a few other sites that don't support it (but have no anecdotal evidence for this as I don't generally sign up for things and even if I do, my name is fairly standard).

    7. Re:...so what? by Mycroft_VIII · · Score: 1

      Off Topic and possibly wrong, but IIRC he did that because of a dispute with a publisher(Warner bro's?) who held contractual rights on the use of 'Prince'.

      Mycroft

      --
      https://signup.leagueoflegends.com/?ref=4c3ed6600b6ea
    8. Re:...so what? by Bazer · · Score: 1

      There should be a standard for handling names in a database. What's the equivalent of an RFC for database design?

    9. Re:...so what? by sco08y · · Score: 1

      Allow a single, Unicode-enabled field of "unlimited" length (let's say 4 kilobytes) which represents "name".

      Two fundamental questions:

      1. How do you represent this name on an invoice?

      2. And how can phone support look up individuals, and greet them on the phone?

      Remember, the point of this is to do some kind of transaction with customers.

    10. Re:...so what? by Rigrig · · Score: 1

      #11 and not having a character for whatever random symbol any person thinks of could hardly be considered a bug in Unicode.

      --
      **TODO** [X] Steal someone elses sig.
    11. Re:...so what? by Hognoxious · · Score: 1

      The sense I get is that half the list is stuff you'd almost have to be stupid to run into (seriously, who doesn't use Unicode?)

      Me. William Shakespeare. Julius Caesar. Jesus Christ.

      In no particular order.

      --
      Confucius say, "Find worm in apple - bad. Find half a worm - worse."
    12. Re:...so what? by Hognoxious · · Score: 1

      In some cultures, you don't get assigned a real name until you're an adult and have had a sacred vision. Your true name is assigned to you, not made up by you.

      Hate to break it to you, but Klingons and Bajorans aren't real.

      --
      Confucius say, "Find worm in apple - bad. Find half a worm - worse."
    13. Re:...so what? by Hognoxious · · Score: 1

      Even if you did that some silly twat would choose a name that was a dance, a series of facial expressions, or a tune made of fart noises.

      --
      Confucius say, "Find worm in apple - bad. Find half a worm - worse."
    14. Re:...so what? by Anonymous Coward · · Score: 0

      Ahem. It's 4 a.m., and the medical software In use at your hospital has to handle the name of a Burmese immigrant who's not exactly at their best. So, you tell them to grow up and enter the world of the Internet?

      Just because it doesn't (usually) matter for you doesn't mean to doesn't matter at all. Jerk.

    15. Re:...so what? by jvkjvk · · Score: 1

      Hate to break it to you but those patterns were based on some Earth religions.

      Ever hear of people taking up new names given to them by their gurus?

      Yeah.

      Reagards.

    16. Re:...so what? by Anonymous Coward · · Score: 0

      Ah, so it's a bug in the legal system.

    17. Re:...so what? by Anonymous Coward · · Score: 0

      Several would be defined by different roles -- "Real name", "Nickname", "login", where only login (sometimes simply an email address) is required to be globally unique.

      Now let's look at what that breaks:

      It may already be broken at this point, if you're using email as a globally unique identifier. My parents, for example, share one email address - the one their ISP gave them. If you want both of them to be able to sign up for a site, better don't rely on email addresses being unique.

      (I imagine quite a few Slashdotters might tell my parents to simply get an extra email address from GMail or whatever, but that's missing the point: namely, that the system already can't handle everything that's out there "in the wild".)

    18. Re:...so what? by SanityInAnarchy · · Score: 1

      That's why I also mentioned vector. The issue isn't raster or vector, it's image or text.

      --
      Don't thank God, thank a doctor!
    19. Re:...so what? by SanityInAnarchy · · Score: 1

      Whoops, replied too soon...

      Also, until you get that sacred name, you have some name. Changing names is perfectly legitimate. Having no name is the part that doesn't make a lot of sense to me.

      --
      Don't thank God, thank a doctor!
    20. Re:...so what? by SanityInAnarchy · · Score: 1

      How do you represent this name on an invoice?

      Verbatim.

      how can phone support look up individuals,

      Look up by name? Tokenize it and search it, the same way you'd do any other fulltext search.

      and greet them on the phone?

      Ah, but if you've got them on the phone, you've likely already tied them to their account anyway, by all the other data you've got on them -- mailing address, phone number, etc. Each of these present their own problems, of course...

      I wasn't trying to trivialize the problem of querying this data. I was trying to point out that what you need to know, when writing an application, is to store the original text as a mostly opaque blob. Then, as we figure names out together, you can get more and more sophisticated about querying it.

      --
      Don't thank God, thank a doctor!
    21. Re:...so what? by SanityInAnarchy · · Score: 1

      I imagine quite a few Slashdotters might tell my parents to simply get an extra email address from GMail or whatever, but that's missing the point: namely, that the system already can't handle everything that's out there "in the wild".

      Actually, it's exactly the point -- your parents are an edge case, and they can easily and pro-actively deal with sharing an email address. In fact, chances are, this isn't an issue -- if they can share an email address, they can almost certainly share an account in my system.

      Now, we could just do it on login, but I think the activation-by-email route is still one of the better CAPTCHA-like systems.

      --
      Don't thank God, thank a doctor!
    22. Re:...so what? by SanityInAnarchy · · Score: 1

      An edge case. #39 on the list was the main point I was trying to refute -- that is, yes, my system is going to handle the vast majority of cases, and situations like yours -- sure, I'd like to handle those, but that's where it gets hard enough that there's no guarantees, even if I'm trying.

      Also, "immigrant" implies that they're now a citizen, or at least a resident, meaning they'll have a legal name somewhere. What the system needs is to be able to update to that name, and sync with records associated with it, once he's coherent enough to communicate it. If he's not, put whatever name we can figure out, or give him a number until then.

      --
      Don't thank God, thank a doctor!
    23. Re:...so what? by Hognoxious · · Score: 1

      Hate to break it to you, but you obviously can't spot a flippant remark when you see one. Get off your high horse already.

      In any case, such a system is totally stupid. How does it work in school? You-yes-you-the-fat-kid-in-the-third-row-second-from-left, him-on-the-back-row-with-the-dodgy-eye-and-smelly-armpits has forgotten his textbook, would you mind sharing? Then the bell goes.

      Another thing - anyone who spouts about gurus, visions, spirit guides and the like is a fucking twat - and most likely either gullible or a charlatan.

      --
      Confucius say, "Find worm in apple - bad. Find half a worm - worse."
  30. it's not as easy as you think by uremog · · Score: 1

    Databases don't work with names well because it's really freaking hard even for a human sometimes. People have funky spellings, middle initials, sometimes full middle names, phone operators mis-type or mis-hear names, or assume a different spelling. People have accents which messes up people hearing names and bad handwriting which messes up people reading names. People have names that look like words that should be cleaned from names like "Mr. John Trust". While "Trust" should usually be cleaned from a list of bank trusts, it can't be cleaned here. Some people do last name first. You want to add another language and address standard in there, good freaking luck. As far as I know, doing data processing as a job, if you want good results, you MUST look at a very small percentage of an average database manually (maybe 0.01%-1% depending on how dirty your DB is). But for any global company, that could still be tens of thousands of checks you have to make and it will be very time consuming.

  31. You make a wrong assumption about software by Anonymous Coward · · Score: 0

    I do work on knowledge base systems, and in particular, research into feeding data into knowledge representations in an automatic way from NLP systems, often from cross-lingual sources. For what I do, issues about things like the naming of persons present real matters that need to be foreseen and thought through before invalid assumptions creep into systems and create headaches down the road.

    From the tone of your post, I presume that you work primarily on databases that store stuff like customer information. So maybe you don't need to worry about this kind of thing as long as you constrain your business to a certain segment of the world.

    But your problems aren't everyone's problems; some of us contend with more complicated issues. Some of us, I imagine, do actually want to do business in more than one country and care about things like this - I'd imagine people that write software for the EU probably have to contend with these kinds of issues pretty frequently. The author of the article is pointing out something that you might not care about, but that doesn't make it stupid or useless.

    1. Re:You make a wrong assumption about software by Vellmont · · Score: 1

      Yes, there are cases where getting closer to perfect modeling of names is more important. I still maintain that no model or software is ever going to be perfect of something as complex and changeable as a name. If you think it is you're just not being creative enough.

      The point though is NOT that there aren't cases where you need a better model. The point is that the author of the article is a crazy person who thinks EVERYTHING has to model names in some absolutely perfect manner. The truth is the vast majority of software doesn't have to go to the extremes the author is talking about. Models are imperfect. If they were perfect, they wouldn't be models, they'd be reality.

      --
      AccountKiller
  32. Not quite by fishexe · · Score: 1

    He's essentially arguing that, because names vary a lot and are complex, your software should never do anything useful with them. Sorry, but that's a stupid answer.

    Actually, he's arguing that you should become infinitely wise and knowledgeable prior to doing anything useful with names, and failing that you should never do anything useful with them. Which is an even stupider answer.

    --
    "I don't care about the Constitution!" --Bill O'Reilly, November 17, 2009
  33. Admit Defeat by mtinsley · · Score: 1

    Just assign everyone a number.

    1. Re:Admit Defeat by zill · · Score: 1

      1283400 has a great point and deserves mod points.

  34. Names don't mean anything by Eskarel · · Score: 1

    They're not globally(or even locally) unique nor are they even identifying, so if you're writing a program there's really no point in trying to treat them as such.

    Even in combination with other data you can't guarantee a unique or identifying result, so why bother. The only reason we bother with them in the first place is because people would get offended if you just treated them as a number, which is exactly what every single computer program on earth does and should do. I know that people get all precious about them, but all you can really do is specify a reasonable number of characters in a reasonable character set, in a format which makes sense for the people who will be using the application(the fact that your users will be looking for a last name is more important than whether a specific person has one from the point of view of application flow, and the vast majority of cultures have a naming structure which conforms at least loosely to a first, last, other configuration even if the display order and meaning of the configuration may differ.

  35. Just more nerd bashing! by RomulusNR · · Score: 2, Insightful

    Yes. It's programmer's fault that they write applications that make poor assumptions about names -- not the people who design software requirements who are neither programmers nor usually very worldly.

    Perhaps we should have a list of "assumptions people make about developers"!
    * Developers get to design their own software.
    * Developers get to have some say in how their software is designed.
    * Developers at least can prevent really stupid things from being put in the software they write.
    * Developers aren't smart enough to know that outliers are inevitable.
    * Developers aren't smart enough to know that of course there are people with punctuation, extra words and spaces, even letters that no one has seen before.
    * Developers wouldn't rather code just one column to hold an identifier rather than two.

    --
    Terrorists can attack freedom, but only Congress can destroy it.
  36. "The Man whose Name Wouldn't Fit" by Animats · · Score: 1

    The Man Whose Name Wouldn't Fit, published in 1968.

    1. Re:"The Man whose Name Wouldn't Fit" by Albatrosses · · Score: 1

      Gotta love the stupidity with that URL turning out to be "amazon.com/man-whose-name-wouldnt-author's name/more stuff".

  37. fair assumption by zlel · · Score: 1

    Names are essentially what we are willing to be identified by, which the computing culture has decided that it must be representable in writing (or drawing for that matter). Writing is a prerequisite for computing, but assuming that everybody who is not actually computing has a name writable in some conceivable script or alphabet is setting up arbitrary rules for the game - which isn't so much different that setting up rules for "usernames" used for logins. The bottom line for now is, if your name can't be represented in UTF-8, you need to put it in ASCII. Foreign names are represented with the Japanese alphabet in Japan - by putting together characters that approximate the pronunciation of the foreign name - yet for well known people, not any approximation will do - society agrees what that foreign name should be written as. If a person can potentially have a name written in a language he doesn't know, a system that needs to represent "real" name(s) has to draw the line somewhere.

  38. Whiny bullshit by Anonymous Coward · · Score: 0

    This article sounds to me like the author got pwned in some discussion about database design, so he went off and whined about it. A few of the points are okay, but most of them are just bullshit he pulled out of his ass to try to make the list longer. The truth is, you can safely ignore most of these without trouble. Let the fucking user deal with it. You put a number in your name? Then you deserve the trouble you brought upon yourself. You don't have a last name? Well, unless you've sold a few million of something, you're just a pretentious whiny bitch and I hope you die.

  39. First hand experience. by the_raptor · · Score: 1

    I live in Australia, I deal with government forms all the time. They are almost universally Family name/Given Name/Other Given names. Information on these forms is for the use of the entity collecting it and therefore has to be usable by their staff, which makes having a system that accepts non-latin characters pointless because nearly none of the staff will be able to read them. I can see the use in the system storing Chinese characters for use in mailing but for internal use the system needs a latin character name*. This article is ridiculous as it even criticises the assumption that people will have family names. The author must have missed his history lesson explaining that family names only became popular in Western European culture when governments started tabulating people. In a rural village everyone knows that Jack the butcher is different from Jack the baker.

    That said, many programmers are terrible at designing name fields. I have two middle names (not an exceptional case for Anglo-Saxon culture) and until a few years ago many input systems would not handle this.

    *Even if the system did a conversion to a latin representation of an asian name most people can't pronounce them because they are based on different sound primitives. Which is why Asians tend to adopt westernised versions of their real names.

    --

    ========
    CINC, 4th Penguin Legion
    1. Re:First hand experience. by mpe · · Score: 4, Informative

      The author must have missed his history lesson explaining that family names only became popular in Western European culture when governments started tabulating people. In a rural village everyone knows that Jack the butcher is different from Jack the baker.

      Hence Butcher, Baker, Smith, Brewer, Tanner, Farmer, etc became "family names".

      *Even if the system did a conversion to a latin representation of an asian name most people can't pronounce them because they are based on different sound primitives.

      Such a "translation" can easily be one to many, dependent on various factors.

      Which is why Asians tend to adopt westernised versions of their real names.

      Or they adopt a regular English, German, French, Spanish, etc name to be known by.

    2. Re:First hand experience. by Anonymous Coward · · Score: 0

      *Even if the system did a conversion to a latin representation of an asian name most people can't pronounce them because they are based on different sound primitives. Which is why Asians tend to adopt westernised versions of their real names.

      Having been in a similar situation to the article author, I can tell you that having your name mangled into a foreign language does grate somewhat.

      I'm mixed European, with a Welsh first name, Dutch middle name and English surname, and I lived in Japan for a year. This necessitated getting a Japanese ID card prepared, complete with a name in Japanese. Now, the Japanese have pretty hard rules on what constitutes a valid name, in that you're allowed to pick a combination from about 1900 kanji, both complete phonetic alphabets, but supposedly not latin characters.

      Neither my first or last name go well into katakana (the phonetic alphabet used for foreign words), but can be mangled to at least be recognisable. My middle name is Jeroen, which includes a dutch rolled-R. There's nothing very similar to that in English (it's closer to the Welsh LL), let alone Japanese, and there is no way to represent that in katakana. I refused to give the official a Japanese transcription, and he wouldn't submit my application without all the names from my passport being represented. In the end, he compromised and put in a J.

      So my official name in Japanese has two butchered katakana names with a J in the middle. If they can put in latin characters, why not just put my actual name in?

  40. So names don't need to comply to these? by Dersaidin · · Score: 1
    In that case I could name my first born child the Mandelbrot set. Not "Mandelbrot" the string, not the equation, but actually use the resulting plot as a character.

    Good luck pronouncing it, or writing it accurately.

  41. Poor Mr. Cumming by SpaghettiPattern · · Score: 1

    Poor Mr. Cumming. I bet this is not so much a programming thing as it is the decency censor interfering.

    --

    I hadn't the slightest objection to his spending his time planning massacres for the bourgeoisie... (P.G. Wodehouse)
  42. Most systems don't handle most data correctly by caywen · · Score: 1

    I found the article pointless, actually. Of course people's names are far more complicated than the typical programmer realizes. I know that Chinese names are an issue (I'm Taiwanese, so I have one). But the DB systems I designed recently only need to be as good as what the credit card companies use, and only good enough for a user to login. There's just no point in pursuing someone else's incoherent sense of perfection.

  43. Not just programmers or computers by zill · · Score: 3, Funny

    This issue is pretty much universal. Even outside the binary world people still silly assumption about people's names.

    For example, numerous people have raised objections about my signature. They always give me bullshit complaints like "Sir, that is not legible." or "um... that's not your name." or even "Did you just draw a penis on the dotted line?".

    My signature does not have to be legible.
    My signature does not have to be my name.
    My signature does not have to contain my name.
    My signature does not have to contain any name.
    My signature does not have to be in the English language.
    My signature does not have to be in any human language.
    My signature does not have to consist of meaningful symbols.
    I swear if I hear one more complaint about my signature I will carry around a portable photo printer to render goatse as my signature:

    "Yes, my signature is an 600 ppi out-stretched anus. Deal with it. The law says that any mark that I make is a legally valid signature and you have to recognize it as such. You either sign the mortgage or I'm going to the next bank."

    1. Re:Not just programmers or computers by maxwell+demon · · Score: 1

      This issue is pretty much universal. Even outside the binary world people still silly assumption about people's names.

      For example, numerous people have raised objections about my signature.

      That's not an example, because

      My signature does not have to be my name.

      --
      The Tao of math: The numbers you can count are not the real numbers.
    2. Re:Not just programmers or computers by Anonymous Coward · · Score: 0

      So what is your signature, the picture of a penis or the picture of an out-stretched anus? I'm confused.

    3. Re:Not just programmers or computers by Anonymous Coward · · Score: 0

      This issue is pretty much universal. Even outside the binary world people still silly assumption about people's names.

      For example, numerous people have raised objections about my signature. They always give me bullshit complaints like "Sir, that is not legible." or "um... that's not your name." or even "Did you just draw a penis on the dotted line?".

      My signature does not have to be legible.

      My signature does not have to be my name.

      My signature does not have to contain my name.

      My signature does not have to contain any name.

      My signature does not have to be in the English language.

      My signature does not have to be in any human language.

      My signature does not have to consist of meaningful symbols.

      I swear if I hear one more complaint about my signature I will carry around a portable photo printer to render goatse as my signature:

      "Yes, my signature is an 600 ppi out-stretched anus. Deal with it. The law says that any mark that I make is a legally valid signature and you have to recognize it as such. You either sign the mortgage or I'm going to the next bank."

      So what you're saying is that all I have to do is draw an out-stretched anus, and I can have a mortgage in your name?

    4. Re:Not just programmers or computers by Hatta · · Score: 1

      Are you Kurt Vonnegut?

      --
      Give me Classic Slashdot or give me death!
    5. Re:Not just programmers or computers by zill · · Score: 1

      I try to make it unique every time. That may be the other reason why I'm getting so many complaints.

    6. Re:Not just programmers or computers by zill · · Score: 1

      Signatures are not cryptographic hashes. They can be easily forged. An out-stretched anus is probably harder to forge than a "regular" signature if you incorporate some unique patterns to it digitally.

    7. Re:Not just programmers or computers by careysub · · Score: 1

      ... "Yes, my signature is an 600 ppi out-stretched anus. Deal with it. The law says that any mark that I make is a legally valid signature and you have to recognize it as such. You either sign the mortgage or I'm going to the next bank."

      And they may say - I'm sorry you "signature" must be your AUTOGRAPH to prove that you are who you say you are. That could be anyone's photo printer. We will have to photograph your anus ourselves to use it for identification. Please drop your pants.

      --
      Starships were meant to fly, Hands up and touch the sky - Nicky Minaj
    8. Re:Not just programmers or computers by Quirkz · · Score: 1

      A little off topic, but an interesting point. I knew a guy whose signature was a stick-figure tiger. Don't think people argued with him too much, though.

    9. Re:Not just programmers or computers by zill · · Score: 1

      There's already a legal definition of signature so people can't go around redefining it for fun. As it turns out a signature doesn't have to a an autograph. It can be literally just anything.

  44. what an arrogant jackass by ILuvRamen · · Score: 0

    Not only does this author contradict or duplicate himself many times in that list but he's basically saying everyone needs to write all software to accept all characters in existence. Certain compilers, frameworks, and other technologies that I have to use don't even support that! Plus, I'd have to manually filter all SQL injection style text instead of just shrinking the allowable character set. The fact that he even put "my software will never have to accept korean names" in the list shows how out of touch he really is. I'd phrase my rule more like "I don't care if someone may attempt to use a foreign character set in my program because I'm purposely not allowing it since it's an internal program and we're based in the central US plus all asian languages have english letter counterparts." And he stated that people may have multiple names and aren't settled on just one at any given time. WHAT?! If they're using my software to enter their name, they have to pick one. Call me stubborn like that. I don't know what planet this guy is from but he and his opinions need to stay there because I'm programming back on Earth.

    --
    Google's Super Secret Search Algorithm: SELECT @search_results FROM internet WHERE @search_results = 'good'
  45. Taking time to consider. by w0mprat · · Score: 1

    Good coders shouldn't spend too long on something like picking a name, because it seems to me part of coding well is keeping a ryhthym of only spending just enough time on each element to get it right. Part of coding is not spending long on solving each individual problem, I would suggest that this method of everything being "that'll do" means when coming to a decision such as naming, the default method is to pick something and move on with your day.

    I didn't quite understand this as a student, I got higher marks in programming assignments than people who'd already learnt to code out in the wild. I was a perfectionist and took way to long to get anything done to the point I swore off a career in pure development. My impression is that it's just too difficult to take too much into consideration at a line by line level. It's near impossible to get anything done with a perfectionist attitude.

    I finally understood why so many finished projects seem to be great in concept but realised with a "that'll do" attitude.

    Because otherwise it would never ship in time.

    --
    After logging in slashdot still does not take you back to the page you were on. It's been that way for 20 years.
    1. Re:Taking time to consider. by oddTodd123 · · Score: 1

      You know, for a perfectionist, you sure did a lousy job understanding what the story was about before writing your little rant about how you couldn't cut it as a software developer.

    2. Re:Taking time to consider. by Anonymous Coward · · Score: 0

      Yes but is it really so much asked for someone to roll out this out as a general library that anybody can clone into their code?

      You might not need to be perfect but it sure as hell would be nice if we would collectively STRIVE to do so

  46. My name is...Optimus Prime. by Anonymous Coward · · Score: 0

    Well, actually no, but my real name sounds like a cross between a lawnmower and breaking glass, and would just give you a splitting headache.

  47. Random Guy's Blog...? by HighFalutinCoder · · Score: 1

    Show me some real research on the topic and I'll call it interesting. Did I mis-click and wind up on idle?

  48. I shall change my name by deed poll to by physburn · · Score: 1

    Mr quote semicolon drop table

    1. Re:I shall change my name by deed poll to by maxwell+demon · · Score: 1

      Little Bobby Tables, is that you?

      --
      The Tao of math: The numbers you can count are not the real numbers.
  49. OT: why is "all caps" in all caps? by noidentity · · Score: 1

    People's names are not written in ALL CAPS.

    Often I see people write the above, or refer to macros in programming as MACROS rather than macros, also they always refer to the canel case style as camelCase. It's a curious thing, because I don't see them putting the word "bold" in bold when they refer to bold text, etc. It's just odd because it's like the writer thinks that every mention of all caps must be in all caps, or every use of the word macro must itself be a macro or something.

    1. Re:OT: why is "all caps" in all caps? by Anonymous Coward · · Score: 0

      YEah, and whatsup with typeing all_lowerCase.LETTERS as "all lower case letters"????

    2. Re:OT: why is "all caps" in all caps? by peterpi · · Score: 1

      People write all sorts of tech-related nouns in all caps. "LINUX" suffers from this quite a lot, as does "LUA".

  50. Same goes for gender by Xtravar · · Score: 1

    For electronic medical records, gender is especially important for obvious reasons, but there are people who want it to be male, female, and a variety of mixes. Usually systems only implement "Unknown" as a third gender because it's just an edge case. But there are people out there who get offended by that. Mostly in California.

    --
    Buckle your ROFL belt, we're in for some LOLs.
    1. Re:Same goes for gender by digitig · · Score: 1

      Also there are many people who are born with ambiguous genitalia, and not all are surgically "assigned" to one sex or the other. Sure, they will have a genetic gender, but they might not know it and anyway it's not what the medical records need to show.

      --
      Quidnam Latine loqui modo coepi?
    2. Re:Same goes for gender by Taevin · · Score: 1

      That's another example of some database designer being "clever." The number of databases out there that use a boolean data type for the gender column must surely approach some uncountable number. They do that even when, like in the example you give, there is an obvious need for a third value at the very least. Cue mental giant using NULL as a "meaningful" value...

      Or you have systems like the one I work with where there are so many places where it makes me think someone was actually making an effort to have a proper database but just as the thought was coming together, someone else beat them mercilessly with a retard stick. In this database, gender is amazingly not a boolean and they even got on the right track by making it a character value before making a permanent stop at char(1) retardville.

    3. Re:Same goes for gender by adonoman · · Score: 1

      That's why our DB uses Female, Male, Neither, Other, Indeterminate, Transgendered, Unknown, and FILENOTFOUND.

    4. Re:Same goes for gender by bbtom · · Score: 1

      It's pretty simple. To represent gender, use a string. Have as default values the lower case string "male" and the lower case string "female" and run any input you get through a function to check if, having removed whitespace and lower-cased the input, it matches the string "male" or "female", you normalize it to that. If someone does not wish to specify a gender, store that as nil or as an empty string.

      If someone wishes to specify something other than male or female, allow them free text entry. That is all. Then if you need to find people who are neither male or female, you just query your database for * where gender != "male" && gender != "female".

      Same goes for sexual orientation. Pick some sensible default strings for your culture - straight, gay, bisexual etc. - and then for all the "I'm queer, not gay", "I'm asexual", "I'm a furry otherkin" or whatever - let them put in whatever they want as a short string-like token.

      --
      catch (HumourFailureException e) { e.user.send("You, sir, are a humourless idiot."); }
  51. Re:Sounds like people need to fix their names by PapayaSF · · Score: 1
    --
    Q: What does the "B." in Benoit B. Mandelbrot stand for? A: Benoit B. Mandelbrot
  52. Anonymous coward by Anonymous Coward · · Score: 0

    Most of the posters are missing the point. It is about the customer. Whenever you address a customer, you start with there name. If the first thing out of your mouth (or at the top of the page) is wrong, the customer is no longer thinking about your message, they are thinking about how you don't care about them enough to even get their name correct.

    I have a nonstandard name (unique spelling), and I know, from personal experiance how distracting it is to have my name "corrected". My mother tells a story about how excited I was coming home from school knowing how to write my name. The next day, she had to have a chat with my teacher about not "correcting" the spelling that was on her class list.

    "The most important word in any language to any person is there name"
    dale carnigie

    1. Re:Anonymous coward by Anonymous Coward · · Score: 0

      Whenever you address a customer, you start with there name.

      Here here!

  53. This is why... by Ariston · · Score: 1

    This is why nobody remembers the name of Johann Gambolputty-de-von-Ausfern-schplenden-schlitter-crass-cren-bon-fried-digger-dingle-dangle-dongle-dungle-burstein-von-knacker-thrasher-apple-banger-horowitz-ticolensic-grander-knotty-spelltinkle-grandlich-grumblemeyer-spelter-wasser-kurstlich-himble-eisenbahnwagen-guten-abend-bitte-ein-nürnburger-bratwürstel-gespurten-mitz-weimache-luber-hundsfut-gumeraber-schönendanker-kalbsfleisch-mittleraucher-von-Hautkopft of Ulm.

    --
    --Ariston
    "I'm never wrong--sometimes reality just disagrees with me."
  54. Zaphod??? by flyingfsck · · Score: 1

    Zaphod Beeblebrox III, I presume. Pleased to meet you, my dear sir.

    --
    Excuse me, but please get off my Pennisetum Clandestinum, eh!
  55. Z3r0 C00l by Anonymous Coward · · Score: 0

    I do.

  56. "People have names" by jobst · · Score: 1

    Its number forty ... even ./ knows this isn't the case and and deals with it appropriately and assigns "Anonymous Coward".

    Peter: Whats your name?
    : ""
    Peter: But people have names!
    : No.
    Peter: So how do I call you on a football field?
    : ""

    --
    to code or not to code, that is the question.
  57. There may be an easy workaround by YA_Python_dev · · Score: 1

    When sites suck so much to not accept emails longer than 32 characters or with "-" or "+" in them, they usually also suck to the point of doing these checks only on the client side, which is obviously easy to work around. The server may not validate the input at all!

    See e.g. this (scroll down to "Now the interesting technical part"): United Airlines sucks.

    --
    There's a hidden treasure in Python 3.x: __prepare__()
    1. Re:There may be an easy workaround by mini+me · · Score: 1

      Sounds like my bank. They require that passwords be limited to 0-9. Except the passwords are only validated client-side. Disabling the check allows one to set a more secure password with any character you wish. Given that it is not a limitation of the system, I cannot figure out why they would impose the restriction on anyone, especially in a context where security is essential.

    2. Re:There may be an easy workaround by SlothDead · · Score: 1

      Dealing with all the stupid customers who get confused by the overly complex passwords they'd type in would be more expensive than the occasional cracked 0-9 account.

      Also, it makes it easier to do banking using your phone.

    3. Re:There may be an easy workaround by Bigjeff5 · · Score: 1

      Dealing with all the stupid customers who get confused by the overly complex passwords they'd type in would be more expensive than the occasional cracked 0-9 account.

      That is bullshit, and is exactly why most people can't remember the complex passwords they have. I've experienced this first hand at a military base with some similar ridiculous requirements. You could walk down the hallway, and half the offices had a sticky with their account name and password stuck to the screen, because the password was too anti-mnemonic and changed to often to ever really remember.

      A 9 digit password with two upper case letters, two lower case letters, two numbers, and two special characters is just as hard to remember as a 15 digit password with the same requirements, and far harder to create a unique password for. It seriously cuts down on the number of potential combinations, and makes the password significantly easier to brute force than a system with every possibility available.

      Pass phrases are more secure than anything out there, and are far easier to remember, yet bullshit like this is why they aren't more common. Require at least three words and plain dictionary words become available and provide far more security than nine digits ever could.

      Of the single-word password systems that do not allow dictionary words, short passwords with multiple mandatory requirements are about the least secure there are, and among the hardest to create and remember. Most of these passwords the only way people can keep them straight is to write them down, particularly if they aren't used very frequently (like a bank account password).

      --
      Security is mostly a superstition... Avoiding danger is no safer in the long run than outright exposure. - Helen Keller
    4. Re:There may be an easy workaround by SQLGuru · · Score: 1

      I've found that putting the serial/date element earlier in the string (instead of the end) is enough to confound any of the "significantly different" algorithms.

      So, if your password is: C0mp@ny
      And the next password has to be "significantly different" than the previous one you can use the following (in order):
      1C0mp@ny
      2C0mp@ny
      3C0mp@ny

      And because the "significantly different" is normally based on the encrypted password (since they are usually one-way and compare the hash), these don't trigger the logic and then all you have to remember is which sequence you used. Of course, YMMV. I've also placed the changing value in the middle instead of the ends because I assume someone (besides me) will get wise to this fact and picking out the middle elements from a hashed password seems like it would be tantamount to cracking the encryption -- and if that can be done, it doesn't matter what your password is.

  58. The Belgian Crown Prince by spectrokid · · Score: 1

    is called Filips Leopold Lodewijk Maria, even though "Filips" is now considered old fashioned, and most people use "Filip" instead. Unless of course you are part of the French speaking part of Belgium, then his name is Philippe Léopold Louis Marie (note the accent).
    And don't get me started on his last name.

    --

    10 ?"Hello World" life was simple then

    1. Re:The Belgian Crown Prince by CrashandDie · · Score: 1

      Indeed. Conditional spelling of one's name is extremely confusing at times (I have it too). In dutch there's a total of 3 a's in my first name, only 1 in french, and 2 in english; but both the english version and french version are shorter than the dutch.

      Though luckily, when you've gone through a few years of slashdot, you become cynical enough that you don't care about anyone anymore, and just stick to one spelling. Usually the simplest one.

    2. Re:The Belgian Crown Prince by Anonymous Coward · · Score: 0

      He's a fucking asshat.

  59. I didn't understand by SimonInOz · · Score: 4, Interesting

    I though the article was about the inability of programmer to remember names and recognise people, Maybe I should have read the article.

    It's a real problem though - is it just me? I often know things about people (ah yes, plays squash, good at making cakes, father of that kid who rides a unicycle), but their actual name - no. It's a miracle if I recognise them at all.
    Mind you, it means if anyone says "Hello" to me, I am obliged to be polite to them as I might actually know them quite well, but haven't recognised them yet - and certainly don't know their name.

    It's a right pain. Anybody else suffer from this - and what the heck do they do about it? (I'd like a camera attachment what would whisper in my ear "that's Mrs Jones, her daughter, Kira is in the same class at school as your daughter. Likes chess and is obsessed with kayaking" - something tiny that could clip on my glasses, maybe).

    --
    "Cats like plain crisps"
    1. Re:I didn't understand by delinear · · Score: 2, Insightful

      I'm the same, faces for me just won't stick - the first few weeks going to a new employer is torture for me as I'll be bombarded with names and I just can't remember anyone, and of course the problem is multiplied because you're the new guy so everyone knows your name. I don't know if it's specifically a developer issue. I did read that people with borderline Asperger's find it difficult to recognise faces and a lot of developers I know seem to fit the patterns for that (awkward in social situations or around new people, like to collect things, etc) so maybe there's some correlation that people who fit those patterns are drawn to careers where they can focus on impenetrable logic problems and not have to deal with people too much (I know I'm making some massive generalisations here, and this is purely anecdotal, but it fits my observations).

    2. Re:I didn't understand by ifrag · · Score: 1

      I think my problem is my brain simply refuses to malloc for name strings most of the time. It's like that area of my brain that holds name strings is limited capacity and must only be used when absolutely necessary. I can usually remember some associated information like what their position is, or what project they are working on. I recognize the faces so there is no problem storing that information, but most of the name pointers just point to null. With enough repeated attempts to malloc for the same person my brain will eventually admit defeat and allow for name storage, but the first try to allocate name memory is almost always going to fail.

      --
      Fear is the mind killer.
    3. Re:I didn't understand by MrKaos · · Score: 1

      I'm the same, faces for me just won't stick - the first few weeks going to a new employer is torture for me as I'll be bombarded with names and I just can't remember anyone, and of course the problem is multiplied because you're the new guy so everyone knows your name.

      I had this problem. I fixed it with :-) THE NAME GAME (-: cause I mainly felt embarrassed to talk to interesting people and have to ask their name again.

      First someone tells you their name and you attach a key to their name, like three different Pauls I had to remember Little Paul, medium Paul, big Paul. Gorgeous Georgie, silly stuff. Since it's only for you you can have fun with it and it's never failed me. Shithead Steve, liar Lisa, Charlie cooldood, William wtf, Peter principle, Matt ertransmitter, Debbie dB etc

      Works for making good networks of people to. Hope it works for you.

      --
      My ism, it's full of beliefs.
    4. Re:I didn't understand by geekoid · · Score: 1

      I sued to. I have developed habits to help me remember.

      When I meet someone I try to use there name three times.

      Geekoid, this is Mary
      Nice to meet you mary.
      Try to work it into conversation

      At the end of the conversation I always say 'it was nice to meet you Mary.

      When I talk with them again, I also try to end the conversation with 'it was good to see you again Mary' or some such.
      It has the added bonus of seeming to be personal, and people respond better to that.

      Unless you can't remember your spouses name, or someone or kids or parents. In that case, consult a Dr.

      --
      The Kruger Dunning explains most post on /. http://en.wikipedia.org/wiki/Dunning%E2%80%93Kruger_effect
    5. Re:I didn't understand by Hatta · · Score: 1

      Try asking. People understand when you say "sorry, I'm really bad with names".

      --
      Give me Classic Slashdot or give me death!
    6. Re:I didn't understand by BeardedChimp · · Score: 1

      I also have this problem but I believe it is a genetic trait in my case. My Mum is already pretty bad and fairly regularly forgets my name.
      My Dad is similarly awful and has to ask other staff who someone is when he's dealing with a CEO etc. I think the two of them combined to give me an unholy alliance of crap memory genes.
      Having done research previously I think I have dysnomia as it isn't just names I forget, more generally nouns. I forget words like door, so I would say "could you please open that erm, entrance please" instead.

      I actaully don't tend to find the name thing a problem, I just tend to tell people that I can't remember names including my girlfriend's and family's. After that people don't seem to take offence and instead find it funny.
      People like to play 'games' with me when I'm in a group of friends because if I go round and name people in a circle, the second time round I'll forget different people.
      My girlfriend even found it fairly funny when I called her Mihir in bed, who happened to be my small, male, African house mate.

    7. Re:I didn't understand by Terwin · · Score: 1

      Nah, not a problem at all, I see a name in an email and I know someone of that name works in my company, and I see someone face to face and I know someone who looks like that works at my company. Easy.
      I just have no links between my image storage system and my name storage system.
      Sometimes I can just use my image storage system(such as if I see them with a name tag for several weeks), otherwise I have co-workers that I have worked with for years at my small company(less than 30 employees) and I can only guess at their names because they work in a different department.

      Also, I don't collect things, so in spite of all of your other accurate descriptions, I must not have Aspergers.

      Now if you don't mind, I need to go get a new bookshelf because the one I bought last week is full.

    8. Re:I didn't understand by MagicM · · Score: 1

      I hate it when people use my name when we're having a direct conversation because it invokes a "pay attention" response that is jarring and often unnecessary. But from now on I'll tell myself that it's just their way of memorizing my name. Thank you.

    9. Re:I didn't understand by easterberry · · Score: 1

      Oh man, I once went two years talking to this girl a couple times a week. And not like, small talk. We'd have 30 minute and up conversations.

      So one day, I bring one of my friends to the place we hang out and I have to introduce them. Let me tell you, there is nothing more awkward than trying to explain to one of your friends that you don't know their name (first, last, nick or otherwise) after hanging out for over 2 years because "it never came up."

    10. Re:I didn't understand by Chaset · · Score: 1

      Reducing myself to AOL-level, but Me three! I just smile and nod at everyone who says hello until I've been there a few months and the names start sticking. -- At a recent conference, I picked up a bumper sticker: "I didn't spend 4 years in computer school to talk to people" Not that extreme, but I totally get what you're saying.

      --
      -- "This world is a comedy to those who think, a tragedy to those who feel."
    11. Re:I didn't understand by DCFusor · · Score: 1
      I get faces fine, no luck with names till I've say, worked a week with you in close contact the whole time, and sometimes a month to remember last names (as when I write paychecks). Embarrassing as hell, but simple solution in my case -- my wife has no such problems and remembers names even if nothing else about someone.

      For a little while I was semi-famous as a rock drummer. People in my small town would come up to me all the time, you know, some kid showing off he knows me to his little teeny girlfriend and so on -- made it a pain to go out for groceries or a bolt at the hardware store, frankly. Sure, I was always nice and talked to them -- after all, fans are what made that job worth doing (profitable). But! Never knew their names even once. People tend to forget that when you are on stage, the lights are on you -- you don't even see faces, and of course no names. Just because they obsessed on you doesn't mean it went the other way either. As soon as they were out of earshot and I could then carry on with whatever I was out in public to do, I'd just ask my wife who the heck that was, and she always knew. With a little training, I got her to converse with these people and always use their names right up front, to hide my ignorance (which of course made me look better -- 100% of good politicians and salesmen have this name recognition talent for a reason, it's flattering to people to at least seem to remember them).

      --
      Why guess when you can know? Measure!
    12. Re:I didn't understand by sootman · · Score: 1

      When I read the first line--"Jamie points out this interesting article about how hard it is for programmers to get names right"--I thought that it was going to be about programmers who give their programs crappy names, like "The Gimp."

      --
      Dear Slashdot: next time you want to mess with the site, add a rich-text editor for comments.
    13. Re:I didn't understand by sootman · · Score: 1

      I say that all the time but I think people are starting to figure out that when I do, what I really mean is "I'm sorry, I was staring at your rack while we were being introduced and I didn't hear a thing."

      --
      Dear Slashdot: next time you want to mess with the site, add a rich-text editor for comments.
    14. Re:I didn't understand by SimonInOz · · Score: 1

      >> Try asking.

      This sounds like a great idea. But if the name STILL doesn't stick, you feel like a complete idiot by the third time. Or maybe the fourth. Or the fifth.

      And if you cannot remember if you've asked before ...

      If you say "Sorry, I'm really bad with names", most people will say "me too" and smile brightly. But they have no idea what they are talking about. What THEY mean is sometimes they forget a name. What I mean is I very nearly always forget names. Not the same.

      I sometimes say "you know that bit of brain where people store names - I do geometry in that bit". Perhaps it's true

      --
      "Cats like plain crisps"
    15. Re:I didn't understand by Anonymous Coward · · Score: 0

      I know that people do this as a mnemonic device, and I still find it annoying--especially when it is smarmy salesdroids.

  60. What's in a name? (also, what's in an address?) by cowboy76Spain · · Score: 1

    I really do not understand why it is an issue at all. From my experience, a name is just free text. Even if it has its own rules to be formed, it does not affect the IT. I live and work in Spain, where everyone usually has his/her surname and two family names (one from his/her father and one from his/her mother) and every damn system I see has two fields for family names. Why? While it is understandable to separate surname from the first family name (*), there is absolutely no need to put the two family names separately because they are always processed together. Anyway, that continues to be done, causing unnecessary trouble with people with only one family name (foreigners) or with more than two.

    I see the same issue with address. Lots of times I have to put my address as a multi-field info: Type of way (Street, Avenue, Road, etc.), the name of the way, number of the building, entrance, floor, door). And this info will just be useful to print all together (I do not think many people is doing stats based in if you live in a street or an avenue, on the B door or C door, etc.). Anyway, this is one of the things programmers over-engineer without no real reason (if you really need that information for your system, then the first thing you need is to forbid that the street name is a free text, and populate a list with all possible values).

    And for the rest of the issues, it is just a matter of knowing where your app is gonna be used. If your system needs to work only in USA, UTF-8 is fine. If I happen to go there I can stand if you put my name without accents; if I go to China I can stand that the bureaucrat puts the chinese character that sound like my name in their systems). Otherwise, if you want your systems to work in France or Spain, then you need to make sure they support accents and special characters like ç, ñ. Simple as that.

    * Because some surnames can be typed in a variety of forms, while family names only have one right spelling. That makes sorting from family names more useful.

    --
    Why can't /. have a rich-text editor? Editing your own HTML is so XXth century.
    1. Re:What's in a name? (also, what's in an address?) by Calinous · · Score: 1

      The address is kept as separated parts for reasons like:
            The One Way street was renamed to Airfield Avenue
            We want to send mail to everyone living in the following streets: ...
            As for extra information, there were cases then you had several blocks at the same street number (each block with several doors)

    2. Re:What's in a name? (also, what's in an address?) by cowboy76Spain · · Score: 1

      If you leave the street name as a free text you will always fail to update all your address.

      --
      Why can't /. have a rich-text editor? Editing your own HTML is so XXth century.
    3. Re:What's in a name? (also, what's in an address?) by Mycroft_VIII · · Score: 1

      With addresses you can have a st. a rd. an ave. and a ct. all in the same town. Do you want your mail delivered half way across town and your food all the way across town? Remember in most cases a computer is doing the routing.
      and don't even get me started on people to stupid to put apartment numbers/letters in the box clearly labeled for it and instead treat it has part of the street name or even building number.

      Mycroft

      --
      https://signup.leagueoflegends.com/?ref=4c3ed6600b6ea
  61. Sounds by Samah · · Score: 1

    They forgot to mention that websites need the ability to upload sound files...
    Fry and Laurie: http://www.youtube.com/watch?v=hNoS2BU6bbQ

    --
    Homonyms are fun!
    You're driving your car, but they're riding their bikes there.
  62. It's not just names of people by Whuffo · · Score: 1

    I'm continually amazed by the number of web forms that will not allow enough characters to spell the name of the California city I live in. And then some of them try to match city names - and their entry forms truncate them at varying lengths. Even major corporate websites fail spectacularly - I couldn't track an order at a major retailer because my city didn't match the city on the order. If I knew how many characters they were looking for I could truncate my input appropriately - but what the heck are some coders thinking?

    Here's a handy tip: if you're allowing 16 bytes for the city name you are making a stupid assumption.

    And those of you who insist that phone numbers be entered in a specific format - get a clue, would you? Databases are most useful when the data is clean and regular - but names, addresses, cities, phone numbers, etc. are not always clean or regular. As a coder, you should be doing as much data cleaning as you can when it's being input. If the database is going to be used for many years and many purposes, somebody's going to have to clean that data. If not your code, then some warm body is going to have to go over the data.

  63. German's Federal Minister of Defence: 15 words by Gunstick · · Score: 1

    is called "Karl Theodor Maria Nikolaus Johann Jacob Philipp Franz Joseph Sylvester Freiherr von und zu Guttenberg"
    http://en.wikipedia.org/wiki/Karl-Theodor_zu_Guttenberg

    I tried to post that to the comments of TFA but it does not work.

    --
    Atari rules... ermm... ruled.
    1. Re:German's Federal Minister of Defence: 15 words by Hognoxious · · Score: 1

      Army Kraut. Nine letters, one space.

      --
      Confucius say, "Find worm in apple - bad. Find half a worm - worse."
  64. Japanese websites are the worst at this by Jeeeb · · Score: 1

    I find English websites are generally fairly good at handling names. At least in comparison to Japanese websites, which assume that you have a first name and last name only and often assume that your name can be written in only a few characters. They also have a nasty habit of deciding that certain characters are illegal.

    It's always interesting trying to find ways to fit my 19 character full name into subscription forms on Japanese websites. Perhaps the worst one I encountered required me to use the initial of my first name using a full-width roman character (They banned ASCII characters so I had to use a roman character from the Japanese character set) and my last name written in Katakana.

    1. Re:Japanese websites are the worst at this by sydneyfong · · Score: 1

      I find English websites are generally fairly good at handling names

      I'm no Japanese, but I guess the vast majority of English websites would simply fail if you tried using a Japanese name (with Japanese characters). Worse, they might pretend to succeed and then get the encoding completely botched up.

      --
      Don't quote me on this.
    2. Re:Japanese websites are the worst at this by Jeeeb · · Score: 1

      I'm no Japanese, but I guess the vast majority of English websites would simply fail if you tried using a Japanese name (with Japanese characters). Worse, they might pretend to succeed and then get the encoding completely botched up

      I've never meet a Japanese person who has issue fitting their name in English language forms. They just write it in roman characters.

      The inverse is much harder when faced with arbitrary character limits and characters such as spaces being made illegal characters (particularly hard when you're trying to fit your first and middle name into a middle name field). This means often the only solution is tricks like using initials in roman characters from the wide-latin character set that is used in East Asian language computing.

    3. Re:Japanese websites are the worst at this by sydneyfong · · Score: 1

      I've never meet a Japanese person who has issue fitting their name in English language forms. They just write it in roman characters.

      Getting a bit OT, but my original point is, if you're complaining about having to write your name in Katakana, then it wouldn't be fair to say that Japanese could work around the English site by filling in English forms using Roman characters. I know most of them are used to doing this, but still.

      --
      Don't quote me on this.
    4. Re:Japanese websites are the worst at this by Jeeeb · · Score: 1

      Getting a bit OT, but my original point is, if you're complaining about having to write your name in Katakana, then it wouldn't be fair to say that Japanese could work around the English site by filling in English forms using Roman characters. I know most of them are used to doing this, but still.

      I have no problem with using Katakana. It's when you add character limits and so on on top of that when it becomes a problem.

  65. Can't agree more by DrXym · · Score: 1

    My real name is P'ying O'hanrah'O'hanrahan von Frankfurt so you can imagine the problems I have.

  66. Ahh, Little Bobby Tables by Anonymous Coward · · Score: 0

    obXKCD: http://xkcd.com/327/

  67. Another nice example by bickerdyke · · Score: 1

    http://en.wikipedia.org/wiki/The_Gods_Must_Be_Crazy

    have a look at the cast list.

    IIRC "!" is pronounced as a tounge-clicking sound

    --
    bickerdyke
  68. on insisting everyone know your bikeshed's tint by FuckingNickName · · Score: 3, Insightful

    I was born with a complicated Spanish name.
    One first name.
    Two second names.
    One hyphenated, accented surname from my father.
    One simpler surname from my mother.

    One of the first things I've done since reaching majority is to give a precise, simple, standard name to everyone who asks for it:
      Xxxxx Xxxxxxx
    where X is in A-Z and x is in a-z. Xxxxx is my first name, and Xxxxxxx is a shortened, accent-and-hyphen-free version of my father's surname.

    You know why?

    Because, in life, there are lots of things one must be "unreasonable" about in order to effect progress, but accommodation of one's name is not of them. It's a tedious, selfish expression of nothing more than ego which ultimately will land you in more trouble than others: some day you will be denied access to something thanks to some computer system not being designed to handle your name, and "computer says no" gets priority over the angry demands to the immigration officer of "Joe\0\rBlogg$ 3'); DROP TABLE citizens; -- [insert spinning cube here] Jr."

    If you and your friends/colleagues have some other name by which they call you, sure why not? But, as any cat will tell you, the world is best when you have three names:

    (i) one for communicating formally;
    (ii) one for more intimate discourse (there's no reason why this can't be the same as (i), though many people end up with peculiar nicknames); and
    (iii) one personal identification which you can keep to yourself and you can't express in words.

    If you want the sum of all your history, culture and personality as expressed in (iii) to be embodied in (i), you're both expecting others to be burdened with your ego and bad at understanding human communication. All I asked for was a couple of words I can use in a reasonably uniform way to easily pick/call you out from a small crowd - that's what (i) is for, after all.

    tl;dr The naming of cats is both a delightful poem and an insightful account of the multiple namespaces for kitty/human names and their different purposes. Don't confuse them.

    1. Re:on insisting everyone know your bikeshed's tint by Anonymous Coward · · Score: 0

      How reasonable of you!

      The thing nobody seems to understand is that English only has 26 letters. If you write your name with an accent mark, in English, then you're doing it wrong (technically).

      Websites are pretty multinational these days. But if the system is meant exclusively for English, then ASCII handles it.

  69. My surname ... by OneSmartFellow · · Score: 2, Funny

    .. is Hfuhruhur-Uumellmahaye you insensitive clod !

  70. One more special case. by idiat · · Score: 0

    You can also have names with spaces in them which is what catches my wife out a lot, she's got a double barreled surname only instead of a hyphen she has a space between the two. One last name made up of two separate words.

    There is also the celtic tradition of using your last middle name as a first name, I know quite a few people who do this, one particularly common example in Ireland is for people to have an extra first name of Patrick although that one would never be used outside of passports and other official documentation.

    --
    And remember folks, Gnu's *not* unix.
  71. I don't see why a hyphenated name... by Anonymous Coward · · Score: 0

    would be a problem. It's not such a difficult thing to consider when writing software. I'm just beginning to learn my first programming language (Java) and have already written a name validation program which accepts hyphenated names. It wasn't difficult, even with my extremely limited knowledge of Java.

    For a corporate site to allow such a problem is ridiculous. However, I wonder if the problem is really large enough to warrant a /. article about it. And linking to a blog post that links to the actual relevant article is about as stupid as the problem being discussed. Oh the irony.

  72. Wondered why Spain is about to go bankrupt? by Hognoxious · · Score: 1

    I see the same issue with address. Lots of times I have to put my address as a multi-field info: Type of way (Street, Avenue, Road, etc.), the name of the way, number of the building, entrance, floor, door)

    Different countries have different rules for formatting the address. So if you know which bit is the house number, you can put it before the street (for the UK) or after it (Germany). Knowing which part is the postcode can also be useful. I don't know if credit card validation requires the right parts to be fed in the right way. But if you have teh parts separate, you can recombine them at will. If it's just free format, you can't do that.

    Just because you can't see a reason doesn't mean there isn't one.

    --
    Confucius say, "Find worm in apple - bad. Find half a worm - worse."
    1. Re:Wondered why Spain is about to go bankrupt? by cowboy76Spain · · Score: 1

      Oh yeah, now if my house goes to live in Germany then you could format its address properly! Duh!...

      Write the address in the format in use in the country it is in. In the same language, also (v.g., if you live in "Dumb Street" I won't write that you live in "Calle de la Tontería". KISVS (the V stands for Very).

      --
      Why can't /. have a rich-text editor? Editing your own HTML is so XXth century.
  73. It's a troll by Uzik2 · · Score: 1

    His complaint is that the world won't jump through hoops to conform to him. CLUE BAT WHAP!

    --
    -- Programming with boost is like building a house with lego. It's a cool but I wouldn't want to live in it
    1. Re:It's a troll by Anonymous Coward · · Score: 0

      The problem is that coders ARE jumping through hoops to create all kinds of rules and checking names against them. Most of the time it's perfectly fine to use a single text field, or maybe two fields, which may be left empty.

  74. Get char/byte right first! by sam0737 · · Score: 1

    Let's teach programmers about char != byte and unicode is not always represented in 2 bytes (not even UTF-16)...
    before we talked about date and name right.

  75. And that attitude is the whole problem by Moraelin · · Score: 4, Insightful

    You know, attitudes like yours are IMHO the root of all that's wrong with computers today. And I'm saying that as a programmer, not as Jane Grandma. The whole idiotic OCD idea that you _must_ make up rules about everything, and that your rules are more important than what people are actually trying to do. The idea that if even someone's name doesn't fit "your" database, then you can just brush them off and have a beer.

    Here's some free clue: yes, you can't handle every edge case in the universe, but you'll find it's easier if you don't create such edge cases in the first place. If your database (actually more likely the program in front of it) can't handle last names with more than one capital letter, or with a dash in the middle, or which are more than 32 bytes long (which with UTF-8 might mean less than you'd think), then guess what? _You_ created an artificial edge case that had no reason to be there in the first place. Instead of handling every edge case in the universe, how about not creating them in the first place?

    I find that about 90% of the problems boil down to the above: some idiot put some artificial limits or rules, that really aren't needed anywhere else. Just because he has the delusion that he's some kind of Moses on the mountain and just _has_ to come down with some rules.

    E.g., he just had to define a byte limit, because he's prematurely optimizing a non-problem he doesn't understand. God forbid wasting space in the database by allowing 256 or 2000 byte strings... never mind that if he actually understood that underlying database, he'd know that a VARCHAR is not padded to that max length. If someone just entered "Alex", the same 4 bytes will be actually used in the database, regardless if the field is a defined as maximum 4, 32, 256 or 2000 characters. But nah, he has to put some restrictive number there, 'cause it looks more like he's doing some smart job.

    There is hardly any reason to even use a user name for anything other than display purposes. (You do have a primary key for that record for everything else, right?) As such there is no reason to make any assumptions about it, or enforce any particular format, or anything. There's no reason to even disallow SQL keywords (just effing quote it before using it in SQL) or angular brackets (just quote it before using it in HTML.)

    There is no reason to create any edge cases in the first place.

    And really it's not even just about names. Names are just one case where people make up BS rules just to feel more like they did the great design job. One could make the same case for the gazillion other pointless rules imposed upon the user or his work-flow or data, not because they're actually needed anywhere, but just because some OCD idiot feels like he _must_ impose some rigid structure upon things that really have none and don't need any. But he'd just feel naked without defining that kind of rigid structure, or without imposing upon humans some data structures theory that was intended only for use by programs.

    --
    A polar bear is a cartesian bear after a coordinate transform.
    1. Re:And that attitude is the whole problem by Qzukk · · Score: 1

      Frankly, I got tired of trying to predict this kind of bullshit, and stopped using CHAR(x) in my database years ago. It's all TEXT or VARCHAR() these days. Instead of input limits, I apply output filters that mangle names as needed when communicating with other systems that don't have as liberal of a view of the world.

      --
      If I have been able to see further than others, it is because I bought a pair of binoculars.
    2. Re:And that attitude is the whole problem by LoztInSpace · · Score: 1

      I can't put into words how much I agree with this!
      I am so pissed off with the whole "must not allow angle brackets, hyphens etc because of injection attacks" mentality.
      Any decent software will escape according to the requirements of the output, be it SQL, HTML, XML or whatever the next great thing is. Deal with it properly and you can accept anything you like.
      Trouble is, I run out of energy arguing this point when some one says "Oh, but who has a surname with bracket, colon, javascript....blah. I don't really have an argument against it and then have to explain why Mary OI'Toole can't log in. It's so easy to write software that does not have a problem with this stuff. Unfortunately, most web developers don't understand that there is life beyond HTML and don't get it. The end result is usually both unnecessary restrictions and badly escaped text meaning specific encoding makes it into the database or is overly escaped and fucks up when displaying it.
      Amusingly, Veracode, who proport to check software for this stuff cannot even push out their boilerplate recommendations without refering to "developer's" and the like.
      I don't get it. It's not hard.
      Ironic as well that TFA has  all over the place, presumably because whoever wrote the site can't escape text properly!

    3. Re:And that attitude is the whole problem by Anonymous Coward · · Score: 0

      as a scientist who works in companies that develop laboratory isntruments, typically 5 - 100K retail price, where software is used both to control the instrument (set instrument parameters like temperature, time, etc) and to enter sample data and to look at the output data, I find that arbitrary decisions made by engineers and software people are almost always a problem.
      time after time, i have had to go to the engineer/programmer and say, why did you limit the allowable value for run time to 5 - 100 minutes ? and they will say, well,from what I understand of backscatter resonance raman HPLC/CE spectroscopy with impedance luminescence detection, those are reasonable values.

      And I say, do you actually know anything about backscatter (etc) ?

    4. Re:And that attitude is the whole problem by russotto · · Score: 5, Insightful

      The idea that if even someone's name doesn't fit "your" database, then you can just brush them off and have a beer.

      We can. Fact is, trying to write a system which can deal with all those 40 assumptions and still do anything useful with names is impossible. Even covering most of them is impractical, if you want programmers to do anything else. It has nothing to do with OCD. The programmers aren't making the rules because of some inner desire for order, but because the requirements of the system require they be made.

      Suppose your system is some sort of order-taking system. And one of the things it must do is print your name on a mailing label. How do you handle that if the name doesn't _fit_ on the mailing label? Or if there is no name at all? Or if the mailing label printer doesn't handle the name's character set? Or if the postal service for the countries in question have standards for names which are not met?

    5. Re:And that attitude is the whole problem by geekoid · · Score: 1

      " make up rules about everything"

      really? how to you program with no rules? Please, give me an example of a system with no rules.

      " he just had to define a byte limit"
      So you write programs with infinite byte limits?

      "God forbid wasting space in the database by allowing 256 or 2000 byte strings"

      but those are rules. you don't want rules or limits.

      ", he'd know that a VARCHAR is not padded to that max length."
      it needs to have the space to be ABLE to go to the max length. You must be an hell of a programmer to not know that. Then any program used to put data into the data must also be able to deal with the limit. so those variable will need to reserve that much memory.

      You would be correct on any system with infinite storage, memory and CPU power.

      Other then that, you are a crappy programmer.

      There are very good technical and engineering reasons for limits. Are there stupid limitation on some names? yes. Are those limits sometimes imposed arbitrarily by programmers? yes. Are the usually there for a business rule imposed by someone who isn't the programmers? yes.

        But your reasons listed read likes someone who has about a week of programming experience.

      --
      The Kruger Dunning explains most post on /. http://en.wikipedia.org/wiki/Dunning%E2%80%93Kruger_effect
    6. Re:And that attitude is the whole problem by Anonymous Coward · · Score: 0

      I rigidly categorize your post into rant, which doesn't have a spot in my database. :(

    7. Re:And that attitude is the whole problem by Anonymous Coward · · Score: 0

      In varchars, remember the length of the string also has to be stored... The name Alex will use 5 bytes in an ascii charset if the varchar limit is = 255. Larger varchar limits will add additional bytes. ie, 'Alex' will use 6 bytes if stored in a varchar(300).

      There must, of course, always be a artificial limit when storing a name. No system has infinite storage space. Moreover, malicious users will punish you if you don't set sane limits.

    8. Re:And that attitude is the whole problem by suomynonAyletamitlU · · Score: 1

      Suppose your system is some sort of order-taking system. And one of the things it must do is print your name on a mailing label. How do you handle that if the name doesn't _fit_ on the mailing label? Or if there is no name at all? Or if the mailing label printer doesn't handle the name's character set? Or if the postal service for the countries in question have standards for names which are not met?

      Actually, I was just thinking about a related problem--TFA says not everyone has a family name, but if you work in an office, for example, you may need to specify for the office workers involved, "What do you call this person? Mr/Mrs/Sir/Madam !%@#!@%$ ?"

      The answer is, of course, to delegate that to the person about whom the form is being filled. They've sent themselves mail before, and they know what they want to be called. Let them enter any string they could possibly want in the actual name field, as long as they have a proper ID to back it up, and then have a field below that which has to be readable for the people doing paperwork--possibly only latin characters within a certain character limit, for example, though that's obviously US/Euro-centric.

      In the end, having a backup "This has to be formatted a certain way for logistical reasons" bonus field that only gets used in edge cases and pseudo-edge cases isn't really all that hard; the hard thing is knowing that it needs to be done, under what circumstances, and what sorts of values you might possibly be expecting. An article like TFA helps a little bit, but like you said, what about mailing addresses? Or other strange things?

      Who knows.

    9. Re:And that attitude is the whole problem by Endophage · · Score: 1

      I don't think we're talking about any of your concerns here. We're talking about the real crazies, re chipmunk poop. If somebody came along and told me they had an emoticon as part of their name I'd tell them to go get a life, not modify my system to handle emoticons in the name field. There are always going to be people who create difficulties just because they can and imo they need to grow up and stop being a pain in the ass... "Issues" you're talking about like capitals, unicode and length are really non-issues that any programmer with half a brain cell should be covering.

    10. Re:And that attitude is the whole problem by johnncyber · · Score: 1

      never mind that if he actually understood that underlying database, he'd know that a VARCHAR is not padded to that max length. If someone just entered "Alex", the same 4 bytes will be actually used in the database, regardless if the field is a defined as maximum 4, 32, 256 or 2000 characters.

      That is not entirely true. While you are correct VARCHAR does not pad the string (char does), older versions of MySQL and other RDMS had/have an upper limit of 255 characters. Newer versions do allow for varchar of a larger length, however the storage size of each value increases.

    11. Re:And that attitude is the whole problem by balbus000 · · Score: 1

      True, but a CHAR(32) will perform better when you go to retrieve the information from the database, as long as there are no VARCHAR columns in the table. It won't be significant for small databases, but it is something to think about.

    12. Re:And that attitude is the whole problem by Moraelin · · Score: 1

      Well, granted, if your program has to run on an ancient version of Oracle which lacks VARCHAR2, things could get hairier. I'm pretty sure VARCHAR2 was already there in Oracle 7 (though back then with a maximum of 2000 instead of the 4000 it has nowadays), but no idea in which version it started existing. Was it version 5 or 6?

      Still, I'm not proposing to invent a new storage layer to work around DB limitations. If you're on a pre-5.0.3 MySQL and the max string is 255, ok, then you don't count as setting your own limits in my book. They're set for you by the DB.

      But you have to admit even that doesn't account for some of the limits out there. E.g., most name or email fields are limited to something _far_ below even 255. And then there are checks like forbidding hyphens, or for that matter the apostrophes in Irish names, or enforcing some capitalization rules, or in other fields stuff like one site which didn't let me register as a non-US customer without picking one of the US states as my state. I don't think the varchar limit explains those, really. Basically it still looks to me like some people are just into making up rules, sad to say.

      Also, well, while I see your point about storage size, we're really talking about 1 byte length vs 2 byte length. It's a whole 1 byte per record we're talking about. You'd need 1 billion customers before that adds up to a whole gigabyte of HDD space. I'm thinking someone has to be pretty obsessed to optimize HDD space at that level.

      And again that still doesn't justify some of the restrictions out there. While there is that 1 byte difference above 255, there is no saving _below_ 255. Setting the limit to 32 characters or 16 or some other restrictions out there really won't produce any saving at all.

      --
      A polar bear is a cartesian bear after a coordinate transform.
    13. Re:And that attitude is the whole problem by Moraelin · · Score: 1

      Well, I guess it's hard to guess what was actually meant. You're right that any programmer with half a brain-cell _should_ not even need to think twice to cover capitals, unicode and name length. But there _are_ sites out there where an O'Maley can't register because someone enforced a "no non-alphabetic characters" rule, or where names are restricted to 32 characters. As I was saying, some people just feel they have to give rules.

      Not everyone, granted, but to put it otherwise, you'd be surprised how many programmers without half a brain cell get hired anyway at some companies.

      --
      A polar bear is a cartesian bear after a coordinate transform.
    14. Re:And that attitude is the whole problem by bmckeever · · Score: 1

      You know, attitudes like yours are IMHO the root of all that's wrong with computers today. And I'm saying that as a programmer, not as Jane Grandma. The whole idiotic OCD idea that you _must_ make up rules about everything, and that your rules are more important than what people are actually trying to do. The idea that if even someone's name doesn't fit "your" database, then you can just brush them off and have a beer.

      Your message is more than 140 characters long, and doesn't fit in my database.

      --
      Your favorite .sig sucks
    15. Re:And that attitude is the whole problem by Bigjeff5 · · Score: 1

      Meh, just give them the "Noone" surname. For such a rare surname, it's extremely common on Slashdot.

      --
      Security is mostly a superstition... Avoiding danger is no safer in the long run than outright exposure. - Helen Keller
    16. Re:And that attitude is the whole problem by SheeEttin · · Score: 1

      Then something goes wrong with the name printing, and nobody cares, because the USPS (and likely others) rarely care about the name. They deliver by address.
      (Of course, they may try and use the name if a piece of mail is undeliverable because of a nonexistent address, or carriers will sometimes deliver mail to the address of the person whose name is on it, to compensate for a typo in the street number.)

    17. Re:And that attitude is the whole problem by Golddess · · Score: 1

      If someone just entered "Alex", the same 4 bytes will be actually used in the database, regardless if the field is a defined as maximum 4, 32, 256 or 2000 characters. But nah, he has to put some restrictive number there, 'cause it looks more like he's doing some smart job.

      There once was a time (at least in MS SQL world) where each record was limited to 8000 characters max. So sure, you could set all fields in a table to varchar(8000), but if someone came along with an 8000 character name, well, you're fucked with trying to store anymore info about them in the same table.

      --
      "I'm not sure I like the fugnutish tone you used in your post!" -RogL (608926)-
    18. Re:And that attitude is the whole problem by rgigger · · Score: 1

      Many of the things you say here are true and they bug me as well. But if you read the article then you realize that many things that the author is advocating are FAR more trivial than just having a large size limit or allowing arbitrary character input. The author states that no system in the world does things right and implies that programmers should start dealing with all of those situations.

      The GP used hyperbole to explain the edge case. So we don't really know if that includes "my name is more than 8 characters" or "my name used to be Prince but now it is this symbol can only be stored as an image."

      The key here is context. If an edge case only affects 0.0000001% of your users it's probably not worth worrying about. In all of my professional experience many of the items on the authors list have fallen into that category every time. Any sane person will do at least some sort of cursory cost benefit analysis of each constraint rather than spending a whole year writing code to handle every possible scenario when it buys you almost nothing.

      It just so happens that most people are pretty sane because the author admits that no one in the world is doing what he seems to be advocating. This is not because they are stupid or have OCD. It's because doing the opposite would be a stupid, OCD, perfectionist sort of thing with vastly diminishing returns.

    19. Re:And that attitude is the whole problem by Anonymous Coward · · Score: 0

      Mailing labels are relatively easy-- what about standardized preprinted forms? For the number of people with 50-character names whining about this, I'd say it's a non-issue. I've been working on apps that are still using green screens with limited real-estate for data entry, for some 20+ years, and I've never had anyone ever tell me to make a name field wider, anywhere. More likely, to make it shorter because they want to add another column on the output. I seem to recall at least one case where we truncated the name display on the screen because it wouldn't fit otherwise and the number of characters we did show was enough to distinguish. The first 25 characters of your name aren't enough to distinguish it from something else? Next you'll be complaining that a 128x128 pixel avatar image isn't enough to distinguish you from your twin brother. Cry me a river...

    20. Re:And that attitude is the whole problem by LordArgon · · Score: 1

      How do you handle that if the name doesn't _fit_ on the mailing label?

      Well, you certainly don't restrict your *database* columns because of a *mailing* label. What happens when you get a smaller mailing label? Go back and add new length constraints to your DB? I'd hate to work in that system.

      Moraelin is mostly right. There's no need to create *arbitrary* edge cases. Arbitrarily constraining your data means you won't have a good reason when you encounter something that doesn't fit those constraints. And, more-than-likely, you're going to screw yourself by outlawing a perfectly-valid case that you didn't know you'd need down the road.

      Set something within the perf / storage bounds of your system and then handle the interface to each consumer of your data as logically as you can. Then if your constraints are violated, you at least know you have a good reason not to support that case.

    21. Re:And that attitude is the whole problem by Anonymous Coward · · Score: 0

      "How do you handle that if the name doesn't _fit_ on the mailing label? Or if there is no name at all?"
      That is not your problem or responsibility. It is the user's.
      I send an received un-named address-only mail, so systems that require a mailing name really piss me off.

      "Or if the mailing label printer doesn't handle the name's character set?"
      So detect for that and print it as an image - this is a very simple problem, you call yourself a programmer?

      "Or if the postal service for the countries in question have standards for names which are not met?"
      You actually program in cases for all 200+ countries? No, you don't.
      What if the postal service for the countries in question have standards for names which you completely fuck up by auto-"correcting"?
      Again, if the user lives in a country with obscure standards, it is their responsibility to make sure the mailing name is formatted correctly.

      While we're on the subject of dumb-ass programmers who think they know best:
      As someone who lives in a country with 4 digit zip codes: fuck all fellow programmers who require zip codes to be 5 digits - especially whoever thought it would be a good idea to ask for zip codes at gas pumps in the US and forgot all about tourists.

    22. Re:And that attitude is the whole problem by Anonymous Coward · · Score: 0

      Suppose instead of allowing 32 bytes for a name, you allow , say 1,000,000; couldn't someone sneak in a Trojan horse program or just crash your database with a million such names?

      I recall when Fortran (or was it PL/1?) could only handle 15 right parentheses. It worked okay. Maybe the next release handled 31 (etc. )... [circa 1970]

      just handle the most frequent cases.

      Sometimes limits are good. Not all doors have to accomodate the largest human. Or else all doors would be 8' or 10' tall.

      We are building a working solution that can be improved over time. Don't worry about every detail.

    23. Re:And that attitude is the whole problem by Anonymous Coward · · Score: 0

      just effing quote it before using it in SQL) or angular brackets (just quote it before using it in HTML

      Just call me:
        \" drop database master \"

    24. Re:And that attitude is the whole problem by konohitowa · · Score: 1

      You know, attitudes like yours are IMHO the root of all that's wrong with computers today.

      Thanks for holding back and only giving us the humble version of your opinion. Way to keep that arrogance in check!

    25. Re:And that attitude is the whole problem by konohitowa · · Score: 1

      Now you're just being silly. Requirements? Standards? Reality? I scoff at thee mere mortal.

    26. Re:And that attitude is the whole problem by Anonymous Coward · · Score: 0

      Thank you. Thank you. Thank you....

    27. Re:And that attitude is the whole problem by jd · · Score: 1

      Why the hell would you want to store variable strings in the same table as real data? Very inefficient. You're much better off with one or more dictionary tables for non-well-formed data and run everything off fixed-length keys into that dictionary. You almost never actually WANT the value - even a comparison is just comparing the keys, if the dictionary is guaranteed to only have unique entries in it.

      This sort of token-driven system solves 99% of your problems, improves efficiency (you make more records fixed-length rather than variable-length, and you're doing comparisons on relatively short fixed-length tokens) and enforces better abstraction.

      --
      It's a small world and it smells funny; I'd buy another if it wasn't for the money; Take back what I paid (SoM)
    28. Re:And that attitude is the whole problem by Golddess · · Score: 1

      Curious, what do you consider a "variable string" that is not also "real data"? Is a last name real data? First name? Address? City? State? Do you have one "Person" table and then one table for a KeyId and list of last names, KeyId and list of first names, etc? If you do, well, sounds a little inefficient, but then I'm not a DBA. Otherwise, my point was simply that there once was a time when you had to apply limits because otherwise if someone happened to come along with an 8000 character name, then you wouldn't be able to store their address.

      --
      "I'm not sure I like the fugnutish tone you used in your post!" -RogL (608926)-
    29. Re:And that attitude is the whole problem by jd · · Score: 1

      None of those are real data from the computer's standpoint. That is merely presentation information. To a computer, the only thing that is "real" is that facet of the data that the computer can directly understand. A value is a value, but to a computer a name is also a value, just a badly-stored one.

      From a computer's standpoint, you have discrete objects, the need to ensure a new object is also discrete, and the ability to select/update/delete the object(s) you want and no others. Also from a computer's standpoint, fixed-size objects are faster to handle than variable-sized objects - no garbage-collection, no fragmentation, you can hash directly to the start of a record, etc.

      If you want to compare two names - say, "Fred Blogg" and "Fred Bloggs", you could go through sequentially until you reach the last byte and see the difference. Or you could load two unsigned integers and compare those. One uses a lot more compute cycles, a lot more I/O and a lot more buffer space. The other is a single opcode on the CPU.

      How would you actually implement this? Well, the simplest method is your basic data dictionary. Two columns - a unique ID and a string object that represents a discrete object. A table of people's names and addresses would then have your key field as normal, a unique ID pointing to the name in a standardized format, a unique ID pointing to the first line of the address, etc. This is how some compilers work - tokenizing by discrete object, storing the object itself in a dictionary and then just using the token thereafter. It's only inefficient when you are doing a lot of I/O with the user, because you've so many tokens to translate. If most of the crunching is internal, it's very very fast.

      A slightly improved version would have an additional table which allows you to specify a binding between tokens. Since a street is unlikely to move very often, comparing a series of tokens for each line - whilst a HELL of a lot better than comparing each character - is still more than you need. So you now have two dictionary tables - one for atomic data and one for composite data. This allows you to improve the granularity on the atomic dictionary some (though you don't want it so fine-grained that the expense of the searches overwhelms the benefits of the tokenizing).

      There MAY be some benefits in splitting the atomic dictionary up by types of data, but probably not a whole lot. If related objects are close in the dictionary, then you can exploit that relationship by performing relatively simple SELECTs - it'll be cheaper to get back more than you want than to get each item individually. You can also reduce seek times, which have always been database killers. The only time you'd really get a benefit is if there are multiple groups of object types between which there is no relationship whatsoever. If you've totally unrelated data, then splitting the data up makes some sense. Entities and their relationships in a database should reflect -something-. If an atomic dictionary tries to pack utterly unrelated things together, then it is not representing a "something" but rather a mashed-up view of several somethings.

      Now, you'll find some of what I've written in any standard DB text. The rest you'll find scattered between programming manuals (especially when considering when to use stack vs. heap), compiler manuals and rule-of-thumb guides on how to squeeze out every last drop from the CPU and data bus.

      --
      It's a small world and it smells funny; I'd buy another if it wasn't for the money; Take back what I paid (SoM)
  76. Primary Keys by Ukab+the+Great · · Score: 1

    The article could be aptly renamed "40 reasons to use surrogate primary keys, and why natural keys are just asking for trouble."

  77. A Basic english regex for names by MrKaos · · Score: 1
    I was knocking up a script to handle peoples name in a ";" separated file. I put together a regex to handle

    "First Name(s)";"Surname"

    ^\"[A-Za-z '\`-]*\";\"[()A-Za-z '\`-]*\"

    It's only tested against 1000 names, but I guess it's a start.

    --
    My ism, it's full of beliefs.
  78. Straw man by mdwh2 · · Score: 1

    I.e., your response is "39. People whose names break my system are weird outliers"

    And the article says no such thing. Can you point me to where the article said "People have names which are pictures and not text"? Yes, that would be an unreasonable restriction, as you're trying to conflate text with images. But none of the points the article makes come anywhere near close.

    All of the points are reasonable cases. Are you seriously suggesting that the points are anywhere near as unreasonable as your example? Go on, give us an example that the article actually gives, rather than arguing with a straw man.

    1. Re:Straw man by pushf+popf · · Score: 1

      And the article says no such thing. Can you point me to where the article said "People have names which are pictures and not text"? Yes, that would be an unreasonable restriction, as you're trying to conflate text with images. But none of the points the article makes come anywhere near close.

      I've got bad news for you. "Text is images." The only reason you haven't needed to treat it this way is because various character sets and fonts define specific images as numbers.

      All of the points are reasonable cases. Are you seriously suggesting that the points are anywhere near as unreasonable as your example? Go on, give us an example that the article actually gives, rather than arguing with a straw man.

      Sure. 32 - 40 are "problems" with people who either have no name or have a name that changes, making it useless for identification. There are also a smattering of other whiny complaints that more or less mean that a name can not be used to identify a person because it's big or changes over time or contains unpleasant characters that some lazy programmers choose to not handle properly.

      You can feel free to get wrapped around the axle about non-problems. I see an academic job in your future.

    2. Re:Straw man by mdwh2 · · Score: 1

      I've got bad news for you. "Text is images." The only reason you haven't needed to treat it this way is because various character sets and fonts define specific images as numbers.

      What an earth does that have to do with what I said? You are the one who made a point dependent on images being separate to text.

      Sure. 32 - 40 are "problems" with people who either have no name or have a name that changes, making it useless for identification.

      Right, so you agree that 1-31 are valid points. End of story - even if 32-40 were flawed, there are still 31 valid useful points the article makes. The whole point of the article is that you should rely on a fixed name for identification, due to these problems. You do realise that people change their names for all sorts of reasons, e.g., marriage?

      I see an academic job in your future.

      I'm a professional programmer - and it's we who have to deal with these sorts of practical issues. I hope you're not doing a non-academic job, if your response is to tell customers who break your system that they're the ones with the problem.

    3. Re:Straw man by pushf+popf · · Score: 1

      I'm a professional programmer - and it's we who have to deal with these sorts of practical issues. I hope you're not doing a non-academic job, if your response is to tell customers who break your system that they're the ones with the problem.

      My clients are very happy and have been since the early 1970's, which judging from your picture is right around the time you were born.

      They're happy, because together, we work out system requirements that fit their business needs and define their data validation and storage in ways that are useful for all their business processes, past present and whatever part of the future we can predict.

      This does not generally allow for all possible forms of naming, especially those that tend to be poorly defined, move around a lot or aren't made up out of letters from commonly used character sets. And it has not been a problem. However if you would like to make it a problem, please feel free.

  79. My name is \000 by Anonymous Coward · · Score: 0

    These are not the droids you're looking for.

  80. My name is That's not my name by craighansen · · Score: 1

    Who's on first?

  81. Hyphens by orsty3001 · · Score: 1

    I hate it when some websites I sign up for can't handle a hyphen in my last name. Then you try a space which some of them seem to be ok with but others don't like. This isn't that big of a deal for most websites but when you are trying to do something important like order something and your name on your debit card doesn't match what they will allow you to type it. Then it gets annoying.

  82. You do not have to go as far as chinese/japanese by Frederic54 · · Score: 1

    My first name is Frédéric and even in QC, Canada, I cannot enter it on some goverment web site without it being rejected for "invalid character", so for "Revenu Québec" (local IRS) I am "Frederic", and it bugs me, and if I want to correct it I have to fill papers and send proof and whatever.
    Sometimes, websites accept it when I type it, but when it displays it, it is garbled as Frdric, Fridiric, Fr%E9d%E9ric, etc
    I cannot believe we are in 2010 and so much websites (especially from USA) do not accept accentuated characters.

    --
    "Science will win because it works." - Stephen Hawking
  83. Better question is: Why do you care? by Moraelin · · Score: 1

    Who the hell has numbers in there name?

    I know you're probably just joking, but you provide the perfect example of the kind of BS made-up rule I was railing against in another post.

    Instead of asking who'd have a name that breaks your rules, or asking them to fix their name, people should be asking themselves if they actually need that rule at all. Why would you care that the users don't have numbers in names? What other part of the program relies on making that assertion about the input data, and why?

    Any programming language or database imaginable can store any combination of letters, numbers and punctuation in a string. It's a string. Virtually any standard library out there (unless you program in something like Brainfuck or Shakespeare) can deal with the string "4L3X" just as well as it can deal with the string "Alex". It can search for it, sort it, index it, print it, etc. As far as the computer is concerned, a digit is just another byte.

    That's the kind of thing I don't get: why do some people make up rules for their users, that really are just unneeded in the first place?

    (Maybe not _you_ specifically, but you did provide an example of something that does happen.)

    --
    A polar bear is a cartesian bear after a coordinate transform.
    1. Re:Better question is: Why do you care? by Anonymous Coward · · Score: 0

      The other day we had an article about IT administrators and managers who feel that they need to enforce computer usage rules by setting up monitoring software to watch everyone. I can't help but think that there is a connection between that kind of thinking and the idea that you need to enforce proper names for your users.

  84. Aliens... by DarthVain · · Score: 1

    You just wait, when Aliens land, and their representative is named "@hjsju7ub+_^b" as their literal translation to English, and when we rename it to "Bob" in our ET database, and they see that as an assault upon their culture, and wipe us all out with their highly advanced magic weapons, your be sorry.

  85. ridiculous rules by silicondope · · Score: 1

    "# People have exactly N names, for any value of N." Sure, please let me tie your uncountable, Gaussian curve of names to that paycheck. We'll both be happier.

  86. Huh, wrong article by Anonymous Coward · · Score: 0

    Here I thought they were going to give pointers about how to come up with good variable names.

    Thought it would be useful.

  87. Postal Addresses by yakovlev · · Score: 1

    Postal addresses are a whole other problem.

    I used to live across the street from a fairly large apartment complex at:

    12345 Lamplight Village.

    I had a friend who lived there, and I can't count the times she complained that mail would be returned to sender saying it was an invalid address. The worst part is that it was often returned by the LOCAL post office, who should have known better.

    Just because a number (or a name) "looks" fake doesn't mean it is.

    1. Re:Postal Addresses by SQLGuru · · Score: 1

      So, you lived near the police station, then? (ATX!)

  88. Ah yes by Moraelin · · Score: 1

    Really? how to you program with no rules? Please, give me an example of a system with no rules.

    There is a difference between no rules, and less rules, or even "only strictly the absolutely needed assertions." Making a BS dichotomy between excessive rules and no rules is exactly the kind of OCPD I was talking about.

    " he just had to define a byte limit"
    So you write programs with infinite byte limits?

    More BS sophistry, or you genuinely don't know that a VARCHAR already has a 4000 byte limit in Oracle -- and YMMV in other databases -- hence not setting your own means that default, not "infinite"?

    But really, I just see more false dicohotomy between setting your own restrictive limits and "infinite". If you genuinely can't see anything in between, it's your shortcoming, not mine.

    but those are rules. you don't want rules or limits.

    Right. We can also add sarcasm to things that go right over your head.

    it needs to have the space to be ABLE to go to the max length. You must be an hell of a programmer to not know that. Then any program used to put data into the data must also be able to deal with the limit. so those variable will need to reserve that much memory.

    You pick on my programming ability, yet judging by the above paragraph you never heard of dynamic allocation. You don't need to pre-reserve space for the maximum possible name, lemming. If more people realized that, not only we'd have less naming restrictions, we'd also have less buffer overflow exploits.

    And space in the database... are you serious? You're proposing to restrict names just to limit the maximum size of the table? Do you also have a maximum number of customer rows, or what? Because you can run over the maximum size for the database that way too,. You do know tablespaces can be extended, right?

    And, generally, it's a stupid optimization in so many ways it's not even funny. For a start, again, each individual row is _not_ saved with 3996 extra bytes if someone named "Alex" is saved in a VARCHAR(4000) field. The extra space is only used for people who literally have a name that large. Which also means you _don't_ need to pre-allocate space in the database for the case that every single customer will have a name that large. Most still won't. Someone called "Wang" will still be called "Wang". Their name won't change from "Wang" to "Wallawallabingbingdingdong" just because you set your limits higher.

    It's also stupid because you're optimizing a cheap resource. If every single customer did have 4000 byte long names, and you had a hundred million of them in the database, we'd be talking about four hundred gigabytes. It's peanuts nowadays. The space in memory is a matter of how many concurrent requests you expect. Assuming actually 1000 requests in-flight at the same time on the same machine (which means tens to hundreds of thousands logged into that machine), if every one of them had 4000 byte names, we'd be talking about a whole 4MB of RAM taken up with names. Whop-de-effing-do.

    And again, both are for an unrealistic worst case scenario: for most users you _don't_ actually save even a byte by defining it as VARCHAR(32), because their name doesn't grow or shrink with the size of that field.

    You would be correct on any system with infinite storage, memory and CPU power.

    I was mostly correct so far on systems with a realistic amount of all three. It's funny what you can do when you actually measure and calculate, instead of applying knee-jerk premature optimizations.

    Other then that, you are a crappy programmer.

    There are very good technical and engineering reasons for limits. Are there stupid limitation on some names? yes. Are those limits sometimes imposed arbitrarily by programmers? yes. Are the usually there for a business rule imposed by someone who isn't the pro

    --
    A polar bear is a cartesian bear after a coordinate transform.
  89. Re:You do not have to go as far as chinese/japanes by geekoid · · Score: 1

    You are not that important or interesting. Let it go and move on.

    --
    The Kruger Dunning explains most post on /. http://en.wikipedia.org/wiki/Dunning%E2%80%93Kruger_effect
  90. There are limits by Anonymous Coward · · Score: 0

    How about if my name consists of the planet Jupiter. How do you fit that in the database.

    Numerals are just that. They oughtn't to be used as components in names (use what you like in email addresses).

    You can moan and whine that your name is a streak of dog turd located on a street in Boise Idaho but the rest of us don't have to accept that. For names expressed in English, my database is good enough.

    Got an exclamation mark in your name or you've upper cased the fifth letter? Get stuffed.

  91. A Note for Web Application Programmers by humphrm · · Score: 1

    Please, for the love of G-d, would you get it through your thick, Mountain Dew-addled brains that any key I can type on my keyboard is a valid character in a legal name. For instance, someone who's name is hyphenated - the hyphen is part of the name, you frigging morons.

    Yes, I have issues. ;-)

    --
    -- "In order to have power, I must be taken seriously." -Mojo Jojo
  92. Summary: Don't make assumptions by Locke2005 · · Score: 1

    In Russian, you (sometimes?) take your husband's last name, but with a different ending (instead of using Mr. and Mrs.) This led to my Russian coworker having 2 ID cards with different last names -- an Americanized one with her husband's last name, and another one with her last name as used in Russia.

    --
    I've abandoned my search for truth; now I'm just looking for some useful delusions.
  93. Assumptions by nlawalker · · Score: 1

    I understand the original gripe about not allowing punctuation in a name. However, the author of the linked article goes overboard.

    Everything relies on assumptions, especially computer systems. The blog author assumed I could read English and had a browser that could navigate hyperlinks (to anyone who wants to respond "well you didn't have to read it" - well, no one has to do anything. The author of the original linked article didn't have to use the system that wouldn't take his name.). It's only an assumption that my telephone has digits 0-9 on the keypad. It's only an assumption that my computer monitor is rectangular. It's only an assumption that I live on Earth.

    Any design of any system relies on assumptions. We try to pick good ones. "Names don't have any punctuation" is probably a bad assumption, but "People have names" is probably OK. Of course, it's all a value judgement, but if you find that "people have names" to be too broad of an assumption, have fun designing the rest of your system. I hope the author hasn't designed any systems that assume I use base-10 numerals, or that I don't use script that flows from the bottom to the top of a page on oddly-numbered dates.

  94. Not am issue by alexo · · Score: 1

    John Graham-Cumming wrote an article today complaining about how a computer system he was working with described his last name as having invalid characters. It of course does not, because anything someone tells you is their name is--by definition--an appropriate identifier for them. John was understandably vexed about this situation, and he has every right to be, because names are central to our identities, virtually by definition.

    The article writer confuses two different concepts: names as identities and names as identifiers.

    The software shouldn't care if your "real" name is {love symbol #2}, it only needs to deal with a limited subset of monikers. Government sites/databases should accept anything that you can legally put on a passport, driver's licence, etc. of that country and nothing more. Financial ones should accept names and addresses that can appear on your bank account or credit card. Private sites can set their own rules.

    I suggest that, at the minimum, accept whatever is required to accurately deliver send a letter to persons in all countries you need to support. I would expect that most post offices can deal with names/addresses rendered in Latin alphabets; if not - either don't support the corresponding countries or be more flexible.

  95. Whine whine whine bitch bitch bitch by pugugly · · Score: 1

    A large number of those assumptions seem, to me, to fall under 'Acceptable Breaks from Reality'. Yes, they're going to be wrong some percentage of the time, but trying to design for the use-case in which you're dealing with that small percentage case will result in a design that has traded have functionality that is useful an arbitrarily small percentage of the time for a system that is utterly unwieldy the rest of the time.

    I mean, pointing out that a unicode point may not represent a letter in a name? Seriously? Listen, if your name requires an SVG code snippet to represent it, you get called "The artist formerly known as Prince". Live with it Jackass.

    Pug

    --
    An Invisible Entity of Vast Power whose existence must be taken on faith alone: Liberal Media
  96. Did somebody say Johnson? by reboot246 · · Score: 1

    Ahh, ya doesn't has to call me Johnson! You can call me Ray, or you can call me Jay, or you can call me Johnny or you can call me Sonny, or you can call me RayJay, or you can call me RJ... but ya doesn't hafta call me Johnson!

    Raymond J. Johnson Jr.
    (Bill Saluga)

  97. Probably would't be a problem. by Estanislao+Mart�nez · · Score: 1

    Good thing that Pablo Picasso never had to get a Californian driver's license.

    However, under the normal format for names in Spanish, the guy would be the perfectly manageable Pablo Ruiz Picasso, where Pablo is the first name and Ruiz Picasso are the last names. California issues drivers' licenses for names like these correctly--I know it because I have one.

  98. Parsing Names is even harder by ZombieBraintrust · · Score: 1

    I know someone at a big firm and they have been in parsing hell for the last two months. They have legacy data for mailing addresses. A 6 by 30 character blob of text. From this they are trying to parse out First Name, Middle Name, Last Name, suffix, prefix, organization names, street address, city, state, zip, attention, in care of, and post office box.

  99. Because we don't need it? by dajalas · · Score: 1

    I've been a coder since junior high school. (8th grade) The inability to remember names and the ability to recognize people appear separate. I can recognize most faces just fine. But remembering names is tough.

    My guess is that social things aren't seen as important enough by geeks to spend effort on until they need the abilities.

    If a coder went into technical sales, I expect their ability to remember names would, by necessity get a lot better over time.

  100. You're kidding, right? by dajalas · · Score: 1

    Patriotism is no more bigotry than is favoring one sports team over another.

    There are, of course, obvious exceptions to this. But I want avoid invoking Godwin's Rule.

    1. Re:You're kidding, right? by Hatta · · Score: 1

      Favoring one sports team over another is a form of bigotry so narrow in scope that it's negative effects are negligible. But fundamentally it's the same. Patriotism is much more dangerous.

      --
      Give me Classic Slashdot or give me death!
    2. Re:You're kidding, right? by dajalas · · Score: 1

      I look forward to examples to help understand your thinking behind this. Successfully making the general case seems quite difficult.

      I note that Americans generally see patriotism as more benign that do Europeans. The history of the 20th century helps explain this.

  101. Do as the Romans do by Pseudonymus+Bosch · · Score: 1

    Ancient (rich) Romans had slaves called "nomenclators" who would accompany their masters whispering the names of the people they met.

    --
    __
    Men with no respect for life must never be allowed to control the ultimate instruments of death.
    GW Bu
  102. Not all words have tones by Anonymous Coward · · Score: 0

    Some grammatical words, like "de" and "ma" don't exactly have a tone, unless you consider the neutral tone to be a tone. But if you do, then it's still the case that not every word in pinyin Chinese would have a number, because the neutral tone (5 in Mandarin) is usually not written.

  103. Obligatory Monty Python by rleibman · · Score: 1

    I can't believe nobody's mentioned Johann Gambolputty de von Ausfern- schplenden- schlitter- crasscrenbon- fried- digger- dingle- dangle- dongle- dungle- burstein- von- knacker- thrasher- apple- banger- horowitz- ticolensic- grander- knotty- spelltinkle- grandlich- grumblemeyer- spelterwasser- kurstlich- himbleeisen- bahnwagen- gutenabend- bitte- ein- nürnburger- bratwustle- gerspurten- mitz- weimache- luber- hundsfut- gumberaber- shönedanker- kalbsfleisch- mittler- aucher von Hautkopft of Ulm

  104. What a coincidence! by valkenar · · Score: 2, Funny

    So is mine!

  105. PhpGedView by ggpauly · · Score: 1

    >>>I have never seen a computer system which handles names properly and doubt one exists, anywhere

    The open source genealogy software PhpGedView (http://www.phpgedview.net/) handles names in an extremely flexible manner. It explicitly supports 30 languages, including several asian languages, and has certainly been used with names from many more cultures (because of immigrant ancestors). Multiple names, aliases, married names, no known name - I have never seen or heard of a situation it cannot handle.

    --
    Verbum caro factum est
  106. Gah! Scots-Irish, indeed! by zooblethorpe · · Score: 1

    My name is McMac, you insensitive clod!

    Cheers,

    --
    "What in the name of Fats Waller is that?"
    "A four-foot prune."
  107. Paper Forms by Anonymous Coward · · Score: 0

    These assumptions aren't unique to computer programs. I've filled out many paper forms for work, school, etc... that expected you to have a first name, last name, and middle initial. Sometimes they would give you little boxes for the letters, thus limiting you to a finite number of characters.

    Clearly, they need to add a check box that says "I don't have a name."

  108. Pointless by parse.here · · Score: 1

    He's somehow defending a belligerent flavor of multiculturalism with this interminable rant about namespaces. That's what they are: simply placeholders. They are the key to the hash-table. Although through time and experience we value the key, it still remains the key to a respective value. It serves a lingual and communicative function.

    I am not intolerant of different cultures, just of over-doing it.

  109. Alien Names by WED+Fan · · Score: 1

    Completely unrelated, but somehow related.

    What Every Sci Fi author thinks make an alien name:

    The combo of a consonant(s) followed by an apostrophe then a string of consonants and more apostrophes makes a good alien name: Kj'Arp'Tan.

    Aliens can't have names that when pronounced sound like Bob, Ted, Alice.

    Many aliens are like Cher and have only one name: Spock

    Warrior races naturally name themselves like Vikings: K'arg Son of B'Jark (K'arg B'jarksonn).

    Warrior races will use the Spanish naming P'llk'nrm'stk y S'nchz y G'tr'ez y M'rls y J'hnsn

    You could never pronounce my family name, your adams apple would explode, says the alien with oddly humanoid outfitted vocal arrangement.

    --
    Politics is the art of looking for trouble, finding it everywhere, diagnosing it incorrectly and applying the wrong fix.
  110. Problems are more with users... by Anonymous Coward · · Score: 0

    I worked in registration in an emergency room in college, and I'd frequently have to spend my trying to find people in our systems because a lot of these "special cases" couldn't figure out which name they wanted to go by. Annita Marilyn Anderson-Michaels (made up) would tell me she was everything from Ann Michaels to Mary Anderson depending on her mood, then she'd get divorced, not tell us, and get angry when our system didn't have her. People would have completely different names on their drivers licenses than we had on file. Keep in mind, this was in an emergency room so people tended to not be at their most rational. As a grunt, I spent inordinate amounts of my time resolving these issues. Forget the Chinese breaking ASCII, if people here just decided how to write their name and understood our database isn't magical I would've gotten a lot more work done.

  111. In Soviet Afghanistan... by beanluc · · Score: 1

    ...one named YOU!

    --
    Say it right: "Nuc-le-ah Powah".
    1. Re:In Soviet Afghanistan... by nbauman · · Score: 1

      I am Radek.

  112. Really Hard Problems by Buzz_Litebeer · · Score: 1

    The biggest thing about names, which does not seem to be addressed, is that naming in systems (at least for America) means a LOT because of the need to remove duplications from data sets.

    This is a very hard problem, and in the end everything the author states is correct to a point, but he misses a larger point.

    It does not matter, in the end, what naming scheme is being used, as long as you treat the parsing of the name in some non destructive generic way that is consistent.

    It doesn't matter that MacLean and McLean, or Mac Lean are all structurally different, all that matters is that when you are doing your matching they all are all managed in the same way.

    IE Bob Mac Lean, parses similarly to Robert Mac Lean, which parses similarly to Robert McLean.

    And you can play games with how you are doing the matching by simply creating them as full names as well, and doing other aspects of data hygiene/harmonization (whatever nomenclature you want to use).

    Here is a caveat that is implied. Names only matter in matching, names that do not matter in matching are for display purposes and you just use some common method of capture. The Name Prefix, First Name, Middle Name, Last Name, Suffix format is always correct for this if your system has to be multi national. Multi national users of your site are going to be used to putting some consistent-ish thing in these fields, so dont worry about it.

    --
    If you don't vote, you don't matter, so don't waste your time telling me your opinion
  113. Srsly... what did you expect? by jvonk · · Score: 1

    Considering how many entry forms still don't allow '+' in an e-mail address (or, worse, allow it in the sign-up box but not in the unsubscribe box)

    Ah... I am fond of GMail aliases. Naturally, many times I have encountered the unsubscribe issue you describe; however, I have often emerged victorious (victory!)

    You will often find that the unsubscribe links (or a GET-based form submission) will result in a URL. Workaday morons often forget to consistently URL-encode their input. They build an URL encoded query string, but they take your email address as a string literal. Bizarre.

    So, your "foo+bar@gmail.com" will often be misinterpreted as "foo bar@gmail.com", thus confusing their system (space character encodes as +). Fortunately, you can take the high path with these drooling simps by simply adding the missing URL encoding feature for them: '+' encodes as %2B.

    Change the salient part of the URL to be "foo%2Bbar" and you should expect a warm feeling of superiority to wash over you. Oh, yeah, and it will finally kill that fucking spam those morons keep sending you.

    HTH. Don't forget to send nastygrams to their customer service departments about their moronic devs.

  114. Fascninating or Neurotic by Anonymous Coward · · Score: 0

    This guy seems obsessed about names with regard to them being entered into various systems. I think the author is a kook or cuc or KoOk or cooc.

  115. "Run, Forrest... RUN!" by Anonymous Coward · · Score: 0

    Your Master "goes down for the count"@ the URL below:

    http://it.slashdot.org/comments.pl?sid=1687452&cid=32609448

    Your Master ran, rather than disproving the points he was challenged to do so against ...

    I.E.-> When he is asked to disprove points in rebuttals others send back his way? Well, take a peek over there, and see Your Master "run, Forrest, RUN..." (lol, TOO easy!).

    APK

    P.S.=> Well - I'll be waiting over there for you to show up & disprove all of the points I put YOUR WAY, and Kalriath's as well, here -> http://it.slashdot.org/comments.pl?sid=1687452&cid=32589278 ... Good LUCK (you'll NEED it)... apk