Slashdot Mirror


AI Can Scour Code To Find Accidentally Public Passwords (qz.com)

An anonymous reader shares a report: Researchers at software infrastructure firm Pivotal have taught AI to locate this accidentally public sensitive information in a surprising way: By looking at the code as if it were a picture. Since modern artificial intelligence is arguably better than humans at identifying minute differences in images, telling the difference between a password and normal code for a computer is just like recognizing a dog from a cat. The best way to check whether private passwords or sensitive information has been left public today is to use hand-coded rules called "regular expressions." These rules tell a computer to find any string of characters that meets specific criteria, like length and included characters.

47 comments

  1. Just Look for Camel case plus numbers and symbols by goombah99 · · Score: 2

    There's realtively few instances where mixed capitals, symbols and numbers are valid syntax. yes there are, but few. sounds like we just made it easy to spot thepassword.

    --
    Some drink at the fountain of knowledge. Others just gargle.
  2. Regex == AI? by Anonymous Coward · · Score: 0

    The best way to check whether private passwords or sensitive information has been left public today is to use hand-coded rules called "regular expressions." These rules tell a computer to find any string of characters that meets specific criteria, like length and included characters.

    So now "m/frist psot/gi" counts as AI? How much more diluted can we make the term?

  3. Let's use a regex! by Anonymous Coward · · Score: 0

    Some people, when confronted with a problem, think "I know, I'll use regular expressions." Now they have two problems.

  4. Rules called regular expressions by Anonymous Coward · · Score: 0

    How many people here don't know what regex is? And how is regex considered AI now?
    Slashdot is desperately trying to make AI posts right now it seems...

    1. Re:Rules called regular expressions by Anonymous Coward · · Score: 0

      "regular expressions."

      yes, tell me more about these newfangled "regular expressions"

    2. Re:Rules called regular expressions by Anonymous Coward · · Score: 0

      It's all the silly faces the characters make in a cartoon called "Regular Show".

  5. I do this sometimes by houghi · · Score: 2

    I do this from time to time myself. I just do the following:
    # grep -r Pa55W0rd $HOME
    Note the space before the grep. That way it does not end up in .bash_history and cause an issue there.
    I have found some from time to time.

    I am the only person on my PC, but security is a mentality.

    --
    Don't fight for your country, if your country does not fight for you.
    1. Re: I do this sometimes by nghate · · Score: 1

      This wouldnâ(TM)t scale very well if you started using unique passwords per site as you should....

    2. Re:I do this sometimes by Junta · · Score: 1

      Actually, not all bash setups are configured to ignore things that start with spaces.

      $ echo $HISTCONTROL
      ignoredups

      ignorespaces has confused so many people that I think a lot of distributions have stopped putting that in HISTCONTROL.

      --
      XML is like violence. If it doesn't solve the problem, use more.
    3. Re: I do this sometimes by Anonymous Coward · · Score: 0

      Sure it does. Pa55W0rd1, Pa55W0rd2, Pa55W0rd3, Pa55W0rd4, ... that grep finds them all.

    4. Re:I do this sometimes by Anonymous Coward · · Score: 0

      If I'm really needing to have password type material on the command line, I set HISTFILE=/dev/null and then don't need to worry about the problem until the end of that session.

    5. Re: I do this sometimes by Guybrush_T · · Score: 1

      A funny answer to a troll. Love it.

    6. Re:I do this sometimes by Junta · · Score: 2

      Another option is to use the read command to store it in an environment variable, never having it on the CLI in the first place. This lets history still show you everything you did, but without the password and such in it.

      --
      XML is like violence. If it doesn't solve the problem, use more.
  6. I think they are saying AI is better than regex by raymorris · · Score: 1

    I think what they are trying to say is this:

    Researchers have a new way using AI.
    In currently deployed, publicly available systems, the best way is regex. The new AI way may be better.

    While regex is a reasonably good tool for the task, I don't know that it's the BEST way currently used. A small, simple routine built specifically for the task may be better because regex takes characters in order. It's difficult (and slow) to build a really good regex for this because you mostly don't care what order they are in. You care that you have groups of upper case, lower case, numbers, and certain punctuation. Regex is good for finding this OR that, but not so good at this AND that AND that, in any order.

    1. Re:I think they are saying AI is better than regex by Anonymous Coward · · Score: 0

      I think what they are trying to say is this:

      Researchers have a new way using AI.
      In currently deployed, publicly available systems, the best way is regex. The new AI way may be better.

      Re-reading TFS several more times.. I think you might be right. Coulda been written better though. Such as the way you wrote it, that made it instantly clear that they were not calling regex AI.

  7. You know where they should be looking? by bobstreo · · Score: 2

    Google Search. site:Domain and the word password.

    You'd be dismayed at how stupid some people are. Or maybe just not surprised.

  8. Third problem by goombah99 · · Score: 2

    Some people, when confronted with a problem, think "I know, I'll use regular expressions." Now they have two problems.

    and if it becomes self-aware regex then they have three problems, two of which don't matter anymore

    --
    Some drink at the fountain of knowledge. Others just gargle.
    1. Re:Third problem by Zontar+The+Mindless · · Score: 1

      Regexes are not self-aware.

      XSLT is self-aware.

      --
      Il n'y a pas de Planet B.
  9. When all you've got is a hammer.. by mccalli · · Score: 1

    Sounds like another "let's force everything into TensorFlow's image prediction system" thing.

    1. Re:When all you've got is a hammer.. by rkordmaa · · Score: 1

      Well sorta, but in their defense AI type systems seem to be very versatile as far as hammers go and we certainly haven't found every application they can do. Chucking AI at any random problem might not get you the best of all possible solutions, but in many cases it can give you a solution nobody has tried before, well worth the experiment.

  10. Can it scour the front page of Slashdot... by Anonymous Coward · · Score: 1

    ...to find clickbait articles about AI?

    1. Re: Can it scour the front page of Slashdot... by Anonymous Coward · · Score: 0

      I want to know who this person I keep hearing about named AI is and where they live, so I can kick their ass.

  11. Regular Expressions by jbwiebe · · Score: 3, Insightful

    Have we really reached the point on a 'News for Nerds' site where we need to explain the term 'regular expression'?

    1. Re:Regular Expressions by religionofpeas · · Score: 1

      Yeah, it's not even a special expression, just plain old regular.

    2. Re:Regular Expressions by Anonymous Coward · · Score: 0

      My diet -- rich in fruits, vegetables, and whole grains -- assures regular expressions.

    3. Re:Regular Expressions by Anonymous Coward · · Score: 0

      It’s even worse than that. According to the summary, regular expressions are now “AI”.

    4. Re:Regular Expressions by mrwireless · · Score: 1

      And even on Slashdot algorithms are "AI" now..

  12. Easy? by frank_adrian314159 · · Score: 2

    telling the difference between a password and normal code for a computer is just like recognizing a dog from a cat.

    Well, unless the code is PERL - then it looks like a password that has been spread over however many lines.

    --
    That is all.
  13. Re:Just Look for Camel case plus numbers and symbo by Anonymous Coward · · Score: 0

    That only works for hardcoded complex passwords, which I think most are not. Looking for password-like named variables and checking what they're set to would likely catch more. Or find the primary encryption/decryption function call and trace all calls to it. I read the short article. They created an numeric encoding of a string's metadata then they trained a NN on that data. That's all they did.

    Both their NN and their regular expression approaches will miss entire classes of passwords. They will also miss all hard coded, generated passwords (ex: "admin".concat("123")). Number only passwords. Simple passwords. Passwords which look like variable names. Empty passwords. Normal text passwords. Etc... Seems like they wanted to toss machine learning at a problem rather than thinking about the most effective way to solve it.

    Well, thinking about it for another minute. Tracing calls to specific functions is difficult to do on projects you haven't downloaded. Their system is for data mining all published code rather than checking specific projects.

  14. rot13 by petes_PoV · · Score: 1

    best way to check whether private passwords or sensitive information

    Easily defeated

    --
    politicians are like babies' nappies: they should both be changed regularly and for the same reasons
  15. FTFY by 3vi1 · · Score: 1

    > ...is to use hand-coded rules called "regular expressions."

    You mean: ...is to use hand-coded rules called "^regular expressions\.$"

  16. Re:Just Look for Camel case plus numbers and symbo by ichimunki · · Score: 2

    Good luck avoiding those "relatively few instances" in a Perl script.

    --
    I do not have a signature
  17. You just made my point I think by goombah99 · · Score: 1

    every language is parsable. Parsing rule sets can be written out in YACC. the rules tend to be incredibly simple and simply use recursion for deep nested cases. As a result it's not a terribly hard task to decide if a small fragment could be expanded to legal code or if it's not legal code.

    Perls use of sigils actually is actually there to improve both to simplify parsing as well as to make it human readable. Yeah yeah... human readable jokes about perl. Ha Ha. But really you can look at perl and tell what catergory a variable is from the sigils-- it's actually giving you information. And as result constrains the parse.

    --
    Some drink at the fountain of knowledge. Others just gargle.
    1. Re:You just made my point I think by angel'o'sphere · · Score: 1

      But really you can look at perl and tell what catergory a variable is from the sigils--
      Actually you cant. For obscure reasons the sigils sometimes change when accessing arrays or hashes.

      --
      Cost free eBook I read (by iBook/Kobo/Amazon/ObookO/Gutenberg etc.): "The Green Odyssey" by Philip Jose Farmer.
    2. Re: You just made my point I think by Anonymous Coward · · Score: 0

      You're right, it's not deterministic, the Perl interpreter just guesses.

  18. No the AI is probably writing the regexs by goombah99 · · Score: 1

    If I were doing it I'd have the AI discriminate what regexes will extract passwords most efficiently.

    --
    Some drink at the fountain of knowledge. Others just gargle.
  19. Re:Just Look for Camel case plus numbers and symbo by Oswald+McWeany · · Score: 1

    There's realtively few instances where mixed capitals, symbols and numbers are valid syntax. yes there are, but few. sounds like we just made it easy to spot thepassword.

    When you're talking about b00B$ but don't want to hit on any keyword censors at work?

    --
    "That's the way to do it" - Punch
  20. Re:Just Look for Camel case plus numbers and symbo by UnknownSoldier · · Score: 1

    /sarcasm Ah, cool, another alternative to for:

    // HACK, co-worker was a

    // B(.)(.)B

  21. Any better source? by Anonymous Coward · · Score: 0

    I was about to forward this on to some of my team members that do a lot of code analysis and are always looking to ways to get better, but then the follow-up article happens to be about where to go and get high on 4/20, so nope.

  22. I air gap any computer with important information by Anonymous Coward · · Score: 0

    Internet facing computers boot from cdrom and have no storage capability.

  23. Re:No solution required... by Anonymous Coward · · Score: 0

    "accidentally public sensitive" Chris you Mongoloid.

  24. Re:No solution required... by Anonymous Coward · · Score: 0

    SWOOOOOSH!

  25. Re: Just Look for Camel case plus numbers and symb by Anonymous Coward · · Score: 0

    So can my notepad ++. And probably some grep afters linux is done molesting children.

    No AI needed. Retards only believe this.

  26. Re:No solution required... by Anonymous Coward · · Score: 0

    There you are shit posting with yet another fake account, you revenue stream hogging disgusting fat sexist tube of lard, Christopher Dale Reimer!

    You can be sure I will be watching this fake account too. I know this is you because you told me you were working on your freepass 11 file server and you are so dumb that you can't even masquerade yourself properly.

    Now, I told you I was out of meds last week and you didn't even care to contact me you lazy fucker.

    How many times do I have to express the emergency of the situation??????

    The python click script you wrote for my pheromone revenue stream web site suddenly stopped to work!!!!!!

    You fucking incompetent python script writer!!!

    When it works, I get 4000+ clicks a day on my pheromone revenue stream web site but only 5 or 6 without it!!!!

    Now, it seems like you dont care and that you have abandoned me you heartless fucking pig!

    Bonus:
    Here is a story that creimer told me when convincing me what a hard life he had:

    The tree was him and the tree knot was his butt hole!

    So, his uncle packed his fat ass with lard and with his cock! Not that it makes much of a difference but anyway, there it is!

    Signed:
    Ethell, The girl that used to love you and now hates you, burn in hell where you belong you sexist pig!

  27. AI - regular expressions ? by Anonymous Coward · · Score: 0

    Did I get it wrong. Did someone put AI and regular expressions in the same sentence ?
    Is everything under the sun AI now ?

  28. Re:Just Look for Camel case plus numbers and symbo by Anonymous Coward · · Score: 0

    I like how someone comes up with a couple of algorithms that rely on each other to produce an accurate result and it suddenly is called "AI" and then they charge a couple million for their salary.