Perl Features of the Future - Part 1

← Back to Stories (view on slashdot.org)

Perl Features of the Future - Part 1

Posted by chrisd on Wednesday February 12, 2003 @07:33PM from the perl-is-a-nice-little-language dept.

Kevin writes "This story highlights some of the features being included in Perl 6. "There will be substantial changes in the move from Perl 5 to Perl 6. We've been hamstrung for a while by the need to maintain backward compatibility all the way back to Perl 1. There are some things we want to remove, because they seemed like good ideas when they were introduced but they're more trouble than [they're worth] now."

61 comments

Min score:

Reason:

Sort:

Goodbye "my", hello UTF-8? by GCP · 2003-02-12 20:55 · Score: 1

Every time I look at Perl and see all local variables having to be declared as "my $foo" and "my $bar" I think, that must be pretty embarrassing for the language designer.

In serious programming, most variables are local in scope, while few or none are global. Perl variables default to global, so you end up having to fight the default with every variable you create: "my $computer", "my $documents", "my $images", "my $ms_bob", (sorry)....

I hope Perl 6 will turn its back on past booboos and make the things that clearly ought to be default the default, even at the cost of backward compatibility.

And if they really have guts, I hope they'll do what Wayne Gretzky always advocated and "skate to where the puck will be" by making "use UTF-8" the default from now on, and "use bytes" the exception (legacy mode). Windows, Mac, and the major Linux distributions are all converting to Unicode pretty fast now.

--
"Those who have never entered upon scientific pursuits know not a tithe of the poetry by which they are surrounded."
1. Re:Goodbye "my", hello UTF-8? by Anonymous Coward · 2003-02-12 21:05 · Score: 1, Interesting
  
  If not for "my", what else would you use to declare variables? Nearly all structured languages require you to declare your variables, and it's just good programming practice.
  
  Maybe "my" is an poor choice of a word to use (same with "use vars", "our", or "local"), but automagically creating variables is asking for trouble -- really unreadable trouble too.
  
  Also, perl 5.8 (and I'm pretty sure earlier versions) have full UTF-8 support.
2. Re:Goodbye "my", hello UTF-8? by orthogonal · 2003-02-12 21:55 · Score: 0, Flamebait
  
  If not for "my", what else would you use to declare variables?
  
  By prefacing the variable name wiith the type:
  int foo ;
  
  Oh.
  
  Oh, wait, it's perl. No types!
  
  "int foo;" plus some line noise probably implements DeCSS. Or does an rm -f. Who knows? Hey, it's perl. It's not like you can read it.
  
  --
  Opinions on the Twiddler2 hand-held keyboard?
3. Re:Goodbye "my", hello UTF-8? by Naikrovek · 2003-02-12 23:04 · Score: 4, Interesting
  
  Hey, it's perl. It's not like you can read it.
  
  I'm so f**king tired of hearing how perl is hard to read.
  
  ITS NOT HARD TO READ UNLESS YOU MAKE IT HARD TO READ!!! and this is true for ANY and EVERY language out there. I can read perl all day long without problems, as long as it wasn't meant to be hard to read, but if you put a C program in front of me it might as well be some made up language that doesn't work, i woudln't be able to tell the difference.
  
  I've come to the conclusion that those of you that say perl is hard to read either a) don't have a single solitary clue about perl at all, b) are trying to stir people up, or c) are trying to convince everyone else that your favorite language is "better" for each and every circumstance, which isn't true of any language at all, not even perl.
  
  Hey, its Slashdot. I'm voting for all three.
4. Re:Goodbye "my", hello UTF-8? by duffbeer703 · 2003-02-13 01:54 · Score: 2, Interesting
  
  The whole idea of perl is that you can write quick scripts or real applications without learing a new language without having a deep understanding of programming.
  
  The only problem is when people who write quick scripts decide to try and make real programs, and the result is unreadable spaghetti. The same thing used to happen when sysadmin "gurus" strung together unreadable combinations of shell, sed, and awk...
  
  If you to do "serious" programming, "use strict;" is your friend. Or waste time with C or Java.
  
  --
  Conformity is the jailer of freedom and enemy of growth. -JFK
5. Re:Goodbye "my", hello UTF-8? by Kailden · 2003-02-13 03:46 · Score: 4, Insightful
  
  I agree.
  
  I think every computer scientist worth thier pay should realize that language advocacy stinks. Every language has its niche--a reason for why some guy sat back and said, "I need to write a shortcut language to do this" or "wouldn't it be great if I could have a better correlation between the way I think/design and the language I write in" and it evolves from there. Perl was designed as a glue language heavily modeled after awk/sed and other unix tools and the concept of following natural language and "having more than one way to do/say it" so you need to have a good feel for the language pieces as you do when you become a master of english and understand different connotations and methods of stating something.
  
  That being said, it is unfortunate that because there are large groups of people who either A) get religious about the language they choose or B) choose not to learn other languages to a point of knowing thier true niche we take every language and bloat it out and take it out of its scope. And in turn that makes it that much harder to grasp each new languages niche because you have to sort through a bunch of crap that trys to make every language the universal language.
  
  Well, it's human nature I guess. Easy to point out as a problem but not easy to fix...but remember that the next time you are about to tell your coworker that they should "write that in _____" instead of answering thier question. Or, be careful when you complain that "____ is bloated or is too hard to understand" because you are just adding fuel to the fire my friend. It is better by far to state why you chose a certain language on a certain project than to be a universal advocate of "_____".
  
  Advocacy is a clear mark of inexperience.
  
  --
  I need a TiVo for my car. Pause live traffic now.
6. Re:Goodbye "my", hello UTF-8? by josephgrossberg · 2003-02-13 06:53 · Score: 3, Insightful
  
  I meet none of a, b and c. Here's my opinion:
  Python is easier to read than the equivalent Perl code, even if the latter well-indented. Now before you close your mozilla tab, or mark this as "Flamebait", please hear me out.
  Yeah, I agree that bad code is bad code, regardless of the lang.
  But compare these two equivalent statements of good code:
  pythonNumber = 1
  my $perlNumber = 1;
  Do the "$", the "my" and the ";" look necessary, or extraneous and confusing? They are the latter to me.
  What about $_ and @_ ? Those don't seem very sensible. Nor does "<>", the backtick "`" or several other common Perl paradigms such as the fact that it makes a big difference whether you have single quotes or double quotes around a string.
  Yes, I agree that some of Perl's "hard-to-read" reputation is deserved, but not all of it.
  Why do reasonable people think Perl is hard to read? Because it has lots of unneeded, non-alphanumeric characters and there are lots of conventions that don't make sense (e.g. '<>' meaning a line of standard input).
  
  --
  
  Joe
  http://www.joegrossberg.com
7. Re:Goodbye "my", hello UTF-8? by bill_mcgonigle · 2003-02-13 08:13 · Score: 1
  
  The whole idea of perl is that you can write quick scripts or real applications without learing a new language without having a deep understanding of programming.
  
  It's true, just be careful. I happen to know c and assembly and (name a bunch of other archaic langauages) pretty well, and use perl frequently and effectively. However, I've seen people who first learned perl use perl, and, wow, they can do some ugly stuff. Sometimes they don't know what an O(n^2) algorithm means, so they use them - alot.
  
  As usual, perl gives you plenty of rope to hang yourself (not that I know of a language that stops bad algorithms).
  
  --
  My God, it's Full of Source!
  OUTSIDE_IP=$(dig +short my.ip @outsideip.net)
8. Re:Goodbye "my", hello UTF-8? by Dave2+Wickham · 2003-02-13 09:48 · Score: 1
  
  use strict;
  I always use strict, because it stops me writing extremely sloppy code without noticing. It forces you to properly initiate all variables. Does it really take too long to type two or three letters extra to declare your variables?
9. Re:Goodbye "my", hello UTF-8? by Anonymous Coward · 2003-02-13 10:19 · Score: 0
  
  Fine. It's really loosely typed. But why don't people also complain about JavaScript and PHP? What, is "var $foo" any more descriptive than "my $foo"?
  
  Why does Perl always get such a hard time, while PHP can do no wrong? Especially in this particular case.
10. Re:Goodbye "my", hello UTF-8? by Anonymous Coward · 2003-02-13 12:24 · Score: 0
  
  While I agree entirely with what you're saying, I think there should be a distinction made between "aesthetically pleasing" and "hard to read".
  
  IMO, "hard to read" means "hard to understand the semantics". And some Perl one-liners are definitely semantically nightmarish.
  
  However, structured, thought-out, and commented code should be easy to read regardless of language. For example, x86 assembler is really quite (aesthetically) ugly, however once it's commented and has decent labels for everything, it's not so bad.
  
  So yes, while superfluous characters make for "ugly" ASCII messes, quality code will be understandable (i.e. readable) to an experienced programmer in that language.
  
  Chinese characters aren't very aesthetically pleasing (IMO, compared to Japanese), but does that make the poetry any less eloquent?
11. Re:Goodbye "my", hello UTF-8? by Anonymous Coward · 2003-02-14 09:46 · Score: 0
  
  excepting of course C/C++, which is the best language for every circumstance. See, the C++ is faster, has a smaller footprint, support arbitrarily high level contstructs as well as low level idioms.
  
  The only way in which it's not better is a bunch of whiners saying it's too hard.... to easy to make mistakes... iow, they complain they don't know what they are doing and would like us to please tolerate their massive, inneficient, slow, bloated programs because they can't handle a language that *gasp* lets you program computers the way computers work... with things like pointers and *evil* memory!!!
  
  - C++ Troll
12. Re:Goodbye "my", hello UTF-8? by TheLink · 2003-02-15 02:32 · Score: 1
  
  Well maybe Perl is harder to read than Python. That doesn't make it a hard to read language though.
  
  Anyway isn't it hard to read tabs vs spaces in someone else's Python code? Distinguishing different white space characters from each other can sometimes be a pain (telnet, vi?).
  
  I find Perl easier to read than Java.
  
  Compare the source code:
  http://developer.java.sun.com/developer/qow /archiv e/184/index.jsp
  http://use.perl.org/article.pl?si d=02/09/16/144824 6
  
  With Perl I can stare at a few lines of text till it makes sense (and that's usually fast if the programmer hasn't purposely obfuscated stuff).
  
  With Java you often have to scroll up and down for a program that does the same thing (and then if you're familiar with other languages you'd ask: why so many lines to do this?).
  
  The scrolling is not that big a problem if the program works. But if you're looking for a bug in someone else's program having to scroll a lot makes it harder.
  --
  
  Too many replies beneath your current threshold
13. Re:Goodbye "my", hello UTF-8? by Sharkeys-Day · 2003-02-15 13:21 · Score: 2, Insightful
  
  my $perlNumber = 1;
  
  "my" makes it a local variable. It's only necessary if you want that.
  
  "$" is used to denote a scalar variable. Technically, it is extraneous. (It's part of perl's shell heritage.) On the other hand, some programmers intentionally add several extraneous characters to every variable (hungarian notation), so it must be a taste thing.
  
  ";" is necessary in languages which don't consider whitespace to be significant. Some people consider significant white space to be annoying and/or dangerous.
  
  What about $_ and @_ ? ... "<>"
  
  "$_" and "@_" are pronouns. They usually disappear in the same way "self" does. (Actually, in perl6, "self" is spelled "$_".) Perl, imitating natural language, has more pronouns than most computer languages.
  
  "<>" is an idiom. In natural languages, commonly used idioms tend to get shorter. "<>" is an idiom for "read the next line from the file(s) named on the command line, or from standard input if there are no files on the command line." How much code would it take you to write that in python?
  
  Perl is refreshingly concise and expressive.
14. Re:Goodbye "my", hello UTF-8? by Ed+Avis · 2003-02-16 23:01 · Score: 1
  
  [unnecessary syntax in 'my $num = 1;']
  
  OK, perhaps the semicolon is unnecessary. The language could use end of line as a statement terminator. Concede that one to Python.
  
  The dollar sign, well it does have its uses, consider string interpolation:
  
  print "The number is $num.\n";
  
  which is not nearly as concise in Python, especially for interpolating a large number of variables into a string. The $ also means that Perl functions can be called without parentheses, as in the example above. In Python, every function call must have () around its arguments, and you could argue this is 'unnecessary' or 'extraneous'. (There is some special case sugar for 'print' in Python but I don't think you can use it for your own functions.) In Perl you have some extra syntax for variables but it allows you to use less syntax for function calls and string interpolation: swings and roundabouts.
  
  The 'my' part is not just extra syntax, it is semantic. It declares the scope of the variable. One of the biggest problems with Python IMHO is the impossibility of specifying the scope of a local variable - I don't know whether this has been fixed in the latest Python releases.
  
  Partly agree about the other stuff, but I think that having two different quote characters mean different things is sensible. You wouldn't argue that C is a broken language because it makes a big difference whether you have () or {} around bits of code.
  
  --
  -- Ed Avis ed@membled.com
15. Re:Goodbye "my", hello UTF-8? by Ed+Avis · 2003-02-16 23:09 · Score: 1
  
  Er, so how would you propose declaring local variables? What keyword would you use?
  
  Or do you think that variables should not need any declaration? If so, how does the language decide what the scope of the variable should be?
  
  --
  -- Ed Avis ed@membled.com
16. Re:Goodbye "my", hello UTF-8? by WWWWolf · 2003-02-16 23:57 · Score: 1
  
  pythonNumber = 1
  my $perlNumber = 1;
  Do the "$", the "my" and the ";" look necessary, or extraneous and confusing? They are the latter to me.
  
  The counter-question: exactly what is the python variable you mentioned? What is its scope? What kind of information it contains? foo = 1 tells it's a variable assignment somewhere in the program. my $foo = 1 says it's a new, local and scalar variable.
  This isn't a black-and-white issue, you know.
17. Re:Goodbye "my", hello UTF-8? by danpbrowning · 2003-02-19 05:19 · Score: 1
  
  Those "$" aren't ugly, I've decided, after lots of thought. They allow you to do things that you could never dream of in Java or other $-deficient languages:
  $html = qq{ <B>$name</B> <TABLE><TR><TD> $address <A HREF="$homepage">$Anchor</A> </TD></TR></TABLE> };
  Others...
  html = "<B>" + name + "</B>"; html += "<TABLE><TR><TD>"; html += $address; html += "<A HREF=\"" + $homepage + ">" + $Anchor + "</A>"; html += "</TABLE></TR></TD>";
  I'm telling you... the "$" is a feature, not a bug . I know that you can use XML libraries to generate intermediate displays that aren't so ugly, but perl doesn't force you to do that.
  
  --
  Daniel
18. Re:Goodbye "my", hello UTF-8? by Anonymous Coward · 2003-02-20 15:15 · Score: 0
  
  You're confusing two different things. Perl's use of meta-characters for variable reference is NOT related to string interpolation. Simply because the syntax (sometimes) looks the same (in Perl) doesn't mean they are the same thing.
19. Re:Goodbye "my", hello UTF-8? by Strigiform · 2003-02-27 03:20 · Score: 1
  
  Hey, it's perl. It's not like you can read it.
  To quote Larry Wall: "You can write assembly in any language."
  Remember, the Obfuscated Perl competition took its' inspiration from the Obfuscated C competition. If you want to talk really obfuscated you should consider INTERCAL! ;-)
Code parsing by flonker · 2003-02-12 21:02 · Score: 1

"They've grown to encompass full recursive descent parsing, so you can use the same syntax to parse source code or complex file formats as you would use to pull apart fields from a string."

Does this mean that I will be able to parse stuff out like HTML tags, and nested parenthesis?

Or even catching VBScript strings, with the "" inside a string representing a single ", so I'll be ale to parse out something like
"""this is a ""test"""""
which currently is incredibly annoying to parse, especially if all you want to do is catch the comments at the end of the line.
1. Re:Code parsing by Anonymous Coward · 2003-02-12 21:15 · Score: 1, Interesting
  
  If existing software can parse HTML or VB with its lexical parser, what makes you think any other parser would be unable to do the same?
  
  Although strictly speaking from a computer science perspective, matching parenthesis cannot be done with regular expressions (finite automata). But you can do it in perl with backreferences.
2. Re:Code parsing by Anonymous Coward · 2003-02-13 05:14 · Score: 0
  
  : Does this mean that I will be able to parse stuff out like HTML tags, and nested parenthesis?
  
  Yes, in perl 6, you can write a complete HTML parser with just regular expression. This is well covered in one of the Apocalypses and the corresponding Genesis. Search for them on www.perl.org in you're interested.
3. Re:Code parsing by Anonymous Coward · 2003-02-13 05:16 · Score: 0
  
  Just use a real programming language.
4. Re:Code parsing by Anonymous Coward · 2003-02-13 05:31 · Score: 0
  
  I wrote a little piece of code for you that I can't post here because of the lameness filter. To extract the comment from a VB line, check out this file
5. Re:Code parsing by babbage · 2003-02-13 05:55 · Score: 2, Informative
  
  Yes. In fact, you can do this sort of thing using Damian Conway's Parse::RecDescent module, which lets you parse a linear input (like say any file or string) into a tree structure on which you can do arbitrary manipulation. -- including HTML/XML parsing, handling balanced parens, or yes your VBScript quoting example.
  Parse::RecDescent is available for the 5.x series (I think it's part of the core download for 5.8, and was optional for 5.6, but am not 100% sure about that), but it seems like it's going to become a more core component when 6 arrives. If you want to be able to do this stuff, look into it -- you don't have to wait for Perl6 to start using this. It's available now, and it's *great*. :)
  
  --
  DO NOT LEAVE IT IS NOT REAL
balanced parens: NO YOU CAN'T by Anonymous Coward · 2003-02-12 21:22 · Score: 0

here
Can I use Perl regular expressions to match balanced text? toc

Although Perl regular expressions are more powerful than "mathematical" regular expressions, because they feature conveniences like backreferences (\1 and its ilk), they still aren't powerful enough -- with the possible exception of bizarre and experimental features in the development-track releases of Perl. You still need to use non-regex techniques to parse balanced text, such as the text enclosed between matching parentheses or braces, for example.

An elaborate subroutine (for 7-bit ASCII only) to pull out balanced and possibly nested single chars, like ` and ', { and }, or ( and ) can be found in http://www.perl.com/CPAN/authors/id/TOMC/scripts/p ull_quotes.gz .

The C::Scan module from CPAN contains such subs for internal usage, but they are undocumented.
1. Re:balanced parens: NO YOU CAN'T by m_ilya · 2003-02-12 21:51 · Score: 2, Informative
  
  This faq entry is a bit outdated. There is experimental extended regexp syntax which allows to match balanced expressions. There is even an example in perlre (look for documentation on (??{ code }) syntax construct).
  Also there exist CPAN module Text::Balanced which does balanced expressions matching.
  
  --
  --
  Ilya Martynov (http://martynov.org/)
$my $computer by Lord+Sauron · 2003-02-12 21:22 · Score: 1

> "my $computer", "my $documents",

Out of curiosity, are you a MS programmer ?
Perl 6 is a mistake by keesh · 2003-02-12 21:52 · Score: 2, Flamebait

I've been using perl pretty much constantly since the Pink Camel, and believe me, Perl 5 is an extremely good language for quick scripting things. That's what it was designed for. Sure, you can do big projects in it, but it's not exactly ideal. Recently I've started using Ruby as well, and I intend to move my department over to it instead of wasting time with Perl 6.
One of the goals of Perl 6 is to make non-trivial projects possible. That's good. The way it's being done is bad. Perl was once a lightweight, extremely flexible language. Now it's become a huge ugly monster. People wanted OO, so a nasty hack was bolted on top to allow some semblance of it. Now this nasty hack is being expanded. Sure, the code's different, but the basic form is the same. Kludge upon kludge upon kludge; I'd much rather have a nice, clean, pure language (and not one with loads of irritating whitespace thank you very much).
The same goes for the syntax. All the switching between $, @ and % is really irritating (ask a newbie how to get at the length of the keys array of a hash inside a hash, for example), and the changes proposed for 6 are just making this worse -- it seems that Larry, in his infinite wisdom, wants to prefix every data type with a different hard-to-type character. Perl was only designed for the three data types, and adding more is a mess.
Perl 6 is a complete rewrite, but it keeps all the mess which has accumulated over the previous versions. This is not good. Sure, my const int $var = 27; may look neat (in the same way that, say, Pascal does), but $var isn't entirely constant, or entirely an integer, it's just a hack which makes it sort of behave like one. The whole thing is an exercise in pseudo-computer science masturbation with little real purpose except to please the managers who dislike the one thing that makes Perl special.
On a similar note is regexes. I'm an avid fan of regular expressions simply because a nondeterministic finite automata is far more flexible than linear code. However, Larry must have been smoking that cheap $2 crack when he wrote this . Does he want Perl 6 to be flex or something?
I won't be going on to use 6. It's a nice idea, but it's completely unnecessary. It won't make large projects any easier to manage (the language is still, at heart, an almighty hack -- an impressive one, but still a hack). It won't make OO any cleaner. It won't make development any faster. To put it bluntly, Perl scripts will still look less beautiful than our friend Mr Goatse. I'd prefer to use a language which has always been pure synthesis of science and engineering, not some half-baked imposter.
Perl 6 will be nice, but I'm guessing it will be the end of Perl. It can't do what it wants to do whilst still being based upon a nasty mess. There are now other options, which provide all of Perl's power and none of the mess. Sorry, but BSD^W Perl is dying. Larry is buggering it up the ass without lubricants, just like Shoeboy is doing to Larry's daughter.
1. Re:Perl 6 is a mistake by orthogonal · 2003-02-12 22:05 · Score: 2, Insightful
  
  [snip: long and thoughtful critique of perl and its future.]
  
  Wow! He managed to make an allusion to "BSD is dying" and a legitimate use of the Goatse man!
  
  I would have skipped the gratuitous and tasteless slam at Larry Wall's daughter he ended with, though.
  
  --
  Opinions on the Twiddler2 hand-held keyboard?
2. Re:Perl 6 is a mistake by Captain+Pedantic · 2003-02-13 00:01 · Score: 2, Funny
  
  You are obviously not old school enough to recognise the Shoeboy stalking Heidi Wall reference.
  
  While I'm here, what the hell does "nondeterministic finite automata" mean, and is Larry's $2 crack the same stuff that moderators are often on?
  
  All in all, a top class post, worthy of being modded way beyond the +3 it is currently on.
  
  --
  
  None are more hopelessly enslaved than those who falsely believe they are free. Johann Wolfgang von Goethe.
3. Re:Perl 6 is a mistake by duffbeer703 · 2003-02-13 02:01 · Score: 2, Insightful
  
  I'm afraid you're right about Perl 6... breaking lots of compatability features with no real goal just doesn't strike me as a good idea.
  
  --
  Conformity is the jailer of freedom and enemy of growth. -JFK
4. Re:Perl 6 is a mistake by Anonymous Coward · 2003-02-13 02:32 · Score: 0
  
  Outstanding Post.
  Please, take a bow.
5. Re:Perl 6 is a mistake by msouth · 2003-02-13 02:42 · Score: 1
  
  I know, I know, IHBT, IHL, HAND. I have wondered (some of) the same things when reading the Apocalypsen. I can't help but wonder a couple of other things, though.
  
  One, I'll bet a lot of people said a lot of the same stuff about Perl 5, no? I know that people still despise the OO stuff, but, hey, some people will hate any OO implementation until it's C++, when what they should really be doing is hating it until it's Smalltalk. But I wouldn't be surprised if at the Perl 4->5 transition people were complaining that Perl was losing what it was good for/at, and I think it survived that pretty well. Past performance blah blah blah, and maybe this transition is completely different (well, ok, it is pretty unarguably completely different in a lot of ways), but I think the burden of proof is on the detractors.
  
  Two, I learned pattern matching with Perl. In time I learned to use the pattern matching in things like grep and vi, and only then did I learn the extreme usefulness of Larry's inspiration to make the special chars require escaping to be non-special, rather than the other way around.
  
  Given that he has one huge win in pattern-matching reform, I think I'll give him the benefit of the doubt on what he's talking about doing with the new pattern matching stuff.
  
  But the most important thing is that the design goal stays the same--"how can I make this language easier to get things done with?". I don't care enough to dig into all the flamewar on Perl 6, but I really don't think it's going to be The End Of Perl as people so often predict. It may be The End Of Perl As We Know It, but as long as it's still Larry asking the same "how can I make easy things easy and hard things possible" question, I still have faith that I will Feel Fine.
  
  --
  Liberty uber alles.
6. Re:Perl 6 is a mistake by Zapman · 2003-02-13 02:55 · Score: 4, Interesting
  
  NFA's (and Deterministic FA's. They're mathmatically the same thing) are the math backend that define regular expressions.
  
  They're state machines. They're in a given state, and they know how to go to the adjacent states. So given the string 'abc', if you're currently looking at the 'b' (having already seen the 'a'), you know that you'll have a valid match iff the next character is a 'c'. If it's not, you have no match. if you have 'ab[cd]', and are looking at the 'b', you know you have a match if the next char is a 'c' or a 'd', you've got a match. 'c' and 'd' then are the 2 next valid states.
  
  The nifty thing (and the limiting thing) is that true RE's require no memory. Just the knowledge of what state they're currently in. For this reason, no true RE can be written to see if a given string is a palendrome (you can write a RE to match a specific palendrome, but not an arbitrary one).
  
  The difference between a NFA and DFA is that NFA's allow 'null transitions'. This basically says that there are more than 1 state that you can leap out of when you see the next character, because you can go to these special adjacent states without seeing a character, and then leap out. There's also a proof out there that any NFA can be written as a DFA.
  
  All of that said, Perl's extended RE's are not true DFA's. They have some features that can not fit into the DFA model. This is one of Larry's reasons for wanting to make Perl's RE's into true CFG's (context free gramers).
  
  This model is much more powerful than RE's, but at a greater cost, since you have to have memory too. The mathmatical definition of a CFG is a state machine that drags around a stack of memory. The state machine may at arbitrary times push data onto the stack, and later pop it off. It must be done in order though (to match the math model. If you add a second stack, you have the definition of a 'turing machine' (aka the computers on our desk)).
  
  A CFG can be written to match arbitrary palendrome's for example (just push each letter onto the stack, and when you hit the middle, start poping off, and matching each letter. Yes, this is over simplified. The true algorithm is left as an exercise for the reader)
  
  --
  Zapman
7. Re:Perl 6 is a mistake by Anonymous Coward · 2003-02-13 05:14 · Score: 0
  
  > The nifty thing (and the limiting thing) is that true RE's require no memory.
  
  Not true, because you have to store the state space and (if you want good performance) a vector of currently active states. If you use DFAs you can get rid of the vector in favor of a single pointer/index, but your DFA might be exponential in the size of the NFA!
  
  Mostly your post is almost correct, but shows signs that you don't really understand what you're talking about. By the way, "palindrome" and "mathematics" are the correct way to spell those words.
8. Re:Perl 6 is a mistake by josephgrossberg · 2003-02-13 06:50 · Score: 1
  
  If "annoying" whitespace is the worst complaint you can level against Python:
  
  * I'll take that as a compliment
  * Your Perl or Ruby code probably could use some tidying up before someone else tries to read it
  
  --
  
  Joe
  http://www.joegrossberg.com
9. Re:Perl 6 is a mistake by IpalindromeI · 2003-02-14 07:07 · Score: 1
  
  Except that you're speaking of a different kind of memory than he was. You're speaking of physical computer memory, which obviously you need some of to do anything useful. He's speaking in abstract about not having to keep track of anything as you go. Not having to "remember" anything. Having no "memory". You see?
  
  --
  
  --
  Promoting critical thinking since 1994.
10. Re:Perl 6 is a mistake by Anonymous Coward · 2003-02-14 11:51 · Score: 0
  
  come on people, this is a total troll
11. Re:Perl 6 is a mistake by Ed+Avis · 2003-02-16 22:53 · Score: 1
  
  I'm used to programming in Perl and when using Python the main two annoyances are:
  
  - No decent closures. They say that in Python 2.2 proper closures have been added, but it still doesn't seem possible to construct them, because the 'lambda' operator only allows an expression, not a statement, so you can't do anything non-trivial inside your lambda expression. For example, I often write Perl code passing round functions, eg:
  
  my $count;
  my $f = sub { ++$count };
  my $g = sub { print "hello\n" if $count == 5 };
  
  But there doesn't seem to be any way to construct these closures in Python.
  
  - The other thing I miss is Perl's labelled loop blocks, so that instead of 'next' and 'last' always referring to the inner loop you can say 'next LABEL' and 'last LABEL'. This can often make code more readable and eliminate the need for Pascal-like condition variables and contorted code to check them. Also I think 'next' and 'last' are clearer names than C's 'continue' and 'break', but that's a matter of taste :-).
  
  --
  -- Ed Avis ed@membled.com
12. Re:Perl 6 is a mistake by Ed+Avis · 2003-02-16 23:07 · Score: 1
  
  Apocalypsen? You can't just stick -en on the end of any word to make a plural, it has to be reasonably Germanic-sounding (like Vaxen, compare hexen). Apocalypse comes from Greek apokalypsis, so the plural has to be apocalpyses, which is correct both by the classical Greek rules (I think) and by the English rule of adding -s.
  
  --
  -- Ed Avis ed@membled.com
13. Re:Perl 6 is a mistake by etcshadow · 2003-02-18 10:14 · Score: 1
  
  Minor note, because I'm being anal, I guess.
  
  CFG's are not implementable as a DFA with a stack. They are actually an NFA with a stack, something that has no direct tie-back to any sort of deterministic automaton. (Whereas, as you noted, NFA's can be expressed as DFA's with an exponential growth in state-space, and Nondeterminstic Turing Machines can be rewritten as TM's with an exponential growth in time-complexity (I'm not saying that they *must* incurr an exponential growth, mind you, just that they can... see P=NP?)... however, a non-deterinstic push-down automaton cannot be rewritten as a deterministic PDA at all.)
  
  --
  :Wq
  Not an editor command: Wq
Perl section by Tom7 · 2003-02-13 05:08 · Score: 1

There is a section for perl! This doesn't belong "programming"!
Seconded by metamatic · 2003-02-13 05:49 · Score: 3, Interesting

Maybe I won't get invited to the PERL hacker parties any more, but I have to agree. PERL 5's hacks for object oriented programming have always seemed unnecessarily complicated and ugly, and I don't see things getting any better in PERL 6.

I too looked at Python. Like you, I decided that basing your language's syntax on differing amounts of whitespace was a really, really bad idea, not because it's ugly, but because I have enough trouble keeping tab damage under control on a single platform.

So I'm looking at Ruby. In fact, the only thing stopping me ditching PERL for Ruby tomorrow is lack of time for re-learning, given all the other new stuff I'm learning right now (J2EE, Objective-C, Cocoa, OpenGL, ...)

--
GCHQ Quantum Insert installed. If only our tongues were made of glass, how much more careful we would be when we speak
1. Re:Seconded by John+Fry · 2003-02-13 06:47 · Score: 2, Interesting
  
  I recently ditched Perl for Ruby and I'll never go back. Ruby is a joy to use: simple, elegant, and consistent.
2. Re:Seconded by IpalindromeI · 2003-02-14 07:43 · Score: 1
  
  Perl is not an acronym. See the last sentence of the perlfaq1 question: What's the difference between perl and Perl?
  
  --
  
  --
  Promoting critical thinking since 1994.
3. Re:Seconded by metamatic · 2003-02-14 08:35 · Score: 1
  
  I'll consider abandoning the acronymic usage when they correct the man page.
  
  --
  GCHQ Quantum Insert installed. If only our tongues were made of glass, how much more careful we would be when we speak
4. Re:Seconded by chromatic · 2003-02-14 15:25 · Score: 1
  
  Consider calling ls LS then.
  
  --
  how to invest, a novice's guide
5. Re:Seconded by metamatic · 2003-02-14 15:45 · Score: 1
  
  LS(1) OpenBSD Reference Manual LS(1)
  
  NAME
  ls - list directory contents
  
  I see no acronym, dumbass.
  
  --
  GCHQ Quantum Insert installed. If only our tongues were made of glass, how much more careful we would be when we speak
6. Re:Seconded by chromatic · 2003-02-14 20:34 · Score: 1
  
  The only place it's spelled PERL is in the header -- the same place as the LS you quoted. My sincerest apologies for taking your post at face value.
  
  Still, if you're going to be that pedantic, why not use ldc instead of ls?
  
  --
  how to invest, a novice's guide
7. Re:Seconded by metamatic · 2003-02-15 03:17 · Score: 1
  
  It's nothing to do with the spelling. It's the fact that the man page says right at the top:
  
  "perl - Practical Extraction and Report Language"
  
  Supporting the acronymic derivation of the name.
  
  The man page for ls, in contrast, doesn't claim that it stands for anything.
  
  --
  GCHQ Quantum Insert installed. If only our tongues were made of glass, how much more careful we would be when we speak
Hard to read Perl [5] by SeanAhern · 2003-02-13 08:21 · Score: 3, Informative
[Perl 5 is] NOT HARD TO READ UNLESS YOU MAKE IT HARD TO READ!!!

If it's not hard to read, then why are the designers of perl 6 making a lot of efforts to make it a lot easier to read than perl 5?

Quoting Larry Wall from the Apocalypses:
- In fact, regular expression culture is a mess, and I share some of the blame for making it that way. Since my mother always told me to clean up my own messes, I suppose I'll have to do just that. [emphasis mine]
- But Perl has often been tagged as a language in which it's easy to write programs that are difficult to read, and it's no secret that regular expression syntax that has been the chief culprit. [emphasis mine]
- there's a lot of regex culture that needs breaking.
- [Read all of Apocalypse 5 to learn exactly why perl 5 sucks to read. Even the extended syntax ain't really the most readable syntax.]
- As a specific example, there are various ways things could improve if we muster the courage to break the ``weird'' relationship between @foo and $foo[]. ... the botch that in Perl 5 requires us to distinguish $foo[] from $foo->[]
- I think length(@array) should be equivalent to @array.length(), so if there's a length method available, it should be called.
- Legacy Perl $pkg'var Should Die.
  I agree. I was unduly influenced by Ada syntax here, and it was a mistake.
- odd looking constructions like: $foo->[1][2][3]
- We're definitely killing Perl 5's slice syntax
- Various special punctuation variables are gone in Perl 6
- Typeglobs are gone.
- I'd like to get rid of the gratuitously ugly \E as an end-of-scope marker.
- I've always thought qw() was kind of ugly, so I'd like to replace it with something prettier.
- Angle Brackets Should Not Be Used for File Globbing. Indeed, they won't be. In fact, angle brackets won't be used for input at all, I suspect.
- This allows us to simplify the special case in Perl 5 represented by the _ token, which was always rather difficult to explain.
- The basic underlying question is "What exactly do those curlies mean?" For Perl 5 and earlier, the answer to that question was, "Too many things". Or rather, too many things with inconsistent rules.
- curlies are so extremely overloaded in Perl 5
- The old use integer pragma was a hack.
There's more, but I got tired of skimming the Apocalypses.

Just for a point of reference, I'm a perl programmer who doesn't fit your categories (a), (b), or (c), but still finds perl code hard to read fairly often.

With all that said, I'll close with one more quote from the Wall:
- Perl 5 does a lot of things right, and we're not terribly interested in ``fixing'' that.
1. Re:Hard to read Perl [5] by Anonymous Coward · 2003-02-13 10:14 · Score: 0
  
  The new syntax should make it harder to write hard-to-read code / The new syntax should make it easier to write easy-to-read code.
  
  Perl5 (and even C) give you far too much room to be a really, really, really sloppy programmer. But it's still the programmer's fault!
  
  Lisp makes it easy to write functional (vs. procedural) code. Java makes it easy to write OO code. But that's not to say you can't make procedural Lisp or Java! The syntax just makes it more intuitive to do it a particular way.
  
  IMHO, anyway.
Perl 6'll be neat by togofspookware · 2003-02-13 13:40 · Score: 1

I, too, ditched Perl 5 for Ruby. I hated the OO and the way you had to expliitly make references to make lists of lists.

But I think I'll probably check out Perl 6 when it comes out. It looks like the OO and references'll be cleaned up, the new regex stuff looks kinda neat, and hopefully Perl and Ruby and Python will all be able to coexist peacefully when they're all ported to Parrot. Need to do some fany regexing from a Ruby program? Just write your regexing function in Perl 6 and link to it from your Ruby program!

Plus, exploring Perl is just fun :-) You can learn all the interesting things about Ruby in a day, but in Perl, there's always something new to discover.

--
Duct tape, XML, democracy: Not doing the job? Use more.
1. Re:Perl 6'll be neat by Ed+Avis · 2003-02-16 22:45 · Score: 1
  
  I too hated that you had to use references to make nested data structures. But I've come to appreciate the fact that the language distinguishes between value semantics and reference semantics. So @a = @b will do a (shallow) value copy, while you can do $a = \@b if you want to take a reference. This is analogous to 'vector' versus 'vector *' in C++. It's nice to have both.
  
  What annoys me is that Perl has no way to just store one list inside another. If I want to make a list of lists, and have value semantics, why can't I just say $a[5] = @b? It might not be that efficient to program this way, just as a vector is not normally that sensible in C++, but it would make things a lot more consistent. You could choose to work with values or with references, instead of being forced into references just to make nested data structures.
  
  I don't know whether perl6 will support this; probably not. It might end up using only references but without the -> operator that reminds you that you are dereferencing something.
  
  --
  -- Ed Avis ed@membled.com
OT Re:Perl 6 is a mistake by msouth · 2003-02-17 02:33 · Score: 1

s/can't/shouldn't/ :)

that was just for fun. (Well, actually it was also promptes by the fact that 'Apocalypses" sounds weird, and so I just grabbed another pluralization rule.)

But I didn't really know the real place to use that rule, so it was nice to have the chance to learn something. Thanks.

ps In case anyone is wondering, if you are talking about a _lot_ of Apocalypses, the correct rendering is "Apocaloodles".

--
Liberty uber alles.
Not that hard, man by etcshadow · 2003-02-18 10:19 · Score: 1

"If I want to make a list of lists, and have value semantics, why can't I just say $a[5] = @b?"

It's called:

$a[5] = [@b];

how hard was that? Once again, it's nice to have *control* over value/reference semantics.

--
:Wq
Not an editor command: Wq
1. Re:Not that hard, man by Ed+Avis · 2003-02-18 10:29 · Score: 1
  
  You're right, you can make an explicit copy to emulate value semantics, but it's inconsistent with what happens for other data types. For example
  
  $a = 55;
  $b = $a;
  $b = 66;
  print $a;
  
  Clearly the original value of $a has not been changed. You don't need any voodoo with square brackets or reading 'man perltoot' to get this.
  And if you do the same thing with lists, it's fine too:
  
  @a = (55, 56);
  @b = @a;
  push @b, 'hello';
  print join(', ', @a);
  
  Again the original object is unchanged. Now, what if instead of variables I have another data structure (say, a hash) storing these values?
  
  $h{a} = (55, 56);
  
  Well there's the first problem, you can't do that, it has to be a scalar. Well, okay,
  
  $h{a} = [ 55, 56 ];
  $h{b} = $h{a};
  push @{$h{b}}, 'hello';
  print join(', ', @{$h{a}});
  
  And bam, it's different. You don't have to take references to strings to store them in hashes, why should you have to take references to lists to store them? If I want a hash of lists, why can't Perl manage it?
  
  Of course you should be able to make explicit references if you want, but they shouldn't be forced on you just for simple things like the above.
  
  'Easy things should be easy' - I wish Perl would follow its own motto in this area. It's good that there is documentation such as perltoot to guide the new programmer through this stuff, but rather odd that none of it seems to acknowledge the possibility that this stuff is anything other than bleeding obvious.
  
  --
  -- Ed Avis ed@membled.com