Domain: cpan.org
Stories and comments across the archive that link to cpan.org.
Comments · 1,172
-
text processing
guess it answers this question (Does Perl Have a Future?) posed by Brian d foy.
perl has some neat text processing capabilities (munging), regex, unicode and CPAN. I wonder if you can gain access to CPAN from your phone? -
Nokia should use TIBET(tm) 2.0
TIBET(tm) 2.0 is being released with the moral equivalent of CPAN compressed into around 200k of gzipped ECMAScript. It's pure ECMAScript -- and includs client-side tag expansion to active client pages, as well as support for web services, Jabber, etc.
-
Perl
Lego::RCX module.
-
Re:Language?
C/C++ is largely English in terms of character set and keywords used.
Which is why Perl is the way forward! -
Re:These kind of benchmarks are so 1970s
"But it usually isn't important to be anywhere near as fast as C, just to speed up the simpler, cleaner Python version by 2-20x. "
I do the same for Perl (search.cpan.org for Math::BigInt) and I really like this optimization approach: fast code development, optimize later (and get rewarded by having a lot to optimize).
Cheers,
Tels -
Re:Not hard enough!
I don't know where Perl gets this totally undeserved reputation for obfuscation.
-
Re:Free AI Minds for Better, Smarter Contest-Robot
Association for Computing Machinery on Mentifex artificial intelligence
Ben Goertzel, Ph.D., on Mentifex artificial intelligence
Comprehensive Perl Archive Network: Mentifex AI mind.txt gameplan
Free Software Donation Directory: Mentifex AI Project
Nanomagazine interviews Mentifex on independent AI scholarship
Redpaper archive of Mentifex documents on artificial intelligence
AI has been solved.
Agents Portal selling Mentifex AI4U textbook of artificial intelligence
GameDev.net selling Mentifex AI4U textbook of artificial intelligence
GreatMindsWorking selling Mentifex AI4U: Mind-1.1 Programmer's Manual
SourceForge Mentifex Donations Page -
CPAN is your friend
I highly recommend that you check out w3mir - which was found after a quick search on CPAN (The Comprehensive Perl Archive Network). I particularly like w3mir due to it's ability to compare against existing copies of your local mirror - which is more of what you're looking for. Using this in conjunction with a simple shell script (to tar and mv files, as so desired - hooked to a cron job) will create your very own, automated Internet Archive.
-
CPAN is your friend
I highly recommend that you check out w3mir - which was found after a quick search on CPAN (The Comprehensive Perl Archive Network). I particularly like w3mir due to it's ability to compare against existing copies of your local mirror - which is more of what you're looking for. Using this in conjunction with a simple shell script (to tar and mv files, as so desired - hooked to a cron job) will create your very own, automated Internet Archive.
-
Re:Will the result be the same?
Adobe has a perfect recourse: Microsoft would need to license the rights to PDF from Adobe first
No. The PDF specifications have been basically open since more or less the very beginning. While they hold patents on the specification, they give the right to use the specs royalty free.
Besides, the market in Acrobat is about a lot more than just Distiller.
True enough, but the majority of people who buy Acrobat only care about creating PDFs. I don't know about the latest versions, but v4.x absolutely *stunk* for editing an already created PDF.
This fall at a tech conference I did a session on PDFs. The last part of the session was looking at alternatives for creating PDFs. At least one group attending was planning to ditch Adobe once they found out that Open Office can create PDFs on the fly. Like most users, they just want basic PDF creation features and don't even use the other ones.
-
Re:Why has this taken so long?
I'm not sure if MS aren't talking about something different from what most of this discussion thinks they are. Rather than showing the thread of discussion of whole emails (which we're all used to in other clients) it might be they mean something more like this old discussion of what e-mail discussions should look like by Ka Ping-Yee..
In case you manage to /. that, the idea is that it shows the responses to pieces of your email - the kind where someone says "see my responses inline" and responds to each of your points piecemeal, then you do the same to their responses, and so on.
I've often thought it would be cool to write something to parse emails the KPY way, but the heuristics would have to be pretty damn clever to deal with supercite. Specifically what I wanted was something that combined KPY's ideas with text-autosummarization , and some 'author ranking' information to produce mailing list summaries from gmane which are like Kernel Traffic and Cousins, or the now-defunct Eclectic.
Oh well, I can always wait until MS put this in Outlook 2010 ;) -
so, when is Perl going to die()?
I like perl, use it every day, but any language that allows source code like this should be, like, banned by the government or something, shouldn't it??
-
WWW::Mechanize is your friendYou Perl folks who want something a bit easier than LWP for your spidering and scraping, take a look at WWW::Mechanize Besides the six hacks in the book that discuss Mech:
- #21: WWW::Mechanize 101
- #22: Scraping with WWW::Mechanize
- #36: Downloading Images from Webshots
- #44: Archiving Yahoo! Groups Messages with WWW::Yahoo::Groups (which uses Mech)
- #64: Super Author Searching
- #73: Scraping TV Listings
- WWW::Mechanize::Examples
A random bunch of examples submitted by users, included with the Mechanize distribution. - http://www.perl.com/pub/a/2003/01/22/mechanize.ht
m l
Chris Ball's article about using WWW::Mechanize for scraping TV listings. (repurposed into hack #73 above) - http://www.stonehenge.com/merlyn/LinuxMag/col47.h
t ml
Randal Schwartz's article on scraping Yahoo News for images. - http://www.perladvent.org/2002/16th/
WWW::Mechanize on the Perl Advent Calendar 2002, by Mark Fowler.
-
WWW::Mechanize is your friendYou Perl folks who want something a bit easier than LWP for your spidering and scraping, take a look at WWW::Mechanize Besides the six hacks in the book that discuss Mech:
- #21: WWW::Mechanize 101
- #22: Scraping with WWW::Mechanize
- #36: Downloading Images from Webshots
- #44: Archiving Yahoo! Groups Messages with WWW::Yahoo::Groups (which uses Mech)
- #64: Super Author Searching
- #73: Scraping TV Listings
- WWW::Mechanize::Examples
A random bunch of examples submitted by users, included with the Mechanize distribution. - http://www.perl.com/pub/a/2003/01/22/mechanize.ht
m l
Chris Ball's article about using WWW::Mechanize for scraping TV listings. (repurposed into hack #73 above) - http://www.stonehenge.com/merlyn/LinuxMag/col47.h
t ml
Randal Schwartz's article on scraping Yahoo News for images. - http://www.perladvent.org/2002/16th/
WWW::Mechanize on the Perl Advent Calendar 2002, by Mark Fowler.
-
Re:Tracking yahoo popularity.LWP runs as part of perl, so it gives you a little easier control over the variety of options (eg. user agent and such). And it's easier to get working cross-platform (it's a bitch that you have to do extra work to get around the shell parsing of arguments to subprocesses on Win32). Also, you can do fancy asynchronous stuff with LWP, so you can have interactive programs, or stuff going in parallel, etc...
In general, most people use LWP, and if you write very many programs that use the web, you're going to want to go to LWP eventually, so you might as well start learning now (and there are easier interfaces to facilitate that too).
-
Re:and like every Linux geek..I'm going to say "Where the ogg version?"
Josh
-
Re:and like every Linux geek..I'm going to say "Where the ogg version?"
Josh
-
Re:and like every Linux geek..
just search for some ogg modules on cpan, and put something together
:) -
Re:and like every Linux geek..
here you go:
An object-oriented interface to Ogg Vorbis information and comment fields, implemented entirely in Perl. -
Re:Save some timeAny reason you don't use Solars::PerlGcc to enable gcc-suppport?
SunFreeware is not funded/created by Sun.
There also isn't much of a need as there used to be to use SunFreeware when Solaris ships with a companion CD full of stuff already compiled.... never mind the
/usr/sfw stuff that's built into the main OS already. -
Re:Sweet function
-
Re:A terrible and never-ending task???> just writing a perl script to talk behind the scenes to the Outlook Web Interface
Sounds like a job for WWW::Mechanize.
-
Re:I remember an app named 'Babble' did the same..Sounds like a variation of the "dissociated press" algorithm, based on techniques similar to Markov models. Such things have been the toys of choice for text processing people for a long time now. I remember prof. Seppo Mustonen's dissociation of Kalevala in his Survo book, and I recently read MegaHAL's poetry produced with similar technique.
I think there's prior art if it attempts to put style in the equation somehow. Whether or not it should be patentable is an entirely different matter... I was wondering first if this was anything I had done personally, but my text-generation stuff has always been rather crude, the most sophisticated thing I've done is based on context-free grammars, and context-free grammars aren't good for preserving structure but not for making stuff that rhymes (unless we narrow the generation accordingly).
And on the topic, the coolest poetry experiment ever has to be the Coy module for Perl =)
-
A poem module for the Perl geeks.
I always though Coy was a fun module to get poems from. I don't think it works with Perl 5.8 though.
-
Re:Mapping engine status: Stalled
<plug>javainetlocator and IP::Country</plug> are also available.
The city data are unreliable. I've posted elsewhere (link1, link2) the reasons why, but will repeat the main points here.
- All IP geolocation techniques assume the user of an IP address lives close to the company which registered the address.
- The above assumption is mostly true if you define 'close' as 'in the same country'.
- In the USA, a lot of people live in (or close to), the same city as the company who registered their IP address. This is because lots of people live in cities and use ISPs which have a presence in that city.
- Most of the world population (and US population) doesn't live in a city with major ISP presence. Their city locations aren't going to be accurate by any IP geolocation technique.
- The commercial IP geolocation vendors (quova, digital envoy) have a business reason for making city geolocation sound accurate. If it wasn't accurate, why would anybody buy their products?
- Conducting a survey that inflates the accuracy of city geolocation is easy - just ensure your survey participants live in a major US cities and you'll achieve high accuracy. One way would be to use server logs from a US financial industry website. Once you have a survey that shows high accuracy, you can sell your product to businesses whose customers don't live in major US cities
- It's extremely difficult to measure accuracy of IP geolocation (even at the country level), so if you make bold claims, no one is going to be in a position to argue.
-
Source code for Punycode encode/decode
I found the technical section of the Switch article didn't fully explain the encoding. After thinking that I should code up the RFC to figure it out (and perhaps gain a name for myself on CPAN), I found to my mixed delight that someone had already beaten me to it: IDNA::Punycode and Encode::Punycode.
--LP -
Source code for Punycode encode/decode
I found the technical section of the Switch article didn't fully explain the encoding. After thinking that I should code up the RFC to figure it out (and perhaps gain a name for myself on CPAN), I found to my mixed delight that someone had already beaten me to it: IDNA::Punycode and Encode::Punycode.
--LP -
Re:You think 2.44 is ancient?Installing 2.60 into your home directory is painless, but you're stuffed unless you're using the client/server spamc/spamd system and you get a sudden influx of a zillion messages. You'll kill your box due to the expense of loading perl a zillion times.
pperl might help you here.
-
Re:SpamCop doesn't work..
-
PDF::API2There's a good Perl package PDF::API2, you can scan the source at:
http://search.cpan.org/~areibens/PDF-API2-0.3r77/
M ANIFESTIt's pretty readable and the basic text output and font metrics all work. It's very easy to produce output with from Perl, so you can very rapidly prototype your reports and see what the resulting PDF contains.
You can also tweak it a bit and disable the PDF stream compression feature so you can really see what'ts going on.
There are several parts of the package that aren't complete, but I haven't needed them so far.
- Barrie
-
Re:It wouldn't be interesting...
-
Re:My problem with Perl
That said, the traditional regular-expression syntax is rather arcane. The only real alternative I've seen is the S-expression syntax of cl-ppcre -- the Common Lisp PCRE implementation. This allows you to write complex regular expressions as tree structure rather than as strings of character glyphs.
Sounds like Regexp::English, only less explicit. -
Re:My problem with Perl
Have you tried Regexp::English?
-
Happy DayThe Perl Cookbook is one of the top three technical books I've purchased. If the new version improves on the old version, it is a must-have for anyone who does anything with Perl.
Regardless of the religious wars over languages, I have time and again found Perl to simply be the most practical language for the job and continue to enjoy using it for a variety of tasks and watching it mature.
There is no substitute for the CPAN or Perl's ability to get the job done quickly and easily with an active community providing free (and very effective) support. Both Perl and the Perl Cookbook are an integral parts in my toolkit and for anyone who uses it, whether from time to time or regularly, I recommend it. -
Multiple Machines in Parallel
One of the problems we have, is when you have clusters with 100+ machines, and need to push configs, or gather stats off each box. On solaris, we run a script called "shout" that does a for/next loop that ssh's into each box and runs a command for us. We also have one called "Scream" which does some root privilege ssh enabled commands.
While the serial approach of looping through machines is a huge improvement over making changes by hand, for large scale environments, you need to use a parallel approach, with 16 processes or so contacting machines in parallel. I wrote my own script, but these days the Parallel::ForkManager module for perl does the process management part for you. -
RT is God
RT is a tremendous package. Version 3 is out, but you can see version 2 in action at rt.cpan.org. All Perl bug tracking, both in modules and the core, goes in here. In fact, submissions for various O'Reilly conferences are in RT, as well. It's very flexible.
-
DBD::SQLite (was: Re:SQLite)
SQLite is tiny, fast and ACID compliant. SQLite is a public domain embedded SQL database library. It is similar to BDB, but provides a complete SQL database.
Let us not forget about DBD::SQLite [cpan.org] — a DBI [cpan.org] (Perl [perl.org] Database Interface) driver which, not being a driver per se, includes a *full* SQLite distribution, so when one needs to use SQLite in a Perl program, one is only perl -W -MCPAN '-einstall"DBD::SQLite"' away from $dbh=DBI->connect(q(dbi:SQLite:dbname=dbfile)); which is truely amazing.
-
DBD::SQLite (was: Re:SQLite)
SQLite is tiny, fast and ACID compliant. SQLite is a public domain embedded SQL database library. It is similar to BDB, but provides a complete SQL database.
Let us not forget about DBD::SQLite [cpan.org] — a DBI [cpan.org] (Perl [perl.org] Database Interface) driver which, not being a driver per se, includes a *full* SQLite distribution, so when one needs to use SQLite in a Perl program, one is only perl -W -MCPAN '-einstall"DBD::SQLite"' away from $dbh=DBI->connect(q(dbi:SQLite:dbname=dbfile)); which is truely amazing.
-
Adn to decode..
the morse code for normal humans, use Convert::Morse from search.cpan.org...
-
Perl 5.81 AI Mind Modules
Perl modules for artificial intelligence must be kept current with the new Perl release.
The Perl AI Weblog has some sample Perl AI code for the main Alife module of the Open-Source AI Mind in Perl.
Register your own Perl weblog if you are writing AI or other good code to share in Perl.
-
Libraries are king..
Really, unless the language really sucks, library availability, quality and pricing are all crucial in choosing what language to use.
Bias : Coming from Perl, I absolutely love CPAN. Readymade libraries that do a lot of the heavy hauling makes sure many hard tasks become manageable.
In the web development world, a good templating engine will gain your productivity. Make sure its caching design and capabilities are good - a proxy server will not save you in all scenarios. At work, we use Mason. I haven't tried the competition, but Mason does the job and those who've tried the competition have seemed pleased. -
Re:Finally
-
Re:You would think
Why not? It can already make itself disappear. Give Damian a few months.
-
Re:Use qmail
I forgot to mention that this code preserves the \Seen state of the messages.
If you're just migrating from one IMAP server to another, check out the migrate() method of Mail::IMAPClient. You won't regret it. If you decide to use it, make sure that you grab the latest version from CPAN.
There are plenty of code/method usage here
/pointer -
Re:Use qmail
I forgot to mention that this code preserves the \Seen state of the messages.
If you're just migrating from one IMAP server to another, check out the migrate() method of Mail::IMAPClient. You won't regret it. If you decide to use it, make sure that you grab the latest version from CPAN.
There are plenty of code/method usage here
/pointer -
Re:Use qmail
I forgot to mention that this code preserves the \Seen state of the messages.
If you're just migrating from one IMAP server to another, check out the migrate() method of Mail::IMAPClient. You won't regret it. If you decide to use it, make sure that you grab the latest version from CPAN.
There are plenty of code/method usage here
/pointer -
Re:HmmmLingua::EN::Squeeze is a Perl module which compresses English text into a tight, abbreviation-rich scheme for pagers or SMS applications. Looks like it'd take practice to get used to the abbreviations done.
- "
- For example pagers have an arbitrary text size limit, typically 200 characters, which you want to fill as much as possible."
F_xmplePaghvAbitryTxtSizLim,Tpcly200Chr,W/UWnttoF
l lAsMchAsPsble
-
perl's GD::BarcodeGD::Barcode
I've used it for a neat project where the only input device for a kiosk was a barcode scanner. (We made this funky menu system for administration(loading content and downloading logs) which involved bouncing the focus between choices and you would swipe the 'admin' barcode when you wanted to choose an option.)
-
Re:Somewhat off topic but...Applescript is an application scripting language. I sure there is YACC/LEX code floating around for the language. So in theory it wouldn't be that hard to port.
Python, Perl and to a lesser extent Javascript aren't designed for this. (I say lesser extent Javascript, since there actually is a version of Javascript that compiles to the same format Applescript uses and works in the same way)
One must also point out that there are glue modules for Perl that allow it to do most Applescripting.
Applescript Module
Applescript Glue for PerlI agree that in general one would be better off using an other scripting language. But Applescript is useful -- especially for small tasks. (I think the majority of my scripts are less than 25 lines long)
I've not tried using the more robust OSA features in Python. Ususally I just call applescripts. I plan on doing more of this during the upcoming weeks.
Unfortunately as I alluded in my other post, Applescript Studio doesn't really support OSA languages other than Applescript. (OSA is from a programmer's perspective something like the VM for Java - it lets you use other languages)
I 100% agree that having a good general scripting book for OSX would be a great idea. It would have to be very practical. Here's hoping.
-
Re:Somewhat off topic but...Applescript is an application scripting language. I sure there is YACC/LEX code floating around for the language. So in theory it wouldn't be that hard to port.
Python, Perl and to a lesser extent Javascript aren't designed for this. (I say lesser extent Javascript, since there actually is a version of Javascript that compiles to the same format Applescript uses and works in the same way)
One must also point out that there are glue modules for Perl that allow it to do most Applescripting.
Applescript Module
Applescript Glue for PerlI agree that in general one would be better off using an other scripting language. But Applescript is useful -- especially for small tasks. (I think the majority of my scripts are less than 25 lines long)
I've not tried using the more robust OSA features in Python. Ususally I just call applescripts. I plan on doing more of this during the upcoming weeks.
Unfortunately as I alluded in my other post, Applescript Studio doesn't really support OSA languages other than Applescript. (OSA is from a programmer's perspective something like the VM for Java - it lets you use other languages)
I 100% agree that having a good general scripting book for OSX would be a great idea. It would have to be very practical. Here's hoping.