Domain: r-project.org
Stories and comments across the archive that link to r-project.org.
Comments · 217
-
Re:well I use open source
-
Re:For scientific calculations, clones are useless
can't someone figure out a smart solution for this without asking the user to modify the source themselves?
If you need more than 64k of data use a app made for scientific work, like R, mupad or Mathematica. -
Tux goes to College...The University I am preparing to attend, Woodbury, has a policy where they require their students to have at least a 300MHz Pentium (Pro? 2? Celeron?) class computer, (laptop preferred, desktop in your dorm room accepted) some version of Windows, a copy of Office 2000 or Office XP, and a copy of SPSS. LA Valley College, on the other hand, has no such policy, but it also has a free Wi-Fi hotspot I'm looking forward to using in the future.
I've got the laptop in question right here, (I'm typing on it now) and yeah, I dual-boot Linux (Knoppix knx-hdinstall) and Windows 2000 SP4. I need to upgrade the hard drive to give both systems the space they need to coexist happily, but even now they both are happy together. The hard drive is 10GB, there is 228MB of RAM in here, and I have both a wired NIC and a Prism-based 802.11b card to use with it. It won't run Neverwinter Nights or Doom 3, or anything like that, but from what I understand Starcraft will probably run on this. I can certainly play KMahjongg on this until the cows come home.
However, I intend to use this machine primarily on Linux...*especially* when it is hooked up to the University network. Everyone knows just how good OpenOffice.org is as an Office alternative, and how much it needs to evolve, so I won't say much about that. However, the SPSS requirement is something that takes some thought.
After some judicious googling, I found two SPSS alternatives: The R Project and GNU/PSPP. I don't know much about either program, (nor do I know much about SPSS) but it's good to know there are at least two alternatives that leap out at you when you look for it.
Linux should be a supported alternative at all Universities and Colleges throughout the world. Actually, I think Linux should be promoted over Windows, and I am not alone in thinking this..
Linux solves a lot of problems that bedevil IT departments at Colleges and Universities. It comes with great Free/Open Source alternatives to widely-bootlegged proprietary software. It is less prone to malware, viruses and trojans. It is more secure than Windows. And if you look beyond full-figured GUIs like GNOME and KDE and use trim window managers like IceWM, BlackBox, XFCE and so on, you can run graphical Linux on modest computers. Linux + KDE is actually quite nimble on my 400MHz ThinkPad 600E, and I have seen it run OK on 233MHz Pentium systems with 128MB RAM or better. If Windows 2000 will run on a machine, Linux and KDE will also run.
All these problems the article we're discussing enumerates would be ameliorated if not completely sidestepped by encouraging alternatives to a Windows Monoculture.
-
They left out a couple
For a start, they left out the S programming language (started in 1976), for which John Chambers won the ACM Software Systems Award. This, and its Libre dialect R (thanks to Robert Gentleman and Ross Ihaka at University of Aukland), are in daily use by folks who have to write programs to use data.
-
Re:Hrmmm
The R Project does a really good job too. It's an S-Plus clone, and meant for all things statistical, but even if you just use its graph features it kicks butt.
-
Re:Lack of decent up to date software.
Just don't buy it if you're looking for an organiser, you'll be bitterly disappointed.
As the proud owner of a now obsolete Handspring Visor and having just purchased a "new" Zaurus 5500 on Ebay, I basically agree.
Everything on a Palm is instant. It may not have every feature you want, and until recently, the screens had pretty poor resolution (Sony is the exception). However, the applications worked well, and had reasonable interfaces so they could be used quickly.
I quickly abandoned the Sharp ROM for the Zaurus in favor of OpenZaurus, but I've certainly had my frustrations with bugs and missing features. Some small things, like the ability for the application buttons to turn on the device (like a Palm), and some larger things, like having it not reboot properly the second time if you haven't suspended in between (though there is an unofficial fix).
Why am I rebooting in the first place? Because of the Zaurus' greatest aspect - it's basically a fully-functional Linux system. As such, one can tweak, test, and otherwise poke around (sometimes needing a reboot if something got messed up or you're testing something). Some Zaurus applications I've found I've had no good free Palm equivalent. Zee Cookbook is a great, if somewhat slow (when editing), way to keep a database of recipes on hand. QTJournal is a great way to take notes that are categorized by date and subject. The ability to run just about any console-based Linux software (even the statistical software R) makes it very useful as a sysadmin tool. With a small, cheap wireless card from Ebay, it is often more convenient than lugging around a laptop.
Some of the things I've wanted to use my Zaurus for before I bought it work, and some don't (yet). I got xmms running and it plays OGG files well (but the included mediaplayer with openzaurus doesn't, and the Sharp ROM's media player has a horrendous interface). I can control the Zaurus remotely via ssh (VERY handy for exploring with a real keyboard) and VNC (with the framebuffer vnc package). However, the latter doesn't offer even basic security (and I haven't gotten iptables to work), so I'm reluctant to use it often, mostly out of principle.
I got the xvnc server running, but the vncviewer client to view it simply will not connect to it, or any other vnc server. I've seen a few other reports of this behavior but no fix, and most people seem to have no problem. This combo is supposed to allow the use of any X11 application on the Zaurus itself, and more importantly for me, remote X applications (so I can control xmms on my music server with a wireless connection - the ultimate remote). If anyone has a suggestion about this, I'd be happy to hear it.
My other problem is mail - mailit (included with OpenZaurus) is simplistic, but more importantly doesn't work for one of my domains (not sure why this is). I can telnet manually to port 110 and execute pop commands fine, but this mail client barks about an unknown response from the server. QTmail doesn't work either - it gives host not found or something like that.
For the price I paid, I get far more functionality than I ever did from a Visor, but the Zaurus definitely has its frustrations. The PIM apps are nothing much to speak of, they function, but are slower than their Palm equivalents (this, again is on OpenZaurus). My greatest desire - the ability to have tree-view tasks, is not implemented on either my Handspring or any version of "todo" on the Zaurus that I've used.
So it's not perfect, but you can still pry it from my cold, dead hands.
-
Re:Ever since Igor
However, making advanced graphs and plots with ROOT requires a whomping manual
Why not try R? There's not much point and click, but the command are quite ok, and as you can see from their page it generates some VERY good-looking graphs. Its GPL'ed :-) -
Re:You must have fewer needs than I do...
R. It's the GNU version of S, which is a very good statistics tool.
-
Re:You must have fewer needs than I do...
It is a play on the name "S"/"SPlus."
-
A, B, C, D, ... R!
Bah, We've allready made it all the way to R!
-
Re:A spread sheet is not...
Having used a quite large number of statistics program, I would like to recommend R in addition to the ones in the list you linked to. R originally was a clone of S-plus, but I wouldn't be surprised if it has surpassed S-plus today. One of the nice features of R is that it's crossplatform (most Linux distros include it) and free.
The bad thing, at least for people used to excel, is that R has no fancy GUI. -
Re:meh
what features is Excel's graphs missing, specifically?
Histograms. At least I never figured out how to make them :-( Excel made me misinterpret my results (initially) because they were displayed in some strange bar diagram which was the best possible approximation to a proper histogram. It's probably nice for quarterly reports, but I'd prefer R for scientific work. (I've heard nice things about Origin as well)
There was apparently some extra package one could purchase, but that was not possible. I may have overlooked something, but I searched around quite a bit, and it wasn't first time I used Excel. -
Re:Matlab, Schmatlab, I want to write some code!
My other peeve is that it is proprietary
You should try R. Free as in beer + speech, high level scripting, can link in compiled low level code (C, FORTRAN, maybe even Java), good graphics output, good matrix handling, lots of 3rd party extensions (most GPL'd). Not good for symbolic mathematics, though. Used heavily in the statistical community and actively developed by some very smart people. -
Re:Speaking of Databases....
I suspect you've got the name wrong.
The language/system called S or Splus is commonly used for such things by financial companies. There's a free gnu version called R with an enormous library of prediction software:
http://www.r-project.org/
Don't expect to predict the market with it though. Incidentally, there's an interesting book called The Predictors about a group of physicists including Stephan Wolfram who were funded by a major Wall Street firm to write market prediction software (as opposed to the more common option pricing and risk avoidance packages that assume market efficiency) -
Displaying the data
Computer benchmarking data are (usually) quite noisy. As such plain graphs, like those shown are not very useful. Simple statistical graphics like lowesss plots are much easier to interpret. Take a look at the for some ideas.
This stuff is easy to use, at least if you have a computer background, and would produce far better graphs.
-
Re:One day...I use X almost all the time, unless there is some pressing reason to shut it off (like big calculations on a small machine).
I use emacs and LaTeX for writing. Those, together with AucTeX, the RefTeX mode, flyspell and the BibTeX modes, make Word look pretty primitive, for both input and results.
For my data analysis, I use Maxima and the R programming language, again in emacs, this time using the eMaxima and ESS modes, repsectively.
Notice that all this can be done, exactly the same way, with or without X. I prefer X for web browsing, and need it to see the finished product from LaTeX (the DVI files). I run X because it's prettier, and it's easier to keep track of different desktops than to keep track of the several consoles, each with a number of emacs buffers. X is nice for the kid's games, like jumpnbump, too.
I could do most of the writing using OpenOffice. I already use OO to open Word and Excel stuff that I bring home from work. OO writer is about good enough now, if Word is good enough for you. Unfortunately, Word just isn't good enough for me, and neither is OO. They seem incredibly primitive.
The problem isn't that they require X or Windows, or that they can't run on the old 486DX I occasionally use, the problem is that I don't like that way of working. I struggle so hard, and my results look so much worse than when I let LaTeX typeset my work.
Maxima has a GUI, called XMaxima, and there is a Windows version of that. There just isn't a GUI for R, but there are several project working on putting one together. Given that we have emacs and ESS, that seems like a big step down, but to each his own.
The question here isn't CLI versus GUI, though I can see why it looks that way. The question here is are you using the tools which have been developed over the years to fit the platform you're on? On Unix, those happen to be mostly CLI, and they are incredibly sophisticated. On Windows, they are entirely GUI, and they may, someday, approach LaTeX for quality and ease of use (which is different from ease of learning). LaTeX and emacs are a pain to use on Windows (I use Windows at work, since it's my employer's time I'm wasting, rather than my own). Even with Cygwin, a lot of things just aren't quite right on Windows, if you're trying to do things the Unix way.
-
How about now?
-
Re:Not Quite...
I did an assingment this week for my comparative vertebrate morphology class. [...] The instructor showed us how to do the plots in Excel.
If you are working in the sciences, you should be using a decent, scientific data analysis and plotting package. That means something like Splus/R. SPSS and Minitab are also commonly used (I don't have first hand experience with those). Matlab/Octave and Mathematica also have lots of plotting and data analysis functions. For plotting, GNUplot is also pretty decent, and for calculations, Perl/PDL and Numerical Python are other good choices. Excel, on the other hand, is hardly ever the right thing to use for scientific work; its feature set, user community, and testing is oriented towards business applications, not the sciences.
You can get "R" for free here. -
Re:much better choices around
If you want statistically sound data analysis and graphing, look no further than the R Project. It's a complete programming environment and is extensively used by working statisticians.
I concur. I just wrote a script to do some AFLP analysis in perl (mainly for the mental excercise as a favour to someone), and aside from PDL that I don't know anything about, perl is pretty horrible for doing numerical stuff. R on the other hand is pretty neat, particularly with an emacs front end. There's lots of material out there introducing it, both in dead tree form (books about S and S+ also cover big chunks of R), and on the net. The statistical development method I use with R (and I really only use it's simplest features) is to do some work with it interactively, then take my command history and roll a script to spit out the graphs and analyses that I want. I used to use gnuplot and octave a lot, but R is really a much better solution for the kind of thing that I do which is analysis of biological data in the main. -
much better choices around
If you want statistically sound data analysis and graphing, look no further than the R Project. It's a complete programming environment and is extensively used by working statisticians.
Python, Numerical Python, various Python plot packages, and VTK also make for very powerful visualizations if you want something more do-it-yourself. -
Re:Oh come on... if you spend 2 hours to hunt the software down and install it, it costs the company about the same in wages as if they'd just bought it...
To use an example I'm familiar with, S-plus costs US$1200 and up. The libre equivalent costs US$0.00. How much do they pay you per hour?
Or were you talking about searching for cracked copies of Reader Rabbit Kindergarten?
-
Re:Plotting
Or use R . . I find it better than Gnuplot.
-
R: Open-source statistical languate
-
Re:What your lab is missing out on
GNU R an implementation of the "S" language is about as good as it gets for statistics.
For a matlab type clone there is octave, which won't run all the fancy toolkits, but then it doesn't cost $15,000 either.
If you need to do some heavy algebraic formula crunching to solve model equations, YACAS, Maxima, and Jacal are all good.
If anything, the data analysis arena is where Linux is completely taking over many labs. It's the hardware that's hard to talk with.
Oh and not to mention that Matlab, Mathematica, and S-Plus were all available under Linux last I checked. (proprietary of course) -
Re:A MacOSX version exists
The carbonised version of gnuplot does work well. However, it is based on 3.7.1 which is getting on a bit now. 3.8 (the pre 4.0 version) has some very nice extra features including mouse interaction for zooming and 3d plot rotation. And some very nice surface mapping and image handling functions.
I have had no problems building version 3.8 under OSX 10.2.4. The aqua term works fine but you only get the mouse interaction when running under X11 which also works perfectly well in my experience.
On a related point you can use gnuplot from within octave a matlab-like environment which is open source. This also works fine under OSX 10.2.4 and can make use of the VecLib BLAS and Lapack accelerate libraries under OSX - details on HPC for OSX. However, I must admit my preference is with R for data analysis outside of my own code (several others have already mentioned R) purely because of the wealth of statistical functionaility available. -
Statistics, was: age-old answer: it dependsFor statistical analysis, my advice is to go for R (GPL implementation of S, see www.r-project.org), or S+ (the commercial implementation of S). R has functionality equivalent to S+ and it's GPL. There are a lot of S language scripts to be found on the web, e.g., StatLib. Both R & S+ have very active user communities.
Especially if you're doing policy analysis (i.e., you're not data mining a 10 million record database) R is a great environment to work in. Hmm, it might be possible to process 10 million record databases with R, I just haven't tried it.
R has more or less equivalent functionality to Octave, but the programming language (S) is more interesting, and plotting functions are more sophisticated. For matrix operations, both R and Octave use BLAS.
My advice is to stay away from SAS and SPSS. These two were invented back when "everybody knows using a computer is hard". Both have GUI interfaces now, but that's just an attempt to hide the essential ugliness of their command languages. If you must use a commercial product then S+ is my recommendation.
Good luck & have fun!
-
Yes, what about libraries?
I'm very interested in what you're saying in the post about multidimensional optimizers, because I do lots of statistics work, and have been recently trying to decide whether or not to write a program in C++.
I was assuming that I'd have to write my own minimization routines, but you bring up an interesting alternative.
I've never really looked much for general-purpose open source multidimensional optimizers. They must be out there, though.
A quick thought--I believe that the R language has a multidimensional optimizer. Of course, you might not want to do whatever you do directly in R, but R is open source, and you might be able to find the code somewhere.
Not particularly helpful, I know.
Does anyone else know of any good libraries for C/C++ or Java? So far the discussion has focused mostly on Maxima/Maple/Mathematica/Matlab. -
right tool for the right job...it all depends no what you're doing. For a lot of applied math (read as: PDE, linear algebra intensive, etc.) there's a lot of optimizations that are best done in code.
At UCLA's math department, the applied people have a QUIST related research program that uses Fortran77 and shell scripts, and C++ for various parts of their code to implement the Level-Set method to simulate the growth of thin films (atom by atom construction of electronic devices). The language choice seems to be due to legacy reasons: the grad student that started it so long ago used it, and the code has continued to grow ever since(ie. legacy reasons).
Although I wouldn't call it *math* research, here at MBI, bioinformatics programs run on the cluster *seem* to be written in C or C++ for the most part. I thinks it's more of the former because that's the interface for a number of bioinformatics libraries that we have licenced. Also, these things tend to be mixed heavily with shell and perl scripts; so the language is only for ease of integration with support libraries.
For most all of my undergrad work, I saw everyone use matlab, mathimatica, and their relatives for their work. In grad school, it seems to depend more on the class and the religious leanings of the mathematician involved.
There's a class on scientific computing that uses VC++ with fortran libraries from netlib (leveraged by f2c) solve some math implementation problem (tends to vary from year to year). Prof Anderson tends to by a junkyard warrior when it comes to math code generation. But then he's the mathematician's MacGuyver. (side note: Prof Anderson is a wonderful teacher and researcher - check out his page for some handy software tools and papers. Also, look at 270B for tidbits of linear algebra optimizations).
The benifit of matlab-ish programs is that you can usually implement your math structure quickly. The down side is that if you want to use any advance optimization then it near impossible. On the other hand, if you don't have a numerical analysis background, then many of the things you try to do to optimize your code in more mundane languages are probably going to be *much* slower then matlab, et al.
All of this is assuming you'r doing numerical analysis. If you're interested in abstract algebra , then I think you're stuck with maple. good program, but I don't have a review on it since I did most of my work by pencil and paper. I did use it for one of my crypto classes and found its implementation of Z_n groups very nice... although I ended up just coding it in C++ anyway
:)Also, check out the R project as it is GNU matlab.
-
S-plus / R?
-
Re:Octave
Octave is a nice MATLAB clone, developed from chemical engineers in the beginning, but now used extensively in virtually any area that math is usefull.
Many packages have their open source counterparts: Octave for MATLAB, R-system for SPLUS (statistics algebra system), and so forth. But IMHO you raise another issue: you can use each of these packages to do whatever calculations you want, since all of them are extended in the C/Fortran end, i.e. they can use programs written in these languages. Custom code is readily integrated. And above all, the GNU Scientific Library. If you don't like or you don't trust the numerical solvers integrated in MATLAB, you can investigate the source in the GSL.
And yes, you can use all of these together. So, what is the question again?
-
Experiences with FOX, Cocoa, Swing, FLTK, etc.
First, a shameless plug: I'm working on a couple of C++ applications that are currently using FOX for the GUI (and other parts). The first one is a front end to the open source stats language R and it is called Obversive. This is the most active project, but it is still real tiny. The second project is a little chat client I'm writing called Bombyx which will work on the SILC and hopefully IRC. In addition to Bombyx, I've also developed a little GUI generator with Lua that makes FOX widgets pretty quickly. It's called GIG and you can get it from the Bombyx site mentioned previously. The intention with GIG is to make it work for any toolkit, but right now it does FOX only.
When I was working on these projects, I went through a huge number of toolkits and actually tried developing fast "mock-ups". My test was to see if I could get the first window I wanted in each project using available resources. If that worked, then I would try a couple of other windows. I also reviewed the designs of each and then went with what worked best at the time.
Here's my review of each of the toolkits I used. These are just my opinions, but they are based on actually using them in various platforms (or trying to).
FOX After all of this evaluation, I found that FOX was pretty much the best at the moment for clean, simple GUI building. It has pretty good docs (way better than wxWindows), and is fairly natural for coding. It also has some nice language bindings for various languages like Python, Ruby, and Eiffel. It doesn't have quite as many controls as wxWindows, and it doesn't work natively on as many platforms, but it is much easier to build and work with and much smaller. If you are porting a Windows application to another platform, you will probably be better off with wxWindows since they try to mimick the API design as much as possible (and of course, all it's problems too). Also, if you hate macros then you'll have a problem with FOX (and wxWindows too). A testimony to how easy FOX is comes from Obversive, where one developer had no C++ or FOX experience and has already created several complicated controls. Another developer has good C++ skills, but only worked with KDE previously. He not only developed an excellent Twin Table control, but also added lots of great features to it and in very little time.
FLTK I did the original work on Bombyx with FLTK. FLTK is a nice library, but it was just missing too many widgets to be useful. It was also rather buggy and it was a real pain to do delegation. There are, however, two really nice things about FLTK: no macros, excellent GUI builder. One of huge problems with ALL other toolkits I've ran into is that, when the developers start to make their GUI builder, the decide they need an entire IDE. I already have one thank you. I doubt some OSS project is really going to out-do my tool chain right now, especially if it's also working on the widgets. Just give me a simple builder please. FLTK got it right with FLUID--their GUI builder. It's real easy to use, super fast, and does one thing well: build interfaces. It was a little weird at times, but I checked out the latest and it worked great. Also, FLTK does not use macros for messaging which is nice for the C++ purists.
Cocoa I love Cocoa, but, Apple made a huge mistake by making it a MacOSX only beast. I think Apple could rule the world if I could spend my time writing my apps with a nice UI, great tools, and an easy language (Objective-C), and then click a button and have a Windows, Linux, HP-UX, and Solaris binary. You could do this with the original NeXT stuff, but Apple crippled Cocoa and friends. So, I reserve Cocoa for other useless programming that I only do on my MacOSX. There is GNUstep, but I couldn't get anything to work with it, and half the fun of Cocoa development is using Interface Builder (IB). BTW, IB is another GUI builder that is excellent, and also does just GUI building. Let's hope the wxWindows guys figure out this pattern and write a small GUI builder rather than their entire IDE which nobody seems to be able to compile but three dudes inside the cabal (sorry, it really burns me).
wxWindows I must say that wxWindows has a ton of nice widgets (I may steal their HTML widget) and works on a huge number of platforms. But, I kept getting burned in three areas: docs, dependencies, and bugs. Their docs are horrible. You have to hunt around the net looking for tutorials which are not very good anyway. Half the example applications in the distro don't even work so you can't rely on them, and it's just really hard to figure out what to do. I also hated having to add a billion other libraries to my build just to get wxWindows to work. I found it a problem when linking my application took longer than compiling all the files from scratch. Finally, just about everytime I went to use some widget, I found some bug or limitation that made it unusable. They have improved quite a lot from what I have heard, and people really rave about it, so I think everyone should give it a try. One thing also that annoyed me about wxWindows is that it does not support delegation very well. All the other toolkits I used either support delegation easily or practically require it. In wxWindows, I spent the better part of a week just trying to get one simple delegation design working and couldn't do it despite lots of help from people on various IRC channels. To me, things are broken if I have to subclass every time I want a widget. The wxWindows XML based resource files are supposed to fix this, but I won't hold my breath.
JFC/Swing The original version of Obversive was done in Swing with lots of other goodies. Java is my primary work language, and I really wanted to use it on the Obversive project, but we just kept running into performance problems. Frankly, Swing is just a P.I.G. and is really hard to code. None of the GUI builders available we tried proved to be that useful, and eventually we dropped it becuase it was just too huge. Ironically, we were able to prototype an almost complete GUI using FOX which runs on Linux, *BSD, Windows, and MacOSX (through the XDarwin or Apple X ports) in about 3 weeks time. This was mainly in part due to the slick design of FOX and simplicity, and partly in part because of the GIG code generator I put together. Oh, and we looked at SWT, but balked at the Linux or Windows only (and the fact that the flagship, Eclipse, ALWAYS crashes on Linux does not make me happy).
So, that's my experiences with these toolkits. I hope that helps answer the question, and I hope the reply wasn't too long. To summarize: Use FOX if you need C++ and good design. Use wxWindows if porting from MFC or if native controls are important. Use Swing if speed is not a problem, but cross platform really is. Use Cocoa if you only care about MacOSX and need to get something out quick. I actually prototype some UIs in Interface Builder on MacOSX, which says a lot.
-
coding from scratch or using tools?
There are a lot of packages out there that will do bayesian analysis for you, if you have a fairly standard type of problem (ie you'll be doing EM with exact inference via junction tree or some approximate algorithm). A good list is here. Many of these packages have source available.
In general, as has been noted by everyone else, Mathematica is great for symbolic analysis, and Matlab is great for numeric analysis. Most people I've talked to in the field use Matlab.
I've been doing some Bayesian analysis recently and I've been writing everything from scratch in R, since it's free so I can use it at home. If you will be doing the whole shebang yourself, and you have the funds, I'd advise going with Matlab, since it tends to be more stable and well supported than R.
If you are interested in R, here is a page about the project to integrate graphical models into R. -
coding from scratch or using tools?
There are a lot of packages out there that will do bayesian analysis for you, if you have a fairly standard type of problem (ie you'll be doing EM with exact inference via junction tree or some approximate algorithm). A good list is here. Many of these packages have source available.
In general, as has been noted by everyone else, Mathematica is great for symbolic analysis, and Matlab is great for numeric analysis. Most people I've talked to in the field use Matlab.
I've been doing some Bayesian analysis recently and I've been writing everything from scratch in R, since it's free so I can use it at home. If you will be doing the whole shebang yourself, and you have the funds, I'd advise going with Matlab, since it tends to be more stable and well supported than R.
If you are interested in R, here is a page about the project to integrate graphical models into R. -
Well....
Matlab is probably more useful to you than Mathematica since you'll be working with simulations and/or data it sounds like. You might also want to check out R, which is designed as a statistical analysis environment and has a large number of packages---including machine learning and whatnot. Its also Open Source and GPL if you care about that sort of thing and runs on pretty much any platform you could potentially care about.
-
Re:You Need Only Consider IIS...
-
from an academic perspective...
In my field(s) of study (bioinformatics, genetics, evolution, organismal and molecular biology), I can say that the open source movement has NOT been adopted by the majority. However, I have recently come across the R project (particularly Bioconductor.org ) which is HIGHLY respected by many MS-oriented biologists from a wide scope of fields. Many computational tasks which are limited to those who can fork out $6000 per year for a license, can be performed on R (for free). Although the learning curve is somewhat steep, I've found the documentation to be exceptional and have been able to execute my needs within a day or two. It has also come to my attention that many graduate students and professors across the country are actively learning R. Granted, many of the users of R are probably "sophisticated"- familar with SAS, C, or UNIX- but it seems to me that open-source is congruent with the peer-review process that so many academics are familiar with and value. I think, if we can introduce it to enough people, and explain to them WHY the open-source method (as opposed to proprietary software) is invaluable, it will become mainstream. R seems to be leading the way in academia at least...
-
Try R
For statistics, at least, R can't be beat.
It's not exactly like MATLAB, Maple, or Mathematica, because it's so statistically oriented. However, it does have matrix computation and pretty good graphical capabilities.
But if you're doing statistics, you almost can't go wrong. -
Re:question : OSS/free project in this space
I haven't seen this being suggested here yet, but R for statistical computing (link) (GNU 'S') is not only open-source, but also used a lot in several scientific fields, such as statistics and machine learning (books have accompanying source code in R). It has loads of packages which allows you to do all kinds of stuff.
-
Re:question : OSS/free project in this space
I haven't seen this being suggested here yet, but R for statistical computing (link) (GNU 'S') is not only open-source, but also used a lot in several scientific fields, such as statistics and machine learning (books have accompanying source code in R). It has loads of packages which allows you to do all kinds of stuff.
-
AlternativesI recently read Wolfram's book, and was most frustrated by the way that it is tied so closely to Mathematica. Mathematica is a very impressive, very important analysis tool, and is REALLY FSCKING EXPENSIVE.
Oh, by the way it was the New Kind of Science book, not the Mathematica book that I read
;)At any rate, I found some cool analysis tools that people should check out as alternatives to Mathematica for analysis and visualization of everything from battlebots to cellular automata. Without further ado:
(just to name a few)PDL is the most directly analagous to Mathematica or Matlab. R is, of course, like S/S+. PGPlot is for visualization. Grass is mostly for geostatistics/GIS. But it's cool enough to throw in the mix.
Anyhow, hope this helps someone out. Go forth and make a battlebot.
-
Re:I think we're stretching things a bit...(I don't have a
./ acct, so posting as AC) - I'm the guy the article is about, and a couple of points are worth noting:the "confirmation of results" & peer review point I was making had to do with crypto and offsite backup software more than with statistical software. When we're talking about crypto or storing someone else's data, it's super important to be transparent. Re: stats, well, one of my slides pointed out that up 'til now, I've always hacked numbers & graphs in Stata, which is proprietary (though most of the really good stuff is published freely, but that's another matter). We should use R, but for cost & "who controls the license" reasons as much as (if not more) than verifiability.
While we're on the verifiability point, human rights data organization techniques tend to be pretty complicated, and it helps to be able to use free software. While distributing the data (via XML) may or may not be useful, it is very important to open the data specifications. I think that means opening SQL scripts, too, and all the database software (in our current mix, the backend is postgres, the front end is Java). That's coming in about a month.
But human rights folks are pretty underfunded, and the "free as in beer" part of open source and free software is a big help, too.
slashdotters might be interested in the International Covenant on Economic, Social, and Cultural Rights, Article 15(b), which states that everyone has the right "To enjoy the benefits of scientific progress and its applications." This is a real, live, human right.
But the real bottom line to human rights and free software has to do with power. Our core rights -- to freedom of speech and free association -- are increasingly exercised in electronic media. Who controls the online world? Can any contractual obligation resulting from a license abridge your human rights? IMHO, these questions make software a human rights concern.
-- PB.
-
Re:This guy's a lunatic....Hehe, yeah, R is going to kill your company, whoever you are!
:-) (I'm not a statistician, but I used R for my thesis, and I loved it, but I also found some bugs, especially one in rpois was quite scary).The basic point of the article still stands, you can't verify things yourself. For any scientist, that should be bad. And for a Human Rights worker, who quite certainly has someone powerful who doesn't want him to do whatever he's doing, it is of paramount importance.
Really, I find it strange that in science, you are supposed to openly document all you're doing except the software implementation of what you did... I don't think that can last.
-
Great Technical Software : R Project
The R Project
A statistical programming language. If some of your friends get burned in Excel by having data over 64K or whatever their limit is, try this. Grandma my not "get it", but Josh College will. Try it. -
What about R + dynamic linking?
You mention something like Matlab as an alternative if it did not have poor performance. There is a very high quality free (beer, speech) program for Windows/Linux/OSX called R that is a matrix oriented language based on the S language, whose commercial analog is S-Plus. One of the really nice features is that you can write iteration or computation heavy routines in C or Fortran and dynamically load them into your R program/script.. The benefits are that you can take advantage of the high-level nature and easy scriptability of R (not to mention publication quality graphing capabilities), while taking advantage of the speed of Fortran/C.
I am always shocked at how little attention R gets considering how good it is. -
Re:About to teach a numerical class...
For data visualization you may also want to take a look at R. It has much better support for plotting than Octave.
-
Re:Goals of the company
How could you mention feng shui, brag about your belief in a western mythology and yet get modded as insightful?
Probably because those things were irrelevant, and the posting was actually insightful.
I'm using a Crusoe-based laptop right now. It weighs about 3 pounds, gets about 7 hours of battery life in real life use. I may have bought it a little early (before they get heavily discounted if the Crusoe is discontinued), but maybe I've got a collector's item here!
The fact is, it's nice to have a laptop small enough to carry with me all the time. It's about the speed of a 500 MHz PIII, and that's fast enough for just about anything I'd want to do on it. I used it to build the Windows distribution of a well-known statistical software package. If I'd had a machine that was 3 times faster, I probably wouldn't have had it with me to do that build. -
Re:How about....I don't know what you might be doing with Excel which requires ``... all the sliders and spinners and fancy stuff
...'', but if your work is valued for its content rather than its form, you might want to reconsider using Excel. Look here and here for some potential problems which you might encounter, and here for some guidance on when it is safe to use this intrinsically flawed product.I have no idea whether OpenOffice suffers similar flaws. Perhaps I'll get ambitious enough to run some of these tests on it myself, someday. In the mean time, I do my number crunching in R. I find that LaTeX provides far more professional-looking reports than any combination of MS products.
-
Re:What we need...There is actually quite a bit of SNA software available; check out the INSNA software page for some pointers. Some of this is free software, e.g., the sna package for the R statistical computing environment. Of course, most of this software is designed for research purposes (rather than visual excitement), but it is there for those who want it....
-Carter
-
Social Networks progress in the open source world
Social Networks has been pretty slow to come to open source world. One of the few pieces of software I know that uses them is the R project, which now has some social network analysis tools.
For visualization, though, I'm currently unaware of any open-source tools. Krackplot has a free web interface, and there is a simple Java program that uses spring-based algorithms for node positioning, but I know of nothing open-source that uses Krackplot's simulated annealing algorithm.
In general, social network analysis can be very useful, but it's results are often subject to misinterpretation. For example, a social isolate in a business might be isolated for a good reason (they are doing research, for example), so you wouldn't want to tell them to integrate themselves more. But in general, it's a great tool to get another look at data you would not normally find out about. -
Re:yet more Excel graphs
Also check out R. It has, IMHO, somewhat more advanced graphing stuff than gnuplot.