Web Log Analyzers?
sammy.lost-angel.com asks: "What's the best web log analyzer out there today? It's time to upgrade our horribly out of date one and I'm not sure what's good out there at this time. Our site receives about 50,000 hits a day, so things like remembering what's already been analyzed can save a lot of time." What about log analyzers that can work on more than one type of web server? An analyzer that could parse access data for, say, IIS and Apache would be a nice tool!
http://www.mrunix.net/webalizer/
I've had good luck running several million hits through Webalizer. It works pretty well.
Well the two web log analyzers I worked with at my old job were,
:)
WebTrends Professional
and WebSphere Site Analyzer.
Bottom line with WebTrends is, its junk. It costs a bundle, is more expensive for the unix version, and you need one base liscence for the first machine who's logs you want to analyze, plus one supplimental liscence for each additional machine. If your site spans four boxes, you need a base+3 additonal. PRICEY! To boot it is not very configurable, and it has a hellova time counting user sessions by custom cookies.
WebSphere SiteAnalyzer on the other hand is a behmoth of a program. It requires far to many resource to run, takes forever to properly configure, and needs a tweaked version of DB2. On the plus side its highly configurable, and comes "Free" with websphere server afaik. You can count anyting on anything if you really want to, and you don't need to get a special version to do your own querries against the data. All the data is in DB2, so you are free to probe the data all on your lonesome. With Webtrends you need a special version to get access to the database, and then the access is only with their propiretary libs. Of course the other big plus for SiteAnalyzer is that it has a client server model, and the both can run on Linux, Solaris, HPuX, windos..etc.
To be honest those are the two biggies for comercial site analysis software, and neither are that good. Check out some of the OS offerings, prehaps one of them will work for you
You are only young once, but you can stay immature indefinitely.
http://www.analog.cx/
http://www.webalizer.com/
I would definately suggest checking out Urchin (http://www.urchin.com). Besides the $$ for different types of setup, its the greatest Apache traffic analysis program out there. I used to use WebAlizer. When i switched to this program it was a god send. Check it out.. i think they have a free trial. Tell them surfshot sent you.
I'm not sure about the other package, but Webtrends leaves much to be desired.
The man who trades freedom for security does not deserve nor will he ever receive either. - Benjamin Franklin
Last I checked, both IIS and Apache generate (or can be set to generate) W3C standard format logfiles. Part of the reason for having/using that standard is so that you don't get locked into a proprietary tool.
yah. And analog is *fast*.
awstats (awstats.sourceforge.net) for the IIS logs, but it's kind of funky to set up...
and webalizer (www.mrunix.net/webalizer) for the Apache logs.
awstats is Perl, too...
I've used them both and since I have only Apache to log, I've stuck with webalizer. Plus, you can easily customize it for each user/domain with its own webalizer.conf file.
I have been running AWStats since July, and I absolutely love it. It does not provide the fine-grain detail that many people need, and which can be provided by Analog. But it does provide exactly what 90% percent of us need, in an easy to view package. It creates an easy to understand page about many aspects of your site, including, users, page hits, countries, languages, OS, browser, spiders/robots, access times; it's great! It is also a GPLed perl script! The developement team is over at Source Forge and is actively releasing new code all the time. It also has the added benefit of allowing cgi updating through a web page; simply putting the script in your /www/cgi-bin/ directory and adding appropriate permissions allows you to get up to the second information about your sight without having to dig up a terminal! Definately check this package out!
-OctaneZ
There is/was Big Brother Web Stats (a cgi written in C) but it appears to be dead to me (search BBStats on freshmeat). On my site, I like to use BBClone. BBclone doesn't use a database and is written in PHP. It works entirely on any webserver that has PHP support. (Search BBClone on Freshmeat). You can see BBClone in action on my site here.
First, as others have commented, the commercial programs suck, especially Webtrends.
Analog is over six years old, but it's still actively developed, and I think it's still the leading free log analyser. The main contender is the Webalizer. To some extent it depends what you want (why not try out both?). The Webalizer's biggest advantage is that it produces prettier pictures. Some of analog's advantages are that it is more configurable; that it runs on any OS (the Webalizer is Unix only); and that it can analyse logfiles from any web server.
Besides, analog's author reads Slashdot.
11.0010010000111111011010101000100010000101101000
In my opinion, a good logfile analysis tool should be able to recognise and analyse all commonly-used formats, and provide a means to specify custom formats. In other words, it should work with what the server has already produced, rather than force the server administrator to reconfigure the server and ignore old logfiles. My program analog does all this, but most programs don't.
11.0010010000111111011010101000100010000101101000
I agree.
I have customers who want to see something that guesses at number of unique visitors, guesses at paths through the site, etc. They don't want to study, and don't care about the details of why it's unknowable information... they're used to seeing it from other packages, and complain that it's missing.
Any kind of wild guess, with a bunch of caveats on the output, would be much more useful than the explanation of why this analysis is not done.
Our company uses Wusage and it's quite a nice package IMHO.
It doesn't generate very pretty reports by default, but it is highly customizable and provides a truck load of data.
Note: I am not affiliated with the makers of Wusage in any way.
Sawmill, by Flowerfire is pretty cool. It understand virtually every log you imagine. It'll run as a cgi, via cli or as a stand alone web server. There is a version for many different platforms. With the web interface, the Marketing group can do their own drill down and queries, so I can dosome real work. Performance is good. I think of it as the program that WebTrends wished it was. Get the eval version and take if for a spin.
It has a rich set of features, analyzes and summarizes logs across multiple servers, has a multitude of pre-defined reports and costs thousands less than Webtrends or IBM.
(Total SW price between $59 - $695)
Robert Merrill
(With a large, American passenger-train corporation)
Yes it is Einstein, however I hit the 120 char limit and didn't notice, as it gives you no indictation of that fact, I am sorry for offending you; as it obviously did.
-OZ
In the meantime, use analog or webalizer to get the full skinny on your traffic.
Bleh!
Wusage is the best stats package out there. It does everything the expensive packages do and it doesn't cost a fortune. It rotates my logs, archives them and gives me a bundle of valuable stats for marketing, sales and R&D.