

Web Log Analyzers? 31
sammy.lost-angel.com asks: "What's the best web log analyzer out there today? It's time to upgrade our horribly out of date one and I'm not sure what's good out there at this time. Our site receives about 50,000 hits a day, so things like remembering what's already been analyzed can save a lot of time." What about log analyzers that can work on more than one type of web server? An analyzer that could parse access data for, say, IIS and Apache would be a nice tool!
Webalizer (Score:1, Informative)
I've had good luck running several million hits through Webalizer. It works pretty well.
My 2 bits about 2 packages (Score:2, Insightful)
WebTrends Professional
and WebSphere Site Analyzer.
Bottom line with WebTrends is, its junk. It costs a bundle, is more expensive for the unix version, and you need one base liscence for the first machine who's logs you want to analyze, plus one supplimental liscence for each additional machine. If your site spans four boxes, you need a base+3 additonal. PRICEY! To boot it is not very configurable, and it has a hellova time counting user sessions by custom cookies.
WebSphere SiteAnalyzer on the other hand is a behmoth of a program. It requires far to many resource to run, takes forever to properly configure, and needs a tweaked version of DB2. On the plus side its highly configurable, and comes "Free" with websphere server afaik. You can count anyting on anything if you really want to, and you don't need to get a special version to do your own querries against the data. All the data is in DB2, so you are free to probe the data all on your lonesome. With Webtrends you need a special version to get access to the database, and then the access is only with their propiretary libs. Of course the other big plus for SiteAnalyzer is that it has a client server model, and the both can run on Linux, Solaris, HPuX, windos..etc.
To be honest those are the two biggies for comercial site analysis software, and neither are that good. Check out some of the OS offerings, prehaps one of them will work for you
I have to agree (Score:2)
Analog and Webalizer (Score:3)
http://www.webalizer.com/
seconded (Score:2)
Re:Analog and Webalizer (Score:1, Informative)
Re:Analog and Webalizer (Score:1)
I don't like the reports generated by anything else, including Webalizer, so I stick with Analog.
Re:Analog and Webalizer (Score:2)
Personally, I've used Analog as listed above and found it to be pretty good, once you get the configuration working (which is a once off thing). It will also work on compressed logs, IIRC, so you can even save some disk space (at the expense of more CPU time at analysis).
Re:Analog and Webalizer (Score:2)
Re:Analog and Webalizer (Score:2)
The point is well made, though; hard disks tend to be the bottleneck on today's systems with clock speeds in the gigahertz.
Re:Analog and Webalizer (Score:3, Informative)
I have done some work recently on presenting the documentation in different ways. As well as the main topic-based documentation, there's now a page with only the most basic commands for beginners; a comprehensive index; all the commands on a single page with a BNF-type grammar; and two sample configuration files with all the commands in, one in topic order and one in report order. There's also the beginnings of a collection of third-party HOWTO's (for which I need more volunteers, HINT HINT!).
I do take a lot of time and trouble over documentation, I suspect much more than most open source projects. My rule is that no change can be committed until it's fully documented. So you will never find the documentation lagging behind the reality, or options missed out of the documentation. I also spend a lot of time rephrasing the existing documentation.
What's it matter what server generates the logs? (Score:2, Informative)
Re:What's it matter what server generates the logs (Score:3, Insightful)
In my opinion, a good logfile analysis tool should be able to recognise and analyse all commonly-used formats, and provide a means to specify custom formats. In other words, it should work with what the server has already produced, rather than force the server administrator to reconfigure the server and ignore old logfiles. My program analog [analog.cx] does all this, but most programs don't.
webalizer & awstats (Score:2, Informative)
and webalizer (www.mrunix.net/webalizer) for the Apache logs.
awstats is Perl, too...
I've used them both and since I have only Apache to log, I've stuck with webalizer. Plus, you can easily customize it for each user/domain with its own webalizer.conf file.
AWstats rocks! (Score:3, Informative)
-OctaneZ
Re:AWstats rocks! (Score:2, Insightful)
If you're going to quote someone, at least give them proper credit:
"I know not with what weapons World War III will be fought, but World War IV will be fought with sticks and stones." - Albert Einstein
Re:Einstein (Score:1)
-OZ
Analog (Score:5, Informative)
First, as others have commented, the commercial programs suck [slashdot.org], especially Webtrends [slashdot.org].
Analog is over six years old, but it's still actively developed, and I think it's still the leading free log analyser. The main contender is the Webalizer. To some extent it depends what you want (why not try out both?). The Webalizer's biggest advantage is that it produces prettier pictures. Some of analog's advantages are that it is more configurable; that it runs on any OS (the Webalizer is Unix only); and that it can analyse logfiles from any web server.
Besides, analog's author reads Slashdot.
Re:Analog (Score:2)
Re:Analog (Score:4, Interesting)
My only complaint is Stephen's dogmatic insistence [analog.cx] on not performing any form of speculative analysis. For example, he refuses to even attempt visitor counting, path tracking, etc. The sort of stuff that bosses like to see, whether or not it's strictly accurate.
Stephen could put WebTrends out of business with a couple hours of coding, but he has his principles.
Re:Analog (Score:1)
I love Analog, it's pretty much setup and forget. I tried (at a customer's insistence) NetTracker and found it to be a nightmare. If it works, it's OK, however if it messes up, which it had a tendency to do, then it was terrible. It would overwrite its "database" periodically which would mean it would have to start from scratch again. This particular site gets approx 500K "hits" a day, and NetTracker would take literally 28 hours to do one day. Analog can do it in minutes. Gotta like that.
speculative analysis (Score:2)
I have customers who want to see something that guesses at number of unique visitors, guesses at paths through the site, etc. They don't want to study, and don't care about the details of why it's unknowable information... they're used to seeing it from other packages, and complain that it's missing.
Any kind of wild guess, with a bunch of caveats on the output, would be much more useful than the explanation of why this analysis is not done.
Re:Analog (Score:1)
Since I want web reports to show to VHost clients, I added ReportMagic, which builds out a great looking, graph-laden, generally beautiful HTML stats. Indeed, I think the ReportMagic graphs are considerably more readable than WebTrends'. As a bonus, RM is written is Perl and it's very hackable.
I have beautiful stats run on over 100 web sites, every single night, and archived every month. I am very happy with the results, and I keep finding cool new features in the docs to add in all the time.
However...WebTends is the de facto web report standard. Having used WebTrends, I know what a painful piece of shit it is to use. I often use it as an example of what is wrong with winduhs programming. Every feature you could ever want, none of it works right, moronic interface, breaks constantly, worthless tech support, frequent payware upgrades to keep it running. I just can't adequately express how much I hate it.
However...WebTends is the de facto web report standard. And after about 400 times explaining how fallacious reports like, "paths through site" and "visitors" and such are, and pointing clients here: http://www.analog.cx/docs/webworks.html, I had to give up. The boss is suffering through doing monthly WebTrends reports for the dozen or so customers who whine.
I have actually had this conversation:
<Digger>
<Luser> Right. So can we get you to do a WebTrends report too? Cause we base our business plan on them.
Seriously. I might as well be telling people to stop using winduhs as their desktop OS. I might as well be tilting at windmills. It's neigh hopeless. I would rather have a userbase that knows not to double-click on any old attachment they get in the mail, but since I am stuck in the real world, I have added scanning software to the mail server.
Analog stats are certainly more than sufficient for me. Adding ReportMagic should make the output sufficient for anyone else, but it doesn't. Stephen has stated quite clearly that he has no intention of adding reports that offend his statistical sensibilities. I certainly understand the position he has taken, and since he is doing all the hard work, it is his right to take it. It is just a shame that most of the world is stuck having to run WebTrends, or some similar crapola commercial product as a result.
Maybe Stephen would like to respond, or point to one of the relevant mailing list posts. What about cookieing solutions? I have custom-built a few of these, and cookie-enabled browsers have been about 99% or so. Is 1% still significant?
Wusage (Score:2, Interesting)
It doesn't generate very pretty reports by default, but it is highly customizable and provides a truck load of data.
Note: I am not affiliated with the makers of Wusage in any way.
Sawmill is great. (Score:2, Interesting)
We are replacing Webtrends with Summary pro (Score:1)
It has a rich set of features, analyzes and summarizes logs across multiple servers, has a multitude of pre-defined reports and costs thousands less than Webtrends or IBM.
(Total SW price between $59 - $695)
Robert Merrill
(With a large, American passenger-train corporation)
Pointy clicky lower end analyzer (Score:2)
In the meantime, use analog or webalizer to get the full skinny on your traffic.
log analysis packages (Score:1)