Web log analysis home

Ive got a bit of a thing about logs. I like logs. I tend to spend a bit too much time trying to work out just how much information you can get out of a log. So here's my page devoted to some of the resources that you can use to get info from logs.

This page should probably have more to put on it. When I find it I'll post it...

Links

Thats right. Just after I get you here im sending you away.
Actually, theres a fair few resources that you might want to look at if you're after some more info on your logs. Theres several log analysers out there. You might want to know about servers and things. Who knows. Stuff that isnt on my page is though...

Log analysers

Ive got a bit of a linux bent on this, since i use linux. Most of these were found on freshmeat.net.

The Webalizer - This is a pretty well-featured. And its very very fast. Very impressive analyser!

ktmatu's Log tools - Three of these: (from their page)

Relax - WWW logfile referring URL and search engine keyword analysis tool. This free Perl script recognizes many search engines and organizes popular keywords used to get to your site.
Lrdns - Log Reverse Domain Name System converts numeric IP addresses in accesss log files into textual domain names. Written in Perl.
Ffcat - Prints only the new entries in a log file. Fast forwards to the position where the last run ended, and then copies only the new lines of that file to the standard output. Written in Perl.
Ive not tried any of these though, although they look useful.

Webstats - Sample output looked pretty nice but I couldnt get it to work due to missing perl modules.

Analog - Not quite as pretty output as the rest, but it works (who needs apperance anyway) and gives you a fair bit of information. It claims to be one of the most popular.

WebLog - A set of python modules for analysing web logs. (ive not looked at this really though)

Pudica - Ahem. This is my attempt at one when there doesnt seem to be one I like. But I might abandon this due to there being better ones around (most notably webalizer). It works on any system with perl, but its slow and probably very specific to my own logs.

Resources

These are a bit web-design orientated. But still...

Browserwatch - General site about what browsers are out there.

Webmonkey browser chart - More to say what browsers have what features, but still a good reference for the log analyser.

The web robot page - has a useful list of which web robots exist, although I'll bet its not as complete as they'd like since they pop up all over the place.

Bot-Spot - News site for bots. Not really looked into it though.

user agents page - Some of my thoughts on what user agents exist, which ones its worth parsing for, and some catagorisation of what user agent is sent from what product.

Other stuff

Well, typically of my pages, theres not much real content, just some links. If anything has moved or anything is wrong or you have some new information, email me on mrsneeze@telebot.com and I'll try and fix it...

Last modified 21 Feburary 2000.


Back To Main Page