Re: clarification about file visualization

From: Lars Aronsson <lars_at_nyob>
Date: Fri, 31 Aug 2012 03:10:46 +0200
To: CODE4LIB_at_LISTSERV.ND.EDU
On 2012-08-31 00:02, Shearer, Timothy J wrote:
> What we have is a webserver with 64,665 files (html, css, js, jpg, you get
> the idea) and lots of directories with subdirectories.
>
> The goal is to be able to conveniently take all that in in a way that
> makes it pretty simple to see/navigate (say for a public services staff
> member tasked with doing a survey of the old content) so that we can get a
> handle on what's there (prior to say, moving from a php+html template
> approach to a CMS).  It's about exploring the website from under the hood.

I'd recommend starting with the web server log files.
Maybe a handful of those files are 95% of your traffic,
and the rest is odd or peculiar "long tail" information.


-- 
   Lars Aronsson (lars_at_aronsson.se)
   Aronsson Datateknik - http://aronsson.se
Received on Thu Aug 30 2012 - 21:12:08 EDT