book graphic unix and linux troubleshooting guide

My Troubleshooting E-Book will show you how to solve tough problems on Linux and Unix systems!







Running out of space for web logs

Sat Jun 5 17:36:30 GMT 2004 Running out of space for web logs

I guess I'm not surprised. This web site started out with 2GB of file space, but you add a few thousand pages, and suddenly three fourths of it is gone. That's not so bad really - a quarter is still left to grow with, but the problem is the darn web access logs. Amazingly enough, the access_log file here grows around 6 MB daily. Toward the end of the month, that starts to be a significant amount of space.


Hate these ads?

I could just zero the darn thing out regularly, but for various reasons that include calculating payments for the contributors, it's easiest to have a whole month's log available. I thought about various schemes to consolidate the data, but darn it, who knows what I might want to extract at some time or another? So I gave that idea up. I also thought about buying more disk space, but I'm just a cheapskate at heart. I'll have to bite that bullet someday, but I want to put it off (shared hosting can also be a bit of a pain to add disk space to - another reason to delay that).

Well, web access logs are easily compressed. A 30 MB access_log easily gzips down to a tenth of that. So a "trimlog" scipt running early Sunday mornng can do that:



DATE=`/bin/date +%m%d%y`
cd ~/www/logs
cat access_log | /usr/bin/gzip > $DATE.gz
> ~/www/logs/access_log 
> ~/www/logs/error_log 


That will lose a few lines, but I don't need 100% accuracy here. The zipping takes a few seconds; not too much comes in early Sunday morning anyway, so it might not even lose anything.

That only left me with the problem of how to feed both the compressed files and the current log to the programs that need them. That's easy:



cd /tmp
zcat ~/www/logs/*.gz | cat - ~/www/logs/access_log | pct.pl > topten.pl
cp topten.pl  /usr/home/pcunix/www/htdocs/topten.html


The zcat prints the logs to stdout, that "-" after cat tells it to read stdin before it reads access_log, and then the whole thing is fed to the program that does all the stuff I need. Problem solved, or at least put off for a while.









Comments
CommentsBlog904 :
"The zcat prints the logs to stdout...Problem solved, or at least put off for a while."

However, the time will come when even gzipping your logs will not be the answer. I suppose more hard drive would help -- disk storage is relatively inexpensive these days -- but the ultimate solution would be to off-load old logs to tape(s) and place the tapes in safe storage. Either that or DVD-RAM.

--BigDumbDinosaur

But it's shared hosting - I don't control the hardware (and don't want to).

--TonyLawrence

Click here to add your comments


LOD Communications, Inc.

Enter your email address for automatic notification of new posts here
(be sure to whitelist 'feedburner.com' if you use spam filtering)

Or use any RSS reader

Delivered by FeedBurner


ad

Views for this page
Today This Week This Month This Year  Overall
2355 1,952

Have you tried Searching this site?

Unix/Linux/Mac OS X support by phone, email or on-site: Support Rates

This is a Unix/Linux resource website. It contains technical articles about Unix, Linux and general computing related subjects, opinion, news, help files, how-to's, tutorials and more. We appreciate comments and article submissions.

Publishing your articles here

pavatar.jpg
More:
       - Blog
       - Code




Unix/Linux Consultants


http://www.vss3.com SCO/Caldera OpenServer, Unixware & Linux. Tarantella & Non-stop Clustering


http://www.cleverminds.net Need expert advice? Want a second opinion? CleverMinds is a one-stop-shop for a wide range of technology solutions. We support Unix, Linux, SCO as well as CMS, ecom, blogs, podcasts, search engines consulting and more. Contact us at web2.0@cleverminds.net 0r (617) 894-1282


http://echo3.net/ Unix/Linux Custom Applications, Web Hosting, C/C++ Programming Courses



Twitter
o Mmmm... fried dead stuff! 12:28 PM Jan 29th 2008








Change Congress

Publish your articles, comments, book reviews or opinions here!