[analog-help] RE: currupt files
Aimee Mandeville
aimee at edc.uri.edu
Thu Aug 2 05:58:40 PDT 2007
Here is a sample of the error file that is being generated.
Aimee
-----Original Message-----
From: analog-help-bounces at lists.meer.net
[mailto:analog-help-bounces at lists.meer.net] On Behalf Of
analog-help-request at lists.meer.net
Sent: Wednesday, August 01, 2007 3:00 PM
To: analog-help at lists.meer.net
Subject: analog-help Digest, Vol 36, Issue 1
Send analog-help mailing list submissions to
analog-help at lists.meer.net
To subscribe or unsubscribe via the World Wide Web, visit
http://lists.meer.net/mailman/listinfo/analog-help
or, via email, send a message with subject or body 'help' to
analog-help-request at lists.meer.net
You can reach the person managing the list at
analog-help-owner at lists.meer.net
When replying, please edit your Subject line so it is more specific
than "Re: Contents of analog-help digest..."
Today's Topics:
1. Re: corrupt files (Aengus)
----------------------------------------------------------------------
Message: 1
Date: Tue, 31 Jul 2007 18:14:33 -0400
From: "Aengus" <analog07 at eircom.net>
Subject: Re: [analog-help] corrupt files
To: "Support for analog web log analyzer" <analog-help at lists.meer.net>
Message-ID: <00f501c7d3c0$2ccaac90$0301a8c0 at xppro>
Content-Type: text/plain; format=flowed; charset=iso-8859-1;
reply-type=original
On Tuesday, July 31, 2007 7:33 AM [EDT],
Aimee Mandeville <aimee at edc.uri.edu> wrote:
> Thanks for the clarification on that. Do you have any thoughts as to
> why Analog is having difficulty parsing these lines? I've attached a
> sample of the CORRUPT lines.
>
> The log file I am analyzing has 69,989 lines and 65,634 of them are
> corrupt.
>
> I am using the following format:
>
> LOGFORMAT (#%j)
>
> LOGFORMAT
>
(%S\t%u\t%B\t%Y-%m-%d\t%h:%n:%j\t%j\t%j\t%j\t%j\t%j\t%j\t%j\t%b\t%j\t%j\
> t%r\t%j\t%c\twww.usawaterquality.org\t%j)
You haven't provided any examples of the lines that Analog considers
corrupt, but at a guess, they don't have www.usawaterquality.org in
them.
If you enable debugging (DEBUG ON), Analog will generate output that
will
indicate where the line stops matching th LOGFORMAT Analog expected to
find.
Aengus
------------------------------
+-----------------------------------------------------------------------
-
| TO UNSUBSCRIBE from this list:
| http://lists.meer.net/mailman/listinfo/analog-help
|
| Usenet version: news://news.gmane.org/gmane.comp.web.analog.general
| List archives: http://www.analog.cx/docs/mailing.html#listarchives
+-----------------------------------------------------------------------
-
End of analog-help Digest, Vol 36, Issue 1
******************************************
-------------- next part --------------
F: Opening SearchQuery.txt as configuration file
F: Closing configuration file SearchQuery.txt
F: Closing configuration file analog.cfg
F: Opening lang\uk.lng as language file
F: Closing language file lang\uk.lng
F: Opening lang\ukdom.tab as domains file
F: Closing domains file lang\ukdom.tab
F: Opening lang\ukdesc.txt as report descriptions file
F: Closing report descriptions file lang\ukdesc.txt
F: Opening w:\ISALOG_20070702_WEB_000.w3c as logfile
C: 74.6.22.228 anonymous Mozilla/5.0 (compatible; Yahoo! Slurp; http://help.yahoo.com/help/us/ysearch/slurp) 2007-07-02 00:00:24 TORCHEMADA - www.edc.uri.edu 131.128.90.11 80 78 257 182 http GET http://131.128.90.11/riatlas/town/Warwick.html Inet 304 www.edc.uri.edu - External - 0x0 Allowed
C: *
C: 74.6.73.226 anonymous Mozilla/5.0 (compatible; Yahoo! Slurp; http://help.yahoo.com/help/us/ysearch/slurp) 2007-07-02 00:00:24 TORCHEMADA - www.edc.uri.edu 131.128.90.11 80 15 267 183 http GET http://131.128.90.11/aerialse/aerial1981/images/1409.sid Inet 304 www.edc.uri.edu - External - 0x0 Allowed
C: *
C: 74.6.67.202 anonymous Mozilla/5.0 (compatible; Yahoo! Slurp; http://help.yahoo.com/help/us/ysearch/slurp) 2007-07-02 00:00:28 TORCHEMADA - www.edc.uri.edu 131.128.90.11 80 31 271 182 http GET http://131.128.90.11/aerialse/aerial1992/92Smrsid/5-1106.sid Inet 304 www.edc.uri.edu - External - 0x100 Allowed
C: *
C: 74.6.26.147 anonymous Mozilla/5.0 (compatible; Yahoo! Slurp; http://help.yahoo.com/help/us/ysearch/slurp) 2007-07-02 00:00:40 TORCHEMADA - www.edc.uri.edu 131.128.90.11 80 1 211 1465 http GET http://131.128.90.11/rigis-spf/quad/Clayville.html Inet 404 www.edc.uri.edu - External - 0x500 Allowed
C: *
C: 66.249.65.203 anonymous Mozilla/5.0 (compatible; Googlebot/2.1; +http://www.google.com/bot.html) 2007-07-02 00:01:42 TORCHEMADA - odonata.edc.uri.edu 131.128.90.26 80 625 273 42310 http GET http://131.128.90.26/cgi-bin/gforum/gforum.cgi?guest=267752&head=home Inet 200 odonata.edc.uri.edu web publishing - External - 0x400 Allowed
C: *
More information about the analog-help
mailing list