[analog-help] RE: currupt files

Aimee Mandeville aimee at edc.uri.edu
Thu Aug 2 05:58:40 PDT 2007


Here is a sample of the error file that is being generated.

Aimee


-----Original Message-----
From: analog-help-bounces at lists.meer.net
[mailto:analog-help-bounces at lists.meer.net] On Behalf Of
analog-help-request at lists.meer.net
Sent: Wednesday, August 01, 2007 3:00 PM
To: analog-help at lists.meer.net
Subject: analog-help Digest, Vol 36, Issue 1

Send analog-help mailing list submissions to
	analog-help at lists.meer.net

To subscribe or unsubscribe via the World Wide Web, visit
	http://lists.meer.net/mailman/listinfo/analog-help
or, via email, send a message with subject or body 'help' to
	analog-help-request at lists.meer.net

You can reach the person managing the list at
	analog-help-owner at lists.meer.net

When replying, please edit your Subject line so it is more specific
than "Re: Contents of analog-help digest..."


Today's Topics:

   1. Re: corrupt files (Aengus)


----------------------------------------------------------------------

Message: 1
Date: Tue, 31 Jul 2007 18:14:33 -0400
From: "Aengus" <analog07 at eircom.net>
Subject: Re: [analog-help] corrupt files
To: "Support for analog web log analyzer" <analog-help at lists.meer.net>
Message-ID: <00f501c7d3c0$2ccaac90$0301a8c0 at xppro>
Content-Type: text/plain; format=flowed; charset=iso-8859-1;
	reply-type=original

On Tuesday, July 31, 2007 7:33 AM [EDT],
Aimee Mandeville <aimee at edc.uri.edu> wrote:

> Thanks for the clarification on that.  Do you have any thoughts as to
> why Analog is having difficulty parsing these lines?  I've attached a
> sample of the CORRUPT lines.
>
> The log file I am analyzing has 69,989 lines and 65,634 of them are
> corrupt.
>
> I am using the following format:
>
> LOGFORMAT (#%j)
>
> LOGFORMAT
>
(%S\t%u\t%B\t%Y-%m-%d\t%h:%n:%j\t%j\t%j\t%j\t%j\t%j\t%j\t%j\t%b\t%j\t%j\
> t%r\t%j\t%c\twww.usawaterquality.org\t%j)

You haven't provided any examples of the lines that Analog considers 
corrupt, but at a guess, they don't have www.usawaterquality.org in
them.

If you enable debugging (DEBUG ON), Analog will generate output that
will 
indicate where the line stops matching th LOGFORMAT Analog expected to
find.

Aengus




------------------------------

+-----------------------------------------------------------------------
-
|  TO UNSUBSCRIBE from this list:
|    http://lists.meer.net/mailman/listinfo/analog-help
|
|  Usenet version: news://news.gmane.org/gmane.comp.web.analog.general
|  List archives:  http://www.analog.cx/docs/mailing.html#listarchives
+-----------------------------------------------------------------------
-


End of analog-help Digest, Vol 36, Issue 1
******************************************
-------------- next part --------------
F: Opening SearchQuery.txt as configuration file
F: Closing configuration file SearchQuery.txt
F: Closing configuration file analog.cfg
F: Opening lang\uk.lng as language file
F: Closing language file lang\uk.lng
F: Opening lang\ukdom.tab as domains file
F: Closing domains file lang\ukdom.tab
F: Opening lang\ukdesc.txt as report descriptions file
F: Closing report descriptions file lang\ukdesc.txt
F: Opening w:\ISALOG_20070702_WEB_000.w3c as logfile
C: 74.6.22.228	anonymous	Mozilla/5.0 (compatible; Yahoo! Slurp; http://help.yahoo.com/help/us/ysearch/slurp)	2007-07-02	00:00:24	TORCHEMADA	-	www.edc.uri.edu	131.128.90.11	80	78	257	182	http	GET	http://131.128.90.11/riatlas/town/Warwick.html	Inet	304	www.edc.uri.edu	-	External	-	0x0	Allowed
C:                                                                                                                                                                                                                                                             *
C: 74.6.73.226	anonymous	Mozilla/5.0 (compatible; Yahoo! Slurp; http://help.yahoo.com/help/us/ysearch/slurp)	2007-07-02	00:00:24	TORCHEMADA	-	www.edc.uri.edu	131.128.90.11	80	15	267	183	http	GET	http://131.128.90.11/aerialse/aerial1981/images/1409.sid	Inet	304	www.edc.uri.edu	-	External	-	0x0	Allowed
C:                                                                                                                                                                                                                                                                       *
C: 74.6.67.202	anonymous	Mozilla/5.0 (compatible; Yahoo! Slurp; http://help.yahoo.com/help/us/ysearch/slurp)	2007-07-02	00:00:28	TORCHEMADA	-	www.edc.uri.edu	131.128.90.11	80	31	271	182	http	GET	http://131.128.90.11/aerialse/aerial1992/92Smrsid/5-1106.sid	Inet	304	www.edc.uri.edu	-	External	-	0x100	Allowed
C:                                                                                                                                                                                                                                                                           *
C: 74.6.26.147	anonymous	Mozilla/5.0 (compatible; Yahoo! Slurp; http://help.yahoo.com/help/us/ysearch/slurp)	2007-07-02	00:00:40	TORCHEMADA	-	www.edc.uri.edu	131.128.90.11	80	1	211	1465	http	GET	http://131.128.90.11/rigis-spf/quad/Clayville.html	Inet	404	www.edc.uri.edu	-	External	-	0x500	Allowed
C:                                                                                                                                                                                                                                                                 *
C: 66.249.65.203	anonymous	Mozilla/5.0 (compatible; Googlebot/2.1; +http://www.google.com/bot.html)	2007-07-02	00:01:42	TORCHEMADA	-	odonata.edc.uri.edu	131.128.90.26	80	625	273	42310	http	GET	http://131.128.90.26/cgi-bin/gforum/gforum.cgi?guest=267752&head=home	Inet	200	odonata.edc.uri.edu web publishing	-	External	-	0x400	Allowed
C:                                                                                                                                                                                                                                                                              *


More information about the analog-help mailing list