argus 3.0 bloating on OpenBSD 4.3

Carter Bullard carter at qosient.com
Wed Jan 28 21:14:18 EST 2009


>
> Hey Michael,
> Yes, I am working on a work around.
> Flows with time in the past should not be a problem, but time waaay  
> in the future will cause exactly the behavior you describe.  And it  
> is very difficult to decide as a piece of software, what is correct  
> time.
>
> If a record is put in the queue with a system time stamp way out,  
> when that record comes to end of the queue, which is a FIFO queue,  
> it will simple not consume any records until that time stamp  
> expires.  We still process packets, we just don't time the status  
> records out.
>
> When you kill argus gracefully, it will flush all the records that  
> are in the cache, so what you are seeing is by design.
>
> Carter
>
> Sent from my Verizon Wireless BlackBerry
>
> -----Original Message-----
> From: Michael Sanderson <sanders at cs.ubc.ca>
>
> Date: Mon, 26 Jan 2009 23:33:09
> To: <argus-info at lists.andrew.cmu.edu>
> Subject: [ARGUS] argus 3.0 bloating on OpenBSD 4.3
>
>
> I'm seeing similar things to what Martijn van Oosterhout had with his
> Linux machine, though on OpenBSD 4.3.
>
> I'm running argus 3.0 on a 32-bit OpenBSD 4.3 machine, with an ra
> collecting data on a 32-bit OpenSuSE box.  Neither is running a
> customized kernel.  Argus does not appear to be built with threads on
> the OpenBSD box.  Upon occassion, the argus will output error lines  
> like
>
> ArgusInterface timestamps wayyy out of order: now -732332471 then
> 1233017300
> ArgusInterface timestamps wayyy out of order: now -1504018871 then
> 1233017510
>
> and shortly after that it begins to bloat, while the data collector
> begins to receive less data.  The collector's rotated files are  
> smaller
> and traffic that I would expect to see in the files may not be  
> present,
> though there is some traffic.  All expected protocols are present, but
> definitely not all of the traffic.  Intriguingly, if the argus  
> daemon is
> 'kill'ed before the box runs out of memory, it will happily send all  
> of
> the flows to the collector before exiting.  Typically there is a
> seriously wacky date on one of the flows that is sent.
>
> As an example, the two entries above took place 26 Jan 09
> 16:48:20.937511 and 26 Jan 09 16:51:50.396118 (incidently, the first
> time I have see two close together like that).  I rotate every 15
> minutes and my file sizes look like:
>
> -rw-r--r-- 1 root root 11760560 2009-01-26 16:00
> data.2009.01.26-16:00:01
> -rw-r--r-- 1 root root 10244648 2009-01-26 16:15
> data.2009.01.26-16:15:01
> -rw-r--r-- 1 root root 10110680 2009-01-26 16:30
> data.2009.01.26-16:30:01
> -rw-r--r-- 1 root root 11347580 2009-01-26 16:45
> data.2009.01.26-16:45:02
> -rw-r--r-- 1 root root  4338192 2009-01-26 17:00
> data.2009.01.26-17:00:01
> -rw-r--r-- 1 root root  2136652 2009-01-26 17:15
> data.2009.01.26-17:15:01
> -rw-r--r-- 1 root root  2043564 2009-01-26 17:30
> data.2009.01.26-17:30:01
> -rw-r--r-- 1 root root  2082432 2009-01-26 17:45
> data.2009.01.26-17:45:01
>
> The kill took place between 21:45 and 22:00:
>
> -rw-r--r-- 1 root root  2191072 2009-01-26 21:45
> data.2009.01.26-21:45:01
> -rw-r--r-- 1 root root 94159604 2009-01-26 22:00
> data.2009.01.26-22:00:01
>
> Here is a segment of the output from an ra on data.2009.01.26-22:00:01
> (sorry for line wrapping).  Note the 46/10/17 date on the fourth  
> output
> line.
>
> 09/01/26 21:01:27  e         udp      77.85.162.112.27371    <->
> 142.103.17.161.19164         2        463   CON
>
> 09/01/26 21:47:06  e         tcp       86.96.229.89.14410     ->
> 142.103.6.5.80            2        114   FIN
>
> 09/01/26 21:46:58  e         udp      137.82.37.253.32770    <->
> 198.162.63.42.161           6       2133   CON
>
> 09/01/26 20:17:50  e         tcp      142.103.9.193.37207     ->
> 75.126.170.58.443          16       3299   RST
>
> 46/10/17 14:18:49  e         tcp      142.103.17.80.59990     ->
> 212.187.172.5.23456        11        820   FIN
>
> 09/01/26 16:48:20  e         udp        142.103.6.6.51948    <->
> 204.13.250.26.53            2        266   CON
>
> 09/01/26 16:48:20  e         tcp     142.103.15.108.3004      ->
> 128.121.146.100.443          14       3660   RST
>
> 09/01/26 16:48:20  e         udp      200.63.155.21.43823    <->
> 198.162.35.1.53            2        214   CON
>
> 09/01/26 16:48:20  e         tcp       60.173.11.39.14400     ->
> 142.103.23.48.1234          1         58   ACC
>
> Another note of interest in the output is that the timestamps jump all
> over the place prior to the wacky date, with lines from as early as
> 09/01/26 16:55:43 (shortly after the error message) through to current
> time.  After the wacky date is output, the dates appear to
> chronologically increase.  It MAY be that the wacky date corresponds  
> to
> when the kill was issued.  I'll have to try watching a data stream the
> next time I do a kill.
>
> Is anyone aware of an OpenBSD 4.3 time error along the lines of the
> Linux kernel bug that affected Martijn's configuration?  Carter, there
> had been a bit of discussion on the list about mitigating these
> effects.  Is this something that you have looked at?
>
>    Michael Sanderson
>
>
>




More information about the argus mailing list