Flow Counting

Carter Bullard carter at qosient.com
Wed Apr 9 13:14:31 EDT 2008


Hmmmm, the "-M replace" must not be working.  I'll double check.
carter

On Apr 9, 2008, at 1:10 PM, Nick Diel wrote:

> Carter,
>
> There seems to be a problem with rastrip and the replace mode.  The  
> first indication is when replace is specified then you get standard  
> output (flow listings), also after it runs, a simple grep on the  
> trans column shows most have a count greater than one.
>
> rastrip -M replace -agr -r file
> ra -r file -s trans | grep -v 1 | wc -l
> >>0
>
> Writing to a separate file or stdout, everything works as expected,  
> no counts greater than 1.
>
> rastrip -M -agr -r file -w - | ra -r - -s trans | grep -v 1 | wc -l
> 0
>
> Nick
>
> On Tue, Apr 8, 2008 at 1:33 PM, Carter Bullard <carter at qosient.com>  
> wrote:
> Hey Nick,
> racluster() does this quite well.  When racluster() merges two  
> records, for whatever reason,
> it generates an aggregation statistic that keeps track of the number  
> of records that are merged.
> Use this to generate all the counts you would like.  The only trick  
> is that if you use racluster()
> to merge your data first, then this aggregation statistic already  
> has values, and it needs to be
> cleared.   You will want to remove that aggregation structure before  
> you begin.
>
> If you want to prep an entire directory of data, you can do this:
> % rastrip -R dir -M replace -agr
>
> This will remove the aggregation DSR from any record in any data  
> file in
> the entire directory struct.
>
> If you can't, for whatever reason, delete the agr dsr, you can do it  
> as a pipe.
>
> % rastrip -r data.file -M -agr -w - - ip | racluster -m matrix -s  
> saddr daddr trans
>
> This will give you the number of transactions (or flows) for all the  
> A's talking to B's in the file.
>
> For things like time intervals, you should use rabins().
>
> % rastrip -r data.file -M -agr -w - - ip | rabins -M soft time 10s - 
> m srcid -s stime dur trans
>
> This will give you the total flow every 10s, if there are any flows  
> to report.  It will
> force the startime and lasttime to coincide with the 10s time  
> boundaries.
>
> There are an infinite number of examples, so if this doesn't help,  
> send more email.
>
>
> Carter
>
>
>
> On Apr 8, 2008, at 2:23 PM, Nick Diel wrote:
>
> I am interested in counting number of flows for things such as  
> source address, IP pairs, time intervals, etc.  This is closely  
> related to Stéphane Peters email: "Counting flows by time interval  
> in argus."
>
> After using racluster to merge status flow records, you have a file  
> where a record represents a flow.  You can of course use racount  
> with a filter to tell you some of this information, but it seem  
> quite impractical to do things such as find the IP pair generating  
> the most flows.  You could also use a set of pipes or scripts (as  
> Stéphane Peters showed us), but this would require modification  
> every time you wanted something slightly different.
>
> Maybe there is a way using ra tools to do this already and I am  
> missing it.  If not maybe we can get racluster (when specified) or  
> another tool to "zero out"/set the trans count to 1 after we have  
> merged status flow records.  This way things such as racluster -r  
> megredRecords.argus -m saddr -s +trans will now list how many flows  
> for each source address.
>
> I am curious what the group has to say about this.  Is something  
> like this already possible?  Is this information useful to other  
> people than me?  Would modifying the trans column present too much  
> ambiguity, i.e. "under these circumstances the trans column can  
> represent number of flows else it represents aggregated record count"?
>
>
> Nick
>
>
>

-------------- next part --------------
An HTML attachment was scrubbed...
URL: <https://pairlist1.pair.net/pipermail/argus/attachments/20080409/86c32507/attachment.html>


More information about the argus mailing list