possible radium issue
Phillip Deneault
deneault at WPI.EDU
Thu Jul 9 10:17:43 EDT 2009
Carter Bullard wrote:
> No, this doesn't seem right at all. A couple of suggestions.
> Don't use the "-M norep" for this type of aggregation (basically it
> just throws away the AGR dsr as the records are being written
> out, and in some apps this is great, but not necessary here).
Ok, I'll remove it as this issue presses forward.
> How are your rasplit()s called, I suspect there may be an issue with that.
/usr/bin/rasplit -d -S <radiumhost> -M time 10m -w
/data/argus/slices/\$srcid/argus-%m.%d.%Y-%H.%M.%S.out
/usr/bin/rasplit -d -S <radiumhost> -M time 1h -w
/data/argus/hourlies/\$srcid/argus-%m.%d.%Y-%H.%M.%S.out
/usr/bin/rasplit -d -S <radiumhost> -M time 1d -w
/data/argus/dailies/\$srcid/argus-%m.%d.%Y.out
> In most cases, you don't need the hourly and daily rasplit() processes,
> because
> you can generate both of these from your 10 min split files. All
> depends on
> whether you want the hourly and daily files updated continuously, or if you
> can get away with updating them, say every 10 minutes.
This is true. Except I don't trust either of them right now. :-) If
the issue is that there is some strange blocking process which would
require me to run only one rasplit, I would do it and hack up everything
I'm doing on the programming end. But neither CPU, memory, nor IO
suggests this to be the case.
> It looks like racluster() is faulting reading one of the files. When
> it does that,
> the pipe closes down, and your racount() reports just the records it
> receives.
> Just need to find the bad file, and then try to figure out how it got
> corrupted
> (at least that is my guess).
>
> what are the totals for each of the individual files in your example(s)
> without
> the clustering?
Here's a breakdown using a few methods. Things mostly make sense except
the numbers seem so low by comparison to the locally recorded copies,
and the difference from the collection of daily files with the hourlies.
The 'Clustering' field is with the '-M norep' off piped to an racount.
The WithoutClustering field is just the straight racount. I ran the
files individually and totaled them, then in a single command line, and
I also ran the hourly file the same way for comparison.
I got no errors when running any of these commands on any of these files.
File Clustering WithoutClustering
00 37 38
10 85 85
20 138 148
30 43 44
40 104 110
50 254 265
-----------------------------------------
SumTotal 661 690
Total with
all 6 files
in the '-r' 631 690
Hourly file 252 263
Can anyone else replicate this behavior?
Thanks,
Phil
> Carter
>
>
>
> On Jul 8, 2009, at 4:24 PM, Phillip Deneault wrote:
>
>> I'm running the beta.8 code. I have a single radium instance
>> collecting data from dozens of locations and 3 rasplit processes
>> connecting to that radium process, one for 10 minute slices, 1 for
>> hourlies, and 1 for dailies.
>>
>> It *seems* as if the data I'm recording is lower than what I should
>> have. I say this because I get drastically different counts when I
>> check locally recorded data vs. radium recorded data.
>>
>> Please yell at me if I am doing this wrong, I performed the racluster
>> in an attempt to normalize the flow counts a little.
>>
>> Locally recorded data tallies like this.(logs rotated daily, so I
>> picked a convenient hour).
>>
>> # racluster -t 14 -M norep -r /var/log/argus/argus.out -w - | racount
>> -r -
>> racount records total_pkts src_pkts dst_pkts
>> total_bytes src_bytes dst_bytes
>> sum 52385 134978 134813 165
>> 9982211 9970637 11574
>>
>> However, when I run a tally on the hourlies and the slices collected
>> by radium, I get two different flow counts, neither of which come
>> anywhere close.
>>
>> (SLICES)
>> # racluster -M norep -r argus-07.08.2009-14.50.00.out
>> argus-07.08.2009-14.40.00.out argus-07.08.2009-14.30.00.out
>> argus-07.08.2009-14.20.00.out argus-07.08.2009-14.10.00.out
>> argus-07.08.2009-14.00.00.out -w - | racount -r -
>> racount records total_pkts src_pkts dst_pkts
>> total_bytes src_bytes dst_bytes
>> sum 631 1920 1397 523
>> 507980 210286 297694
>>
>> (HOURLIES)
>> # racluster -M norep -r argus-07.08.2009-14.00.00.out -w - | racount -r -
>> racount records total_pkts src_pkts dst_pkts
>> total_bytes src_bytes dst_bytes
>> sum 252 447 348 99
>> 95012 57022 37990
>>
>> Is this a bug, or me doing something wrong?
>>
>> Thanks,
>> Phil
>>
>
>
>
>
More information about the argus
mailing list