argus and 95th percentile
Rodney McKee
rmckee at aconex.com
Wed Mar 25 21:47:11 EDT 2009
Carter,
Looks the goods.
I've just been seeing the same issue as Alexander reported when building the client package. Just have not had a chance to look at it further.
I've also been looking at the spec file for the server package and cleaned it up so I can generate my rpm's. Just need to test it on our RHEL 5 build server then I can send it through to you if you like.
----- "Carter Bullard" <carter at qosient.com> wrote:
> Hey Rodney,
I found a bug in the code for rabins() when testing out this new version
of the clients that affects the examples that I sent earlier. Please grab
the latest code before trying out these new features!!!!
>
ftp://qosient.com/dev/argus-3.0/argus-clients-3.0.2.beta.4.tar.gz
>
I added 95th percentile reporting to rahisto(), which is our frequency distribution
tool. This tool calculates mean, stddev, max, min, and the median (50th
percentile), and so it was very easy to add 95th percentile to the report.
>
You feed rahisto() the output of your 5 minute rabins() aggregations,
and it will give you a little stats report of the specific variable, and
a frequency distribution of where the data falls. If you don't know what
the range is, run it with just a small number of bins, and rahisto() will start
to show you where the data lies in its range.
>
Using the examples I used before:
>
rabins -M rmon hard time 5m -m smac -r hourly.file -w /tmp/data.out
>
Now, run rahisto() instead of rasort(), this way to generate your OutBound data
for the specific ether address ('sload') :
>
rahisto -r /tmp/data.out -H sload 10 - ether src host 0:a0:c5:e1:7a:fa
N = 31 mean = 80407.974516 stddev = 67795.873860 max = 174059.203125 min = 172.133331
median = 82742.617188 95% = 173895.046875
Class Interval Freq Rel.Freq Cum.Freq
1 0.000000e+00-1.740600e+04 12 38.7097% 38.7097%
2 1.740600e+04-3.481200e+04 0 0.0000% 38.7097%
3 3.481200e+04-5.221800e+04 3 9.6774% 48.3871%
4 5.221800e+04-6.962400e+04 1 3.2258% 51.6129%
5 6.962400e+04-8.703000e+04 1 3.2258% 54.8387%
6 8.703000e+04-1.044360e+05 0 0.0000% 54.8387%
7 1.044360e+05-1.218420e+05 1 3.2258% 58.0645%
8 1.218420e+05-1.392480e+05 3 9.6774% 67.7419%
9 1.392480e+05-1.566540e+05 4 12.9032% 80.6452%
10 1.566540e+05-1.740600e+05 6 19.3548% 100.0000%
>
And, run rahisto() this way to generate your InBound data for the specific ether address ('dload') :
>
rahisto -r /tmp/data.out -H dload 10 - ether src host 0:a0:c5:e1:7a:fa
N = 31 mean = 2520065.831098 stddev = 2286667.779977 max = 5742157.000000 min = 335.946655
median = 2935971.500000 95% = 5711441.500000
Class Interval Freq Rel.Freq Cum.Freq
1 0.000000e+00-5.742160e+05 13 41.9355% 41.9355%
2 5.742160e+05-1.148432e+06 1 3.2258% 45.1613%
3 1.148432e+06-1.722648e+06 1 3.2258% 48.3871%
4 1.722648e+06-2.296864e+06 1 3.2258% 51.6129%
5 2.296864e+06-2.871080e+06 0 0.0000% 51.6129%
6 2.871080e+06-3.445296e+06 2 6.4516% 58.0645%
7 3.445296e+06-4.019512e+06 1 3.2258% 61.2903%
8 4.019512e+06-4.593728e+06 3 9.6774% 70.9677%
9 4.593728e+06-5.167944e+06 3 9.6774% 80.6452%
10 5.167944e+06-5.742160e+06 6 19.3548% 100.0000%
>
>
The numbers are slightly different from the last time, because of the bug in rabins().
>
So you see, the data I'm using is multi-modally distributed, and while very low in samples,
it does suggest an SLA for two tiers, one above 1M bps and one below 1M bps.
You can calculate a 95th percentile for the two regions, by adjusting the range on the
histogram option field like this (just do dload for this example):
>
Traffic Below 1M bps
rahisto -r /tmp/rabins.5m.out -H dload 10:0-1M - ip and ether src host 0:a0:c5:e1:7a:fa
N = 14 mean = 220514.849217 stddev = 168862.596105 max = 612549.625000 min = 335.946655
median = 157696.734375 95% = 612549.625000
Class Interval Freq Rel.Freq Cum.Freq
1 0.000000e+00-1.000000e+05 2 14.2857% 14.2857%
2 1.000000e+05-2.000000e+05 7 50.0000% 64.2857%
3 2.000000e+05-3.000000e+05 1 7.1429% 71.4286%
4 3.000000e+05-4.000000e+05 2 14.2857% 85.7143%
5 4.000000e+05-5.000000e+05 1 7.1429% 92.8571%
6 5.000000e+05-6.000000e+05 0 0.0000% 92.8571%
7 6.000000e+05-7.000000e+05 1 7.1429% 100.0000%
8 7.000000e+05-8.000000e+05 0 0.0000% 100.0000%
9 8.000000e+05-9.000000e+05 0 0.0000% 100.0000%
10 9.000000e+05-1.000000e+06 0 0.0000% 100.0000%
>
Traffic Above 1M bps
rahisto -r /tmp/rabins.5m.out -H dload 10:1-6M - ip and ether src host 0:a0:c5:e1:7a:fa
N = 17 mean = 4413813.698529 stddev = 1253167.017151 max = 5742157.000000 min = 1717778.125000
median = 4992350.500000 95% = 5742157.000000
Class Interval Freq Rel.Freq Cum.Freq
1 1.000000e+06-1.500000e+06 0 0.0000% 0.0000%
2 1.500000e+06-2.000000e+06 2 11.7647% 11.7647%
3 2.000000e+06-2.500000e+06 0 0.0000% 11.7647%
4 2.500000e+06-3.000000e+06 1 5.8824% 17.6471%
5 3.000000e+06-3.500000e+06 1 5.8824% 23.5294%
6 3.500000e+06-4.000000e+06 1 5.8824% 29.4118%
7 4.000000e+06-4.500000e+06 1 5.8824% 35.2941%
8 4.500000e+06-5.000000e+06 4 23.5294% 58.8235%
9 5.000000e+06-5.500000e+06 3 17.6471% 76.4706%
10 5.500000e+06-6.000000e+06 4 23.5294% 100.0000%
>
Hope this is helpful,
>
Carter
>
On Mar 23, 2009, at 3:16 PM, Rodney McKee wrote:
> Carter,
>
> The value I'm after is based on 5 minute samples of user uploads (inbound traffic) that are sorted highest to lowest then the value at the 95th percent point is then used for our volume calculation. Is their a way to pull the 5 minutes samples like "ragraph -M 5m" using the text "ra" tools?
> I just see the rrdgraph is able to do it but not sure yet how to use it.
>
> More info on the billing scheme here:
> http://en.wikipedia.org/wiki/Burstable_billing
>
>
--
Rodney McKee
Linux systems administrator Aconex
The easy way to save time and money on your project
696 Bourke Street, Melbourne
Tel: +61 3 9240 0200 Fax: +61 3 9240 0299
Email: rmckee at aconex.com www.aconex.com
This email and any attachments are intended solely for the addressee. The contents may be privileged, confidential and/or subject to copyright or other applicable law.
No confidentiality or privilege is lost by an erroneous transmission. If you have received this e-mail in error, please let us know by reply e-mail and delete or destroy
this mail and all copies. If you are not the intended recipient of this message you must not disseminate, copy or take any action in reliance on it. The sender takes no
responsibility for the effect of this message upon the recipient's computer system.
-------------- next part --------------
An HTML attachment was scrubbed...
URL: <https://pairlist1.pair.net/pipermail/argus/attachments/20090326/cbb30c5b/attachment.html>
More information about the argus
mailing list