rabins issue, maybe order related
Mark E. Mallett
mem at mv.mv.com
Wed Jun 20 13:45:43 EDT 2012
On Wed, Jun 20, 2012 at 01:29:38PM -0400, Carter Bullard wrote:
> Hey Mark,
> Yes, all ra* programs will take "-R ." file names and sort them to figure
> out what files to open in what order. I don't think the -r filenames are
> a part of that algorithm, so you can push files in front of others by using
> -r instead of -R.
>
> Currently, rabins() would like its input to be in some time order, as it uses the
> stream to estimate what the starting bin time is, and how many bins to process.
> rabins() will discard input records that are before its notion of the "epoch" of time.
> If you use "-D 2", rabins() will print out an error message when it throws
> a record away. Something like this:
I ran it with -D all the way up to 8, but didn't see anything.
> You can solve this problem by giving rabins() a time filter that spans the
> time of your processing. This tell rabins to preallocate bins for the time
> span, and you are guaranteed that your data will be processed, regardless
> of what order it comes in. Since your doing day bins, you can probably
> get away with an entire year as your filter " -t -12M+12M ".
In fact I know the time range (or rather the scripts know it), so that's
a great solution. I'll try that.
Thanks,
-mm-
More information about the argus
mailing list