About Data Flagging Checks
Count data is subjected to various checks that flag data as needing review or as recommended to be rejected. This page provides an overview of the flagging checks in use and the criteria that determines if it flags data as needing review or as recommended to be rejected.
Overview Table
Test | Type | 'Recommended Removal' Threshold | 'Needs Review' Threshold |
---|---|---|---|
Gap | Conditional | ≥ 12 hours of null values, or &GreaterThan; 6 hours between 06:00 and 20:00 are of null values | |
Zero | Conditional | ≥ 6 consecutive days of counts equaling 0 | 3, 4, or 5 consecutive days of counts equaling 0 |
Max Day | Conditional |
|
|
Max Hour | Conditional |
|
|
3AM | Conditional | n/a | An hourly sum of ≥ 50 between 03:00 and 06:00 |
Gap
Checks for reported null values (called gaps) within each day. If there are 12 or more hours in a day with a null count that day is recommended to be removed. If there are 6 or more hours between 5am and 8pm with a null count that day is recommended to be removed.
Zero
Checks for consecutive days that report a daily volume of zero. Note: this check doesn't apply to sidewalk bike loops as those frequently have a low volume If there are 6 or more consecutive days with daily volumes of zero those days are recommended to be removed. If there are 3, 4, or 5 consecutive days with daily volumes of zero those days are marked as needing review.
Maxday
Checks for days that have a suspiciously high daily volume. If a pedestrian datastream reports a daily volume greater than 15000 or if a bicycle datastream reports a daily volume greater than 5000 then the day is recommended to be removed. If a pedestrian datastream reports a daily volume between 10000 and 15000 or if a bicycle datastream reports a daily volume between 2000 and 5000 then the day is marked as needing review.
Maxhour
Checks for days that contain a suspiciously high hourly volume. If within a day a pedestrian datastream reports an hourly volume greater than 3000 or a bicycle datastream reports an hourly volume greater than 1000 then the day is recommended to be removed. If within a day a pedestrian datastream reports an hourly volume between 1000 and 3000 or a bicycle datastream reports an hourly volume between 500 and 1000 then the day is marked as needing review.