GEH statistic
The GEH Statistic is a formula used in traffic engineering, traffic forecasting, and traffic modelling to compare two sets of traffic volumes. The GEH formula gets its name from Geoffrey E. Havers, who invented it in the 1970s while working as a transport planner in London, England. Although its mathematical form is similar to a chi-squared test, is not a true statistical test. Rather, it is an empirical formula that has proven useful for a variety of traffic analysis purposes.
Using the GEH Statistic avoids some pitfalls that occur when using simple percentages to compare two sets of volumes. This is because the traffic volumes in real-world transportation systems vary over a wide range. For example, the mainline of a freeway/motorway might carry 5000 vehicles per hour, while one of the on-ramps leading to the freeway might carry only 50 vehicles per hour. The GEH statistic reduces this problem; because the GEH statistic is non-linear, a single acceptance threshold based on GEH can be used over a fairly wide range of traffic volumes. The use of GEH as an acceptance criterion for travel demand forecasting models is recognised in the UK Highways Agency's Design Manual for Roads and Bridges the Wisconsin microsimulation modeling guidelines, the Transport for London Traffic Modelling Guidelines and other references.
For traffic modelling work in the "baseline" scenario, a GEH of less than 5.0 is considered a good match between the modelled and observed hourly volumes. According to DMRB, 85% of the volumes in a traffic model should have a GEH less than 5.0. GEHs in the range of 5.0 to 10.0 may warrant investigation. If the GEH is greater than 10.0, there is a high probability that there is a problem with either the travel demand model or the data.
Applications
The GEH formula is useful in situations such as the following:- Comparing a set of traffic volumes from manual traffic counts with a set of volumes done at the same locations using automation.
- Comparing the traffic volumes obtained from this year's traffic counts with a group of counts done at the same locations in a previous year.
- Comparing the traffic volumes obtained from a travel demand forecasting model with the real-world traffic volumes.
- Adjusting traffic volume data collected at different times to create a mathematically consistent data set that can be used as input for travel demand forecasting models or traffic simulation models.
Common criticism about GEH statistic
Deviations are evaluated differently upward or downward, so the calculation is not symmetrical.
Moreover, the GEH statistic is not without a unit, but has the unit .
The GEH statistic does not fall within a range of values between 0 and 1. Thus, the range of values can only be interpreted with sufficient experience.
Furthermore, it is criticized that the value does not have a well-founded statistical derivation.
Development of the SQV statistic
An alternative measure to the GEH statistic is the Scalable Quality Value, which solves the above-mentioned problems: It is applicable to various indicators, it is symmetric, it has no units, and it has a range of values between 0 and 1. Moreover, Friedrich et al. derive the relationship between GEH statistic and normal distribution, and thus the relationship between SQV statistic and normal distribution. The SQV statistic is calculated using an empirical formula with a scaling factor :Fields of application
By introducing a scaling factor, the SQV statistic can be used to evaluate other mobility indicators. The scaling factor is based on the typical magnitude of the mobility indicator.| Indicator | Order of magnitude | Scaling factor |
| Number of person trips per day | 100 | 1 |
| Mean trip distance in kilometers | 101 | 10 |
| Duration of all trips per person per day in minutes | 102 | 100 |
| Traffic volume per hour | 103 | 1,000 |
| Traffic volume per day | 104 | 10,000 |
According to Friedrich et al., the SQV statistic value is suitable for assessing:
- Traffic volumes.
- Person-related mobility indicators:
- * Number of trips per person,
- * mean travel times per trip in minutes,
- * mean travel distances per trip in kilometers.
- Percentage of modal split or modal shares: here there is a fixed upper limit of 100% that cannot be exceeded. Instead, the number of trips per person per mode can be used for validation with the SQV statistic.
- Travel times for paths between 2 points in the network: This indicator does not depend on the path taken by a single person, but represents a sequence of distances along a route.
Quality categories
| SQV statistic | GEH statistic | Evaluation |
| 0.90 | 3.4 to 3.6 | Very good match |
| 0.85 | 5.4 to 5.8 | Good match |
| 0.80 | 7.5 to 8.5 | Acceptable match |
Depending on the indicator under comparison, different quality categories may be required.