Stops matching
Matching Stages
| Stage | Count | % of Total | Many-to-One | Not Closest | Mean (m) | Median (m) | P95 (m) |
|---|---|---|---|---|---|---|---|
| Exact (UIC ref) | 21,691 | 34.7% | 13,733 | 5,258 | 21.04 | 8.76 | 79.03 |
| Name matching | 436 | 0.7% | 6 | 155 | 48.46 | 14.23 | 118.7 |
| Route matching (total) | 12,917 | 20.7% | 11 | 3,003 | 7.21 | 5.27 | 21.34 |
| GTFS | 12,917 | 20.7% | 11 | 3,003 | 7.21 | 5.27 | 21.34 |
| Distance (total) | 14,164 | 22.7% | 2,337 | 3,982 | - | - | - |
| Stage 0: Trio side matching | 2,298 | 3.7% | 2,298 | 779 | 8.95 | 4.46 | 23.58 |
| Stage 1: Group proximity (50m) | 10,855 | 17.4% | 29 | 2,757 | - | - | - |
| Stage 1a: UIC ref | 10,537 | 16.9% | 26 | 2,683 | 7.77 | 5.21 | 25.18 |
| Stage 1b: UIC name | 44 | 0.1% | 0 | 8 | 8.4 | 6.04 | 30.38 |
| Stage 1c: Name | 274 | 0.4% | 3 | 66 | 12.06 | 8.04 | 37.87 |
| Stage 1b: Group proximity (150m) | 500 | 0.8% | 5 | 380 | - | - | - |
| Stage 1d: UIC ref (150m) | 469 | 0.8% | 3 | 354 | 81.72 | 74.87 | 136.08 |
| Stage 1e: UIC name (150m) | 6 | 0.0% | 0 | 5 | 124.01 | 129.9 | 149.23 |
| Stage 1f: Name (150m) | 25 | 0.0% | 2 | 21 | 75.48 | 60.86 | 126.95 |
| Stage 2: Local ref | 74 | 0.1% | 4 | 25 | 15.88 | 12.29 | 39.22 |
| Stage 3a: Single candidate | 385 | 0.6% | 1 | 41 | - | - | - |
| 3a pass 1 | 347 | 0.6% | 1 | 36 | 11.93 | 7.98 | 42.74 |
| 3a pass 2 | 38 | 0.1% | 0 | 5 | 10.33 | 5.14 | 44.52 |
| Stage 3b: Ratio test | 52 | 0.1% | 0 | 0 | 2.95 | 2.45 | 6.62 |
| Post-processing: ATLAS Duplicate propagation | 332 | 0.5% | 332 | 121 | 23.4 | 8.62 | 100.28 |
| Post-processing: OSM stop partner propagation | 12,940 | 20.7% | 1,455 | 7,738 | 9.06 | 5.48 | 27.52 |
Distance Quality
Cross-Predicate Consistency
If we had matched every pair to its nearest OSM node instead:
Many-to-One Analysis
OSM platform/stop_position groups are counted as one OSM stop in this section.
Distribution: 1 ATLAS to N OSM stops
Ratio key: 1A:2O = one ATLAS platform matched to two OSM stops.
Distribution: N ATLAS to 1 OSM stop
Ratio key: 2A:1O = two ATLAS platforms matched to one OSM stop.
OSM Stops
OSM Stop Inventory
operator:wikidatanetwork:wikidataNon-node OSM Stops (Ways)
OSM Groups (Platform / Stop Position Pairs)
Groups by Type
| Group Type | Count |
|---|---|
| osm_trio | 1,150 |
| osm_pair_uic_equal_15m | 9,511 |
| osm_pair_uic | 2,534 |
| osm_pair_name | 665 |
| osm_pair_name_equal_15m | 29 |
| osm_pair_tram | 314 |
| osm_pair_tram_equal_15m | 13 |
Unmatched
Unmatched Analysis
Unmatched ATLAS (5,776)
| No OSM within 50m | 3,017 |
| Has nearby OSM | 2,759 |
Unmatched OSM (9,055)
| No ATLAS within 50m | 3,342 |
| Has nearby ATLAS | 5,713 |
| Has Attributes | Count |
|---|---|
| UIC & Local Ref & Routes | 174 |
| UIC & Local Ref | 98 |
| UIC & Routes | 2,146 |
| Local Ref & Routes | 96 |
| UIC only | 2,724 |
| Local Ref only | 81 |
| Routes only | 1,127 |
| None | 2,609 |
ATLAS
ATLAS Duplicates
| Total duplicate SLOIDs | 627 |
| Matched duplicates | 636 |
| Unmatched duplicates | 36 |
Problems
Problem Detection Statistics
| Problem Type | Count |
|---|---|
| Distance — Too far from OSM node | 1,287 |
| Unmatched — Isolated ATLAS / OSM node | 4,561 |
| Attributes — Missing or mismatched data | 984 |
| Problem Type | Count |
|---|---|
| Distance — Too far from OSM node | 4,696 |
| Unmatched — Isolated ATLAS / OSM node | 7,215 |
| Attributes — Missing or mismatched data | 325 |
| Duplicates — OSM or ATLAS side collisions | 324 |
| Problem Type | Count |
|---|---|
| Distance — Too far from OSM node | 5,731 |
| Unmatched — Isolated ATLAS / OSM node | 3,055 |
| Attributes — Missing or mismatched data | 11,356 |
| Duplicates — OSM or ATLAS side collisions | 1,438 |
Routes
Route Coverage
| ATLAS stops matched by GTFS routes | 42,583 / 54,945 | 77.5% |
| OSM stops with routes coverage | 36,863 | 75.1% |
Route-Route Matching
| Metric | Count |
|---|---|
| Route-route links (ATLAS ↔ OSM) | 1,972 |
| Linked ATLAS route IDs | 1,972 |
| Linked OSM route IDs | 1,972 |
| Coverage | Count | Percent |
|---|---|---|
| ATLAS route IDs with link | 1,972 / 3,979 | 49.6% |
| OSM route IDs with link | 1,972 / 2,848 | 69.2% |
| ATLAS itineraries with link | 1,361 / 13,790 | 9.9% |
| OSM itineraries with link | 1,361 / 8,121 | 16.8% |
| ATLAS route directions (total) | 7,498 | - |
| OSM route directions (total) | 4,823 | - |
Match Method Breakdown
| Method | Count | % of Total |
|---|---|---|
| Exact GTFS route ID | 1,471 | 74.6% |
| Normalized GTFS route ID | 501 | 25.4% |
| OSM display route ID matches ATLAS route ID | 0 | 0.0% |
Route Pipeline Shape
| Metric | Mean | Median |
|---|---|---|
| ATLAS itineraries per family | 3.47 | 3.0 |
| OSM itineraries per family | 2.85 | 2.0 |
| GTFS trips collapsed into one ATLAS itinerary | 105.03 | 15.0 |
| Matched family itinerary gap (ATLAS - OSM) | 0.88 | 0.0 |
| Matched Family Comparison | Count |
|---|---|
| Matched families where ATLAS has more itineraries | 832 |
| Matched families with equal itinerary counts | 790 |
| Matched families where OSM has more itineraries | 350 |
OSM Route Family Grouping Method
How OSM route relations are grouped into line families. Priority order: route_master → GTFS route ID → (route, ref, operator, network) → per-relation fallback.
| Grouping Method | Families | % of Families | Relations |
|---|---|---|---|
| route_master (explicit OSM grouping) | 2,636 | 92.6% | 7,860 |
| Normalized GTFS route ID | 5 | 0.2% | 7 |
| Synthetic key (route, ref, operator, network) | 207 | 7.3% | 254 |
| Per-relation fallback (no grouping key) | 0 | 0.0% | 0 |
| Total | 2,848 | 100% | 8,121 |
GTFS stop_id → ATLAS sloid Mapping
Matching Paths
① Direct SLOID match
Matches where original_stop_id directly contains a valid ATLAS SLOID.
② Strict matching
Matches on (UIC number, local ref) = (number, designation) in ATLAS.
③ Coordinate proximity
For remaining: ATLAS stops with the same UIC number within 0.5m (must be unambiguous).
④ Unique-number fallback
For remaining: if a UIC number maps to exactly one ATLAS row, use that sloid.
Cardinality
GTFS stop_id side
ATLAS sloid side
Unmatched Reasons
| Reason | Count |
|---|---|
| No ATLAS candidate for this UIC number | 349 |
| Non-unique ATLAS number (strict miss, fallback not applicable) | 11,505 |
| Coordinate proximity conflict (multiple candidates within radius) | 92 |