Confidence Layer for the Hard 20%
Potsdam & South-West Berlin | Partnered with Siemens Digital Logistics
44 million connected vehicles detect road signs weekly.
The automation pipeline handles the clear-cut cases well.
We built the confidence layer for the hard 20%.
7-gate deterministic classification with visual evidence
Graph-walk to terminating sign along road network
Permanent vs conditional with OCR time extraction
| Dataset | Size | Purpose |
|---|---|---|
| School zone signs | 326 rows | Classification input |
| Remaining speed signs | 11,607 rows | Extent end-sign detection |
| Road geometry | 27,943 links | Graph-walk topology |
| School POIs (4,780 โ 3,985 educational) | 3,985 | Proximity gate G2 |
| Mapillary images | 1,850 imgs | Visual evidence + OCR |
| Probe speed data | 48.5M records | Speed anomaly signal |
7 deterministic gates โ no LLM in the decision loop
80 excluded because they sit inside an existing Tempo-30-Zone (G1)
โ the zone already governs the speed limit, sign is advisory only
DBSCAN clustering (ฮต=300m, UTM EPSG:25833) groups poles into school zones
Map-match pole to road link, walk forward in bearing-governed direction, stop at terminating sign.
Termination types:
Method: Florence-2 detect โ crop โ Tesseract OCR โ regex
Raw Mapillary street-view image โ bounding boxes drawn around every sign
Pole 00250 โ 30 km/h sign + supplemental time plate detected with bounding boxes
Bounding box crop โ sign fills the frame
Active MondayโFriday, 06:00โ18:00
Florence-2 detects both Zeichen 136 (school crossing) and the 30 km/h plate
Full scene with bounding boxes
Cropped โ clear school crossing sign
| Model | Method | Hallucination Rate |
|---|---|---|
| Gemini 2.5 Flash | Direct image query | 89.8% |
| GPT-4o | Direct image query | 72.3% |
| Claude 3.5 Sonnet | Direct image query | 61.4% |
| Florence-2 + CoT prompt | Detect โ crop โ LLM | 17.9% |
| Florence-2 + Our Pipeline | Detect โ crop โ Tesseract | 5.4% โ |
Signs are <5% of frame. Crop first, then read. CoT prompting cut hallucination by 80%.
315 poles ยท extent LineStrings ยท DBSCAN zone clusters โ all interactive
No LLM in the decision loop ยท Multi-source triangulation ยท QA-ranked output