Toll Collect is the company responsible for Germany’s LKW-Maut system, the tolling process for trucks on the country’s motorways. Handling vast sums of daily traffic data, the company faces strict regulations that forbid personal tracking, complicating the calculation of precise tolls.

Despite the hurdles, Toll Collect’s infrastructure ensures a seamless flow of transactions, balancing user privacy with effective cost collection. Its innovative engineering solutions showcase Germany’s commitment to technologically advanced yet privacy-conscious road management.

The Toll Collect endeavor set out to build a unified software platform to calculate truck tolls across Germany, anchoring the process in compliance with strict data privacy laws. Prior to this modernization, some 150 employees handled their respective regions with bulky Excel macros, inadvertently creating silos and inconsistencies.

The new solution needed to integrate three main inputs—satellite signals, data from on-board units (OBUs), and readings from toll gates or pillars. Because German law forbids tracking personal vehicle information, any identifying data had to be purged within 15 minutes of collection. This timeframe equated to ingesting and fully processing 1-2 terabytes worth of data in 7-8 minutes, quite a challenge for conventional systems.

I was deeply involved in architecting and implementing the big data workflow using Spark and Scala, which allowed massive parallel processing to meet the seven-to-eight-minute window. Reconciling GPS coordinates from satellites with data from OBUs sometimes required advanced interpolation techniques due to inaccuracies or missing signals.

Additionally, robust logic had to be implemented to check for anomalies from toll gates, ensuring that single sensor glitch did not invalidated toll charges.

An important facet of my role was handling analytics and metadata: although the raw data had to be wiped clean on schedule, we needed derived metrics for auditing and fraud detection.

By preserving selected aggregated statistics, I implemented multiple algorithms which could identify suspicious patterns, such as repeated OBU “offline” episodes or contradictory location evidence.

These flags were instrumental in uncovering attempts to manipulate the toll system, helping enforce proper road usage fees. In the process, we effectively replaced thousands of scattered Excel macros, standardizing calculations in a single, reliable application.

Keywords:

Big Data, Spark, Geo, Fraud Detection, Efficiency, Software Engineering

Previous
Previous

Henkel