![]() Data Science Lab University of Piraeus |
Abstract
Introduction
Synopses Generator
Semantic Integrator
Data Manager
(Sub-)Trajectory Clustering Module
pivots[9][10]. The role of this module to the overall architecture of ARGO is to take as input data selected from the RDF store, apply STC, and provide the resulting representatives (i.e. cluster medoids) both to the prediction module as well as back to the RDF store.
Future Location Prediction Module
alignedwith the
closest-matchedroute in the maximum-likelihood sense.
System Architecture
In brief, the stream processing layer processes the stream of surveillance data, and performs data cleaning, noise elimination, compression and semantic data integration, in an online manner. The synopsized and enriched data stream, represented in RDF, can be consumed as it is, thus enabling the deployment of data analysis pipelines, and it is also stored in a distributed spatiotemporal RDF store for batch processing. This store supports scalable and efficient processing of SPARQL queries with spatiotemporal constraints, providing filtered, integrated, spatiotemporal data for higher level analysis tasks. Offline analysis of integrated data (e.g., for trajectory clustering) generates mined patterns, which are exploited in conjunction to the enriched data stream during the online operation of the trajectory prediction module.
References