An automated framework for explaining facts extracted from mobility datasets

Anique Tahir, Yuhan Sun, Mohamed Elsayed

Research output: Chapter in Book/Report/Conference proceedingConference contribution

Abstract

When a data scientist analyzes mobility data (e.g., using a data visualization tool), she may find out some interesting facts in the dataset. An example of a fact can be: 'The number of Taxi trips in NYC on January 23, 2016, dropped drastically as compared to other days of the same month'. However, the data scientist may be left clueless if they cannot find a crisp explanation to such a fact. Furthermore, the tedious task of finding an explanation by manually scraping the data becomes even impossible with big data. Existing techniques are designed for non-spatial data which cannot be applied to spatial data because it does not consider the spatial proximity. In this paper, we propose an automatic framework which guides the data scientist to explain the fact discovered from mobility data. Our approach expands on the aggravation and intervention techniques while using spatial partitioning/clustering to improve explanations for spatial data. Experiments show that the proposed approach outperforms the state-of-The-Art approaches in finding the explanation for facts extracted from NYC taxi real mobility dataset.

Original languageEnglish (US)
Title of host publicationProceedings - 2019 20th International Conference on Mobile Data Management, MDM 2019
PublisherInstitute of Electrical and Electronics Engineers Inc.
Pages269-278
Number of pages10
ISBN (Electronic)9781728133638
DOIs
StatePublished - Jun 1 2019
Event20th International Conference on Mobile Data Management, MDM 2019 - Hong Kong, Hong Kong
Duration: Jun 10 2019Jun 13 2019

Publication series

NameProceedings - IEEE International Conference on Mobile Data Management
Volume2019-June
ISSN (Print)1551-6245

Conference

Conference20th International Conference on Mobile Data Management, MDM 2019
CountryHong Kong
CityHong Kong
Period6/10/196/13/19

Fingerprint

Data visualization
Experiments
Big data

Keywords

  • Database
  • Mobile spatial data
  • Spatial explanation

ASJC Scopus subject areas

  • Engineering(all)

Cite this

Tahir, A., Sun, Y., & Elsayed, M. (2019). An automated framework for explaining facts extracted from mobility datasets. In Proceedings - 2019 20th International Conference on Mobile Data Management, MDM 2019 (pp. 269-278). [8788788] (Proceedings - IEEE International Conference on Mobile Data Management; Vol. 2019-June). Institute of Electrical and Electronics Engineers Inc.. https://doi.org/10.1109/MDM.2019.00-48

An automated framework for explaining facts extracted from mobility datasets. / Tahir, Anique; Sun, Yuhan; Elsayed, Mohamed.

Proceedings - 2019 20th International Conference on Mobile Data Management, MDM 2019. Institute of Electrical and Electronics Engineers Inc., 2019. p. 269-278 8788788 (Proceedings - IEEE International Conference on Mobile Data Management; Vol. 2019-June).

Research output: Chapter in Book/Report/Conference proceedingConference contribution

Tahir, A, Sun, Y & Elsayed, M 2019, An automated framework for explaining facts extracted from mobility datasets. in Proceedings - 2019 20th International Conference on Mobile Data Management, MDM 2019., 8788788, Proceedings - IEEE International Conference on Mobile Data Management, vol. 2019-June, Institute of Electrical and Electronics Engineers Inc., pp. 269-278, 20th International Conference on Mobile Data Management, MDM 2019, Hong Kong, Hong Kong, 6/10/19. https://doi.org/10.1109/MDM.2019.00-48
Tahir A, Sun Y, Elsayed M. An automated framework for explaining facts extracted from mobility datasets. In Proceedings - 2019 20th International Conference on Mobile Data Management, MDM 2019. Institute of Electrical and Electronics Engineers Inc. 2019. p. 269-278. 8788788. (Proceedings - IEEE International Conference on Mobile Data Management). https://doi.org/10.1109/MDM.2019.00-48
Tahir, Anique ; Sun, Yuhan ; Elsayed, Mohamed. / An automated framework for explaining facts extracted from mobility datasets. Proceedings - 2019 20th International Conference on Mobile Data Management, MDM 2019. Institute of Electrical and Electronics Engineers Inc., 2019. pp. 269-278 (Proceedings - IEEE International Conference on Mobile Data Management).
@inproceedings{edfd220b8acf42ca9a8f4513c1e06461,
title = "An automated framework for explaining facts extracted from mobility datasets",
abstract = "When a data scientist analyzes mobility data (e.g., using a data visualization tool), she may find out some interesting facts in the dataset. An example of a fact can be: 'The number of Taxi trips in NYC on January 23, 2016, dropped drastically as compared to other days of the same month'. However, the data scientist may be left clueless if they cannot find a crisp explanation to such a fact. Furthermore, the tedious task of finding an explanation by manually scraping the data becomes even impossible with big data. Existing techniques are designed for non-spatial data which cannot be applied to spatial data because it does not consider the spatial proximity. In this paper, we propose an automatic framework which guides the data scientist to explain the fact discovered from mobility data. Our approach expands on the aggravation and intervention techniques while using spatial partitioning/clustering to improve explanations for spatial data. Experiments show that the proposed approach outperforms the state-of-The-Art approaches in finding the explanation for facts extracted from NYC taxi real mobility dataset.",
keywords = "Database, Mobile spatial data, Spatial explanation",
author = "Anique Tahir and Yuhan Sun and Mohamed Elsayed",
year = "2019",
month = "6",
day = "1",
doi = "10.1109/MDM.2019.00-48",
language = "English (US)",
series = "Proceedings - IEEE International Conference on Mobile Data Management",
publisher = "Institute of Electrical and Electronics Engineers Inc.",
pages = "269--278",
booktitle = "Proceedings - 2019 20th International Conference on Mobile Data Management, MDM 2019",

}

TY - GEN

T1 - An automated framework for explaining facts extracted from mobility datasets

AU - Tahir, Anique

AU - Sun, Yuhan

AU - Elsayed, Mohamed

PY - 2019/6/1

Y1 - 2019/6/1

N2 - When a data scientist analyzes mobility data (e.g., using a data visualization tool), she may find out some interesting facts in the dataset. An example of a fact can be: 'The number of Taxi trips in NYC on January 23, 2016, dropped drastically as compared to other days of the same month'. However, the data scientist may be left clueless if they cannot find a crisp explanation to such a fact. Furthermore, the tedious task of finding an explanation by manually scraping the data becomes even impossible with big data. Existing techniques are designed for non-spatial data which cannot be applied to spatial data because it does not consider the spatial proximity. In this paper, we propose an automatic framework which guides the data scientist to explain the fact discovered from mobility data. Our approach expands on the aggravation and intervention techniques while using spatial partitioning/clustering to improve explanations for spatial data. Experiments show that the proposed approach outperforms the state-of-The-Art approaches in finding the explanation for facts extracted from NYC taxi real mobility dataset.

AB - When a data scientist analyzes mobility data (e.g., using a data visualization tool), she may find out some interesting facts in the dataset. An example of a fact can be: 'The number of Taxi trips in NYC on January 23, 2016, dropped drastically as compared to other days of the same month'. However, the data scientist may be left clueless if they cannot find a crisp explanation to such a fact. Furthermore, the tedious task of finding an explanation by manually scraping the data becomes even impossible with big data. Existing techniques are designed for non-spatial data which cannot be applied to spatial data because it does not consider the spatial proximity. In this paper, we propose an automatic framework which guides the data scientist to explain the fact discovered from mobility data. Our approach expands on the aggravation and intervention techniques while using spatial partitioning/clustering to improve explanations for spatial data. Experiments show that the proposed approach outperforms the state-of-The-Art approaches in finding the explanation for facts extracted from NYC taxi real mobility dataset.

KW - Database

KW - Mobile spatial data

KW - Spatial explanation

UR - http://www.scopus.com/inward/record.url?scp=85071001797&partnerID=8YFLogxK

UR - http://www.scopus.com/inward/citedby.url?scp=85071001797&partnerID=8YFLogxK

U2 - 10.1109/MDM.2019.00-48

DO - 10.1109/MDM.2019.00-48

M3 - Conference contribution

AN - SCOPUS:85071001797

T3 - Proceedings - IEEE International Conference on Mobile Data Management

SP - 269

EP - 278

BT - Proceedings - 2019 20th International Conference on Mobile Data Management, MDM 2019

PB - Institute of Electrical and Electronics Engineers Inc.

ER -