Multi-task learning for spatio-temporal event forecasting

Liang Zhao; Qian Sun; Jieping Ye; Feng Chen; Chang Tien Lu; Naren Ramakrishnan

doi:10.1145/2783258.2783377

Multi-task learning for spatio-temporal event forecasting

Liang Zhao, Qian Sun, Jieping Ye, Feng Chen, Chang Tien Lu, Naren Ramakrishnan

Research output: Chapter in Book/Report/Conference proceeding › Conference contribution

124 Scopus citations

Abstract

Spatial event forecasting from social media is an important problem but encounters critical challenges, such as dynamic patterns of features (keywords) and geographic heterogeneity (e.g., spatial correlations, imbalanced samples, and different populations in different locations). Most existing approaches (e.g., LASSO regression, dynamic query expansion, and burst detection) are designed to address some of these challenges, but not all of them. This paper proposes a novel multi-task learning framework which aims to concurrently address all the challenges. Specifically, given a collection of locations (e.g., cities), we propose to build forecasting models for all locations simultaneously by extracting and utilizing appropriate shared information that effectively increases the sample size for each location, thus improving the forecasting performance. We combine both static features derived from a predefined vocabulary by domain experts and dynamic features generated from dynamic query expansion in a multi-task feature learning framework; we investigate different strategies to balance homogeneity and diversity between static and dynamic terms. Efficient algorithms based on Iterative Group Hard Thresholding are developed to achieve efficient and effective model training and prediction. Extensive experimental evaluations on Twitter data from four different countries in Latin America demonstrated the effectiveness of our proposed approach.

Original language	English (US)
Title of host publication	KDD 2015 - Proceedings of the 21st ACM SIGKDD Conference on Knowledge Discovery and Data Mining
Publisher	Association for Computing Machinery
Pages	1503-1512
Number of pages	10
ISBN (Electronic)	9781450336642
DOIs	https://doi.org/10.1145/2783258.2783377
State	Published - Aug 10 2015
Externally published	Yes
Event	21st ACM SIGKDD Conference on Knowledge Discovery and Data Mining, KDD 2015 - Sydney, Australia Duration: Aug 10 2015 → Aug 13 2015

Publication series

Name	Proceedings of the ACM SIGKDD International Conference on Knowledge Discovery and Data Mining
Volume	2015-August

Conference

Conference	21st ACM SIGKDD Conference on Knowledge Discovery and Data Mining, KDD 2015
Country/Territory	Australia
City	Sydney
Period	8/10/15 → 8/13/15

Keywords

Dynamic query expansion
Event forecasting
Hard thresholding
LASSO
Multi-task learning

ASJC Scopus subject areas

Software
Information Systems

Access to Document

10.1145/2783258.2783377

Cite this

Zhao, L., Sun, Q., Ye, J., Chen, F., Lu, C. T., & Ramakrishnan, N. (2015). Multi-task learning for spatio-temporal event forecasting. In KDD 2015 - Proceedings of the 21st ACM SIGKDD Conference on Knowledge Discovery and Data Mining (pp. 1503-1512). (Proceedings of the ACM SIGKDD International Conference on Knowledge Discovery and Data Mining; Vol. 2015-August). Association for Computing Machinery. https://doi.org/10.1145/2783258.2783377

Multi-task learning for spatio-temporal event forecasting. / Zhao, Liang; Sun, Qian; Ye, Jieping et al.
KDD 2015 - Proceedings of the 21st ACM SIGKDD Conference on Knowledge Discovery and Data Mining. Association for Computing Machinery, 2015. p. 1503-1512 (Proceedings of the ACM SIGKDD International Conference on Knowledge Discovery and Data Mining; Vol. 2015-August).

Research output: Chapter in Book/Report/Conference proceeding › Conference contribution

Zhao, L, Sun, Q, Ye, J, Chen, F, Lu, CT & Ramakrishnan, N 2015, Multi-task learning for spatio-temporal event forecasting. in KDD 2015 - Proceedings of the 21st ACM SIGKDD Conference on Knowledge Discovery and Data Mining. Proceedings of the ACM SIGKDD International Conference on Knowledge Discovery and Data Mining, vol. 2015-August, Association for Computing Machinery, pp. 1503-1512, 21st ACM SIGKDD Conference on Knowledge Discovery and Data Mining, KDD 2015, Sydney, Australia, 8/10/15. https://doi.org/10.1145/2783258.2783377

Zhao L, Sun Q, Ye J, Chen F, Lu CT, Ramakrishnan N. Multi-task learning for spatio-temporal event forecasting. In KDD 2015 - Proceedings of the 21st ACM SIGKDD Conference on Knowledge Discovery and Data Mining. Association for Computing Machinery. 2015. p. 1503-1512. (Proceedings of the ACM SIGKDD International Conference on Knowledge Discovery and Data Mining). doi: 10.1145/2783258.2783377

@inproceedings{c0f0fc2ed1234cc1baabd226385225a4,

title = "Multi-task learning for spatio-temporal event forecasting",

abstract = "Spatial event forecasting from social media is an important problem but encounters critical challenges, such as dynamic patterns of features (keywords) and geographic heterogeneity (e.g., spatial correlations, imbalanced samples, and different populations in different locations). Most existing approaches (e.g., LASSO regression, dynamic query expansion, and burst detection) are designed to address some of these challenges, but not all of them. This paper proposes a novel multi-task learning framework which aims to concurrently address all the challenges. Specifically, given a collection of locations (e.g., cities), we propose to build forecasting models for all locations simultaneously by extracting and utilizing appropriate shared information that effectively increases the sample size for each location, thus improving the forecasting performance. We combine both static features derived from a predefined vocabulary by domain experts and dynamic features generated from dynamic query expansion in a multi-task feature learning framework; we investigate different strategies to balance homogeneity and diversity between static and dynamic terms. Efficient algorithms based on Iterative Group Hard Thresholding are developed to achieve efficient and effective model training and prediction. Extensive experimental evaluations on Twitter data from four different countries in Latin America demonstrated the effectiveness of our proposed approach.",

keywords = "Dynamic query expansion, Event forecasting, Hard thresholding, LASSO, Multi-task learning",

author = "Liang Zhao and Qian Sun and Jieping Ye and Feng Chen and Lu, {Chang Tien} and Naren Ramakrishnan",

note = "Publisher Copyright: {\textcopyright} 2015 ACM.; 21st ACM SIGKDD Conference on Knowledge Discovery and Data Mining, KDD 2015 ; Conference date: 10-08-2015 Through 13-08-2015",

year = "2015",

month = aug,

day = "10",

doi = "10.1145/2783258.2783377",

language = "English (US)",

series = "Proceedings of the ACM SIGKDD International Conference on Knowledge Discovery and Data Mining",

publisher = "Association for Computing Machinery",

pages = "1503--1512",

booktitle = "KDD 2015 - Proceedings of the 21st ACM SIGKDD Conference on Knowledge Discovery and Data Mining",

}

TY - GEN

T1 - Multi-task learning for spatio-temporal event forecasting

AU - Zhao, Liang

AU - Sun, Qian

AU - Ye, Jieping

AU - Chen, Feng

AU - Lu, Chang Tien

AU - Ramakrishnan, Naren

PY - 2015/8/10

Y1 - 2015/8/10

N2 - Spatial event forecasting from social media is an important problem but encounters critical challenges, such as dynamic patterns of features (keywords) and geographic heterogeneity (e.g., spatial correlations, imbalanced samples, and different populations in different locations). Most existing approaches (e.g., LASSO regression, dynamic query expansion, and burst detection) are designed to address some of these challenges, but not all of them. This paper proposes a novel multi-task learning framework which aims to concurrently address all the challenges. Specifically, given a collection of locations (e.g., cities), we propose to build forecasting models for all locations simultaneously by extracting and utilizing appropriate shared information that effectively increases the sample size for each location, thus improving the forecasting performance. We combine both static features derived from a predefined vocabulary by domain experts and dynamic features generated from dynamic query expansion in a multi-task feature learning framework; we investigate different strategies to balance homogeneity and diversity between static and dynamic terms. Efficient algorithms based on Iterative Group Hard Thresholding are developed to achieve efficient and effective model training and prediction. Extensive experimental evaluations on Twitter data from four different countries in Latin America demonstrated the effectiveness of our proposed approach.

AB - Spatial event forecasting from social media is an important problem but encounters critical challenges, such as dynamic patterns of features (keywords) and geographic heterogeneity (e.g., spatial correlations, imbalanced samples, and different populations in different locations). Most existing approaches (e.g., LASSO regression, dynamic query expansion, and burst detection) are designed to address some of these challenges, but not all of them. This paper proposes a novel multi-task learning framework which aims to concurrently address all the challenges. Specifically, given a collection of locations (e.g., cities), we propose to build forecasting models for all locations simultaneously by extracting and utilizing appropriate shared information that effectively increases the sample size for each location, thus improving the forecasting performance. We combine both static features derived from a predefined vocabulary by domain experts and dynamic features generated from dynamic query expansion in a multi-task feature learning framework; we investigate different strategies to balance homogeneity and diversity between static and dynamic terms. Efficient algorithms based on Iterative Group Hard Thresholding are developed to achieve efficient and effective model training and prediction. Extensive experimental evaluations on Twitter data from four different countries in Latin America demonstrated the effectiveness of our proposed approach.

KW - Dynamic query expansion

KW - Event forecasting

KW - Hard thresholding

KW - LASSO

KW - Multi-task learning

UR - http://www.scopus.com/inward/record.url?scp=84954148765&partnerID=8YFLogxK

UR - http://www.scopus.com/inward/citedby.url?scp=84954148765&partnerID=8YFLogxK

U2 - 10.1145/2783258.2783377

DO - 10.1145/2783258.2783377

M3 - Conference contribution

AN - SCOPUS:84954148765

T3 - Proceedings of the ACM SIGKDD International Conference on Knowledge Discovery and Data Mining

SP - 1503

EP - 1512

BT - KDD 2015 - Proceedings of the 21st ACM SIGKDD Conference on Knowledge Discovery and Data Mining

PB - Association for Computing Machinery

T2 - 21st ACM SIGKDD Conference on Knowledge Discovery and Data Mining, KDD 2015

Y2 - 10 August 2015 through 13 August 2015

ER -

Multi-task learning for spatio-temporal event forecasting

Abstract

Publication series

Conference

Keywords

ASJC Scopus subject areas

Access to Document

Other files and links

Fingerprint

Cite this