XM2A: Multi-Scale Multi-Head Attention with Cross-Talk for Multi-Variate Time Series Analysis

Yash Garg; K. Selçuk Candan

doi:10.1109/MIPR51284.2021.00030

XM2A: Multi-Scale Multi-Head Attention with Cross-Talk for Multi-Variate Time Series Analysis

Yash Garg, K. Selçuk Candan

Engineering, Ira A. Fulton Schools of (IAFSE)

Research output: Chapter in Book/Report/Conference proceeding › Conference contribution

1 Scopus citations

Abstract

Advances in sensory technologies are enabling the capture of a diverse spectrum of real-world data streams. Increasing availability of such data, especially in the form of multivariate time series, allows for new opportunities for applications that rely on identifying and leveraging complex temporal patterns A particular challenge such algorithms face is that complex patterns consist of multiple simpler patterns of varying scales (temporal length). While several recent works (such as multi-head attention networks) recognized the fact complex patterns need to be understood in the form of multiple simpler patterns, we note that existing works lack the ability of represent the interactions across these constituting patterns. To tackle this limitation, in this paper, we propose a novel Multi-scale Multi-head Attention with Cross-Talk (XM2A) framework designed to represent multi-scale patterns that make up a complex pattern by configuring each attention head to learn a pattern at a particular scale and accounting for the co-existence of patterns at multiple scales through a cross-talking mechanism among the heads. Experiments show that XM2A outperforms state-of-the-art attention mechanisms, such as Transformer and MSMSA, on benchmark datasets, such as SADD, AUSLAN, and MOCAP.

Original language	English (US)
Title of host publication	Proceedings - 4th International Conference on Multimedia Information Processing and Retrieval, MIPR 2021
Publisher	Institute of Electrical and Electronics Engineers Inc.
Pages	151-157
Number of pages	7
ISBN (Electronic)	9781665418652
DOIs	https://doi.org/10.1109/MIPR51284.2021.00030
State	Published - 2021
Event	4th IEEE International Conference on Multimedia Information Processing and Retrieval, MIPR 2021 - Virtual, Online, Japan Duration: Sep 8 2021 → Sep 10 2021

Publication series

Name	Proceedings - 4th International Conference on Multimedia Information Processing and Retrieval, MIPR 2021

Conference

Conference	4th IEEE International Conference on Multimedia Information Processing and Retrieval, MIPR 2021
Country/Territory	Japan
City	Virtual, Online
Period	9/8/21 → 9/10/21

Keywords

Information descriptors
Multi-head attention
Multi-scale features
Transformer

ASJC Scopus subject areas

Media Technology
Computer Networks and Communications
Signal Processing

Access to Document

10.1109/MIPR51284.2021.00030

Cite this

Garg, Y., & Candan, K. S. (2021). XM2A: Multi-Scale Multi-Head Attention with Cross-Talk for Multi-Variate Time Series Analysis. In Proceedings - 4th International Conference on Multimedia Information Processing and Retrieval, MIPR 2021 (pp. 151-157). (Proceedings - 4th International Conference on Multimedia Information Processing and Retrieval, MIPR 2021). Institute of Electrical and Electronics Engineers Inc.. https://doi.org/10.1109/MIPR51284.2021.00030

XM2A: Multi-Scale Multi-Head Attention with Cross-Talk for Multi-Variate Time Series Analysis. / Garg, Yash; Candan, K. Selçuk.
Proceedings - 4th International Conference on Multimedia Information Processing and Retrieval, MIPR 2021. Institute of Electrical and Electronics Engineers Inc., 2021. p. 151-157 (Proceedings - 4th International Conference on Multimedia Information Processing and Retrieval, MIPR 2021).

Research output: Chapter in Book/Report/Conference proceeding › Conference contribution

Garg, Y & Candan, KS 2021, XM2A: Multi-Scale Multi-Head Attention with Cross-Talk for Multi-Variate Time Series Analysis. in Proceedings - 4th International Conference on Multimedia Information Processing and Retrieval, MIPR 2021. Proceedings - 4th International Conference on Multimedia Information Processing and Retrieval, MIPR 2021, Institute of Electrical and Electronics Engineers Inc., pp. 151-157, 4th IEEE International Conference on Multimedia Information Processing and Retrieval, MIPR 2021, Virtual, Online, Japan, 9/8/21. https://doi.org/10.1109/MIPR51284.2021.00030

Garg Y, Candan KS. XM2A: Multi-Scale Multi-Head Attention with Cross-Talk for Multi-Variate Time Series Analysis. In Proceedings - 4th International Conference on Multimedia Information Processing and Retrieval, MIPR 2021. Institute of Electrical and Electronics Engineers Inc. 2021. p. 151-157. (Proceedings - 4th International Conference on Multimedia Information Processing and Retrieval, MIPR 2021). doi: 10.1109/MIPR51284.2021.00030

Garg, Yash ; Candan, K. Selçuk. / XM2A : Multi-Scale Multi-Head Attention with Cross-Talk for Multi-Variate Time Series Analysis. Proceedings - 4th International Conference on Multimedia Information Processing and Retrieval, MIPR 2021. Institute of Electrical and Electronics Engineers Inc., 2021. pp. 151-157 (Proceedings - 4th International Conference on Multimedia Information Processing and Retrieval, MIPR 2021).

@inproceedings{0cf69bced1484190aeafa6244637b359,

title = "XM2A: Multi-Scale Multi-Head Attention with Cross-Talk for Multi-Variate Time Series Analysis",

abstract = "Advances in sensory technologies are enabling the capture of a diverse spectrum of real-world data streams. Increasing availability of such data, especially in the form of multivariate time series, allows for new opportunities for applications that rely on identifying and leveraging complex temporal patterns A particular challenge such algorithms face is that complex patterns consist of multiple simpler patterns of varying scales (temporal length). While several recent works (such as multi-head attention networks) recognized the fact complex patterns need to be understood in the form of multiple simpler patterns, we note that existing works lack the ability of represent the interactions across these constituting patterns. To tackle this limitation, in this paper, we propose a novel Multi-scale Multi-head Attention with Cross-Talk (XM2A) framework designed to represent multi-scale patterns that make up a complex pattern by configuring each attention head to learn a pattern at a particular scale and accounting for the co-existence of patterns at multiple scales through a cross-talking mechanism among the heads. Experiments show that XM2A outperforms state-of-the-art attention mechanisms, such as Transformer and MSMSA, on benchmark datasets, such as SADD, AUSLAN, and MOCAP.",

keywords = "Information descriptors, Multi-head attention, Multi-scale features, Transformer",

author = "Yash Garg and Candan, {K. Sel{\c c}uk}",

note = "Publisher Copyright: {\textcopyright} 2021 IEEE.; 4th IEEE International Conference on Multimedia Information Processing and Retrieval, MIPR 2021 ; Conference date: 08-09-2021 Through 10-09-2021",

year = "2021",

doi = "10.1109/MIPR51284.2021.00030",

language = "English (US)",

series = "Proceedings - 4th International Conference on Multimedia Information Processing and Retrieval, MIPR 2021",

publisher = "Institute of Electrical and Electronics Engineers Inc.",

pages = "151--157",

booktitle = "Proceedings - 4th International Conference on Multimedia Information Processing and Retrieval, MIPR 2021",

}

TY - GEN

T1 - XM2A

T2 - 4th IEEE International Conference on Multimedia Information Processing and Retrieval, MIPR 2021

AU - Garg, Yash

AU - Candan, K. Selçuk

PY - 2021

Y1 - 2021

N2 - Advances in sensory technologies are enabling the capture of a diverse spectrum of real-world data streams. Increasing availability of such data, especially in the form of multivariate time series, allows for new opportunities for applications that rely on identifying and leveraging complex temporal patterns A particular challenge such algorithms face is that complex patterns consist of multiple simpler patterns of varying scales (temporal length). While several recent works (such as multi-head attention networks) recognized the fact complex patterns need to be understood in the form of multiple simpler patterns, we note that existing works lack the ability of represent the interactions across these constituting patterns. To tackle this limitation, in this paper, we propose a novel Multi-scale Multi-head Attention with Cross-Talk (XM2A) framework designed to represent multi-scale patterns that make up a complex pattern by configuring each attention head to learn a pattern at a particular scale and accounting for the co-existence of patterns at multiple scales through a cross-talking mechanism among the heads. Experiments show that XM2A outperforms state-of-the-art attention mechanisms, such as Transformer and MSMSA, on benchmark datasets, such as SADD, AUSLAN, and MOCAP.

AB - Advances in sensory technologies are enabling the capture of a diverse spectrum of real-world data streams. Increasing availability of such data, especially in the form of multivariate time series, allows for new opportunities for applications that rely on identifying and leveraging complex temporal patterns A particular challenge such algorithms face is that complex patterns consist of multiple simpler patterns of varying scales (temporal length). While several recent works (such as multi-head attention networks) recognized the fact complex patterns need to be understood in the form of multiple simpler patterns, we note that existing works lack the ability of represent the interactions across these constituting patterns. To tackle this limitation, in this paper, we propose a novel Multi-scale Multi-head Attention with Cross-Talk (XM2A) framework designed to represent multi-scale patterns that make up a complex pattern by configuring each attention head to learn a pattern at a particular scale and accounting for the co-existence of patterns at multiple scales through a cross-talking mechanism among the heads. Experiments show that XM2A outperforms state-of-the-art attention mechanisms, such as Transformer and MSMSA, on benchmark datasets, such as SADD, AUSLAN, and MOCAP.

KW - Information descriptors

KW - Multi-head attention

KW - Multi-scale features

KW - Transformer

UR - http://www.scopus.com/inward/record.url?scp=85126211878&partnerID=8YFLogxK

UR - http://www.scopus.com/inward/citedby.url?scp=85126211878&partnerID=8YFLogxK

U2 - 10.1109/MIPR51284.2021.00030

DO - 10.1109/MIPR51284.2021.00030

M3 - Conference contribution

AN - SCOPUS:85126211878

T3 - Proceedings - 4th International Conference on Multimedia Information Processing and Retrieval, MIPR 2021

SP - 151

EP - 157

BT - Proceedings - 4th International Conference on Multimedia Information Processing and Retrieval, MIPR 2021

PB - Institute of Electrical and Electronics Engineers Inc.

Y2 - 8 September 2021 through 10 September 2021

ER -

XM2A: Multi-Scale Multi-Head Attention with Cross-Talk for Multi-Variate Time Series Analysis

Abstract

Publication series

Conference

Keywords

ASJC Scopus subject areas

Access to Document

Other files and links

Fingerprint

Cite this