ASP vision: Optically computing the first layer of convolutional neural networks using angle sensitive pixels

Huaijin G. Chen; Suren Jayasuriya; Jiyue Yang; Judy Stephen; Sriram Sivaramakrishnan; Ashok Veeraraghavan; Alyosha Molnar

doi:10.1109/CVPR.2016.104

ASP vision: Optically computing the first layer of convolutional neural networks using angle sensitive pixels

Huaijin G. Chen, Suren Jayasuriya, Jiyue Yang, Judy Stephen, Sriram Sivaramakrishnan, Ashok Veeraraghavan, Alyosha Molnar

Research output: Chapter in Book/Report/Conference proceeding › Conference contribution

50 Scopus citations

Abstract

Deep learning using convolutional neural networks (CNNs) is quickly becoming the state-of-the-art for challenging computer vision applications. However, deep learning's power consumption and bandwidth requirements currently limit its application in embedded and mobile systems with tight energy budgets. In this paper, we explore the energy savings of optically computing the first layer of CNNs. To do so, we utilize bio-inspired Angle Sensitive Pixels (ASPs), custom CMOS diffractive image sensors which act similar to Gabor filter banks in the V1 layer of the human visual cortex. ASPs replace both image sensing and the first layer of a conventional CNN by directly performing optical edge filtering, saving sensing energy, data bandwidth, and CNN FLOPS to compute. Our experimental results (both on synthetic data and a hardware prototype) for a variety of vision tasks such as digit recognition, object recognition, and face identification demonstrate 97% reduction in image sensor power consumption and 90% reduction in data bandwidth from sensor to CPU, while achieving similar performance compared to traditional deep learning pipelines.

Original language	English (US)
Title of host publication	Proceedings - 29th IEEE Conference on Computer Vision and Pattern Recognition, CVPR 2016
Publisher	IEEE Computer Society
Pages	903-912
Number of pages	10
ISBN (Electronic)	9781467388504
DOIs	https://doi.org/10.1109/CVPR.2016.104
State	Published - Dec 9 2016
Externally published	Yes
Event	29th IEEE Conference on Computer Vision and Pattern Recognition, CVPR 2016 - Las Vegas, United States Duration: Jun 26 2016 → Jul 1 2016

Publication series

Name	Proceedings of the IEEE Computer Society Conference on Computer Vision and Pattern Recognition
Volume	2016-December
ISSN (Print)	1063-6919

Conference

Conference	29th IEEE Conference on Computer Vision and Pattern Recognition, CVPR 2016
Country/Territory	United States
City	Las Vegas
Period	6/26/16 → 7/1/16

ASJC Scopus subject areas

Software
Computer Vision and Pattern Recognition

Access to Document

10.1109/CVPR.2016.104

Cite this

Chen, H. G., Jayasuriya, S., Yang, J., Stephen, J., Sivaramakrishnan, S., Veeraraghavan, A., & Molnar, A. (2016). ASP vision: Optically computing the first layer of convolutional neural networks using angle sensitive pixels. In Proceedings - 29th IEEE Conference on Computer Vision and Pattern Recognition, CVPR 2016 (pp. 903-912). Article 7780473 (Proceedings of the IEEE Computer Society Conference on Computer Vision and Pattern Recognition; Vol. 2016-December). IEEE Computer Society. https://doi.org/10.1109/CVPR.2016.104

ASP vision: Optically computing the first layer of convolutional neural networks using angle sensitive pixels. / Chen, Huaijin G.; Jayasuriya, Suren; Yang, Jiyue et al.
Proceedings - 29th IEEE Conference on Computer Vision and Pattern Recognition, CVPR 2016. IEEE Computer Society, 2016. p. 903-912 7780473 (Proceedings of the IEEE Computer Society Conference on Computer Vision and Pattern Recognition; Vol. 2016-December).

Research output: Chapter in Book/Report/Conference proceeding › Conference contribution

Chen, HG, Jayasuriya, S, Yang, J, Stephen, J, Sivaramakrishnan, S, Veeraraghavan, A & Molnar, A 2016, ASP vision: Optically computing the first layer of convolutional neural networks using angle sensitive pixels. in Proceedings - 29th IEEE Conference on Computer Vision and Pattern Recognition, CVPR 2016., 7780473, Proceedings of the IEEE Computer Society Conference on Computer Vision and Pattern Recognition, vol. 2016-December, IEEE Computer Society, pp. 903-912, 29th IEEE Conference on Computer Vision and Pattern Recognition, CVPR 2016, Las Vegas, United States, 6/26/16. https://doi.org/10.1109/CVPR.2016.104

Chen HG, Jayasuriya S, Yang J, Stephen J, Sivaramakrishnan S, Veeraraghavan A et al. ASP vision: Optically computing the first layer of convolutional neural networks using angle sensitive pixels. In Proceedings - 29th IEEE Conference on Computer Vision and Pattern Recognition, CVPR 2016. IEEE Computer Society. 2016. p. 903-912. 7780473. (Proceedings of the IEEE Computer Society Conference on Computer Vision and Pattern Recognition). doi: 10.1109/CVPR.2016.104

Chen, Huaijin G. ; Jayasuriya, Suren ; Yang, Jiyue et al. / ASP vision : Optically computing the first layer of convolutional neural networks using angle sensitive pixels. Proceedings - 29th IEEE Conference on Computer Vision and Pattern Recognition, CVPR 2016. IEEE Computer Society, 2016. pp. 903-912 (Proceedings of the IEEE Computer Society Conference on Computer Vision and Pattern Recognition).

@inproceedings{204663ad49064b129fc72fed0ad4b6b7,

title = "ASP vision: Optically computing the first layer of convolutional neural networks using angle sensitive pixels",

abstract = "Deep learning using convolutional neural networks (CNNs) is quickly becoming the state-of-the-art for challenging computer vision applications. However, deep learning's power consumption and bandwidth requirements currently limit its application in embedded and mobile systems with tight energy budgets. In this paper, we explore the energy savings of optically computing the first layer of CNNs. To do so, we utilize bio-inspired Angle Sensitive Pixels (ASPs), custom CMOS diffractive image sensors which act similar to Gabor filter banks in the V1 layer of the human visual cortex. ASPs replace both image sensing and the first layer of a conventional CNN by directly performing optical edge filtering, saving sensing energy, data bandwidth, and CNN FLOPS to compute. Our experimental results (both on synthetic data and a hardware prototype) for a variety of vision tasks such as digit recognition, object recognition, and face identification demonstrate 97% reduction in image sensor power consumption and 90% reduction in data bandwidth from sensor to CPU, while achieving similar performance compared to traditional deep learning pipelines.",

author = "Chen, {Huaijin G.} and Suren Jayasuriya and Jiyue Yang and Judy Stephen and Sriram Sivaramakrishnan and Ashok Veeraraghavan and Alyosha Molnar",

year = "2016",

month = dec,

day = "9",

doi = "10.1109/CVPR.2016.104",

language = "English (US)",

series = "Proceedings of the IEEE Computer Society Conference on Computer Vision and Pattern Recognition",

publisher = "IEEE Computer Society",

pages = "903--912",

booktitle = "Proceedings - 29th IEEE Conference on Computer Vision and Pattern Recognition, CVPR 2016",

note = "29th IEEE Conference on Computer Vision and Pattern Recognition, CVPR 2016 ; Conference date: 26-06-2016 Through 01-07-2016",

}

TY - GEN

T1 - ASP vision

T2 - 29th IEEE Conference on Computer Vision and Pattern Recognition, CVPR 2016

AU - Chen, Huaijin G.

AU - Jayasuriya, Suren

AU - Yang, Jiyue

AU - Stephen, Judy

AU - Sivaramakrishnan, Sriram

AU - Veeraraghavan, Ashok

AU - Molnar, Alyosha

PY - 2016/12/9

Y1 - 2016/12/9

N2 - Deep learning using convolutional neural networks (CNNs) is quickly becoming the state-of-the-art for challenging computer vision applications. However, deep learning's power consumption and bandwidth requirements currently limit its application in embedded and mobile systems with tight energy budgets. In this paper, we explore the energy savings of optically computing the first layer of CNNs. To do so, we utilize bio-inspired Angle Sensitive Pixels (ASPs), custom CMOS diffractive image sensors which act similar to Gabor filter banks in the V1 layer of the human visual cortex. ASPs replace both image sensing and the first layer of a conventional CNN by directly performing optical edge filtering, saving sensing energy, data bandwidth, and CNN FLOPS to compute. Our experimental results (both on synthetic data and a hardware prototype) for a variety of vision tasks such as digit recognition, object recognition, and face identification demonstrate 97% reduction in image sensor power consumption and 90% reduction in data bandwidth from sensor to CPU, while achieving similar performance compared to traditional deep learning pipelines.

AB - Deep learning using convolutional neural networks (CNNs) is quickly becoming the state-of-the-art for challenging computer vision applications. However, deep learning's power consumption and bandwidth requirements currently limit its application in embedded and mobile systems with tight energy budgets. In this paper, we explore the energy savings of optically computing the first layer of CNNs. To do so, we utilize bio-inspired Angle Sensitive Pixels (ASPs), custom CMOS diffractive image sensors which act similar to Gabor filter banks in the V1 layer of the human visual cortex. ASPs replace both image sensing and the first layer of a conventional CNN by directly performing optical edge filtering, saving sensing energy, data bandwidth, and CNN FLOPS to compute. Our experimental results (both on synthetic data and a hardware prototype) for a variety of vision tasks such as digit recognition, object recognition, and face identification demonstrate 97% reduction in image sensor power consumption and 90% reduction in data bandwidth from sensor to CPU, while achieving similar performance compared to traditional deep learning pipelines.

UR - http://www.scopus.com/inward/record.url?scp=84986317236&partnerID=8YFLogxK

UR - http://www.scopus.com/inward/citedby.url?scp=84986317236&partnerID=8YFLogxK

U2 - 10.1109/CVPR.2016.104

DO - 10.1109/CVPR.2016.104

M3 - Conference contribution

AN - SCOPUS:84986317236

T3 - Proceedings of the IEEE Computer Society Conference on Computer Vision and Pattern Recognition

SP - 903

EP - 912

BT - Proceedings - 29th IEEE Conference on Computer Vision and Pattern Recognition, CVPR 2016

PB - IEEE Computer Society

Y2 - 26 June 2016 through 1 July 2016

ER -

ASP vision: Optically computing the first layer of convolutional neural networks using angle sensitive pixels

Abstract

Publication series

Conference

ASJC Scopus subject areas

Access to Document

Other files and links

Fingerprint

Cite this