TY - GEN
T1 - An ultra low power SIMD processor for wireless devices
AU - Woh, Mark
AU - Seo, Sangwon
AU - Chakrabarti, Chaitali
AU - Mahlke, Scott
AU - Mudge, Trevor
PY - 2010/12/1
Y1 - 2010/12/1
N2 - This paper presents an ultra low power programmable processor architecture for wireless devices that support 4G wireless communications and video decoding. To derive such an architecture, first we analyzed the kernel algorithms that constitute these applications. The characteristics of these algorithms helped define the wide-SIMD architecture, where the SIMD width can be configured at run time to the specifics of the algorithm being executed. For ultra low power operation, we advocate operating the processor at near threshold voltage. While a combination of near-threshold circuit techniques and parallel SIMD computations achieves excellent energy efficiency, near-threshold operations suffer from large delay variations due to increased process variability. The paper explores low overhead architectural techniques to tolerate and mitigate problems due to delay variations. The techniques include replication of SIMD functional units to replace faulty ones and use of an XRAM crossbar to efficiently set up the new error-free SIMD datapath.
AB - This paper presents an ultra low power programmable processor architecture for wireless devices that support 4G wireless communications and video decoding. To derive such an architecture, first we analyzed the kernel algorithms that constitute these applications. The characteristics of these algorithms helped define the wide-SIMD architecture, where the SIMD width can be configured at run time to the specifics of the algorithm being executed. For ultra low power operation, we advocate operating the processor at near threshold voltage. While a combination of near-threshold circuit techniques and parallel SIMD computations achieves excellent energy efficiency, near-threshold operations suffer from large delay variations due to increased process variability. The paper explores low overhead architectural techniques to tolerate and mitigate problems due to delay variations. The techniques include replication of SIMD functional units to replace faulty ones and use of an XRAM crossbar to efficiently set up the new error-free SIMD datapath.
UR - http://www.scopus.com/inward/record.url?scp=79957990949&partnerID=8YFLogxK
UR - http://www.scopus.com/inward/citedby.url?scp=79957990949&partnerID=8YFLogxK
U2 - 10.1109/ACSSC.2010.5757542
DO - 10.1109/ACSSC.2010.5757542
M3 - Conference contribution
AN - SCOPUS:79957990949
SN - 9781424497218
T3 - Conference Record - Asilomar Conference on Signals, Systems and Computers
SP - 390
EP - 394
BT - Conference Record of the 44th Asilomar Conference on Signals, Systems and Computers, Asilomar 2010
T2 - 44th Asilomar Conference on Signals, Systems and Computers, Asilomar 2010
Y2 - 7 November 2010 through 10 November 2010
ER -