Robotic Knee Prosthesis Real-Time Control Using Reinforcement Learning with Human in the Loop

Yue Wen; Xiang Gao; Jennie Si; Andrea Brandt; Minhan Li; He (Helen) Huang

doi:10.1007/978-981-13-7983-3_41

Robotic Knee Prosthesis Real-Time Control Using Reinforcement Learning with Human in the Loop

Yue Wen, Xiang Gao, Jennie Si, Andrea Brandt, Minhan Li, He (Helen) Huang

Electrical, Computer, and Energy Engineering, School of (IAFSE-ECEE)

Research output: Chapter in Book/Report/Conference proceeding › Conference contribution

6 Scopus citations

Abstract

Advanced robotic prostheses are expensive considering the cost of human resources and the time spent on manually tuning the high-dimensional control parameters for individual users. To alleviate clinicians’ effort and promote the advanced robotic prosthesis, we implemented an optimal adaptive control algorithm, which fundamentally is a type of reinforcement learning method, to automatically tune the high-dimensional control parameters of a robotic knee prosthesis through interaction with a human-prosthesis system. The ‘human-in-the-loop’ term means that the learning controller tunes the control parameters based on the performance of the robotic knee prosthesis while an amputee subject walking with it. We validated the human-in-the-loop auto-tuner with one transfemoral amputee subject for 4 hour-long lab testing sessions. Our results demonstrated that this novel reinforcement learning controller was able to learn through interaction with the human-prosthesis system and discover a set of suitable control parameter for the amputee user to generate near-normative knee kinematics.

Original language	English (US)
Title of host publication	Cognitive Systems and Signal Processing - 4th International Conference, ICCSIP 2018, Revised Selected Papers
Editors	Dewen Hu, Huaping Liu, Fuchun Sun
Publisher	Springer Verlag
Pages	463-473
Number of pages	11
ISBN (Print)	9789811379826
DOIs	https://doi.org/10.1007/978-981-13-7983-3_41
State	Published - 2019
Event	4th International Conference on Cognitive Systems and Information Processing, ICCSIP 2018 - Beijing, China Duration: Nov 29 2018 → Dec 1 2018

Publication series

Name	Communications in Computer and Information Science
Volume	1005
ISSN (Print)	1865-0929
ISSN (Electronic)	1865-0937

Conference

Conference	4th International Conference on Cognitive Systems and Information Processing, ICCSIP 2018
Country/Territory	China
City	Beijing
Period	11/29/18 → 12/1/18

Keywords

Amputees
Gait symmetry
Machine learning
Prosthetic knee kinematics
Reinforcement learning
Robotic knee prosthesis

ASJC Scopus subject areas

General Computer Science
General Mathematics

Access to Document

10.1007/978-981-13-7983-3_41

Cite this

Wen, Y., Gao, X., Si, J., Brandt, A., Li, M., & Huang, H. (2019). Robotic Knee Prosthesis Real-Time Control Using Reinforcement Learning with Human in the Loop. In D. Hu, H. Liu, & F. Sun (Eds.), Cognitive Systems and Signal Processing - 4th International Conference, ICCSIP 2018, Revised Selected Papers (pp. 463-473). (Communications in Computer and Information Science; Vol. 1005). Springer Verlag. https://doi.org/10.1007/978-981-13-7983-3_41

Robotic Knee Prosthesis Real-Time Control Using Reinforcement Learning with Human in the Loop. / Wen, Yue; Gao, Xiang; Si, Jennie et al.
Cognitive Systems and Signal Processing - 4th International Conference, ICCSIP 2018, Revised Selected Papers. ed. / Dewen Hu; Huaping Liu; Fuchun Sun. Springer Verlag, 2019. p. 463-473 (Communications in Computer and Information Science; Vol. 1005).

Research output: Chapter in Book/Report/Conference proceeding › Conference contribution

Wen, Y, Gao, X, Si, J, Brandt, A, Li, M & Huang, H 2019, Robotic Knee Prosthesis Real-Time Control Using Reinforcement Learning with Human in the Loop. in D Hu, H Liu & F Sun (eds), Cognitive Systems and Signal Processing - 4th International Conference, ICCSIP 2018, Revised Selected Papers. Communications in Computer and Information Science, vol. 1005, Springer Verlag, pp. 463-473, 4th International Conference on Cognitive Systems and Information Processing, ICCSIP 2018, Beijing, China, 11/29/18. https://doi.org/10.1007/978-981-13-7983-3_41

Wen Y, Gao X, Si J, Brandt A, Li M, Huang H. Robotic Knee Prosthesis Real-Time Control Using Reinforcement Learning with Human in the Loop. In Hu D, Liu H, Sun F, editors, Cognitive Systems and Signal Processing - 4th International Conference, ICCSIP 2018, Revised Selected Papers. Springer Verlag. 2019. p. 463-473. (Communications in Computer and Information Science). doi: 10.1007/978-981-13-7983-3_41

Wen, Yue ; Gao, Xiang ; Si, Jennie et al. / Robotic Knee Prosthesis Real-Time Control Using Reinforcement Learning with Human in the Loop. Cognitive Systems and Signal Processing - 4th International Conference, ICCSIP 2018, Revised Selected Papers. editor / Dewen Hu ; Huaping Liu ; Fuchun Sun. Springer Verlag, 2019. pp. 463-473 (Communications in Computer and Information Science).

@inproceedings{4102443425b24292bd76f602fac4226a,

title = "Robotic Knee Prosthesis Real-Time Control Using Reinforcement Learning with Human in the Loop",

abstract = "Advanced robotic prostheses are expensive considering the cost of human resources and the time spent on manually tuning the high-dimensional control parameters for individual users. To alleviate clinicians{\textquoteright} effort and promote the advanced robotic prosthesis, we implemented an optimal adaptive control algorithm, which fundamentally is a type of reinforcement learning method, to automatically tune the high-dimensional control parameters of a robotic knee prosthesis through interaction with a human-prosthesis system. The {\textquoteleft}human-in-the-loop{\textquoteright} term means that the learning controller tunes the control parameters based on the performance of the robotic knee prosthesis while an amputee subject walking with it. We validated the human-in-the-loop auto-tuner with one transfemoral amputee subject for 4 hour-long lab testing sessions. Our results demonstrated that this novel reinforcement learning controller was able to learn through interaction with the human-prosthesis system and discover a set of suitable control parameter for the amputee user to generate near-normative knee kinematics.",

keywords = "Amputees, Gait symmetry, Machine learning, Prosthetic knee kinematics, Reinforcement learning, Robotic knee prosthesis",

author = "Yue Wen and Xiang Gao and Jennie Si and Andrea Brandt and Minhan Li and Huang, {He (Helen)}",

note = "Funding Information: This work was partly supported by National Science Foundation #1563454, #1563921, #1808752 and #1808898. Publisher Copyright: © 2019, Springer Nature Singapore Pte Ltd.; 4th International Conference on Cognitive Systems and Information Processing, ICCSIP 2018 ; Conference date: 29-11-2018 Through 01-12-2018",

year = "2019",

doi = "10.1007/978-981-13-7983-3_41",

language = "English (US)",

isbn = "9789811379826",

series = "Communications in Computer and Information Science",

publisher = "Springer Verlag",

pages = "463--473",

editor = "Dewen Hu and Huaping Liu and Fuchun Sun",

booktitle = "Cognitive Systems and Signal Processing - 4th International Conference, ICCSIP 2018, Revised Selected Papers",

}

TY - GEN

T1 - Robotic Knee Prosthesis Real-Time Control Using Reinforcement Learning with Human in the Loop

AU - Wen, Yue

AU - Gao, Xiang

AU - Si, Jennie

AU - Brandt, Andrea

AU - Li, Minhan

AU - Huang, He (Helen)

PY - 2019

Y1 - 2019

N2 - Advanced robotic prostheses are expensive considering the cost of human resources and the time spent on manually tuning the high-dimensional control parameters for individual users. To alleviate clinicians’ effort and promote the advanced robotic prosthesis, we implemented an optimal adaptive control algorithm, which fundamentally is a type of reinforcement learning method, to automatically tune the high-dimensional control parameters of a robotic knee prosthesis through interaction with a human-prosthesis system. The ‘human-in-the-loop’ term means that the learning controller tunes the control parameters based on the performance of the robotic knee prosthesis while an amputee subject walking with it. We validated the human-in-the-loop auto-tuner with one transfemoral amputee subject for 4 hour-long lab testing sessions. Our results demonstrated that this novel reinforcement learning controller was able to learn through interaction with the human-prosthesis system and discover a set of suitable control parameter for the amputee user to generate near-normative knee kinematics.

AB - Advanced robotic prostheses are expensive considering the cost of human resources and the time spent on manually tuning the high-dimensional control parameters for individual users. To alleviate clinicians’ effort and promote the advanced robotic prosthesis, we implemented an optimal adaptive control algorithm, which fundamentally is a type of reinforcement learning method, to automatically tune the high-dimensional control parameters of a robotic knee prosthesis through interaction with a human-prosthesis system. The ‘human-in-the-loop’ term means that the learning controller tunes the control parameters based on the performance of the robotic knee prosthesis while an amputee subject walking with it. We validated the human-in-the-loop auto-tuner with one transfemoral amputee subject for 4 hour-long lab testing sessions. Our results demonstrated that this novel reinforcement learning controller was able to learn through interaction with the human-prosthesis system and discover a set of suitable control parameter for the amputee user to generate near-normative knee kinematics.

KW - Amputees

KW - Gait symmetry

KW - Machine learning

KW - Prosthetic knee kinematics

KW - Reinforcement learning

KW - Robotic knee prosthesis

UR - http://www.scopus.com/inward/record.url?scp=85065759212&partnerID=8YFLogxK

UR - http://www.scopus.com/inward/citedby.url?scp=85065759212&partnerID=8YFLogxK

U2 - 10.1007/978-981-13-7983-3_41

DO - 10.1007/978-981-13-7983-3_41

M3 - Conference contribution

AN - SCOPUS:85065759212

SN - 9789811379826

T3 - Communications in Computer and Information Science

SP - 463

EP - 473

BT - Cognitive Systems and Signal Processing - 4th International Conference, ICCSIP 2018, Revised Selected Papers

A2 - Hu, Dewen

A2 - Liu, Huaping

A2 - Sun, Fuchun

PB - Springer Verlag

T2 - 4th International Conference on Cognitive Systems and Information Processing, ICCSIP 2018

Y2 - 29 November 2018 through 1 December 2018

ER -

Robotic Knee Prosthesis Real-Time Control Using Reinforcement Learning with Human in the Loop

Abstract

Publication series

Conference

Keywords

ASJC Scopus subject areas

Access to Document

Other files and links

Fingerprint

Cite this