Keyvan Majd

Safe Robot Learning in Assistive Devices through Neural Network Repair

K. Majd, G. Clark, T. Khandait, S. Zhou, S. Sankaranarayanan, G. Fainekos, H. Ben Amor

Abstract: Guaranteeing safety in human-centric applications is critical in robot learning as the learned policies may demonstrate unsafe behaviors in formerly unseen scenarios. We present a framework to locally repair an erroneous policy network to satisfy a set of formal safety constraints using Mixed Integer Quadratic Programming (MIQP). Our MIQP formulation explicitly imposes the safety constraints to the learned policy while minimizing the original loss function. We demonstrate the application of our framework to derive safe policies for a robotic lower-leg prosthesis.

Full Paper (CoRL) Full Paper (NeurIPS) Source Code Talk (NeurIPS) Poster

Publications

NeurIPS'22

Certifiably-correct Control Policies for Safe Learning and Adaptation in Assistive Robotics

(Best Paper 2nd Runner-up Award)

K. Majd, G. Clark, T. Khandait, S. Zhou, S. Sankaranarayanan, G. Fainekos, H. Ben Amor

Neural Information Processing Systems (NeurIPS) - Robot Learning Workshop, 2022.

[BibTex]

CoRL'22

Safe Robot Learning in Assistive Devices through Neural Network Repair

K. Majd, G. Clark, T. Khandait, S. Zhou, S. Sankaranarayanan, G. Fainekos, H. Ben Amor

Conference on Robot Learning (CoRL), 2022.

[BibTex]