Towards optimally decentralized multi-robot collision avoidance via deep reinforcement learning - Details

author：

Indexed by：

Scopus

Abstract：

Developing　a　safe　and　efficient　collision　avoidance　policy　for　multiple　robots　is　challenging　in　the　decentralized　scenarios　where　each　robot　generates　its　paths　without　observing　other　robots＇　states　and　intents.　While　other　distributed　multi-robot　collision　avoidance　systems　exist,　they　often　require　extracting　agent-level　features　to　plan　a　local　collision-free　action,　which　can　be　computationally　prohibitive　and　not　robust.　More　importantly,　in　practice　the　performance　of　these　methods　are　much　lower　than　their　centralized　counterparts.　We　present　a　decentralized　sensor-level　collision　avoidance　policy　for　multi-robot　systems,　which　directly　maps　raw　sensor　measurements　to　an　agent＇s　steering　commands　in　terms　of　movement　velocity.　As　a　first　step　toward　reducing　the　performance　gap　between　decentralized　and　centralized　methods,　we　present　a　multi-scenario　multi-stage　training　framework　to　learn　an　optimal　policy.　The　policy　is　trained　over　a　large　number　of　robots　on　rich,　complex　environments　simultaneously　using　a　policy　gradient　based　reinforcement　learning　algorithm.　We　validate　the　learned　sensor-level　collision　avoidance　policy　in　a　variety　of　simulated　scenarios　with　thorough　performance　evaluations　and　show　that　the　final　learned　policy　is　able　to　find　time　efficient,　collision-free　paths　for　a　large-scale　robot　system.　We　also　demonstrate　that　the　learned　policy　can　be　well　generalized　to　new　scenarios　that　do　not　appear　in　the　entire　training　period,　including　navigating　a　heterogeneous　group　of　robots　and　a　large-scale　scenario　with　100　robots.　Videos　are　available　at　https://sites.google.com/view/drlmaca.　©　2018　IEEE.

Keyword：

Community：

[ 1 ] [Long, P.]Dorabot Inc., Shenzhen, China
[ 2 ] [Fanl, T.]Dorabot Inc., Shenzhen, China
[ 3 ] [Liao, X.]Dorabot Inc., Shenzhen, China
[ 4 ] [Liu, W.]Department of Computer Science, Fuzhou University, Fuzhou, China
[ 5 ] [Zhang, H.]Dorabot Inc., Shenzhen, China
[ 6 ] [Pan, J.]Department of Mechanical and Biomedical Engineering, City University of Hong Kong, Hong Kong

Reprint 's Address：

Email：

Show more details

Related Keywords：

Source ：

Proceedings - IEEE International Conference on Robotics and Automation

ISSN： 1050-4729

Year： 2018

Page： 6252-6259

Language： English

Cited Count：

WoS CC Cited Count：

SCOPUS Cited Count： 406

ESI Highly Cited Papers on the List： 0 Unfold All

WanFang Cited Count：

Chinese Cited Count：

30 Days PV： 2

Affiliated Colleges：

Get Fulltext

DOI Library Discovery Baidu Scholar Search SCOPUS

Type
Departments

All Years Choose Year From to