Distributed multi-robot collision avoidance via deep reinforcement learning for navigation in complex scenarios - Details

author：

Fan, Tingxiang (Fan, Tingxiang.) ^[1] | Long, Pinxin (Long, Pinxin.) ^[2] | Liu, Wenxi (Liu, Wenxi.) ^[3] (Scholars：刘文犀) | Pan, Jia (Pan, Jia.) ^[4]

Indexed by：

EI Scopus SCIE

Abstract：

Developing　a　safe　and　efficient　collision-avoidance　policy　for　multiple　robots　is　challenging　in　the　decentralized　scenarios　where　each　robot　generates　its　paths　with　limited　observation　of　other　robots＇　states　and　intentions.　Prior　distributed　multi-robot　collision-avoidance　systems　often　require　frequent　inter-robot　communication　or　agent-level　features　to　plan　a　local　collision-free　action,　which　is　not　robust　and　computationally　prohibitive.　In　addition,　the　performance　of　these　methods　is　not　comparable　with　their　centralized　counterparts　in　practice.　In　this　article,　we　present　a　decentralized　sensor-level　collision-avoidance　policy　for　multi-robot　systems,　which　shows　promising　results　in　practical　applications.　In　particular,　our　policy　directly　maps　raw　sensor　measurements　to　an　agent＇s　steering　commands　in　terms　of　the　movement　velocity.　As　a　first　step　toward　reducing　the　performance　gap　between　decentralized　and　centralized　methods,　we　present　a　multi-scenario　multi-stage　training　framework　to　learn　an　optimal　policy.　The　policy　is　trained　over　a　large　number　of　robots　in　rich,　complex　environments　simultaneously　using　a　policy-gradient-based　reinforcement-learning　algorithm.　The　learning　algorithm　is　also　integrated　into　a　hybrid　control　framework　to　further　improve　the　policy＇s　robustness　and　effectiveness.　We　validate　the　learned　sensor-level　collision-3avoidance　policy　in　a　variety　of　simulated　and　real-world　scenarios　with　thorough　performance　evaluations　for　large-scale　multi-robot　systems.　The　generalization　of　the　learned　policy　is　verified　in　a　set　of　unseen　scenarios　including　the　navigation　of　a　group　of　heterogeneous　robots　and　a　large-scale　scenario　with　100　robots.　Although　the　policy　is　trained　using　simulation　data　only,　we　have　successfully　deployed　it　on　physical　robots　with　shapes　and　dynamics　characteristics　that　are　different　from　the　simulated　agents,　in　order　to　demonstrate　the　controller＇s　robustness　against　the　simulation-to-real　modeling　error.　Finally,　we　show　that　the　collision-avoidance　policy　learned　from　multi-robot　navigation　tasks　provides　an　excellent　solution　for　safe　and　effective　autonomous　navigation　for　a　single　robot　working　in　a　dense　real　human　crowd.　Our　learned　policy　enables　a　robot　to　make　effective　progress　in　a　crowd　without　getting　stuck.　More　importantly,　the　policy　has　been　successfully　deployed　on　different　types　of　physical　robot　platforms　without　tedious　parameter　tuning.　Videos　are　available　at　.

Keyword：

Distributed collision avoidance hybrid control multi-robot systems multi-scenario multi-stage reinforcement learning

Community：

[ 1 ] [Fan, Tingxiang]Univ Hong Kong, Dept Comp Sci, Hong Kong, Peoples R China
[ 2 ] [Pan, Jia]Univ Hong Kong, Dept Comp Sci, Hong Kong, Peoples R China
[ 3 ] [Long, Pinxin]Baidu Inc, Baidu Res, Beijing, Peoples R China
[ 4 ] [Liu, Wenxi]Fuzhou Univ, Coll Math & Comp Sci, Fuzhou, Peoples R China

Reprint 's Address：

[Pan, Jia]Univ Hong Kong, Dept Comp Sci, Pokfulam, Chow Yei Ching Bldg,Room 410, Hong Kong, Peoples R China

Email：

jpan@cs.hku.hk

Show more details

Version：

Distributed multi-robot collision avoidance via deep reinforcement learning for navigation in complex scenarios
2020，International Journal of Robotics Research
Distributed multi-robot collision avoidance via deep reinforcement learning for navigation in complex scenarios
2020，International Journal of Robotics Research

Related Keywords：

6G Mobile Communications for Multi-Robot Smart Factory
2021，Journal of ICT Standardization
A Human Decision-Making Behavior Model for Human-Robot Interaction in Multi-Robot Systems
2020，IEEE ACCESS
A behavior control scheme for multi-robot systems under human intervention
2019，38th Chinese Control Conference, CCC 2019
An Improved Deep Reinforcement Learning for Robot Navigation
2023，3rd International Conference on Machine Learning and Computer Application, ICMLCA 2022

Source ：

INTERNATIONAL JOURNAL OF ROBOTICS RESEARCH

ISSN： 0278-3649

Year： 2020

Issue： 7

Volume： 39

Page： 856-892

4 . 7 0 3

JCR@2020

7 . 5 0 0

JCR@2023

ESI Discipline： ENGINEERING;

ESI HC Threshold：132

JCR Journal Grade：1

Cited Count：

WoS CC Cited Count： 103

SCOPUS Cited Count： 187

ESI Highly Cited Papers on the List： 0 Unfold All

WanFang Cited Count：

Chinese Cited Count：

30 Days PV： 2

Affiliated Colleges：

计算机与大数据学院、软件学院本学院/部未明确归属的数据

Get Fulltext

DOI Library Discovery Baidu Scholar Search Web of Science

Type
Departments

All Years Choose Year From to