Locomotion Control Framework for Snake-like Robot using Deep Reinforcement Learning

  IJCTT-book-cover
 
         
 
© 2021 by IJCTT Journal
Volume-69 Issue-7
Year of Publication : 2021
Authors : Obe Olumide O, Ayogu Thomas O
DOI :  10.14445/22312803/IJCTT-V69I7P103

How to Cite?

Obe Olumide O, Ayogu Thomas O, "Locomotion Control Framework for Snake-like Robot using Deep Reinforcement Learning," International Journal of Computer Trends and Technology, vol. 69, no. 7, pp. 24-28, 2021. Crossref, https://doi.org/10.14445/22312803/IJCTT-V69I7P103

Abstract
In many industries today around the globe, robots can be seen carrying out different tasks. These robots have the capabilities to lift heavy loads, move at a very unbelievable speed, and execute tasks at a high level of pinpoint accuracy. But despite their amazing repertoire of tasks, most robots will find it very difficult to adapt themselves to new and environments that are unfamiliar to them. This could be because human environments are so dynamic and unpredictable and very difficult to be programmed, but rather must be learned firsthand by the robot. The desire to build machines that learn behavior based on the environment presented to them is one of the goals of Reinforcement Learning (RL). Reinforcement learning, an aspect of machine learning which is inspired by behavioral psychology, allows an agent – the learner and decision-maker, to automatically and autonomously discover optimal behavior through trial and error interactions with its environments in an attempt to solve problems. We present in this paper a control framework for Snake-like robot locomotion based on Deep Reinforcement Learning.

Keywords
Locomotion, Snake-like robot, Framework, Reinforcement learning, Controller.

Reference

[1] S. Hirose., Biologically inspired robots: snake-like robot locomotors and manipulators, Oxford University Press, Oxford., 1093 (1993).
[2] H. Ohno and S. Hirose., Design of slim slime robot and its gait of locomotion. Proceedings 2001 IEEE/RSJ International Conference on Intelligent Robots and Systems.,( 2001) 707-715.
[3] S. Ma., Analysis of snake movement forms for the realization of snake-like robots. In Proc. IEEE Int. Conf. Robotics and Automation, Detroit, MIUSA., (1999) 3007–3013.
[4] A. Cully, J. Chune, D. Tarapire, J. B. Mouret (2015). Robots that can adapt like animals, Nature 521(7553) 503.
[5] A. J. Ijspeert., Central Pattern Generators for locomotion control in animals and robots: a review. Neural Networks, 21 (2008) 642-649.
[6] M. Tesch, J. Schneider, and H. Choset., Using response surface and expected improvement to optimize snake robot gait parameters. In 2001 IEE/RSJ International Conference on Intelligent Robots and Systems., (2011) 1069-1074.
[7] N. M. Nor, S. Ma., Smooth transition for CPG-based body shape control of a snake-like robot. Bioinspiration & biomimetics 9(1) (2013) 016003.
[8] Z. Bing, L. Cheng, G. Chen, F. Rohrbein, K. Huang, A. Knoll., Towards autonomous locomotion. CPG-based control of smooth 3D slithering gait transition of a snake-like robot. Bioinspiration and Biomimetics. 12(3) (2017) 035001.
[9] S. Chernova, M. Veloso., An evolutionary approach to gait learning for four-legged robots. IEEE/RSJ International Conference on Intelligent Robots and Systems., 3(2004) 2562-2567.
[10] M. S. Kim, W. Uther., Automatic gait optimization for quadruped robots. Australian Conference on Robotics and Automation. Citeseer. (2003) 1-3.
[11] N. Kohl, P. Stone., Machine learning for fast quadrupedal locomotion. AAAI. 4(2004) 611-616.
[12] J. Kober, J. A. Bagnell, J. Peters., Reinforcement Learning in Robotics: A survey. International Journal of Robotics Research. 32(11) (2013) 1238 – 1274.
[13] X. B. Peng, G. Berseth, K. Yin, M. Van De Panne., Deeploco: Dynamic Locomotion Skills using hierarchical deep reinforcement learning. ACM Transactions on Graphics. 36(4) (2017) 41.
[14] A. Rajeswaran, V. Kumar, A. Gupta, G. Vezzani, J. Schulman, E. Todorov, S. Levine., Learning complex dexterous manipulation with deep reinforcement learning and demonstrations. arXiv Preprint 1709.10087., (2017).
[15] P. Long, T. Final, X. Liao, W. Liu, H. Zhang, J. Pan., Towards optimally decentralized multi-robot collision avoidance via deep reinforcement learning. IEEE International Conference on Robotics and Automation (ICRA)., (2018) 6252-6259.
[16] J. Schulman, F. Wolski, P. Dhariwal, A. Radford, O. Klimor (2017). Proximal Policy Optimization Algorithms. arXiv: 1707.06347.
[17] D. Wu, X. Dong, J. Shen, S. C. H. Hoi., Reducing estimation bias via triplet-average deep deterministic policy gradient. IEEE Transactions on Neural Networks and Learning Systems. (2020) 1-1.
[18] T. P. Lillicrap, J. J. Hunt, A. Pritzel, N. Heess, T. Zrez, Y. Yassa, D. Silver, D. Wierstra (2015). Continuous control with deep reinforcement learning. CoRR abs/1509.02971.