+
X
Skip to main content

Advertisement

Springer Nature Link
Log in
Menu
Find a journal Publish with us Track your research
Search
Cart
  1. Home
  2. Artificial Intelligence in Theory and Practice II
  3. Conference paper

Batch Reinforcement Learning for Controlling a Mobile Wheeled Pendulum Robot

  • Conference paper
  • pp 151–160
  • Cite this conference paper
Artificial Intelligence in Theory and Practice II (IFIP AI 2008)
Batch Reinforcement Learning for Controlling a Mobile Wheeled Pendulum Robot
  • Andrea Bonarini2,
  • Claudio Caccia3,
  • Alessandro Lazaric2 &
  • …
  • Marcello Restelli2 

Part of the book series: IFIP – The International Federation for Information Processing ((IFIPAICT,volume 276))

Included in the following conference series:

  • IFIP International Conference on Artificial Intelligence in Theory and Practice
  • 1513 Accesses

  • 13 Citations

Abstract

In this paper we present an application of Reinforcement Learning (RL) methods in the field of robot control. The main objective is to analyze the behavior of batch RL algorithms when applied to a mobile robot of the kind called Mobile Wheeled Pendulum (MWP). In this paper we focus on the common problem in classical control theory of following a reference state (e.g., position set point) and try to solve it by RL. In this case, the state space of the robot has one more dimension, in order to represent the desired variable state, while the cost function is evaluated considering the difference between the state and the reference. Within this framework some interesting aspects arise, like the ability of the RL algorithm to generalize to reference points never considered during the training phase. The performance of the learning method has been empirically analyzed and, when possible, compared to a classic control algorithm, namely linear quadratic optimal control (LQR).

Download to read the full chapter text

Chapter PDF

Similar content being viewed by others

Trajectory control of autonomous mobile robots considering disturbance with machine learning agents

Article 22 May 2023

Two-Wheeled Self-Balancing Mobile Robot Using Kalman Filter and LQG Regulator

Chapter © 2021

Reinforcement Learning-Based Two-Wheel Robot Control

Chapter © 2019

Explore related subjects

Discover the latest articles, books and news in related subjects, suggested using machine learning.
  • Control, Robotics, Automation
  • Learning algorithms
  • Machine Learning
  • Rehabilitation Robotics
  • Robotic Engineering
  • Robotics

References

  1. Baird, L.C.: Residual algorithms: Reinforcement learning with function approximation. In: Proceedings of the 12th Intl. Conf. on Machine Learning, pp. 30-37 (1995)

    Google Scholar 

  2. Ernst, D., Geurts, P., Wehenkel, L.: Tree-based batch mode reinforcement learning. Journal of Machine Learning Research 6, 503-556 (2005)

    MathSciNet  Google Scholar 

  3. Gordon, G.J.: Approximate solutions to markov decision processes. Ph.D. thesis, Carnegie Mellon University (1999)

    Google Scholar 

  4. Landau, L., Lifshitz, E.M.: Mechanics, Course of Theoretical Physics, Volume 1. Pergamon Pres (1976)

    Google Scholar 

  5. Ogata, K.: Modern Control Engineering (4th ed.). Prentice Hall PTR, Upper Saddle River, NJ, USA (2001)

    Google Scholar 

  6. Ormoneit, D., Sen, S.: Kernel-based reinforcement learning. Machine Learning 49(2-3), 161-178 (2002)

    Google Scholar 

  7. Press, W.H., Flannery, B.P., Teukolsky, S.A., Vetterling, W.T.: Numerical Recipes: The Art of Scientific Computing. Cambridge Univ. Press, New York, 1989. (1989)

    Google Scholar 

  8. Reddy, J.: Energy Principles and Variational Methods in Applied Mechanics (2nd ed.). John Wiley and Sons, Hoboken, NJ, USA (2002)

    Google Scholar 

  9. Riedmiller, M.: Neural fitted q iteration - first experiences with a data efficient neural reinforcement learning method. In: Proceedings of European Conference on Machine Learning, pp. 317-328 (2005)

    Google Scholar 

  10. Sutton, R.S., Barto, A.G.: Reinforcement Learning: An Introduction. MIT Press, Cambridge, MA (1998)

    Google Scholar 

Download references

Author information

Authors and Affiliations

  1. Dept. of Electronics and Information, Politecnico di Milano, Piazza Leonardo da Vinci 32, I-20133, Milan, Italy

    Andrea Bonarini, Alessandro Lazaric & Marcello Restelli

  2. Dept. of Informatics, Systems and Communication, Università degli Studi di Milano – Bicocca, Viale Sarca 336, I-20126, Milan, Italy

    Claudio Caccia

Authors
  1. Andrea Bonarini
    View author publications

    Search author on:PubMed Google Scholar

  2. Claudio Caccia
    View author publications

    Search author on:PubMed Google Scholar

  3. Alessandro Lazaric
    View author publications

    Search author on:PubMed Google Scholar

  4. Marcello Restelli
    View author publications

    Search author on:PubMed Google Scholar

Editor information

Editors and Affiliations

  1. University of Portsmouth, UK

    Max Bramer

Rights and permissions

Reprints and permissions

Copyright information

© 2008 International Federation for Information Processing

About this paper

Cite this paper

Bonarini, A., Caccia, C., Lazaric, A., Restelli, M. (2008). Batch Reinforcement Learning for Controlling a Mobile Wheeled Pendulum Robot. In: Bramer, M. (eds) Artificial Intelligence in Theory and Practice II. IFIP AI 2008. IFIP – The International Federation for Information Processing, vol 276. Springer, Boston, MA. https://doi.org/10.1007/978-0-387-09695-7_15

Download citation

  • .RIS
  • .ENW
  • .BIB
  • DOI: https://doi.org/10.1007/978-0-387-09695-7_15

  • Publisher Name: Springer, Boston, MA

  • Print ISBN: 978-0-387-09694-0

  • Online ISBN: 978-0-387-09695-7

  • eBook Packages: Computer ScienceComputer Science (R0)

Share this paper

Anyone you share the following link with will be able to read this content:

Sorry, a shareable link is not currently available for this article.

Provided by the Springer Nature SharedIt content-sharing initiative

Keywords

  • Mobile Robot
  • Reinforcement Learn
  • Markov Decision Process
  • Real Robot
  • Reinforcement Learn Algorithm

These keywords were added by machine and not by the authors. This process is experimental and the keywords may be updated as the learning algorithm improves.

Publish with us

Policies and ethics

Search

Navigation

  • Find a journal
  • Publish with us
  • Track your research

Discover content

  • Journals A-Z
  • Books A-Z

Publish with us

  • Journal finder
  • Publish your research
  • Language editing
  • Open access publishing

Products and services

  • Our products
  • Librarians
  • Societies
  • Partners and advertisers

Our brands

  • Springer
  • Nature Portfolio
  • BMC
  • Palgrave Macmillan
  • Apress
  • Discover
  • Your US state privacy rights
  • Accessibility statement
  • Terms and conditions
  • Privacy policy
  • Help and support
  • Legal notice
  • Cancel contracts here

23.94.208.52

Not affiliated

Springer Nature

© 2025 Springer Nature

点击 这是indexloc提供的php浏览器服务,不要输入任何密码和下载