Skip to main content
Apress

AI for Robotics

Toward Embodied and General Intelligence in the Physical World

  • Book
  • © 2025

Overview

  • Teaches you to frame robotics as an AI problem
  • Explores cutting-edge AI techniques relevant to robotics with actionable insights into promising technical directions
  • Includes in-depth, practical analysis of key applications such as self-driving, manufacturing, and humanoids

This is a preview of subscription content, log in via an institution to check access.

Access this book

eBook USD 19.99
Price excludes VAT (Canada)
  • Available as EPUB and PDF
  • Read on any device
  • Instant download
  • Own it forever
Softcover Book USD 29.99
Price excludes VAT (Canada)
  • Compact, lightweight edition
  • Dispatched in 3 to 5 business days
  • Free shipping worldwide - see info

Tax calculation will be finalised at checkout

Other ways to access

Licence this eBook for your library

Institutional subscriptions

About this book

This book approaches robotics from a deep learning perspective. Artificial intelligence (AI) has transformed many fields, including robotics. This book shows you how to reimagine decades-old robotics problems as AI problems and is a handbook for solving problems using modern techniques in an era of large foundation models. 

The book begins with an introduction to general-purpose robotics, how robots are modeled, and how physical intelligence relates to the movement of building artificial general intelligence, while giving you an overview of the current state of the field, its challenges, and where we are headed. The first half of this book delves into defining what the problems in robotics are, how to frame them as AI problems, and the details of how to solve them using modern AI techniques. First, we look at robot perception and sensing to understand how robots perceive their environment, and discuss convolutional networks and vision transformers to solve robotics problems such as segmentation, classification, and detection in two and three dimensions. The book then details how to apply large language and multimodal models for robotics, and how to adapt them to solve reasoning and robot control. Simulation, localization, and mapping and navigation are framed as deep learning problems and discussed with recent research. Lastly, the first part of this book discusses reinforcement learning and control and how robots learn via trial and error and self-play.

The second part of this book is concerned with applications of robotics in specialized contexts. You will develop full stack knowledge by applying the techniques discussed in the first part to real-world use cases. Individual chapters discuss the details of building robots for self-driving, industrial manipulation, and humanoid robots. For each application, you will learn how to design these systems, the prevalent algorithms in research and industry, and how to assess trade-offs for performance and reliability. The book concludes with thoughts on operations, infrastructure, and safety for data-driven robotics, and outlooks for the future of robotics and machine learning.

In summary, this book offers insights into cutting-edge machine learning techniques applied in robotics, along with the challenges encountered during their implementation and practical strategies for overcoming them.

 

What You Will Learn

  • Explore ML applications in robotics, covering perception, control, localization, planning, and end-to-end learning
  • Delve into system design, and algorithmic and hardware considerations for building efficient ML-integrated robotics systems
  • Discover robotics applications in self-driving, manufacturing, and humanoids and their practical implementations
  • Understand how machine learning and robotics benefit current research and organizations

 

Who This Book Is For

Software and AI engineers eager to learn about robotics, seasoned robotics and mechanical engineers looking to stay at the cutting edge by integrating modern AI, and investors, executives or decision makers seeking insights into this dynamic field

Similar content being viewed by others

Table of contents (11 chapters)

Authors and Affiliations

  • San Francisco, USA

    Alishba Imran, Keerthana Gopalakrishnan

About the authors

Alishba Imran is a machine learning and robotics developer focusing on robot learning for manipulation and perception. She is currently conducting research in reinforcement learning and unsupervised learning with Pieter Abbeel at the Berkeley AI Research Lab. Alishba has worked on perception at Cruise, developed simulation object manipulation methods at NVIDIA, and led efforts to reduce prosthetics costs at SJSU. At Hanson Robotics, she improved manipulation in Sophia the Robot, and at Kindred.AI, she contributed to robots that have picked up over 140 million units in production.


Keerthana Gopalakrishnan is a senior research scientist at Google Deepmind, working on robot manipulation and the Gemini project. She was educated at Carnegie Mellon University and Indian Institute of Technology. Her research concerns large language models for robotic planning, scaling visual language models for low level control, and cross-embodiment robot learning. 

 

Accessibility Information

PDF accessibility summary

This PDF does not fully comply with PDF/UA standards, but does feature limited screen reader support, described non-text content (images, graphs), bookmarks for easy navigation and searchable, selectable text. Users of assistive technologies may experience difficulty navigating or interpreting content in this document. We recognize the importance of accessibility, and we welcome queries about accessibility for any of our products. If you have a question or an access need, please get in touch with us at [email protected].

EPUB accessibility summary

This ebook is designed with accessibility in mind, aiming to meet the ePub Accessibility 1.0 AA and WCAG 2.0 Level AA standards. Its features include described images and other non-text content, screenreader-friendly navigation and accessible math. Math is represented either as MathML, LaTeX or in images. If math is represented as image, Alt Text might not be present. We recognize the importance of accessibility, and we welcome queries about accessibility for any of our products. If you have a question or an access need, please get in touch with us at [email protected].

Bibliographic Information

  • Book Title: AI for Robotics

  • Book Subtitle: Toward Embodied and General Intelligence in the Physical World

  • Authors: Alishba Imran, Keerthana Gopalakrishnan

  • DOI: https://doi.org/10.1007/979-8-8688-0989-7

  • Publisher: Apress Berkeley, CA

  • eBook Packages: Professional and Applied Computing, Apress Access Books, Professional and Applied Computing (R0)

  • Copyright Information: Alishba Imran, Keerthana Gopalakrishnan 2025

  • Softcover ISBN: 979-8-8688-0988-0Published: 03 May 2025

  • eBook ISBN: 979-8-8688-0989-7Published: 02 May 2025

  • Edition Number: 1

  • Number of Pages: XXIII, 451

  • Number of Illustrations: 59 b/w illustrations, 153 illustrations in colour

  • Topics: Artificial Intelligence, Machine Learning, Python, Robotics

Keywords

Publish with us