WALL-E: Embodied Robotic WAiter Load Lifting with Large Language Model

Wang, Tianyu; Li, Yifan; Lin, Haitao; Xue, Xiangyang; Fu, Yanwei

Computer Science > Robotics

arXiv:2308.15962 (cs)

[Submitted on 30 Aug 2023 (v1), last revised 31 Aug 2023 (this version, v2)]

Title:WALL-E: Embodied Robotic WAiter Load Lifting with Large Language Model

Authors:Tianyu Wang, Yifan Li, Haitao Lin, Xiangyang Xue, Yanwei Fu

View PDF

Abstract:Enabling robots to understand language instructions and react accordingly to visual perception has been a long-standing goal in the robotics research community. Achieving this goal requires cutting-edge advances in natural language processing, computer vision, and robotics engineering. Thus, this paper mainly investigates the potential of integrating the most recent Large Language Models (LLMs) and existing visual grounding and robotic grasping system to enhance the effectiveness of the human-robot interaction. We introduce the WALL-E (Embodied Robotic WAiter load lifting with Large Language model) as an example of this integration. The system utilizes the LLM of ChatGPT to summarize the preference object of the users as a target instruction via the multi-round interactive dialogue. The target instruction is then forwarded to a visual grounding system for object pose and size estimation, following which the robot grasps the object accordingly. We deploy this LLM-empowered system on the physical robot to provide a more user-friendly interface for the instruction-guided grasping task. The further experimental results on various real-world scenarios demonstrated the feasibility and efficacy of our proposed framework. See the project website at: this https URL

Comments:	14 pages, 8 figures. See this https URL
Subjects:	Robotics (cs.RO); Artificial Intelligence (cs.AI)
Cite as:	arXiv:2308.15962 [cs.RO]
	(or arXiv:2308.15962v2 [cs.RO] for this version)
	https://doi.org/10.48550/arXiv.2308.15962

Submission history

From: Tianyu Wang [view email]
[v1] Wed, 30 Aug 2023 11:35:21 UTC (9,403 KB)
[v2] Thu, 31 Aug 2023 13:51:56 UTC (9,399 KB)

Computer Science > Robotics

Title:WALL-E: Embodied Robotic WAiter Load Lifting with Large Language Model

Submission history

Access Paper:

References & Citations

Bookmark

Bibliographic and Citation Tools

Code, Data and Media Associated with this Article

Demos

Recommenders and Search Tools

arXivLabs: experimental projects with community collaborators

Computer Science > Robotics

Title:WALL-E: Embodied Robotic WAiter Load Lifting with Large Language Model

Submission history

Access Paper:

References & Citations

BibTeX formatted citation

Bookmark

Bibliographic and Citation Tools

Code, Data and Media Associated with this Article

Demos

Recommenders and Search Tools

arXivLabs: experimental projects with community collaborators