[go: up one dir, main page]
More Web Proxy on the site http://driver.im/ skip to main content
10.1145/3640771.3643717acmotherconferencesArticle/Chapter ViewAbstractPublication PagesiscaiConference Proceedingsconference-collections
research-article

Hierarchical Large Language Models in Cloud-Edge-End Architecture for Heterogeneous Robot Cluster Control

Published: 29 March 2024 Publication History

Abstract

Despite their powerful semantic understanding and code generation capabilities, Large Language Models (LLMs) still face challenges when dealing with complex tasks. Multi-agent strategy generation and motion control are highly complex domains that inherently require experts from multiple fields to collaborate. To enhance multi-agent strategy generation and motion control, we propose an innovative architecture that employs the concept of a cloud-edge-end hierarchical structure. By leveraging multiple large language models with distinct areas of expertise, we can efficiently generate strategies and perform task decomposition. Introducing the cosine similarity approach, aligning task decomposition instructions with robot task sequences at the vector level, we can identify subtasks with incomplete task decomposition and iterate on them multiple times to ultimately generate executable machine task sequences.The robot is guided through these task sequences to complete tasks of higher complexity. With this architecture, we implement the process of natural language control of robots to perform complex tasks, and successfully address the challenge of multi-agent execution of open tasks in open scenarios and the problem of task decomposition.

References

[1]
Austin J, Odena A, Nye M, Program synthesis with large language models[J]. arXiv preprint arXiv:2108.07732, 2021
[2]
Jain N, Vaidyanath S, Iyer A, Jigsaw: Large language models meet program synthesis[C]//Proceedings of the 44th International Conference on Software Engineering. 2022: 1219-1231
[3]
Nijkamp E, Pang B, Hayashi H, Codegen: An open large language model for code with multi-turn program synthesis[J]. arXiv preprint arXiv:2203.13474, 2022
[4]
Chen X, Lin M, Schärli N, Teaching large language models to self-debug[J]. arXiv preprint arXiv:2304.05128, 2023
[5]
Huang W, Wang C, Zhang R, Voxposer: Composable 3d value maps for robotic manipulation with language models[J]. arXiv preprint arXiv:2307.05973, 2023
[6]
Wu J, Antonova R, Kan A, Tidybot: Personalized robot assistance with large language models[J]. arXiv preprint arXiv:2305.05658, 2023
[7]
Vemprala S, Bonatti R, Bucker A, Chatgpt for robotics: Design principles and model abilities[J]. Microsoft Auton. Syst. Robot. Res, 2023, 2: 20
[8]
Dong Y, Jiang X, Jin Z, Self-collaboration Code Generation via ChatGPT[J]. arXiv preprint arXiv:2304.07590, 2023
[9]
Shah D, Osiński B, Levine S. Lm-nav: Robotic navigation with large pre-trained models of language, vision, and action[C]//Conference on Robot Learning. PMLR, 2023: 492-504
[10]
Mandi Z, Jain S, Song S. Roco: Dialectic multi-robot collaboration with large language models[J]. arXiv preprint arXiv:2307.04738, 2023
[11]
Yu J, Vincent J A, Schwager M. Dinno: Distributed neural network optimization for multi-robot collaborative learning[J]. IEEE Robotics and Automation Letters, 2022, 7(2): 1896-1903
[12]
Hong Y, Zhen H, Chen P, 3d-llm: Injecting the 3d world into large language models[J]. arXiv preprint arXiv:2307.12981, 2023

Cited By

View all
  • (2024)Securing UAV Delivery Systems with Blockchain and Large Language Models: An Innovative Logistics Solution2024 11th International Conference on Machine Intelligence Theory and Applications (MiTA)10.1109/MiTA60795.2024.10751689(1-8)Online publication date: 14-Jul-2024

Recommendations

Comments

Please enable JavaScript to view thecomments powered by Disqus.

Information & Contributors

Information

Published In

cover image ACM Other conferences
ISCAI '23: Proceedings of the 2023 2nd International Symposium on Computing and Artificial Intelligence
October 2023
120 pages
ISBN:9798400708954
DOI:10.1145/3640771
Permission to make digital or hard copies of all or part of this work for personal or classroom use is granted without fee provided that copies are not made or distributed for profit or commercial advantage and that copies bear this notice and the full citation on the first page. Copyrights for components of this work owned by others than the author(s) must be honored. Abstracting with credit is permitted. To copy otherwise, or republish, to post on servers or to redistribute to lists, requires prior specific permission and/or a fee. Request permissions from [email protected].

Publisher

Association for Computing Machinery

New York, NY, United States

Publication History

Published: 29 March 2024

Permissions

Request permissions for this article.

Check for updates

Author Tags

  1. Cosine similarity
  2. LLMS
  3. Task decomposition

Qualifiers

  • Research-article
  • Research
  • Refereed limited

Conference

ISCAI 2023

Contributors

Other Metrics

Bibliometrics & Citations

Bibliometrics

Article Metrics

  • Downloads (Last 12 months)55
  • Downloads (Last 6 weeks)12
Reflects downloads up to 12 Dec 2024

Other Metrics

Citations

Cited By

View all
  • (2024)Securing UAV Delivery Systems with Blockchain and Large Language Models: An Innovative Logistics Solution2024 11th International Conference on Machine Intelligence Theory and Applications (MiTA)10.1109/MiTA60795.2024.10751689(1-8)Online publication date: 14-Jul-2024

View Options

Login options

View options

PDF

View or Download as a PDF file.

PDF

eReader

View online with eReader.

eReader

HTML Format

View this article in HTML Format.

HTML Format

Media

Figures

Other

Tables

Share

Share

Share this Publication link

Share on social media