Skip to main navigation Skip to search Skip to main content

World Model Enhanced Embodied Intelligence for Deformable Object Manipulation of Dynamic Targets

  • Zuyan Chen
  • , Juha Roning
  • , Shuai Li*
  • *Corresponding author for this work

Research output: Chapter in Book/Report/Conference proceedingConference article in proceedingsScientificpeer-review

Abstract

Deformable object manipulation, due to its widespread applications in both daily life and engineering, has been a popular research topic in robotics field. However, current methods always suffer from high computational complexity and poor interpretability. In this paper, we propose an iterative perception-gradient strategy for dynamic-target deformable object manipulation. The proposed policy refers to the idea of world model and constructs a perception environment internally for planning and decision-making, which integrates a perception dynamics network based on Transformer and Neural ODE architectures, a perception gradient estimation module, and an iterative policy to determine the robot's optimal actions. Numerous experimental results and statistical analyses demonstrate that the proposed policy maintains superior efficacy and reliability under various conditions and dynamic targets.

Original languageEnglish
Title of host publicationCINTI 2025 - IEEE 25th International Symposium on Computational Intelligence and Informatics, Proceedings
PublisherIEEE Institute of Electrical and Electronic Engineers
Pages679-684
Number of pages6
ISBN (Electronic)979-8-3315-5291-6
DOIs
Publication statusPublished - 2025
MoE publication typeA4 Article in a conference publication
Event25th International Symposium on Computational Intelligence and Informatics, CINTI 2025 - Budapest, Hungary
Duration: 18 Nov 202519 Nov 2025

Conference

Conference25th International Symposium on Computational Intelligence and Informatics, CINTI 2025
Country/TerritoryHungary
CityBudapest
Period18/11/2519/11/25

Fingerprint

Dive into the research topics of 'World Model Enhanced Embodied Intelligence for Deformable Object Manipulation of Dynamic Targets'. Together they form a unique fingerprint.

Cite this