[Embodied AI Tutorial] Overview of Embodied AI (Part1)

Overview of Embodied AI (Part1)
课程内容
资料查询

Overview of Embodied AI (Part1)

课程主页：https://ai-workshops.github.io/building-and-working-in-environments-for-embodied-ai-cvpr-2022/
slides
video
讲师：
Zhiwei Jia

课程内容

Simulators

使用仿真引擎主要考虑如下因素：

Rendering：RGB/Depth/Optical Flow/Segmentation等Sensor数据
Physics：不同类别的task需要不同级别的物理仿真，例如，视觉导航使用Partial Physics即可，而开门、倒水等Task需要Full Physics。
Speed：渲染速度
Objects types and properties：刚体、液体、可切分、可碎裂等属性
Action modeling：分为High Level(将某物放到某个地方)与Low Level(倒水)
Human interface：该仿真器与人交互的方式，例如键鼠或者 VR

Assets

object：
-- Grasping：抓各种静态物品，数据集 YCB\EGAD
-- General Manipulation Skill：倒水、开门、关抽屉等各种操作，数据集 PartNet-Mobility、DoorGym、Objects from iTHOR、Meta-World
-- Multisensory：混合多模态信息源(视觉、声音、触觉等)，数据集 ObjectFolder、ThreeDWorld
scene: 静态场景(例如家里的布局)
demonstrations：state-based trajectories，例如下面Slides

Task

Locomotion: eg. Control a robot dog to perform a series of actions
Visual Navigation:
-- Object Goal Navigation: Specify an object category and ask the agent to find it
-- Embodied Question Answering: Ask an agent to answer a question which requires it to navigate in the scene
Object Manipulation: 开抽屉、抹桌子、洗杯子然后冲咖啡
Rearrangement: bring poses of the objects to a specified configuration

资料查询

折叠Title

FromChatGPT(提示词：XXX)

posted @ 2024-09-10 19:28 fariver 阅读(26) 评论(0) 编辑收藏举报

刷新页面返回顶部