Domain Randomization for Sim2Real

less than 1 minute read

Key Concepts

Randomize visual, dynamics, and sensor parameters.
Use curriculum to expand ranges gradually.

Common Pitfalls

Randomization too wide makes policy underfit.
Mismatch between randomized assets and target robot.

Practical Checklist

Start with camera and lighting randomization.
Add dynamics later (mass, friction, delay).
Track real-world success rate per setting.

Math Note

Reward shaping often uses: \(r_t = \alpha r_{task} + \beta r_{stability}\)

Share on

X Facebook LinkedIn Bluesky

World Model-based Perception for Visual Legged Locomotion

2 minute read

Title + 摘要本文提出 World Model-based Perception (WMP)，通过世界模型把高维视觉观测压缩为可用于控制的低维隐变量，从而避免“特权信息教师-学生”框架的性能上限和信息鸿沟。方法使用 RSSM 在仿真中学习可预测未来感知的隐状态，并把该隐状态输入到视觉 locomotion...

Flying on Point Clouds with Reinforcement Learning

1 minute read

Title + 摘要本文研究如何用机载 3D 激光雷达点云与 sim-to-real 强化学习，实现四旋翼在复杂障碍环境中的低时延自主飞行。作者提出一种任务相关的点云表示：将历史点云在机体坐标系下按角度分区，用“最近点距离/未知区域距离”作为每个分区的数值输入，既保留细障碍感知，又降低维度以支持 RL 训练。策...

Diffusion Policy: Visuomotor Policy Learning via Action Diffusion