Profile

Exploring the Cognitive Architecture
Of Embodied Agents.

Undergraduate Student @ Xi’an University of Architecture and Technology (XAUAT).
Working on Whole-body Control and World Models for Humanoid Robots.

“I view the humanoid body not merely as a mechanical assembly, but as the only viable vessel for a cognitive model to truly interact with and understand the physical laws of our world.”

My research seeks to bridge the gap between high-level reasoning and low-level torque control. I am less interested in the hardware Itself, but rather in how we can instill biological plausibility and adaptive intelligence into these iron vessels.

探索具身智能体的
认知架构

本科生 @ 西安建筑科技大学。
致力于人形机器人的 全身控制 (Whole-body Control)世界模型 (World Models) 研究。

“我认为人形躯体不仅仅是机械结构,而是认知模型真正理解物理世界、并与其交互的唯一有效载体。”

我的研究旨在弥合“高层认知推理”与“底层力矩控制”之间的鸿沟。相比于单纯的硬件本体,我更着迷于如何赋予这些钢铁躯壳以生物合理性自适应智能

#Optimal Control (MPC/WBC) #Embodied AI #Sim-to-Real #Cognitive Modeling
Email / GitHub / Scholar

Selected Research & Manuscripts 精选研究与手稿

Unified Whole-body Control and World Model Planning for Dynamic Locomotion 面向动态行走的统一全身控制与世界模型规划架构
Weihao Kong, Collaborator A, Supervisor B
In preparation for ICRA 2025 拟以此投稿 ICRA 2025 Work in Progress

Proposed a hierarchical framework where a Transformer-based world model generates latent trajectories, which are then tracked by a convex MPC controller. This approach allows the humanoid to "Imagine" physics before stepping.

提出了一种分层架构:利用基于 Transformer 的世界模型生成潜在轨迹,并通过凸优化 MPC 控制器进行跟踪。该方法使人形机器人能够在迈步前先“想象”物理后果,从而实现更加稳健的动态行走。

Reproducing ‘Learning to Walk’ via Deep Reinforcement Learning 基于深度强化学习的 “Learning to Walk” 复现与分析
Weihao Kong
Undergraduate Research Report 本科研究报告 (独立研究) Technical Report

Implemented PPO from scratch to train a humanoid agent in MuJoCo. Analyzed the impact of reward shaping on gait symmetry and energy efficiency.

从零实现了 PPO 算法并在 MuJoCo 环境中训练人形智能体。重点分析了不同奖励函数设计(Reward Shaping)对步态对称性及能量效率的具体影响。

Talks & Notes 演讲与笔记

I document my learning path and theoretical derivations. 记录我的学习路径与理论推导。

Background 背景与技能

Advanced Coursework 核心课程

  • Convex Optimization
  • MIT 6.8210:Underactuated Robotics
  • Deep Learning & Generative Models
  • CS285:Deep Reinforcement Learning
  • CMU 16-831:Introduction to Robot Learning
  • 凸优化 (Convex Optimization)
  • 最优控制理论 (Optimal Control)
  • 深度学习与生成式模型
  • CS285: Deep Reinforcement Learning (UC Berkeley)

Technical Stack 技术栈

Languages: Python, C++
Frameworks: PyTorch, JAX
Simulation: MuJoCo, Isaac Gym