ACT (Action Chunking with Transformers)

ACT (Action Chunking with Transformers) 是一种基于Transformer架构的模仿学习算法。

📊 数据格式转换

pip install -r policies/act/requirements/train_eval.txt -i https://pypi.tuna.tsinghua.edu.cn/simple

将原始仿真数据转换为ACT算法所需的HDF5格式：

python3 policies/act/data_process/raw_to_hdf5.py -md mujoco -dir data -tn <task_name> -vn <video_names>

转换后的数据存放于discoverse/data/hdf5文件夹中。

参考的训练配置文件位于policies/act/configurations/task_configs/example_task.py中，其中主要参数解释如下：

训练特定任务时，需要复制一份配置文件并重命名为任务名，后续将通过任务名索引相关配置文件。

仿真采集的数据默认位于discoverse仓库根目录的data文件夹中，而训练时默认从policies/act/data/hdf5中寻找数据。因此，建议使用软连接的方式将前者链接到后者，命令如下（注意修改命令中的路径，并且需要绝对路径）：

ln -sf /absolute/path/to/discoverse/data /absolute/path/to/discoverse/policies/act/data

python3 policies/train.py act -tn <task_name>

其中-tn参数指定任务名，程序会根据任务名分别在task_configs和act/data/hdf5目录下寻找同名的配置文件和数据集。

训练结果保存在policies/act/my_ckpt目录下。

推理配置文件可基于训练配置文件修改，其中主要参数解释如下：

python3 policies/infer.py act -tn <task_name> -mts <max_timesteps> -ts <ckpt> -rn discoverse/examples/<tasks_folder>/<task_script>

示例：

python3 policies/infer.py act -tn block_place -mts 100 -ts 20250711-091004 -rn discoverse/examples/task_airbot_play/block_place

其中：

-tn 任务名，程序会根据任务名分别在task_configs和data目录下寻找同名的配置文件和数据集
-mts 动作执行总步数，该命令行参数会覆盖配置文件中的max_timesteps
-ts 时间戳，对应训练得到的模型文件所在的以时间戳命名的文件夹，程序会根据任务名和时间戳在policies/act/my_ckpt目录下寻找对应的模型文件
-rn 数据采集时使用的脚本文件路径，例如discoverse/examples/tasks_airbot_play/drawer_open.py，程序会加载其中的SimNode类和AirbotPlayCfg的实例cfg来创建仿真环境