In inference time, you can retarget a humanoid hand to match the predicted human hand.
We have tested out the WUJI and Inspire hands and it works quite well for those. Our industry partners have reported successes on their hardware as well.
To model these grasps, we use both RGB and Depth as input and predict the MANO hand pose of the robot.
We needed multiple iterations to nail down the predictions in high-DOF hand space, and hope this architecture is helpful to others trying to predict full humanoid hand pose.