In this paper, a hypothesis is posited that these neurons contribute to generate the critic output (state evaluation in a given task) and are obtained through reinforcement learning. Thereby, tool use is considered to be the change of link length for simplicity; a layered neural network learns hand reaching by a manipulator based on reinforcement learning. Inputs of the network are visual sensory signals and the state of the manipulator. Outputs are the critic and joint torques as the actor. After learning, the manipulator came to move its hand toward the target on the visual sensor when the target was located within the hand's reach. Both types of neurons observed in experiments of Iriki et al. were found in the hidden layer of the neural network.
- Force Load -
It has been known that when a human moves its hand to a target,
the trajectory becomes almost a straight line from the start point
to the target.
When a viscosity force field is loaded to the hand unexpectedly,
it is pulled toward the force direction once and then
goes back to the target.
However, after the learning in the force field,
the trajectory becomes a straight line again,
and when the force field is removed, it is pulled toward the opposite
direction of the force that was loaded to the hand\cite{Shadmehr}.
This is called after-effect.
In this paper, a neural network, whose inputs are
visual sensory signals and state of manipulator,
and whose outputs are joint torques, was trained by reinforcement learning.
The effect of the first force field exposure and after-effect
could be observed.
This means that the system obtains inverse dynamics of its hand
and environment in the neural network through reinforcement learning.
Further, when the neural network learned with a random force at every trial,
it became to control its hand based on feedback control rather than
feedforward control.
6. 柴田克成:
強化学習によって獲得された手先のリーチング運動における軌跡と速度履歴,
第21回計測自動制御学会九州支部学術講演会予稿集, pp. 241--244, 2002.12
(in Japanese)
5. Katsunari Shibata \& Koji Ito:
Effect of Force Load in Hand Reaching Movement Acquired by Reinforcement Learning,
Proc. of Int'l Conf. on Neural Information Processing Systems (ICONIP '02), Vol. 3, pp. 1444-1448, 2002.11
pdf File (5 pages, 200kB)
4. 柴田克成, 杉坂政典, 伊藤宏司:
強化学習によるリーチング運動の獲得,
電子情報通信学会技術研究報告, NC2000-170, pp. 107-114, 2001. 3.
(in Japanese)
pdf File (8 pages, 318kB)
3. Katsunari Shibata, Masanori Sugisaka and Koji Ito:
Hand Reaching Movement Acquired through Reinforcement Learning,
Proc. of 2000 KACC (Korea Automatic Control Conf.), 90rd (CD-ROM), 2000. 10
PDF File (186kB)
2. 柴田 克成, 伊藤 宏司:
Direct-Vision-Based 強化学習に基づく Hand-Eye Coordination の形成,
第12回自律分散システム・シンポジウム資料, pp. 217 - 222, 2000.1.
(in Japanese)
pdf File (6 pages, 470kB)
1. K. Shibata and K. Ito:
Hand-Eye Coordination in Robot Arm Reaching Task by Reinforcement Learning Using Neural Networks,
Proc. of IEEE SMC (Int'l Conf. on Systems, Man, and Cybernetics) '99, Vol.V, pp. V-458 - 463, 1999.10