Task Planning based on Intelligence Algorithm under Uncertainty

Yu-min JIAO, Ting XU, Qiang Wang, Jing SU

Abstract


Given the non-convergence of function estimation and difficulty of reliability distribution in the process of reinforce learning, GASA-Q-learning was presented to solve the problems with markov process under uncertainty, in which genetic algorithm, simulated annealing algorithm and heuristic rule are proposed as a mean. In order to solve this problem, iterative process of value function estimation was transformed to continuous evolving process of reliability state space, and then Q-value function and distance function were used to adjust fitness value and energy function respectively. Finally the functions mentioned above were incorporated with probability distribution and reliability space to aid in finding the optimal policy. The pushing experiments show that the proposed methodology has strong robustness and fast convergence speed.


Keywords


Q-learning; hybrid genetic algorithm; task planning under uncertainty; reinforce learning; belief state

Full Text:

PDF

Refbacks

  • There are currently no refbacks.


Creative Commons License
This work is licensed under a Creative Commons Attribution 3.0 License.

AITMĀ©: World Science Publisher United States