RL之SARSA:利用强化学习之SARSA实现走迷宫—训练智能体走到迷宫(复杂陷阱迷宫)的宝藏位置
生活随笔
收集整理的這篇文章主要介紹了
RL之SARSA:利用强化学习之SARSA实现走迷宫—训练智能体走到迷宫(复杂陷阱迷宫)的宝藏位置
小編覺得挺不錯的,現在分享給大家,幫大家做個參考.
RL之SARSA:利用強化學習之SARSA實現走迷宮—訓練智能體走到迷宮(復雜陷阱迷宮)的寶藏位置
?
?
目錄
輸出結果
設計思路
實現代碼
測試記錄全過程
?
?
?
輸出結果
?
?
設計思路
?
?
實現代碼
后期更新……
?
?
?
?
測試記錄全過程
……......... . . . Ao . . . ......... ......... . . . Ao . . . ......... ......... . . . Ao . . . ......... ......... . . . A . . . ......... ......... . . . A . . . ......... ......... . . . A . . . ......... Episode:11 Total Step:44, Total Reward:100 . . . A . . . ......... Episode:11 Total Step:44, Total Reward:100 . A . . . ......... Episode:11 Total Step:44, Total Reward:100 . . ......... Episode:11 Total Step:44, Total Reward:100 ......... Episode:11 Total Step:44, Total Reward:100 ......... ......... . A . ......... . A . . o . ......... . A . . o . . . ......... . A . . o . . . ......... ……......... . . . A . . . ......... ......... . . . A . . . ......... ......... . . . A . . . ......... Episode:31 Total Step:6, Total Reward:100 . . . A . . . ......... Episode:31 Total Step:6, Total Reward:100 . A . . . ......... Episode:31 Total Step:6, Total Reward:100 . . ......... Episode:31 Total Step:6, Total Reward:100 ......... Episode:31 Total Step:6, Total Reward:100 ......... ......... . . ......... . . .A o . ......... . . .A o . . . ......... . . .A o . . . ......... ......... Episode:42 Total Step:6, Total Reward:100 . . . A . . . ......... Episode:42 Total Step:6, Total Reward:100 . A . . . ......... Episode:42 Total Step:6, Total Reward:100 . . ......... Episode:42 Total Step:6, Total Reward:100 ......... Episode:42 Total Step:6, Total Reward:100 ......... ......... . . ......... . . .A o . ......... . . .A o . . . ......... . . .A o . . . ......... ......... ……......... . . . A o . . . ......... ......... . . . A o . . . ......... ......... . . . A o . . . ......... ......... . . . A o . . . ......... ......... . . . A o . . . ......... ......... . . . A o . . . ......... ......... . . . A o . . . ......... ......... . . . A o . . . ......... ......... . . . A o . . . ......... ......... . . . A o . . . ......... ......... . . . A o . . . ......... ......... . . . Ao . . . ......... ......... . . . Ao . . . ......... ......... . . . Ao . . . ......... ......... . . . Ao . . . ......... ......... . . . Ao . . . ......... ......... . . . A . . . ......... ......... . . . A . . . ......... ......... . . . A . . . ......... Episode:99 Total Step:8, Total Reward:100 . . . A . . . ......... Episode:99 Total Step:8, Total Reward:100 . A . . . ......... Episode:99 Total Step:8, Total Reward:100 . . ......... Episode:99 Total Step:8, Total Reward:100 ......... Episode:99 Total Step:8, Total Reward:100 Episode:99 Total Step:8, Total Reward:100?
?
?
?
?
?
?
?
《新程序員》:云原生和全面數字化實踐50位技術專家共同創作,文字、視頻、音頻交互閱讀總結
以上是生活随笔為你收集整理的RL之SARSA:利用强化学习之SARSA实现走迷宫—训练智能体走到迷宫(复杂陷阱迷宫)的宝藏位置的全部內容,希望文章能夠幫你解決所遇到的問題。
- 上一篇: RL之Q Learning:利用强化学习
- 下一篇: RL之DQN:基于TF训练DQN模型玩“