强化学习算法,reinforcement learning
1)reinforcement learning强化学习算法
1.Based on simulated annealing andreinforcement learning algorithm,a hybrid intelligent controller was proposed to ship steering.本文基于模拟退火-强化学习算法提出了一种混合智能控制器,应用于船舶运动航向控制中。
英文短句/例句
1.Study of Multi-agent Learning Problem Based on Reinforcement Learning;基于强化学习算法的多智能体学习问题的研究
2.Improvement and Applications for Q-learning Reinforcement Learning AlgorithmsQ-learning强化学习算法改进及其应用研究
3.Genetic Reinforcement Learning Algorithm for Job-shop Scheduling ProblemJob-shop排序问题的遗传强化学习算法
4.The Research of Elevator Dynamic Scheduling Policy Based on Reinforcement Learning Algorithm;基于强化学习算法的电梯动态调度策略的研究
5.Reinforcement Learning Algorithm for Dynamic Policy Under Mixed Multi-agent Domains混合多Agent环境下动态策略强化学习算法
6.On Dynamic Scheduling Method Based on Averaged Reinforcement Learning Algorithm;基于平均型强化学习算法的动态调度方法的研究
7.Simulation experiments prove that this algorithm introduced adapt any complex environment and own good self- learning abilities.仿真实验证明:该强化学习算法不仅能够适应复杂的环境,而且具有较强的自学习能力。
8.The Research and Implementation of Large Space Reinforcement Learning Based on Model Knowledge;基于模型知识的大空间强化学习算法的研究与实现
9.A Neuro-Fuzzy Controller Based on Improved Reinforcement Learning;基于改进强化学习算法的神经模糊控制器的设计与实现
10.Unified Algorithms for Semi-Markov Decision Processes with Discounted and Average Criteria Based on Performance Potentials by Reinforcement Learning;折扣和平均准则下SMDP基于性能势的统一强化学习算法
11.Application of Intensive Learning Algorithm to Inventory Control in Supply Chains;强化学习算法在供应链环境下的库存控制中的应用
12.A Novel Dynamic Spectrum Allocation Algorithm Based on POMDP Reinforcement Learning基于POMDP强化学习的动态频谱分配算法
13.A Flight Path Planning Algorithm Based on Multi-Agent Reinforcement Learning Method多智能体强化学习飞行路径规划算法
14.On the Reinforcement Learning Based Task Allocation of Multi-robot;基于强化学习的多机器人任务分配算法研究
15.An enhanced artificial immune network with elitist-learning capability for optimization problems一类具有精英学习能力的增强型人工免疫网络优化算法
16.Research of Auto-generating Test Paper Based on Reinforcement Learning;基于加强学习的自动组卷算法的研究
17.A Study of Reinforcement Learning Based on Factor Representation基于因素化表示的强化学习方法研究
18.The Study of Multi-Agent Reinforcement Learning Methods for Cooperative Team;多Agent协作团队的强化学习方法研究
相关短句/例句
Q-Reinforcement LearningQ-强化学习算法
3)self-adaptive strenthen learning algorithm自适应强化学习算法
4)multi-agent reinforcement learning algorithm多Agent强化学习算法
5)Sarsa reinforcement learning algorithmSarsa增强学习算法
6)reinforcement learning algorithm增强学习算法
1.The Research of PID parameters adjusting method Based onreinforcement learning algorithm;基于增强学习算法的PID参数调整方法研究
延伸阅读
逆推学习算法分子式:CAS号:性质:又称逆推学习算法,简称BP算法,是1986年鲁梅哈特(D. E. Rumelhart)和麦克莱朗德(J. L. McClelland)提出来的。用样本数据训练人工神经网络(一种模仿人脑的信息处理系统),它自动地将实际输出值和期望值进行比较,得到误差信号,再根据误差信号从后(输出层)向前(输入层)逐层反传,调节各神经层神经元之间的连接权重,直至误差减至满足要求为止。反向传播算法的主要特征是中间层能对输出层反传过来的误差进行学习。这种算法不能保证训练期间实现全局误差最小,但可以实现局部误差最小。BP算法在图像处理、语音处理、优化等领域得到应用。