обучаться по алгоритму обучения с подкреплением

be trained using reinforcement learning (which means AI agents are given a reward as they get closer and closer to their goal. This reward is used as a signal that they're doing the right thing, and that whatever they're doing is something to learn for the future Alex_Odeychuk)

服务器正进行维修，网站进入唯读模式。请稍后再来检查。">增加 | 服务器正进行维修，网站进入唯读模式。请稍后再来检查。">报告错误 | 获取短网址 | 语言选择诀窍