| |||
be trained using reinforcement learning (which means AI agents are given a reward as they get closer and closer to their goal. This reward is used as a signal that they're doing the right thing, and that whatever they're doing is something to learn for the future Alex_Odeychuk) |