@33550336

33550336@lemmy.world · 21 days ago

Yes – this game has some fixed, relative small set of rules so the RL could learn to play by playing millions of games at random but following the rules of the game. Confront this with dounting (infinite) number of situations which may approach one in a daily life.

33550336@lemmy.world · 22 days ago

OK, RL exists end results like the protein design or Go are impressive, but does exist a RL solving the benchmark problem?

33550336@lemmy.world · 22 days ago

Yeah RL exist since 80s (or in some form earlier) but have it solve the benchmark?

33550336@lemmy.world · 22 days ago

if only it would exist