

2·
22 days agoOK, RL exists end results like the protein design or Go are impressive, but does exist a RL solving the benchmark problem?


OK, RL exists end results like the protein design or Go are impressive, but does exist a RL solving the benchmark problem?


Yeah RL exist since 80s (or in some form earlier) but have it solve the benchmark?
Yes – this game has some fixed, relative small set of rules so the RL could learn to play by playing millions of games at random but following the rules of the game. Confront this with dounting (infinite) number of situations which may approach one in a daily life.