מה אני DRL

לחץ כאן לכל השאלות

b) When training an agent to play a two-player game against other agents trained by other people (e.g., chess, rock-paper-scissors), it might be useful to use a stochastic policy both at train and test time

1
by
מיין לפי

* השאלה נוספה בתאריך: 28-02-2025