Discuss, Learn and be Happy דיון בשאלות

help brightness_4 brightness_7 format_textdirection_r_to_l format_textdirection_l_to_r

Which of the following statements is correct regarding DAgger with coaching

1
done
by
מיין לפי

Auto-encoders can be used in model-based learning to predict the next state given a state and an action

1
sentiment_very_satisfied
by
מיין לפי

One of the main advantages of evolutionary algorithms is the fact that they do not need to use backpropagation

1
by
מיין לפי

1. Which of the following statements is correct with regard to model-based RL:

1
done
by
מיין לפי

a. One of the main difficulties in learning across multiple datasets is the need to determine the similarity of the various datasets (and their states) to the one currently being analyzed

1
by
מיין לפי

b. Adaptation for changing dynamics is an important element in applying meta-learning to real-world problems

1
by
מיין לפי

Which of the following scenarios may encourage us to use a stochastic policy (multiple answers may apply):

1
done
done
by
מיין לפי

a. For a good unbiased estimator, we could use the average of the value-function for all the states in our state space

1
by
מיין לפי

Which of the following reasons might make us to prioritize using imitation learning techniques over other types of RL techniques (multiple answers may apply):

1
done
done
done
by
מיין לפי

If we have the optimal value function V* then we are always capable of calculating the optimal Q-function Q* -

1
by
מיין לפי