Discuss, Learn and be Happy דיון בשאלות

help brightness_4 brightness_7 format_textdirection_r_to_l format_textdirection_l_to_r

When applying DAgger with coaching, we (multiple answers may apply)

1
done
done
done
by
מיין לפי

Which of the following statements is correct regarding model-based learning (multiple answers may apply)

1
done
done
by
מיין לפי

Which of the following statements are correct regarding policy gradients algorithms (multiple answers may apply)

1
done
done
by
מיין לפי

The problem of model bias stems from the fact that a given sampling of the dynamics may be represented by multiple functions, with us being unable to know whether we overfit.

1
by
מיין לפי

Which of the following statements regarding actor-critic methods is correct

1
done
by
מיין לפי

When using Experience replay in DQN (multiple answers may apply):

1
done
done
by
מיין לפי

Distillation produces a more efficient model after the training is complete, but requires additional steps during training

1
by
מיין לפי

Which of the following statements is true regarding Actor-Mimic Networks (AMN). (multiple answers may apply):

1
done
done
by
מיין לפי

The update of the REINFORCE algorithm is carried out using the formula (2020 A, 13)

1
done
done
by
מיין לפי

When applying pre-trained networks to a new dataset, attention models can be used to determine how much weight to assign to the input of each network

1
by
מיין לפי