Hey,
Thanks for the remark. Indeed, the gradient is calculated each time at a single sample so it is SGD that is used here. The update formula is therefore, each time, with respect to an i-th observation in the dataset.
Best,
Firas
Hey,
Thanks for the remark. Indeed, the gradient is calculated each time at a single sample so it is SGD that is used here. The update formula is therefore, each time, with respect to an i-th observation in the dataset.
Best,
Firas
Follow the pulses of EPFL on social networks
© 2023 EPFL, all rights reserved