Implicit Regularization of SGD in High dimensional Linear Regression. Научный семинар, осень 2025

Name: Implicit Regularization of SGD in High dimensional Linear Regression. Научный семинар, осень 2025
Uploaded: 2026-06-23T15:15:15+03:00
Duration: 1 h 6 min 1 s
Channel: BRAIn Lab: научные семинары
Description: Implicit Regularization of SGD in High dimensional Linear Regression. Научный семинар, осень 2025

Speaker: Cong Fang, Researcher at Peking University What will the talk cover? Stochastic Gradient Descent (SGD) is one of the most widely used algorithms in modern machine learning. In high-dimensional learning problems, the number of SGD iterations is often smaller than the number of model parameters, and the implicit regularization induced by the algorithm plays a key role in ensuring strong generalization performance. In this seminar, we will: Analyze the generalization behavior of SGD across different learning scenarios; Compare learning efficiency under various scales — depending on data size and dimensionality; Discuss the effects of covariate shift; Present theoretical insights that inspire memory-efficient training algorithms for large language models (e.g., GPT-2)

12+

1 просмотр

Пожаловаться Нарушение авторских прав

12+

1 просмотр

, чтобы оставлять комментарии