Taking a Big Step: Large Learning Rates in Denoising Score Matching Prevent Memorization

Wu, Yu-Han; Marion, Pierre; Biau, Gérard; Boyer, Claire

Statistics > Machine Learning

arXiv:2502.03435 (stat)

[Submitted on 5 Feb 2025 (v1), last revised 6 May 2025 (this version, v2)]

Title:Taking a Big Step: Large Learning Rates in Denoising Score Matching Prevent Memorization

Authors:Yu-Han Wu, Pierre Marion, Gérard Biau, Claire Boyer

View PDF

Abstract:Denoising score matching plays a pivotal role in the performance of diffusion-based generative models. However, the empirical optimal score--the exact solution to the denoising score matching--leads to memorization, where generated samples replicate the training data. Yet, in practice, only a moderate degree of memorization is observed, even without explicit regularization. In this paper, we investigate this phenomenon by uncovering an implicit regularization mechanism driven by large learning rates. Specifically, we show that in the small-noise regime, the empirical optimal score exhibits high irregularity. We then prove that, when trained by stochastic gradient descent with a large enough learning rate, neural networks cannot stably converge to a local minimum with arbitrarily small excess risk. Consequently, the learned score cannot be arbitrarily close to the empirical optimal score, thereby mitigating memorization. To make the analysis tractable, we consider one-dimensional data and two-layer neural networks. Experiments validate the crucial role of the learning rate in preventing memorization, even beyond the one-dimensional setting.

Subjects:	Machine Learning (stat.ML); Machine Learning (cs.LG)
Cite as:	arXiv:2502.03435 [stat.ML]
	(or arXiv:2502.03435v2 [stat.ML] for this version)
	https://doi.org/10.48550/arXiv.2502.03435

Submission history

From: Yu-Han Wu [view email]
[v1] Wed, 5 Feb 2025 18:29:35 UTC (189 KB)
[v2] Tue, 6 May 2025 13:17:30 UTC (197 KB)

Statistics > Machine Learning

Title:Taking a Big Step: Large Learning Rates in Denoising Score Matching Prevent Memorization

Submission history

Access Paper:

References & Citations

Bookmark

Bibliographic and Citation Tools

Code, Data and Media Associated with this Article

Demos

Recommenders and Search Tools

arXivLabs: experimental projects with community collaborators

Statistics > Machine Learning

Title:Taking a Big Step: Large Learning Rates in Denoising Score Matching Prevent Memorization

Submission history

Access Paper:

References & Citations

BibTeX formatted citation

Bookmark

Bibliographic and Citation Tools

Code, Data and Media Associated with this Article

Demos

Recommenders and Search Tools

arXivLabs: experimental projects with community collaborators