A Theoretical Perspective: How to Prevent Model Collapse in Self-consuming Training Loops

Fu, Shi; Wang, Yingjie; Chen, Yuzhu; Tian, Xinmei; Tao, Dacheng

Computer Science > Machine Learning

arXiv:2502.18865 (cs)

[Submitted on 26 Feb 2025]

Title:A Theoretical Perspective: How to Prevent Model Collapse in Self-consuming Training Loops

Authors:Shi Fu, Yingjie Wang, Yuzhu Chen, Xinmei Tian, Dacheng Tao

View PDF HTML (experimental)

Abstract:High-quality data is essential for training large generative models, yet the vast reservoir of real data available online has become nearly depleted. Consequently, models increasingly generate their own data for further training, forming Self-consuming Training Loops (STLs). However, the empirical results have been strikingly inconsistent: some models degrade or even collapse, while others successfully avoid these failures, leaving a significant gap in theoretical understanding to explain this discrepancy. This paper introduces the intriguing notion of recursive stability and presents the first theoretical generalization analysis, revealing how both model architecture and the proportion between real and synthetic data influence the success of STLs. We further extend this analysis to transformers in in-context learning, showing that even a constant-sized proportion of real data ensures convergence, while also providing insights into optimal synthetic data sizing.

Comments:	Accepted at ICLR 2025
Subjects:	Machine Learning (cs.LG); Artificial Intelligence (cs.AI)
Cite as:	arXiv:2502.18865 [cs.LG]
	(or arXiv:2502.18865v1 [cs.LG] for this version)
	https://doi.org/10.48550/arXiv.2502.18865

Submission history

From: Shi Fu [view email]
[v1] Wed, 26 Feb 2025 06:18:13 UTC (314 KB)

Computer Science > Machine Learning

Title:A Theoretical Perspective: How to Prevent Model Collapse in Self-consuming Training Loops

Submission history

Access Paper:

References & Citations

Bookmark

Bibliographic and Citation Tools

Code, Data and Media Associated with this Article

Demos

Recommenders and Search Tools

arXivLabs: experimental projects with community collaborators

Computer Science > Machine Learning

Title:A Theoretical Perspective: How to Prevent Model Collapse in Self-consuming Training Loops

Submission history

Access Paper:

References & Citations

BibTeX formatted citation

Bookmark

Bibliographic and Citation Tools

Code, Data and Media Associated with this Article

Demos

Recommenders and Search Tools

arXivLabs: experimental projects with community collaborators