Loading...

Model collapse

The degenerative process where a model trained on synthetic (AI-generated) data eventually loses quality, diversity, and connection to reality. As the internet fills with AI-generated content, "organic" human-generated training data becomes a premium asset; contracts may need to specify the ratio of synthetic versus organic data to support data quality warranties.

See: Data provenance; Synthetic data; Training data