BeClaude
Research2026-05-06

Iterative Finetuning is Mostly Idempotent

Source: Arxiv CS.AI

arXiv:2605.01130v1 Announce Type: new Abstract: If a model has some behavioral tendency, such as sycophancy or misalignment, and it is trained on its own outputs, will the tendency be amplified in the next generation of models? We study this question by training a series of models where each model...

arxivpapers