BeClaude
Research2026-05-08

Are Flat Minima an Illusion?

Source: Arxiv CS.AI

arXiv:2605.05209v1 Announce Type: cross Abstract: Neural networks that land in flat regions of the loss landscape tend to generalise better than those in sharp regions. Sharpness-Aware Minimisation exploits this to improve generalisation. But function-preserving reparameterisation can inflate the...

arxivpapers