BeClaude
Research2026-05-12

Do not copy and paste! Rewriting strategies for code retrieval

Source: Arxiv CS.AI

arXiv:2605.08299v1 Announce Type: cross Abstract: Embedding-based code retrieval often suffers when encoders overfit to surface syntax. Prior work mitigates this by using LLMs to rephrase queries and corpora into a normalized style, but leaves two questions open: how much representational shift...

arxivpapers