Research2026-05-07
Understanding and Mitigating Bias Inheritance in LLM-based Data Augmentation on Downstream Tasks
Source: Arxiv CS.AI
arXiv:2502.04419v3 Announce Type: replace-cross Abstract: Generating synthetic datasets via large language models (LLMs) has emerged as a promising approach to improve LLM performance. However, LLMs inherently reflect biases in their training data, leading to a critical challenge: when models are...
arxivpapers