Research2026-05-12
When Tables Leak: Attacking String Memorization in LLM-Based Tabular Data Generation
Source: Arxiv CS.AI
arXiv:2512.08875v2 Announce Type: replace-cross Abstract: Large Language Models (LLMs) have recently demonstrated remarkable performance in generating high-quality tabular synthetic data. In practice, two primary approaches have emerged for adapting LLMs to tabular data generation: (i) fine-tuning...
arxivpapers