Research2026-05-12

When Tables Leak: Attacking String Memorization in LLM-Based Tabular Data Generation

arXiv:2512.08875v2 Announce Type: replace-cross Abstract: Large Language Models (LLMs) have recently demonstrated remarkable performance in generating high-quality tabular synthetic data. In practice, two primary approaches have emerged for adapting LLMs to tabular data generation: (i) fine-tuning...

Read Original Article on Arxiv CS.AI

arxivpapers