Rapid FinFET Modelling Using an Autoencoder
arXiv:2606.24046v1 Announce Type: cross Abstract: This work presents a machine learning framework that leverages an autoencoder (AE) for the efficient modeling of FinFET. We first calibrated a BSIM-CMG model to generate a dataset of current-voltage (ID-VG) characteristics. This data was used to...
This research from Arxiv presents a practical application of autoencoders to solve a bottleneck in semiconductor device modeling. The authors trained a machine learning framework to replicate the electrical behavior of FinFET transistors—the foundational building blocks of modern processors and memory chips.
What Happened
The team began by calibrating a standard physics-based model (BSIM-CMG) to generate a large dataset of current-voltage (ID-VG) curves. Instead of using this data to train a conventional neural network for direct prediction, they employed an autoencoder architecture. An autoencoder learns a compressed, latent representation of the input data and then reconstructs it. In this context, the autoencoder learned the underlying patterns of FinFET electrical behavior from the simulated data, creating a surrogate model that can rapidly generate accurate ID-VG characteristics without running the full physics simulation.
Why It Matters
The semiconductor industry faces a growing computational burden. Traditional TCAD (Technology Computer-Aided Design) simulations for advanced transistors like FinFETs can take hours or days per structure. This becomes a critical bottleneck in design space exploration, process optimization, and statistical variability analysis—tasks that require evaluating thousands or millions of device variations.
The autoencoder approach offers a compelling trade-off: a one-time training cost to create a model that can then generate predictions nearly instantaneously. This is not merely an academic curiosity. As chip designs push toward 3nm and 2nm nodes, the complexity of device physics increases exponentially. Surrogate models that preserve accuracy while slashing simulation time could enable designers to explore more design points, optimize for power and performance more aggressively, and reduce the time-to-market for new process technologies.
Furthermore, using an autoencoder rather than a simpler regression model suggests an attempt to capture higher-order correlations and non-linearities in the device behavior. The latent space of the autoencoder may also prove useful for other tasks, such as anomaly detection (identifying outlier devices) or transfer learning between different process nodes.
Implications for AI Practitioners
For AI engineers working in hardware or electronic design automation (EDA), this work reinforces a key lesson: domain-specific data compression techniques can outperform generic supervised learning for physical modeling. The autoencoder’s ability to learn a compact representation of complex physical phenomena is directly applicable to other areas of computational physics, including electromagnetics, fluid dynamics, and thermal modeling.
Practitioners should note the reliance on simulated data for training. This is both a strength and a limitation. Simulated data is clean and abundant, but it may not capture all real-world manufacturing variations. Deploying such a model in a fab environment would require careful validation against measured silicon data.
The approach also highlights the importance of calibration. The BSIM-CMG model was calibrated first, meaning the autoencoder’s accuracy is bounded by the quality of that initial physics model. AI practitioners should view this as a pipeline problem: the machine learning component is only as good as the data generation step that precedes it.
Key Takeaways
- Autoencoders can serve as efficient surrogate models for complex semiconductor physics, enabling near-instantaneous prediction of FinFET electrical characteristics after a one-time training cost.
- This approach addresses a critical industry bottleneck: the growing computational expense of TCAD simulations at advanced process nodes (3nm and below).
- The reliance on calibrated simulation data means model accuracy is fundamentally limited by the underlying physics model, not just the neural network architecture.
- For AI practitioners, this demonstrates a template for applying unsupervised representation learning to physical modeling problems beyond semiconductors, such as electromagnetics or fluid dynamics.