Equivariant transformer is all you need
Oct 19, 20237 pages
Published in:
- PoS LATTICE2023 (2024) 001
Contribution to:
- , 001,
- Lattice 2023
- Lattice 2023
- Published: Dec 27, 2023
e-Print:
- 2310.13222 [hep-lat]
DOI:
View in:
Citations per year
Abstract: (SISSA)
Machine learning, deep learning, has been accelerating computational physics, which has been used to simulate systems on a lattice. Equivariance is essential to simulate a physical system because it imposes a strong induction bias for the probability distribution described by a machine learning model. This reduces the risk of erroneous extrapolation that deviates from data symmetries and physical laws.However, imposing symmetry on the model sometimes occur a poor acceptance rate in self-learning Monte-Carlo (SLMC). On the other hand, Attention used in Transformers like GPT realizes a large model capacity. We introduce symmetry equivariant attention to SLMC. To evaluate our architecture, we apply it to our proposed new architecture on a spin-fermion model on a two-dimensional lattice. We find that it overcomes poor acceptance rates for linear models and observe the scaling law of the acceptance rate as in the large language models with Transformers.Note:
- 7 pages, 4 figures, contribution for the 40th International Symposium on Lattice Field Theory (Lattice 2023), July 31st - August 4th, 2023, Fermi National Accelerator Laboratory
- model: linear
- dimension: 2
- acceptance
- machine learning
- lattice
- Monte Carlo
- numerical calculations
- scaling
- induction
References(16)
Figures(6)
- [1]
- [2]
- [3]
- [4]
- [5]
- [6]
- [7]
- [8]
- [9]
- [10]
- [11]
- [12]
- [13]
- [14]
- [15]
- [16]