< 목차 > Introduction References Introduction References Transformers learn in-context by gradient descent What learning algorithm is in-context learning? Investigations with linear models Transformers Learn Higher-Order Optimization Methods for In-Context Learning: A Study with Linear Models