ICLR2025

Task Descriptors Help Transformers Learn Linear Models In-Context

Ruomin Huang, Rong Ge

Abstract

Large language models (LLM) exhibit strong in-context learning (ICL) ability, which allows the model to make predictions on new examples based on the given prompt. Recently, a line of research (