EMNLP2025
AIR: Complex Instruction Generation via Automatic Iterative Refinement
Wei Liu, Yancheng He, Yu Li, Hui Huang, Chengwei Hu, Jiaheng Liu, Shilong Li, Wenbo Su, Bo Zheng
摘要
With the development of large language models, their ability to follow simple instructions has significantly improved.However, adhering to complex instructions remains a major challenge.Current approaches to generating complex instructions are often irrelevant to the current instruction requirements or suffer from limited scalability and diversity.Moreover, methods such as back-translation, while effective for simple instruction generation, fail to leverage the rich knowledge and formatting in human written documents.In this paper, we propose a novel Automatic Iterative Refinement (AIR) framework to generate complex instructions with constraints, which not only better reflects the requirements of real scenarios but also significantly enhances LLMs' ability to follow complex instructions.The AIR framework consists of two stages: 1) Generate an initial instruction from a document; 2) Iteratively refine instructions with LLM-as-judge guidance by comparing the model's output with the document to incorporate valuable constraints.Finally, we construct the AIR-10K dataset with 10K complex instructions and demonstrate that instructions generated with our approach significantly improve the model's ability to follow complex instructions, outperforming existing methods for instruction generation 1 .1 Codes and data are available at https://github.com/ WeiLiuAH/AIR-Automatic-Iterative-Refinement.Help me to write an advertisement line for laptop. Initial InstructionPower up your productivity and unleash creativity with our cutting-edge laptop-where performance meets portability!make the line with around 10 words.C1 Unleash your potential with speed, style, and innovation.Savior: Unleash the power of innovation in your hands.Savior: Unleash epic gaming performance with cutting-edge power and immersive visuals refer to the name of the laptop as savior.C2 C3 emphasize its gaming performance.C3