EMNLP2023

Revisiting Instruction Fine-tuned Model Evaluation to Guide Industrial Applications

Manuel Faysse, Gautier Viaud, Céline Hudelot, Pierre Colombo

被引用 2 次

摘要

Instruction Fine-Tuning (IFT) is a powerful paradigm that strengthens the zero-shot capabilities of Large Language Models (LLMs), but in doing so induces new evaluation metric requirements. We show LLM-based metrics to be well adapted to these requirements, and leverage them to conduct an investigation of taskspecialization strategies, quantifying the tradeoffs that emerge in practical industrial settings. Our findings offer practitioners actionable insights for real-world IFT model deployment.