ICLR2025

Adapters for Altering LLM Vocabularies: What Languages Benefit the Most?

HyoJung Han, Akiko Eriguchi, Haoran Xu, Hieu Hoang, Marine Carpuat, Huda Khayrallah

摘要

Limitations of Existing Vocabulary Adaptation Approaches • Heuristics based initialization for new embeddings from existing ones → Lack adaptability, not fully integrated, requires additional training • Dependency on external embeddings or dictionaries → Increase complexity and limit scalability • Language-specific approach or restrictions on the number of languages