ICLR2025
Adapters for Altering LLM Vocabularies: What Languages Benefit the Most?
HyoJung Han, Akiko Eriguchi, Haoran Xu, Hieu Hoang, Marine Carpuat, Huda Khayrallah
Abstract
Limitations of Existing Vocabulary Adaptation Approaches • Heuristics based initialization for new embeddings from existing ones → Lack adaptability, not fully integrated, requires additional training • Dependency on external embeddings or dictionaries → Increase complexity and limit scalability • Language-specific approach or restrictions on the number of languages