ACL2025 Li: TokAlign Token Alignment
Method to adapt tokenization across models. Notable Methods use pairwise cosine similarity between token embeddings to create a grid of alignment initialize new adapted embeddings for each id’s most similar tokens tune