When characters are added to the tokenizer with the init_unk: true setting, the first 2 characters are not initialized with the <unk> embeddings

When new characters are added to the NLLB tokenizer and the init_unk configuration setting is enabled, the first 2 new characters are not initialized with the embeddings of the <unk> character.