tokenizer_multiling_emoji Some scripts for tokenization of multilingual text containing "composed emojis" introduced with Unicode 10.0 in 2017.