It will be exciting times when we start collaborating on standardization around every part of the training tech stack starting from the tokenizer.
It’s not that clear why open models use slightly different tokenizers. That would be a good accelerant.








