@yoavartzi I am soon moving to the new lab, so there's a lot of thinking (and a big branch should be pretraining), but it is also already active. I think the most concrete pretraining challenge is With context
"It is time to separate language from language models" The revelation keeps bugging me, and while making the talk "multilingual?" I just gave. Thought I'd briefly share the contents of the talk: