Rohan Anil, formerly a Distinguished Engineer at Google DeepMind, describes immediate compute access at Ant without permissions, meetings or future commitments that supported uninterrupted research, contrasting it with conditions at Google DeepMind
The exchange highlighted how differences in resource allocation and approval processes affect research velocity across AI organizations.
Saying this in the most helpful way: since I am not cherrypicking anything here, Interesting bit having worked at Ant and GDM. At Ant, I needed no permissions or meetings to get access to them. No trading compute, no future promises. No stress. Just pure research, and it helped a lot in landing new innovative ideas to make model better and it was one of the most beautiful time of my life for pushing fundamental innovation, my mind was fully refreshed and became the best IC I have ever been, even better than when I was at Brain. Only Jt could nerdsnipe me into something new because I think its possible to do that soon.
It’s age of research, but we need to advance research on faster timelines. Org chart and status driven tastes on research is not going to work. Its needs to be cradled like a baby, need to pay attention, need to do diaper changes and feed that research.
Saying this in the most helpful way: since I am not cherrypicking anything here, Interesting bit having worked at Ant and GDM. At Ant, I needed no permissions or meetings to get access to them. No trading compute, no future promises. No stress. Just pure research, and it helped a lot in landing new innovative ideas to make model better and it was one of the most beautiful time of my life for pushing fundamental innovation, my mind was fully refreshed and became the best IC I have ever been, even better than when I was at Brain. Only Jt could nerdsnipe me into something new because I think its possible to do that soon.
Cortisol spike on all or nothing compute fights is something anti thesis to fundamental innovation.
It’s age of research, but we need to advance research on faster timelines. Org chart and status driven tastes on research is not going to work. Its needs to be cradled like a baby, need to pay attention, need to do diaper changes and feed that research.
@_arohan_ My solution to this is to work on stuff no one cares about so no one has tried to scale it yet, so I can still do novel work with like 5 GPUs.
Saying this in the most helpful way: since I am not cherrypicking anything here, Interesting bit having worked at Ant and GDM. At Ant, I needed no permissions or meetings to get access to them. No trading compute, no future promises. No stress. Just pure research, and it helped a lot in landing new innovative ideas to make model better and it was one of the most beautiful time of my life for pushing fundamental innovation, my mind was fully refreshed and became the best IC I have ever been, even better than when I was at Brain. Only Jt could nerdsnipe me into something new because I think its possible to do that soon.
@_arohan_ It's a quiet life but it has its simple joys.
@_arohan_ My solution to this is to work on stuff no one cares about so no one has tried to scale it yet, so I can still do novel work with like 5 GPUs.
@LucaAmb @_arohan_ Oh no no, I'm talking even fewer people caring.
@pfau @_arohan_ Join the language diffusion field then :D
@_arohan_ A lot of foundational research doesn't need that much compute..!
It’s age of research, but we need to advance research on faster timelines. Org chart and status driven tastes on research is not going to work. Its needs to be cradled like a baby, need to pay attention, need to do diaper changes and feed that research.
@pfau @_arohan_ Join the language diffusion field then :D
@_arohan_ My solution to this is to work on stuff no one cares about so no one has tried to scale it yet, so I can still do novel work with like 5 GPUs.