These days, paper visibility feels really hard, especially for us unknowns. Happy to see the community building more ways to help. Just learned about @HuggingPapers and submitted a short paper there. Many thanks to @researchpodapp for noticing it and turning it into a brilliant podcast with a sleek UI design, and to @NielsRogge for reminding me to release pretrained checkpoints for my previous two papers, as well as direct help with improving the model cards.
Attaching this short paper here to see whether anyone is interested. Main takeaway: instead of focusing only on improving inference sparsity methods, we can also design upstream architectures that are inherently more capable of handling sparse inference.



