Sorry, wrong first author. @realshivamsingh the call for help is still on
Call for help: there's a lot more data that needs cleaning, if we do we can train on a scale larger of models and create much much better models. Ones that are likely to be the best in tons of languages for years. Let's help them do that. Reply for suggestions or questions