6h agoOpen-source developer xlr8harder extends Talkie-1930-13B to 32k context, arguing RULER benchmark unfairly tests modern conceptsIts RULER score dropped from 80.78 to 61.83.