Sentiment
Users express excitement about the LoRA8 model's benchmark outperformance because of the substantial headroom remaining in joint length reduction and accuracy preservation techniques.
Pos
100.0%
Neg
0.0%
1 comments with sentiment.
Cluster Engagement
Digg Deeper
No Digg Deeper questions have been answered for this story yet.
Posts from X
Most Activity
Most Activity
VIEWS69LIKES1REPLIES1
kalomaze@kalomaze
if i set max tokens to 12k instead of 4k per turn for the ones with nontrivial truncation...
kalomaze@kalomaze
you LOVE to see it
13mViews 69Likes 1Bookmarks 0
kalomaze@kalomaze
there's so much headroom in joint length reduction + accuracy preservation reward shaping, it's pure insanity tbqh
kalomaze@kalomaze
if i set max tokens to 12k instead of 4k per turn for the ones with nontrivial truncation...
11mViews 38Likes 0Bookmarks 0