Edgebench Proposed As Leading ASI Benchmark For Correctness And Performance · Digg

/Tech5h ago

Edgebench Proposed As Leading ASI Benchmark For Correctness And Performance

5380177.3K

Original post

Teortaxes▶️ (DeepSeek 推特🐋铁粉 2023 – ∞)@teortaxesTex#501in/Tech

Proud to be early to @tikgiau this is actual paradigmatic science of applied intelligence folks The fit may be falsified, the specific mathematics explaining the log-sigmoid cast into doubt, but that's what a Hypothesis looks like. We don't have these very often.

Hanchi Sun @ ACL@sun_hanchi

I had been dreaming about what ASI benchmarks should be like, but edgebench still exceeded my imaginations

In essence, those tasks has both goal for correctness (verified by human) and performance (which it may outperform human)

Anyway, I foresee it being the most important benchmark in the next few years

2:54 PM · Jul 4, 2026 · 6.1K Views

Sentiment

Sentiment building, check back later.

Cluster Engagement

Digg Deeper

No Digg Deeper questions have been answered for this story yet.

Posts from X

Most Activity

Most Activity

VIEWS1.2KBOOKMARKS2LIKES8REPLIES1

Teortaxes▶️ (DeepSeek 推特🐋铁粉 2023 – ∞)@teortaxesTex

now to get @arcticinstincts to read the paper and give his opinion on whether it's Kumon grindcelism or frolicmaxxing…

Teortaxes▶️ (DeepSeek 推特🐋铁粉 2023 – ∞)@teortaxesTex

Proud to be early to @tikgiau this is actual paradigmatic science of applied intelligence folks The fit may be falsified, the specific mathematics explaining the log-sigmoid cast into doubt, but that's what a Hypothesis looks like. We don't have these very often.

5h|Views 1.2KLikes 8Bookmarks 2

5380177.3K