ProgramBench evaluates language models on reconstructing complete codebases solely from compiled binaries and documentation, as John Yang calls for v2 task suggestions
John Yang seeks input on CLI tools, executables, and apps.
——0——
John Yang seeks input on CLI tools, executables, and apps.