@xeophon I will relax them to make it more interesting 1) No more than 10M trainable weights in the solver 2) Can't peak into the test set
@DimitrisPapail don’t tempt me
what are the rules?
The update bans access to the evaluation test set
@xeophon I will relax them to make it more interesting 1) No more than 10M trainable weights in the solver 2) Can't peak into the test set
@DimitrisPapail don’t tempt me
what are the rules?
Many users praised the ML solver challenge as a cute project and good fun approach worth trying with its outlined rules.
No Digg Deeper questions have been answered for this story yet.
@alexjc 1) No more than 10M trainable weights in the solver 2) Can't use frozen models/api calls 3) Can't peak into the test set
@DimitrisPapail If you're willing to include things like units as hard-coded hints you can get more than 15%... what were your rules?
@xeophon also 3) can't use frozen models (i thought about a variant of that but it's not very symbolic) where you take a frozen model, and finetune linear probes at the last layer. But that's a different project altogether :p
@xeophon I will relax them to make it more interesting 1) No more than 10M trainable weights in the solver 2) Can't peak into the test set
@alexjc i think above 20% is extremely hard!
@DimitrisPapail OK, that's not what I imagined as pure Python program! With trainable weights it's a good approach, but with those rules and a narrow focus I think 50-60% (or more) should be the target? Maybe I should dig out my prototypes to try to add more parameters...

@DimitrisPapail 0-shot and pass@1? I will see if I can make some time, sounds fun to dig in again...
@DimitrisPapail OK, that's not what I imagined as pure Python program! With trainable weights it's a good approach, but with those rules and a narrow focus I think 50-60% (or more) should be the target? Maybe I should dig out my prototypes to try to add more parameters...
@alexjc 1) No more than 10M trainable weights in the solver 2) Can't use frozen models/api calls 3) Can't peak into the test set

@xeophon it's a cute project!

@alexjc yes :)