VIEWS40

Kyle Kastner@kastnerkyle
Additionally "neural thickets" give some interesting empirical evidence that this search for behavior is often nearby in weight space https://arxiv.org/abs/2603.12228
19hViews 40
The framework maps REINFORCE directly to score function estimators

Additionally "neural thickets" give some interesting empirical evidence that this search for behavior is often nearby in weight space https://arxiv.org/abs/2603.12228