How can we accurately evaluate whether an AI model has generated a genuinely novel concept?
Is there a widely accepted benchmark for measuring machine creativity or invention?
Tuhin Chakrabarty says testing novelty requires exact training data
How can we accurately evaluate whether an AI model has generated a genuinely novel concept?
Is there a widely accepted benchmark for measuring machine creativity or invention?
For that you need to to know whats “genuinely” in the training data
How can we accurately evaluate whether an AI model has generated a genuinely novel concept?
Is there a widely accepted benchmark for measuring machine creativity or invention?
@TuhinChakr if you model an intractably large combinatorial space with a neural network, you generally get some kind of ability to map unseen points you can reliably get base models to say novel things the problem is that "novelty" in isolation =/= "novel & true/useful/culturally resonant"
For that you need to to know whats “genuinely” in the training data
@TuhinChakr one mans novelty is another mans ood output
@TuhinChakr if you model an intractably large combinatorial space with a neural network, you generally get some kind of ability to map unseen points you can reliably get base models to say novel things the problem is that "novelty" in isolation =/= "novel & true/useful/culturally resonant"
Tuhin Chakrabarty says testing novelty requires exact training data
How can we accurately evaluate whether an AI model has generated a genuinely novel concept?
Is there a widely accepted benchmark for measuring machine creativity or invention?
For that you need to to know whats “genuinely” in the training data
How can we accurately evaluate whether an AI model has generated a genuinely novel concept?
Is there a widely accepted benchmark for measuring machine creativity or invention?
@TuhinChakr if you model an intractably large combinatorial space with a neural network, you generally get some kind of ability to map unseen points you can reliably get base models to say novel things the problem is that "novelty" in isolation =/= "novel & true/useful/culturally resonant"
For that you need to to know whats “genuinely” in the training data
@TuhinChakr one mans novelty is another mans ood output
@TuhinChakr if you model an intractably large combinatorial space with a neural network, you generally get some kind of ability to map unseen points you can reliably get base models to say novel things the problem is that "novelty" in isolation =/= "novel & true/useful/culturally resonant"