Prime Intellect's kalomaze shows transformers can condition on audio by projecting raw waveform samples directly as patches, bypassing STFT · Digg