9h agoSimple prompt tests like the 'carwash' scenario expose situational logic and premise-rejection failures in LLMsChatGPT suggested walking to the wash and driving back.SentimentSentimentPos60%Neg40%Positive users like the car wash distance query test as a useful estimator of LLMs' real capabilities, while negative users dismiss such viral tests as unhelpful for fixing actual model flaws.5 comments with sentiment. View comments.