Simple prompt tests like the 'carwash' scenario expose situational logic and premise-rejection failures in LLMs · Digg