Revolutionary One-Shot RLVR: Train LLMs to Reason with a Single Example - Searchlysis Developer