Pioneering RL Frontier for LLM