Step Reasoner mini (Step R-mini for short) is the first reasoning model launched by Leap Star. It uses a unique "slow thinking" and repeatedly verified logic mechanism to provide accurate and reliable responses, and can effectively solve complex problems such as logical reasoning, coding, mathematics, etc., while taking into account general fields such as literary creation, showing a powerful " Ability to study both arts and sciences.
Core features:
Strong reasoning ability: Good at active planning, trial and reflection, and solving complex problems through logical reasoning, including mathematics problems (even Mathematical Olympiad problems), geometry problems (can actively draw sketches), logical reasoning problems and LeetCode "Hard" level programming problems .
A combination of arts and sciences: Unlike many inference models that are only good at a single field, Step R-mini has been trained through a large amount of reinforcement learning to perform well in tasks such as literary creation, daily chatting and translation, and can understand user intentions and perform creative tasks. Express.
Excellent benchmark performance: In mathematical benchmarks such as AIME and Math, Step R-mini performs better than o1-preview and is comparable to OpenAI's o1-mini; it also performs better than o1-preview in LiveCodeBench programming tasks.
Reinforcement learning training: Use the On-Policy reinforcement learning algorithm for training to improve the comprehensive capabilities of the model.
Future visual reasoning capabilities: Step Star is developing a visual reasoning model to extend reasoning capabilities to the visual field and achieve "Spatial-Slow-Thinking".
Application scenarios:
Math problem solving: Ability to construct chains of reasoning, enumerate solutions, and draw sketches.
Logical reasoning: Ability to independently explore problem-solving ideas and self-questioning.
Programming: Able to understand user needs and build code logic to solve complex development needs.
Content Creation: Able to understand users’ needs and express creatively.
Translation: Ability to translate accurately and with rich connotations.
How to experience:
Users can log in to the Yuewen web page