HomeAI News
The domestic self-developed 100-billion-level mathematical model MathGPT is online and open for public testing from now on
867

The domestic self-developed 100-billion-level mathematical model MathGPT is online and open for public testing from now on

Hayo News
Hayo News
August 24th, 2023
View OriginalTranslated by Google

Talal Future CTO Tian Mi announced that MathGPT, a 100-billion-level large-scale model in the mathematics field developed by TAL, was officially launched and opened for public testing. From now on, users can apply for a free trial experience through the official website ( www.mathgpt.com ) to register an account.

In May of this year, TAL announced that it is developing a self-developed large mathematical model, named MathGPT. MathGPT is a large-scale model in the vertical field of mathematics with problem-solving and lecture algorithms as the core for mathematics enthusiasts and scientific research institutions around the world. It is also known as the first large-scale model specially built for mathematics in China.

When users use MathGPT, they can upload math questions in text or pictures, and then they can get dialogue-style answer feedback. They can also use the "random question" button to randomly generate math questions and give answers by the system. Currently, MathGPT supports PC and mobile experiences in Chinese and English versions .

According to the official website of MathGPT, MathGPT's mathematical computing ability has covered mathematics problems in elementary school, junior high school, and high school. Q&A interaction.

The MathGPT technical report shows that among the test results of six public mathematics evaluation collections including CEval-Math, AGIEval-Math, APE5K, CMMLU-Math, Gaokao Mathematics and Math401, TAL’s MathGPT has achieved the highest scores in multiple tests. At the same time, MathGPT also performed well on the general test collection of C-Eval's middle and high schools.

MathGPT's C-Eval list of junior and senior high school subjects

In terms of problem-solving stability and explanation friendliness, MathGPT conducts model training based on a large number of famous teachers' problem-solving process data, and the model's problem-solving steps are professional and clear. Taking a sequence question as an example, the answer given by MathGPT includes three parts: "analysis", "detailed explanation", and "finishing", which is more detailed than the rough explanation method of the general large model. Among them, "Analysis" provides the problem-solving ideas and thinking methods of the topic to help users better understand the topic; , Key points are prompted to help users review and reflect on the intention of the question, and draw inferences from one instance.

For users, researching mathematical problems is not only about getting the answers themselves, but also about the problem-solving principles and thinking logic behind the answers. Compared with other general-purpose large models, MathGPT can solve problems with higher accuracy, and can also analyze the answers more clearly and explain them more clearly, and better meet the core needs of users to use AI products to solve mathematical problems. In addition, at the same time as MathGPT was released, TAL also updated a representative and challenging math task evaluation set on its official website for global artificial intelligence experts and math enthusiasts to experience and evaluate. According to Tian Mi, CTO of TAL, TAL hopes to make MathGPT play a greater role in the field of mathematics education. TAL is willing to share with the industry the experience and methods of developing hundreds of billions of large models based on large-scale, high-quality content, and make progress together with the industry. With the smooth progress of the public beta, MathGPT's problem-solving ability will continue to improve, and product-level applications based on MathGPT are also being accelerated in research and development and will be released in the near future.

Reprinted from 新智元 好困 桃子View Original

Comments

no dataCoffee time! Feel free to comment