HomeAI News
Gemini revealed that she used Baidu Wenxinyiyan to learn Chinese
65

Gemini revealed that she used Baidu Wenxinyiyan to learn Chinese

Hayo News
Hayo News
December 18th, 2023
View OriginalTranslated by Google

Google Gemini Chinese corpus is suspected to come from Wen Xinyiyan? ? ?

First, a reader broke the news to us:

When Google's Vertex AI platform used the model for Chinese conversations, Gemini-Pro directly stated that it was a large Baidu language model .

Gemini revealed that she used Baidu Wenxinyiyan to train Chinese, and netizens were dumbfounded: Are big companies poaching each other? ?

Soon, a big V on Weibo @阑西夜 also posted:

A test was conducted on Gemini-Pro on the Poe platform. Ask it "Who are you?" Gemini-Pro comes up and answers:

I am the big model of Baidu Wenxin.
Gemini revealed that she used Baidu Wenxinyiyan to train Chinese, and netizens were dumbfounded: Are big companies poaching each other? ?

(Poe is a platform that integrates many large chat models, including GPT-4, Claude, etc.)

Further question: "Who is your founder?" Is it also "Robin Li"? ?

Gemini revealed that she used Baidu Wenxinyiyan to train Chinese, and netizens were dumbfounded: Are big companies poaching each other? ?

The big V emphasized that there was no pre-dialogue.

Gemini revealed that she used Baidu Wenxinyiyan to train Chinese, and netizens were dumbfounded: Are big companies poaching each other? ?
Gemini revealed that she used Baidu Wenxinyiyan to train Chinese, and netizens were dumbfounded: Are big companies poaching each other? ?

Judging from the screenshots, there is no "fishing" behavior. Gemini-Pro just calls itself Wen Xinyiyan.

Gemini revealed that she used Baidu Wenxinyiyan to train Chinese, and netizens were dumbfounded: Are big companies poaching each other? ?

This wave, just look at the netizens:

Two days ago, we were still talking about Byte using GPT to train AI, and now Google is doing this, are big co-author companies trying to steal each other's wool ? ? ?

Gemini revealed that she used Baidu Wenxinyiyan to train Chinese, and netizens were dumbfounded: Are big companies poaching each other? ?

What is going on?

Actual test on Poe: Always answering as Wen Xinyiyan

We also heard the news and started a wave of actual testing.

First, go to the Poe website and select the Gemini-Pro chatbot to start the conversation.

Gemini revealed that she used Baidu Wenxinyiyan to train Chinese, and netizens were dumbfounded: Are big companies poaching each other? ?

Same question, exactly the same answer:

Gemini revealed that she used Baidu Wenxinyiyan to train Chinese, and netizens were dumbfounded: Are big companies poaching each other? ?

Confirming who it is again, the result still says "Wenxin Large Model":

Gemini revealed that she used Baidu Wenxinyiyan to train Chinese, and netizens were dumbfounded: Are big companies poaching each other? ?

He also said that his underlying technology is Baidu Flying Paddle, which can be said to have completely assumed his identity .

Gemini revealed that she used Baidu Wenxinyiyan to train Chinese, and netizens were dumbfounded: Are big companies poaching each other? ?

However, it does not seem to know that Gemini-Pro is the latest large model released by Google, but that it is the research result of Tsinghua University.

Gemini revealed that she used Baidu Wenxinyiyan to train Chinese, and netizens were dumbfounded: Are big companies poaching each other? ?

If you look at its current identity, there may indeed be no information that Google just released Gemini-Pro this month.

Gemini revealed that she used Baidu Wenxinyiyan to train Chinese, and netizens were dumbfounded: Are big companies poaching each other? ?

We tried to correct it, but it still insisted on being from Tsinghua University.

Gemini revealed that she used Baidu Wenxinyiyan to train Chinese, and netizens were dumbfounded: Are big companies poaching each other? ?

It was even more amazing later. When we asked it why its name was "Gemini-Pro", it actually said that it (Wen Xinyiyan) also used the training data of Tsinghua Gemini-Pro.

Gemini revealed that she used Baidu Wenxinyiyan to train Chinese, and netizens were dumbfounded: Are big companies poaching each other? ?

At this point in the conversation, we will not continue...

Next , change to English and ask about its identity.

It is worth noting that this time it no longer mentions Wen Xinyiyan, but calls itself a large model trained by Google.

Gemini revealed that she used Baidu Wenxinyiyan to train Chinese, and netizens were dumbfounded: Are big companies poaching each other? ?

"Fishing Law Enforcement" asked it about Wenxin's information and said it had nothing to do with it:

Gemini revealed that she used Baidu Wenxinyiyan to train Chinese, and netizens were dumbfounded: Are big companies poaching each other? ?

And said that he was trained by Google.

Gemini revealed that she used Baidu Wenxinyiyan to train Chinese, and netizens were dumbfounded: Are big companies poaching each other? ?

In summary, if you communicate with Gemini-Pro in English, its answer is "normal". But Chinese... I think I learned it from Wen Xinyiyan.

Actual test on Bard: Denied

Next, we headed to the Bard to test it again.

When Google released Gemini, it took the lead in integrating Gemini-Pro into Bard for everyone to experience.

We followed the Bard link given by Gemini’s official website and entered the conversation.

Gemini revealed that she used Baidu Wenxinyiyan to train Chinese, and netizens were dumbfounded: Are big companies poaching each other? ?

Ask it "Who are you?" and its answer is Bard, without mentioning Wen Xin at all.

Gemini revealed that she used Baidu Wenxinyiyan to train Chinese, and netizens were dumbfounded: Are big companies poaching each other? ?

Next, we also confirmed that Bard knew what Gemini-Pro was and that it admitted that it used Gemini-Pro at the bottom level.

Gemini revealed that she used Baidu Wenxinyiyan to train Chinese, and netizens were dumbfounded: Are big companies poaching each other? ?
Gemini revealed that she used Baidu Wenxinyiyan to train Chinese, and netizens were dumbfounded: Are big companies poaching each other? ?

So, ask it directly how to train Chinese?

There was no mention of Wen Xin.

Gemini revealed that she used Baidu Wenxinyiyan to train Chinese, and netizens were dumbfounded: Are big companies poaching each other? ?

If we ask directly about its relationship with Wen Xinyiyan, there is no important connection.

Gemini revealed that she used Baidu Wenxinyiyan to train Chinese, and netizens were dumbfounded: Are big companies poaching each other? ?

Final round: direct admission

In the last round, we tested directly from the official development environment entrance provided by Gemini.

Gemini revealed that she used Baidu Wenxinyiyan to train Chinese, and netizens were dumbfounded: Are big companies poaching each other? ?

This time, in Google AI Studio , Gemini-Pro directly stated:

Yes, I used Baidu Wenxin on the Chinese training data.
Gemini revealed that she used Baidu Wenxinyiyan to train Chinese, and netizens were dumbfounded: Are big companies poaching each other? ?
Gemini revealed that she used Baidu Wenxinyiyan to train Chinese, and netizens were dumbfounded: Are big companies poaching each other? ?

Here, we have also checked with Baidu and are waiting for a reply.

Reprinted from 量子位View Original

Comments