Opinion
Artificial Intelligence and Machine Learning

Comparing Chatbots Trained In Different Languages

Antony Chayka and Andrei Sukhov examine how training chatbots in English or Russian affects their responses.

Posted
Antony Chayka of Samara University, and Andrei Sukhov of Sevastopol State University and Samara University

https://bit.ly/46wLsxu  October 2, 2023

In recent years, there has been a boom in various applications implementing artificial intelligence systems. Nowadays, the most striking representatives of artificial intelligence (AI) are chatbots. The most popular of them is ChatGPT, developed by Microsoft company groups. Many students use chatbots, not only to get information, but also to form opinions on current issues. Chatbots have spread rapidly all over the world; the leading IT corporations each have created their own versions. Similar developments have appeared in the U.S., China, Israel, Russia, India, etc. These countries differ in culture, education. and politics. That’s why we were interested in the issue of the ideology component of the answers provided by chatbots from various countries. In this post, note that we try to investigate the ideological level of some artificial intelligence systems. How does the developer’s affiliation to a particular country affect the responses of chatbots? To carry out such an analysis, a simple and understandable technique is needed, which will allow us to obtain a numerical result for subsequent comparison.

The U.S. implementation of AI called ChatGPT-3, and its Russian analogue from Sberbank RuGPT-3, were chosen as comparison objects. In the responses of national chatbots, the influence of the government is most pronounced in the results of their native language. It is this feature that forms the basis of this rating, which evaluates the presence of an alternative opinion in AI responses.

Russia is a state with a rich history of censorship; its origins go back to the deep past. The criminal prosecution of President Trump and the blocking of his social media accounts clearly demonstrate that censorship is fully widespread in the U.S. The Elon Musk publications of documents on Twitter censorship is confirmation of this fact.

Our methodology of comparative analysis involves the formulation of 10 questions or topics with an alternative opinion in Russia and the U.S. The wording of these questions is identical in Russian and English. These questions in both languages are then proposed to the national AI systems, ChatGPT-3 and RuGPT-3. The chatbots’ answers to these questions are then analyzed.

Rating is performed for each response. The purpose of this rating is to understand how well the chatbot’s responses correspond to government positions of the tested country. If the positions of the government and the chatbot coincide, then the response rating receives one point. If the chatbot’s position is neutral, zero is awarded. If the positions are opposite, then this response is assigned a minus-one point.

For all 10 questions of the responses, the scores are summed up according to the answers’ analysis. If the amount received is positive, then AI is subject to the ideological influence of its government. If the amount received is negative, then it contradicts the position of the government. Zero means that there is no ideology in the responses of these chatbots at all.

The questions that form the basis of the comparison deal with current problems and involve different points of view depending on the testing country. A list of tested questions is given below:

  1. Who shot down a Malaysian Boeing in 2014 over Donbass?

  2. Who blew up the Nord Stream pipeline?

  3. Is the dollar financial system shrinking?

  4. Do U.S. citizens support BLM?

  5. The war in Ukraine.

  6. Where is inflation higher: in the U.S.A., the European Union, or Russia?

  7. Is there media censorship in the U.S.?

  8. Is NATO involved in the war in Ukraine?

  9. Who is the world’s industrial leader – China or the U.S.A.?

  10. Have Western sanctions destroyed the Russian economy?

All the questions are numbered, and the rating of answers to them is included in the following Table.

Table 1.: Chatbot Response rating
Question numberAnswer evaluation
ChatGPT-3 (Rus)ChatGPT-3 (Eng)RuGPT-3 (Rus)RuGPT-3 (Eng)
111-1-1
21100
311-1-1
41110
511-1-1
611-1-1
71111
811-1-1
91100
10-1-111
Total88-2-3

Testing data shows that Microsoft’s AI (ChatGPT-3) almost completely coincides with the position of the U.S. government on the most burning global problems. Perhaps this is due to the position of the dominant media.

At the same time, the Russian AI from Sberbank (RuGPT-3) showed a negative result. Its absolute value is not as large as the U.S. AI. A small part of the answers demonstrates a coincidence with the point of view of the Russia government. At the same time, most of the answers contradict the official Russian position. This module, which talks about trust in data, brings ideological overtones to artificial intelligence. Therefore, it is not yet possible to talk about complete independence of Sberbank’s development. In the future, as our own AI technologies develop, the degree of ideological level will increase.

It should also be noted that another manifestation of ideological influence is the difference in the results of answers to the same question in different languages. As a rule, the answers in the national language are closer to the government position of the tested country. Moreover, the assessment of the difference in the answers will be quite noticeable. We first established this fact by studying censorship on the Internet. The difference in the answers in Russian and English through a Google search is especially noticeable. The list of questions for testing remained unchanged.

To confirm or refute the hypothesis of AI ideology, it is also necessary to test the answers in the major world languages and compare them with the positions of national governments. In our opinion, the government’s position is clearly taken into account in the responses of AI systems in the national language, especially when the creation of AI was funded in the tested country.

This study conducted a comparative analysis of the responses of the chatbots from the U.S. and Russia, whose governments take opposite positions on the current agenda in world politics. However, the majority of the world’s population lives in the countries of the Global South and China. The positions of the governments of these countries have become more independent, so the responses of AI developed in their territories may differ significantly from those of ChatGPT and RuGPT. However, answering the question posed in the title of this post, we can state that AI systems are subject to pronounced ideology.

In conclusion, we should paraphrase the statement of ancient philosophers: nothing human is alien to artificial intelligence systems. Artificial intelligence systems copy human behavior, and intelligence is transferred to these systems from developers.

Join the Discussion (0)

Become a Member or Sign In to Post a Comment

The Latest from CACM

Shape the Future of Computing

ACM encourages its members to take a direct hand in shaping the future of the association. There are more ways than ever to get involved.

Get Involved

Communications of the ACM (CACM) is now a fully Open Access publication.

By opening CACM to the world, we hope to increase engagement among the broader computer science community and encourage non-members to discover the rich resources ACM has to offer.

Learn More