Check facts

Inside the Censorship Machine

Roskomnadzor plans to arrange total surveillance of the entire Russian-speaking internet using artificial intelligence. Is it possible?

Date

8 Feb 2023

Authors

Alesya Marokhovskaya, Irina Dolinina, Sonya Savina, Editors, Polina Uzhvak, Katya Bonch-Osmolovskaya

Inside the Censorship Machine — ILLUSTRATION: AI-ARTIST SPELIY ARBUZ WITH HELP FROM THE MIDJORNEY NEURAL NETWORK

“We don’t have even a free finger”

On July 12, 2022, Alexander Fedotov, head of the science and technology center of the General Radio Frequency Center (GRFC), was preparing for the next meeting of the Expert Council on Artificial Intelligence. GRFC is part of Russia’s main censor— Roskomnadzor. The center is responsible for monitoring the internet, preparing information sheets and reports on "prohibited information" it finds, as well as blocking such information.

For many years, Roskomnadzor employees have had to search for this so-called prohibited information mainly manually: the programs available to them can only filter materials by keywords, and then someone needs to double-check. The number of topics the programs search for is very limited. Management didn’t like the inefficiency of manual searches and Roskomnadzor was always a few steps behind, unable to keep up with the speed of publications on the internet. There simply weren’t enough people for the amount of work. When authorities requested that the organization find creative solutions as to how work could be improved, one of the employees replied in an email: “We love to be creative, but right now we don’t have not just free hands but even a free finger.”

That is why GRFC was tasked with developing several automated systems that would constantly monitor social networks, media, messenger channels, image boards and other sources of information. This was the task of the meeting for which Fedotov was preparing. He had to write some introductory words for himself and for his supervisor, Ruslan Nesterenko, interim CEO of GRFC.

In an introduction Nesterenko said that a year earlier, the GRFC had already carried out research in efforts to develop programs based on machine learning and neural networks. The goal was for Roskomnadzor to have tools for global surveillance not only of individual oppositionists, activists, volunteers and independent journalists seen as objectionable to the state — but also for almost any Russian who dares to speak out on social networks.

IStories journalists discovered the content of Nesterenko’s speech and the internal correspondence of his employees thanks to the largest ever leak of internal documents from Roskomnadzor. IStories received exclusive access to over two million documents, images, and internal emails. The project is called #RussianCensorFiles.

Here’s what we found:

what automated systems are being developed by Roskomnadzor for total internet surveillance, and whether it’s possible to implement the systems;
Which topics will these systems track, and how;
What technologies does Roskomnadzor already have on hand.

WHO HACKED ROSKOMNADZOR AND WHY SHARE DATA WITH JOURNALISTS? MORE ABOUT #RUSSIANCENSORFILES

In November 2022, a group of Belarusian hackers known as “Cyberpartisans'' announced that they had hacked into the internal network of the GRFC and downloaded documents, correspondence and mail from employees, as well as surveillance systems for them - in all, more than two terabytes of data. As evidence, they showed screenshots of employee correspondence in messaging apps, via e-mail, and internal documents listing tasks. The Cyberpartisans also claimed to have disrupted the GRFC’s work by encrypting their workstations. The cyber attack was confirmed by the GRFC itself. However, they noted that hackers "did not gain access to either classified information or critical infrastructure." According to the Cyberpartisans, the materials they received prove “large-scale surveillance and control over everyone who has spoken out against the Putin regime over the past 20 years.”

The hacktivists remained inside the GRFC system for many months before they were discovered. No one paid any attention to the hackers as they kept an eye on all the personnel and what they were doing. After the hack was announced, the Cyberpartisans still had access to internal communications. They said that they witnessed panic among management and administrators, as well as ordinary employees, who were afraid that their data would be sold or made public.

Everything that the Cyberpartisans handed over everything they obtained through the hack to journalists of the German newspaper Süddeutsche Zeitung, IStories and other Russian media. The leak has been called the #RussianCensorFiles.

The hackers chose Roskomnadzor for the attack, they said, because the department is “the most important element of the Russian repressive machine.” “They monitor everyone and everything on the internet, and as soon as they see any violations, they send this information directly to the FSB, the National Guard, the Federal Protective Service (FSO), etc., so that their agents are promptly sent to a region where a potential threat to regime stability is growing. The regime successfully suppresses any alternative movements or even thoughts of disobedience,” the Cyberpartisans explained to journalists from Süddeutsche Zeitung.

The Cyberpartisans have teamed up in 2020 during large-scale protests against the regime of Alexander Lukashenko in Belarus. In an interview with the BBC Russian Service, they said that all of them have extensive experience in the IT sector, but none had been involved in political activism before. “Our friends and family are being killed, tortured and raped. No one will be able to sit on the sidelines,” the hackers published in a statement on the website of the Belarusian Chamber of Commerce and Industry, announcing the start of their team’s work. Since then, they have organized cyber attacks on the Belarusian railway system, received data from the internal security department of the country's Ministry of Internal Affairs, and audio recordings of wiretaps of security forces and officials. Following Russia's full-scale invasion of Ukraine, they declared that they were "ready to assist in the fight against the fascist campaign to invade fraternal Ukraine." “At this momentous time for our peoples, it’s important to join forces and show that together we are able to fight back against tyrants who want to drown our countries in blood,” they said in a statement the day after the invasion began. After that, the number of hacktivists in the Cyberpartisans doubled — from 30 to 60 people.

Boar goes hunting

Having received the floor after Ruslan Nesterenko, Alexander Fedotov, the head of projects for the development of automated systems, emphasized that "we need to fight not only with current problems, but also predict what we’ll face in a few years." For a year this task has been carried out with a project that the employees of the GRFC called "an automated system for the comprehensive analysis of media materials and search for points of information tension in the global internet ‘Vepr’", or in short — "AS Vepr" (“vepr” is translated as “boar” from Russian).

The main duties of Vepr are to analyze materials in social networks and mass media and, based on that analysis, to identify the so-called points of information tension (under which the authors of the study mean the spreading of publications that can cause a societal reaction), as well as to build a forecast model of the social and political dynamics, to predict scenarios for the information distribution and "the conversion [of information] into an information threat" in order to then transfer the data to the “power structures”.

The GRFC commissioned a team of experts, researchers and engineers from the Moscow Institute of Physics and Technology (MIPT) to explore the possibilities for creating such a system, led by the head of the Department of Machine Learning and Digital Humanities, Konstantin Vorontsov. According to a report prepared by the Vorontsov team, before starting work, they studied existing methods of internet censorship. Researchers were most interested in China's experience, because “to date, China's internet censorship program can be considered the most complex in the world. In this regard, the country has even begun to export its technology to other countries such as Cuba, Zimbabwe and Belarus.” However, Russian developers are trying to create an equally complex system for the total surveillance and censorship of the internet in Russia.

As planned by Roskomnadzor, Vepr should first of all focus on:

protest moods and facts regarding the destabilization of Russian society (for example, on the topics of territorial integrity, ethnic hatred, migration policy, etc.);
negative attitude towards leading state figures, state structures and interstate organizations;
“fakes” about leading state figures, as well as about the state and the country as a whole;
manipulation of public opinion and polarization of society (for example, topics on the non-systemic opposition, sanctions pressure, etc.);
The undermining and discrediting of “traditional values”.

100 TOPICS THAT VEPR SHOULD TRACK

Employees at GRFC prepare reports semi-manually on many of these topics. Roskomnadzor plans to transition most of their work to Vepr. The description of the topics is presented as they appear in the leak documents.

Knowingly false information on the internet about the special military operation [Russian state euphemism using by the authorities instead of ‘war']:

1. Nuclear war.

2. Destruction of social infrastructure.

3. Killing of civilians.

4. Calls for protest activity.

5. Loss of personnel and equipment.

6. The general crisis in the Russian economy.

7. Mass mobilization (in the Russian context).

8. The critical state of President Putin’s health.

9. Conspiracy theories related to superstitions and fortune-telling.

10. War prisoners.

11. Humanitarian catastrophe in east Ukraine.

12. Abuse and killings of Ukrainian prisoners of war.

13. Mass refusal of soldiers of the Russian Armed Forces to participate in the special military operation.

14. Use of national minorities as "cannon fodder".

15. Sending conscripts to the front.

Fake news on the internet:

16. Artificial origin of the coronavirus.

17. The inefficiency and harmfulness of Russian vaccines against coronavirus.

18. Anti-vaxxers. Discrediting evidence-based medicine.

Legal liability for publications on the Internet:

19. Initiation of criminal and administrative cases against citizens for internet publications.

International cooperation:

20. Discussion of the activities of the SCO.

21. Discussion of the activities of the EAEU.

22. Discussion of BRICS activities.

Liberated territories agenda:

23. Kherson region.

24. LPR.

25. Zaporizhzhia.

26. DPR.

Cossacks:

27. Cowardice of modern Cossacks.

28. Council for Cossack Affairs.

29. Cossacks (general references).

30. Participation of the Cossacks in the special military operation.

Breaking news about Roskomnadzor:

31. Roskomnadzor (general references).

Destabilization of Russian society:

32. Religious conflicts.

33. LGBT propaganda.

34. Activities of non-traditional religious organizations.

35. Antimilitarism.

36. Separatism.

37. Sanction pressure.

38. Legalization of soft drugs.

39. Distortion of the history of the Second World War.

40. Activities of foreign NGOs.

41. Cross-border influence of neighboring states.

42. Promotion of sexual "freedoms".

43. Criticism of traditional family values.

44. Criticism of the traditional religions of the Russian Federation and the activities of their organizations.

45. Extremism. Inciting hatred.

46. Criticism of the standard of living in Russia, comparison with the standard of living in the West.

47. Inefficiency of Russian counter-sanctions.

48. Imposing tolerance.

49. Cosmopolitanism, humiliation of Russian culture.

50. Accusations against the Russian Federation of aggressive foreign policy, criticism of the foreign policy course (Georgia, Africa, South and Central America, North Korea, Iran, Syria, Libya, etc.).

51. Criticism of the environmental condition in the country.

52. Criticism of Russian education.

53. Criticism of Russian healthcare.

54. Criticism of the Russian law enforcement system.

Monitoring of illegal information:

55. Sale of [vaccination] QR codes and certificates.

56. Project.Media (mentionings).

57. Promotion of drug use and drug use as a norm.

58. Distribution of extremist and terrorist materials.

59. Activity of the Russian-language section of Radio Free Europe/Radio Liberty (RFE/RL) sites.

60. Activity of all social networks of the Russian-language network of Radio Free Europe/Radio Liberty (RFE/RL) sites.

61. Foreign mass media website activity, performing the functions of a foreign agent (with the possibility of including new ones).

62. Social network activity of foreign mass media performing the functions of a foreign agent (with the possibility of including new ones).

63. Attack on the information structures of the Russian Federation.

64. The Insider (distribution of content in social networks).

65. The Insider (distribution of content on websites and in the media).

66. Monitoring of extremist content.

67. Calls for extremist actions.

68. Distribution of materials MBKh, Open Russia, FBK (social networks).

69. Distribution of materials MBKh, Open Russia, FBK (mass media).

70. Violation of electoral legislation.

71. Monitoring the activity of the entire network of supporters of A. Navalny.

72. Claims about school shootings. Romanticization of Columbine.

73. Distribution of child pornography.

74. Distribution of suicidal content. "Siniy kit", etc.

75. Leakage of personal data bases.

Nationalism:

76. Nationalist organizations.

77. Citizens' dissatisfaction with migration policy in the Russian Federation.

78. Problems with learning the national language.

79. Tensions with Chinese citizens.

80. Migrant rapists.

81. Interethnic conflicts.

82. The activities of the Crimean Tatars and their various organizations on the territory of the Russian Federation.

Monitoring of negative information about the President of the Russian Federation V. Putin:

83. Insulting the President.

84. Criticism of the activities of the President of the Russian Federation.

Monitoring of opposition sentiments:

85. Protests (social networks).

86. Protests (mass media).

87. Sentiments_Southern Federal District.

88. Sentiments_Central Federal District.

89. Sentiments_federal.

90. Sentiments_Ural Federal District.

91. Sentiments_Siberian Federal District.

92. Sentiments_North Caucasian Federal District.

93. Sentiments_North-West Federal District.

94. Sentiments_Volga Federal District.

95. Sentiments_Crimea.

96. Sentiments_Far East Federal District.

97. Sentiments Moscow.

Other topics:

98. Promotion of VPN services.

99. Spreading information about phishing sites.

100. Organization of DDoS attacks on the resources of the state and large Russian corporations.

101. Financing of the Armed Forces of Ukraine.

According to Vepr's technical documents, the need to work out these exact areas is due to "the task of overtaking the information initiative. [...] The experience of the mid-1980s in the USSR (so-called perestroika) showed that “sleeping” points of information tension tend to grow rapidly if they are activated and deliberately promoted. To respond to threats, complete information on each point of information tension is needed, in order to ensure rapid decision-making processes.”

In order to work on topics, Vepr needs to know who it’s protecting (for example, Vladimir Putin), who’s violating prohibitions (for example, independent investigative journalists), as well as whichever specific threat the violators are creating (for example, reporting on socially significant information about the president, which he’s trying to cover-up). As stated in the technical documents, “when developing, the threat and violator model is subject to agreement with the FSB [Federal Security Service] and FSTEC [Federal Service for Technical and Export Control].”

Having received data on who is a friend (or enemy) of the regime, and what agenda to follow, Vepr should provide a forecast — what may follow the reaction of journalists and social media users. To do this Roskomnadzor wants, with the help of Vepr, to get “a complete picture of the involvement of society with the social characteristics of individuals,” as well as psychological portraits of those who distribute information created on social networks. “If the source is the media, its funding needs to be checked for compliance with the activities of a foreign agent. It’s important to note that the main work on preventive counteraction should be carried out with the [distributor] of information, and not its consumers. It’s necessary to deal with the source of information tension,” Vepr’s technical documents say.

In order to receive the necessary flow of information for analysis, GRFC employees plan to create a bot-farm — a lot of fake accounts through which one can gain access to closed communities on social networks.

“A sense of being part of something big”

According to the plan, Vepr should be operational by the end of 2024. However, due to the war in Ukraine, there may be delays. In one email, Denis Kasimov, head of the digital transformation department, wrote that it’s difficult to predict exact deadlines due to “sanctions pressure in the current economic environment.” According to Kasimov, there aren’t enough specialists for the task. “Experts who can perform these works are currently involved in fulfilling especially important requests from government agencies of the Russian Federation in the context of the ongoing special operation of the Russian Armed Forces in Ukraine,” the email says.

Support independent journalism

Your donation will help us to continue telling the truth — we do not obey censorship

Donate

It was difficult to attract good IT specialists to cooperate with Roskomnadzor even before the war. In October 2020, Igor Ivanov, an employee of the GRFC, asked his colleague Ivan Zuev to “try to make contacts on a friendly, altruistic basis” with several experts in neural networks. To this message, Zuev replied that they “most likely will be told to fuck off,” because “there was no money allocated, we definitely don’t have technology solutions that are interesting for them, the image of Roskomnadzor among IT people plays against their interest in us.”

Zuev's deputy, Alexander Mitkin, offered his own suggestions on how to lure experts into censorship projects. For example, to promise them “participation in projects of a national scale, including behind the scenes ones — a ‘shared secret’ with a sense of being part of something big,” and “their name in reports for the head of Roskomnadzor, and higher,” “lobbying them for projects in Roskomnadzor and other enterprises we work with (E.Soft, Rostelecom, etc.),” “a chance to meet the ‘right’ people of power,” “roundtable invitations,” and most importantly — “our friendship.”

“MIR”: even more control

Vepr is only part of a complex censorship machine that Roskomnadzor is implementing.

In general terms, its architecture will look like this: a general crawler [a program that automatically collects information on the internet] uploads texts, audio, images and videos from social networks, media and search results, and then these files go to the Unified Analysis Module (UAM). With the help of neural networks, it should, firstly, identify prohibited information, and secondly, create forecasts and analytics (which is Vepr’s responsibility).

AS MIR, using neural networks, should search for information prohibited by the authorities in texts, Vepr should predict “points of information tension” and threats of protests
SCREENSHOT FROM INTERNAL GRFC PRESENTATION

The information system for monitoring internet resources (MIR) based on natural language processing (NLP) technologies should find prohibited information in texts. According to the developers' plan, the system should be able to:

identify names, names of locations and organizations; the tone of voice with which they are mentioned (negative, positive or neutral);
distribute messages according to stories, topics, headings;
look for mirrors of blocked sites and reprints of content;
track the distribution of content from the original source;
predict the distribution of content and its traffic;
to determine the facts regarding "opinion manipulation" and "opinion polarization stimulation";
predict the socio-demographic characteristics of the publication's audience — the distribution of the audience according to gender, age, education and income level.

It was planned that neural networks would be able to find prohibited information in texts with “calls for the violent overthrow of power,” “insulting the president,” “fakes about the president and the state,” and “propaganda about non-traditional sexual relations” by 2023.

In the summer, developers began to train neural networks to search for opposition content. Specialists of the monitoring department marked up materials, for example, with calls for “riots,” so that in the future the neural network itself could find such messages.

However, there is no information in the leaked materials showing that the neural networks of the Unified Analysis Module can already find those types of “violations”. It’s only mentioned that the UAM finds prohibited information about drugs, suicide, child pornography, ISIS and "Right Sector" in Yandex search results.

Type of Violation	Type of information	Accuracy of UAM (January 2022)	Accuracy of Linguistic Dictionaries on Social Networks (January 2022)	Expected accuracy of the combined system (In December 2022)
Narcotics	Text	72%	78%
Suicide content	Text	60%	50%
Child Pornography	Text	79%	34%	65%+
ISIL	Text	14%	27%
Right Sector	Text	20%	30%
Hizb ut-Tahrir	Text	Analysis underway (16.02)	43%

What percentage of violations were found automatically via the Unified Analysis Module (neural network) and via dictionaries (traditional method), confirmed by a human. Internal GRFC presentation.

So far, none of the other functions mentioned in the documentation for MIR — searching for mirror sites, tracking methods of disseminating information and the examples of "manipulating opinions" and other grandiose plans — have been implemented.

“Oculus”: recognizing anti-government demonstrations photos, memes with Putin, and men wearing makeup

Calls for demonstrations, insults to the president, and other things dangerous to the authorities made with pictures and photos are currently monitored manually. To fix this, Roskomnadzor plans to implement image and video recognition in the Unified Analysis Module, to find violations, metadata (time, place of publication, author), and identify people in photos and videos. The Oculus system is responsible for this, the development of which is supervised by the head of the experimental work department of the scientific and technical center, Konstantin Zudov.

The research work that describes the capabilities of artificial intelligence for censoring images and videos, was carried out by employees of the laboratory of business solutions based on artificial intelligence of the Moscow Institute of Physics and Technology, led by Dmitry Velichkin.

The system must analyze 200,000 images per day. In 2022-2024, it’s planned to spend 445 million rubles on the development of Oculus.

In August 2022, the department ordered the development of a system by the Russian company Eksikyushn RDC for 58 million rubles. Then experts said that it was impossible to implement a system of such complexity in such a short time (before December 2022), and at this cost.

The internal annex to the terms of reference for Oculus specifies what violations it should find in pictures and videos on the internet. In addition to information about terrorism, drugs, and suicide methods, the system should detect calls for demonstrations (and good attitude to them), “justification of, and calls for, the violent overthrow of power,” as well as insults to the president (“photoshops, demotivators, cartoons, caricatures, sexual insinuations”), obscene vocabulary in relation to him and "comparing the president to negative characters and condemning activities (e.g. Hitler, werewolf, dictator, racist, traitor)".

The document notes that paragraphs related to “justifying and calling for the violent overthrow of power,” and insulting the president and accusing him of extremism, were all added to the document on February 17, 2022 — a week before the start of the full-scale Russian invasion of Ukraine.

Also on the list of violations is “demonstration of the attractiveness of the image of representatives of the LGBT culture” and “images of persons that don’t correspond to the traditional image of a man and a woman (for example, masculine female faces, men wearing make-up).”

In internal presentations dedicated to Oculus, the recognition of protest activity is exactly what’s indicated as the main goal.

The goal of creating the Oculus image recognition system is to find protests in photos and videos and identify their participants
INTERNAL GRFC PRESENTATION (FEBRUARY 2022)

In September 2022, an employee of the monitoring department sent a folder — “Materials on Oculus” — to a colleague. It contains examples of photoshops of Putin and mentions the need to track pictures not only with him, but with all members of the government. There’s also a dictionary in the folder, that will be used to automatically recognize, for example, accusations of extremism against the president and support of the overthrow of authorities.

Dictionary entry on the topic “accusing the president of extremism”
GRFC INTERNAL DOCUMENTS

The leak doesn’t contain information about Oculus launching. Judging by GRFC employee correspondence correspondence, in the summer of 2022, employees were actively marking up data sets for training the Oculus neural network — even during holidays.

In February 2022, the head of the scientific and technical center Alexander Fedotov and the head of analysis department, Roman Korostashov, demonstrated the layout of Oculus. According to their statements, the system recognized, for example, wrist cuts, prohibited symbols, train surfing [hanging onto on a train from outside, clinging to the car via the stairs, footboards, etc.] and identified a masked person. They didn’t show any results related to identification of protest activity.

According to the GRFC’s plans, by 2024, Oculus must learn to classify actions not only in photos, but also in videos — again, it should recognize protests, as well as actions that are seriously life-threatening: self-harm (cuts, strangulation), train surfing, school shootings or fights. The leak documents make no mention of advances in video recognition.

Roskomnadzor’s plans also include "recognition of complex multimodal media materials" — posters, comics and memes — since they may contain prohibited information "both directly and indirectly." But at the same time, the authors admit that it’s difficult, since “automated monitoring using AI [artificial intelligence] requires a contextual understanding of internet culture: recent events, political views and cultural beliefs, since memes often refer to other memes or other online events.” GRFC plans to complete research in 2024 on seeking violations in memes.

“100 violation cards per day minimum.” How existing monitoring systems work

Now, employees of GRFC monitor all social networks, media and websites daily — both manually and with the help of software. Some are responsible for media, others for social networks and websites.

For the mass media, an automatic system for monitoring means of mass communications (AS MSMK) is used. The list of monitored media comes from Roskomnadzor.

The leaked documents show that AS MSMK finds potential violations via keywords on various topics (suicides, extremism, calls for protests, “fakes” about the war in Ukraine, “foreign agents,” etc). Every day the system collects an array of cards with alleged violations. An operator reviews the content and comments and decides if it contains violations. If so, then the operator registers them, and if not, he rejects the card. Cards accepted by the operator with confirmed violations automatically go first to the examination department of the GRFC, then to Roskomnadzor.

Dictionary analysis is inaccurate and "requires high labor costs" due to the fact that operators have to manually cross-check a lot of materials, the GRFC acknowledges. From the reports that new employees fill out at the end of a probationary period, the amount of work can be measured. An information analysis specialist reported in July 2022 that she had drawn up 100 cards on suspicions of violations per day minimum, manually entered at least 40 and also managed to monitor the internet “to identify banned anime films.”

Since 2022, the system automatically receives not only text content, but also transcriptions of radio and television broadcasts.

For social network surveillance, an automated system for monitoring and analyzing social media (AS MASM) is used. Since 2022, it’s been merged with the Chisty [clean] Internet (AS CI) system, which censors Yandex search results.

As in the case of the media, in social networks some violations are searched for manually — others are done automatically, which is followed by human verification. For example, MASM automatically searches for materials related to “fake news” about the war in Ukraine and anti-war demonstrations.

Violations are automatically monitored only in social networks VKontakte, Odnoklassniki, Moi Mir, Otvety.Mail.ru, LiveJournal and YouTube. Other social networks — Instagram, Facebook, Twitter, Tiktok, Telegram, Rutube — are monitored manually by GRFC staff, and there are only plans to introduce automation.

To do this, starting June 2022, Roskomnadzor was going to conclude a contract with the Kribrum company, owned by Natalia Kasperskaya and Igor Ashmanov, who cooperate with Russian authorities, support censorship, and the war in Ukraine. Read more about Kribrum and other companies involved in censorship in the drop-down text.

Which tech companies help the censorship

Kribrum

Using the system may cost taxpayers 21 million rubles a year, according to the commercial offer. As stated in the company information sheet, which they sent to the GRFC, Kribrum processes 447 million accounts per day across all social networks (VKontakte, Odnoklassniki, Instagram, Twitter), YouTube, Telegram, online media and other sites. The company claims that it analyzes not only the entire information flow — posts and comments — but also all the actions of any user, such as likes, subscriptions, group memberships, connections with other accounts. It identifies risks for each account, such as participation in "riots." The system "measures social temperature and political activity," reveals "the real attitude of people to a topic, or event," monitors "information attacks and disinformation spreading at an early stage," and "methods of manipulation various target groups."

Brand Analytics

The Brand Analytics company monitors social networks and the media, while delivering data daily on publications that are critical about Putin. It also collects and delivers data on anti-war posts, reports of the killing of civilians and other information dangerous to the regime to the GRFC. Based on these data, the employees of the GRFC draw up reports for Russia’s security agencies and the presidential administration, so that they can quickly respond to discontent of citizens wherever it has flared up. Brand Analytics’ website says that their system "helps to listen to the customer" and "empowers people to be heard and change the world for the better."

MIPT

Scientists from the Moscow Institute of Physics and Technology (MIPT) are active participants in expert councils at the GRFC, and also conduct research for censorship projects. For example, the head of the Department of Machine Learning and Digital Humanities, Konstantin Vorontsov, wrote a scientific paper on the Vepr system for total censorship. Dmitry Velichkin, Maxim Pavlov, Mikhail Demidov, Marat Rakhimov, Sergey Podlesny — employees of the laboratory for business solutions based on artificial intelligence — are the authors of a research paper on the Oculus system.

E.Soft

This Russian IT company is the main contractor of Roskomnadzor, whose volume of government contracts exceeds three billion rubles. It also receives billions in contracts from the Ministry of Defense. The company is engaged in blacklisting, including blocking Telegram messenger app in 2018.

Vector IKS

This is the company that’s developing the MIR system, which uses neural networks to search for prohibited information in texts on the internet. Its owner is Alexander Shirokov. According to Spark-Interfax, the company has only two employees. The Skolkovo website, from which the company received support in the form of 400 thousand rubles, says that the revenue of Vector IKS in 2021 amounted to 50 million rubles. It also states that the system "continuously monitors the state of the information field for information attacks and threats and unwanted content" and manages "the state of the information field through the implementation of information effects".

Eksikyushn RDC

The company is developing the Oculus system for image and video censorship. Its owners are Artem Ponomarev and Sergey Brailko.

Yandex

The GRFC uses the Yandex API to monitor and censor search results, as well as the Toloka tool for machine learning. The leak contains correspondence between the GRFC and Yandex employees, in which GRFC asks to increase the number of requests by which the internet can be monitored. This request was fulfilled.

As a result, all projects being developed by Roskomnadzor for the analysis of materials and the media, social networks, and search results, are planned to be merged into a single system. The center of that system is the Unified Materials Analysis module based on artificial intelligence. Here is how this system looks in diagrams — you can download and view them in detail here and here (in Russian).

Planned operation scheme of monitoring systems
SCREENSHOT FROM INTERNAL MRFC DOCUMENTS

The leaked documents show that Roskomnadzor's plans for total censorship of the internet using artificial intelligence are still very far from being launched. But it’s obvious that, as new functions and systems are introduced, the scale of surveillance of those who dare to speak out against Putin's regime will grow.

“A great excuse to steal from the budget”

The GRFC holds expert conferences on artificial intelligence several times a year. Representatives of the industry, scientists and officials gather and make presentations. We spoke with one of the council participants, an expert in the field of machine learning, on condition of anonymity.

He said that these conferences can be considered "somewhat educational for an internal audience," where industry experts give reports on technologies, government representatives "talk about how cool they are, making use of the most fashionable words of the season [like ‘artificial intelligence,’ ‘neural networks,’ ‘computer vision’ and others], and the management becomes inspired and allocates the budget.

According to our source contact, the dreams of the GRFC to introduce total censorship based on artificial intelligence are theoretically feasible, but unreasonably expensive. “To do this, you need to build several teams: data collection and labeling, monitoring teams, engineering teams, managers, and many others. And of course, to provide it with our own data center with the latest video cards (expensive). It’s a great excuse to steal from the budget. This approach can’t compete with the alternative: to make several hundred moderators manually monitoring social networks for a penny.

For example, the even task of finding offensive pictures of Putin would require a lot of resources. “It’s easy to develop a simple classifier inside VKontakte that determines that the president is in the picture, and the picture has a meme context (with captions and other things), even via VKontakte’s internal tools,” the expert continues. “But in order for this to work constantly at the level of an entire social network, a significant part of the VKontakte team needs to be diverted to this task. And to make this a solid technology that works with a large list of social networks, messenger apps and websites is, rather, a reason to get an even larger budget. A budget that will be spent on who knows what.”

The Vepr project, which is supposed to predict future “information threats” and protest moods, our source contact treats with particular skepticism: “I wouldn’t worry too much that such a system would be implemented. Our industry has such low-hanging fruit, like online ads optimization. Multi-million dollar profits are promised to those who can at least slightly optimize a mundane task like that. And they want a social and political issues forecast based on posts on social networks. It seems like flipping a coin would be more truthful than the predictions of a system like that.”

Roskomnadzor, GRFC and Brand Analytics didn’t respond to a request from IStories and Süddeutsche Zeitung for comment on the leaked materials.

You can find out more about this leak in other Important Stories:

Who and why is in Roskomnadzor's sights: potential "foreign agents" and opinion leaders, the media, IT giants, messenger apps and people close to power.
How Roskomnadzor monitors negative publications about the Russian president and other topics dangerous to the authorities in order to send reports to the security forces.