A large number of scientists, programmers, teachers and students work on the Corpus. All of them can be divided into several working groups, each of them performs a specific task. Below we have listed the main working groups and described the current "vacancies" for each of them. In parentheses, you will find an indication of which professional field is most likely to be engaged in this; however, these are not strict requirements and we are discussing any of your suggestions. So be sure to write!

Site development and administration

  1. Web design of the search page (programmers)
  2. Development of the mobile version of the Corpus (programmers)

Preparing the corpus data

  1. alignment of parallel texts (Sinologists)
  2. search for new parallel texts in Goyu (國語, Taiwan), Huayu (华语, Singapore) (Sinologists)

The corpus of translations from the Wenyan language (文言)

  1. Search for new parallel texts in Wenyan (Sinologists)
  2. Search and development of related technologies (Sinologists)

Educational environment based on the Corpus

  1. Creation of an algorithm for automatic simplification of Russian and Chinese texts for a particular level of language knowledge (programmers, linguists, Sinologists, Russian as a foreign language)
  2. Creating additional features in the corpus - audio and video examples for quotations from books contained in the corpus (programmers, Sinologists, Russian as a foreign language)
  3. Creation of methodological manuals on Russian as a foreign language and Chinese as a foreign language (linguists, Sinologists, Russian as a foreign language)

Development of a new layout for the Corpus

  1. Creating an algorithm that would highlight the most likely translation of a particular word (programmers, linguists, Sinologists)
  2. Checking the existing annotation for texts in traditional and variant Chinese characters (Sinologists, programmers)

Research tasks for the future

  1. Creating an algorithm that will automatically generate texts of different styles
  2. Research on translation studies, Chinese stylistics and philology

SMM and popularization

  1. SMM for TikTok
  2. SMM for WeChat (native Chinese speaker)

If you have any questions about working together, write to the project coordinator Kirill Semenov (