HSE Researchers Teach Neural Network to Distinguish Origins from Genetically Similar Populations

Researchers from the AI and Digital Science Institute, HSE Faculty of Computer Science, have proposed a new approach based on advanced machine learning techniques to determine a person’s genetic origin with high accuracy. This method uses graph neural networks, which make it possible to distinguish even very closely related populations.
Over the past 10–15 years, genetic analysis has become increasingly popular not only as a tool for medical diagnostics, but also as a means of ancestry research. DNA testing allows people to learn more about their ethnic background, identify the places where their ancestors lived, and determine the number of Neanderthal mutations in a person’s genome.
This has become possible thanks to the development of modern technologies—such as genotyping, data storage and processing systems, and machine learning—and the significant reduction in their cost. However, current testing methods are unable to differentiate between genetically similar populations that have lived in adjacent regions for extended periods.
Researchers from the AI and Digital Science Institute have developed a method for distinguishing between individuals from closely related populations. At the heart of this technology are graph neural networks, which do not rely on DNA sequences but instead use graphs to represent genetic links between individuals with shared genome segments. These shared segments indicate the degree of kinship between people, revealing how many generations back their common ancestors lived. The more overlaps there are, the closer their ancestral connection is. In the model, each person is represented by a vertex in the graph, and the strength of the connection between them is indicated by the edges in the graph.
The method was tested on data from various regions. The results were particularly insightful for the population of the East European Plain, as a large dataset had already been compiled there. The graph neural network was able to accurately determine the population affiliation of individuals from genetically similar ethnic groups.
Aleksei Shmelev
‘Existing methods of genetic analysis address a different task: they identify affiliation with large, isolated groups, such as determining whether someone has French, German, or English ancestry. Our method enables the analysis of closely related populations, which is particularly relevant for Russia, a country with a diverse ethnic background,’ said Aleksei Shmelev, one of the study's authors and Research Assistant at the HSE International Laboratory of Statistical and Computational Genomics, AI and Digital Science Institute.
In their future work, the researchers aim to train the neural network to predict the proportion of different populations within a genome.
They have named their development AncestryGNN, which stands for 'Neural Network-Based Prediction of Population Affiliation via Shared Genome Segments.’
Vladimir Shchur
As noted by Vladimir Shchur, Head of the International Laboratory of Statistical and Computational Genomics at the AI and Digital Science Institute, HSE University, the proposed method holds great potential for more accurate understanding of human history and can be applied in genealogy and anthropology research.
This research was supported by a grant from the Government of the Russian Federation as part of the federal program ‘Artificial Intelligence.’
See also:
HSE and Yandex Propose Method to Speed Up Neural Networks for Image Generation
A team of scientists at HSE FCS and Yandex Research has proposed a method that reduces computational costs and accelerates text-to-image generation in diffusion models without compromising quality. These models currently set the standard for text-to-image generation, but their use is limited by high computational loads, the company said in a statement.
HSE Scientists Identify Effective Models for Training Research Personnel for Industry
Experts from the HSE Institute for Statistical Studies and Economics of Knowledge have examined industrial PhD programmes across 19 countries worldwide. The analysis shows that the key components of an effective model include co-funding by universities, industry, and government; dual academic supervision; and flexible intellectual property arrangements. The findings have been published in Foresight and STI Governance.
HSE Biologists Identify Factors That Accelerate Breast Cancer Recurrence
Scientists at HSE University have identified a molecular mechanism underlying aggressive breast cancer. They found that the signals supporting tumour growth originate not from the tumour itself but from its microenvironment. The researchers also demonstrated that reduced levels of the IGFBP6 protein in the tumour microenvironment lead to the accumulation of macrophages—immune cells associated with a higher risk of cancer recurrence. These findings already make it possible to assess patient risk more accurately and may, in the future, enable the development of drugs that target cells of the tumour microenvironment. The study has been published in Current Drug Therapy.
HSE University and Moscow DIT Partner to Advance 5G and 6G Networks
The Moscow Department of Information Technology and HSE University have signed a cooperation agreement in the field of innovative development of the capital’s IT infrastructure. The parties agreed on joint research into modern and promising communication technologies, including 5G and 6G, as well as AI, the Internet of Things, and other smart city technologies.
HSE University Presents Research Results at AI Conference in Oman
In April 2026, the International Conference on Intelligent Systems and Artificial Intelligence Applications (ISAA 2026) was held at the University of Nizwa in the Sultanate of Oman. The event was co-organised by HSE University, the University of Nizwa, and the University of Technology and Applied Sciences–Ibri. Researchers from HSE University were among the key speakers at the conference.
Russian Scientists Propose Method to Speed Up Microwave Filter Design
Researchers at HSE MIEM, in collaboration with colleagues from the Moscow Technical University of Communications and Informatics (MTUCI), have implemented a novel approach to designing microwave filters—generative synthesis using machine learning tools. The proposed method reduces the filter development cycle from several days to just a few minutes and in the future could be applied to the design of other microwave electronic devices. The results were presented at the IEEE International Conference '2026 Systems of Signals Generating and Processing in the Field of on Board Communications.'
Scientists Find That Only Technological Innovations Consistently Advance Environmental Sustainability
Renewable energy and labour productivity do not always contribute to environmental sustainability. Technological innovation is the only factor that consistently has a positive effect. This is the conclusion reached by an international team of researchers, including Natalia Veselitskaya, Leading Research Fellow at the HSE ISSEK Foresight Centre. The study has been published in Sustainable Development.
HSE’s CardioLife Test Among Winners of Data Fusion Awards 2026
The CardioLife genetic test—a development by the Centre for Biomedical Research and Technologies of the AI and Digital Science Institute at HSE University’s Faculty of Computer Science—has won the All-Russian cross-industry Data Fusion Awards, which recognise achievements in data and AI technologies. The project took first place in the Science–Business Partnership category, demonstrating a successful model for transferring technology from university research into the real healthcare sector.
HSE Researchers Train Neural Network to Predict Protein–Protein Interactions More Accurately
Scientists at the AI and Digital Science Institute of the HSE Faculty of Computer Science have developed a model capable of predicting protein–protein interactions with 95% accuracy. GSMFormer-PPI integrates three types of protein data (including information about protein surface properties) to analyse relationships between proteins, rather than simply combining datasets as in previous models. The solution could accelerate the discovery of disease molecular mechanisms, biomarkers, and potential therapeutic targets. The paper has been published in Scientific Reports.
HSE University Installs Geoscan Station at IIT Bombay
A Russian ground station for receiving SONIKS satellite data has been installed on the campus of the Indian Institute of Technology Bombay (IIT Bombay). Developed by Geoscan, the system will become part of a mirror laboratory project run jointly by HSE University and one of India’s leading universities.


