Epidemiology of Rare Hereditary Diseases in the European Part of Russia: Point and Cumulative Prevalence

The issue of point prevalence, cumulative prevalence (CP), and burden of rare hereditary diseases (RHD), comprising 72–80% of the group of rare diseases, is discussed in many reports and is an urgent problem, which is associated with the rapid progress of genetic technology, the identification of thousands of genes, and the resulting problems in society. This work provides an epidemiological analysis of the groups of the most common RHDs (autosomal dominant, autosomal recessive, and X-linked) and their point prevalence (PP) and describes the structure of RHD diversity by medical areas in 14 spatially remote populations of the European part of Russia. The total size of the examined population is about 4 million. A total of 554 clinical forms of RHDs in 10,265 patients were diagnosed. The CP for all RHDs per sample examined was 277.21/100,000 (1:361 people). It is worth noting that now is the time for characterizing the accumulated data on the point prevalence of RHDs, which will help to systematize our knowledge and allow us to develop a strategy of care for patients with RHDs. However, it is necessary to address the issues of changing current medical classifications and coding systems for nosological forms of RHDs, which have not kept pace with genetic advances.


INTRODUCTION
The problem of rare diseases (RDs) and their number, birth, point, and cumulative prevalence are actively discussed by many researchers, and this is important for public health and society. Criteria for the definition of "rare diseases" differ from country to country depending on legislation. A review by Richter et al. (2015) provides data on 296 definitions of RDs from 1,109 organizations (Richter et al., 2015) and confirms their quantitative differences in different countries. European legislation defines a prevalence threshold of 1 per 2,000 persons. The United States in 1983 defined the threshold for RD as <200,000 affected people in the country (currently 1 in 1,800 people). Japan considers any disease affecting less than 50,000 people in the country as rare, which is equivalent to less than 1 in 2,500 people. In Russia, one patient per 10,000 people in the population is a sufficient measure for a disease being rare (European Union (EU), 2000; Donnart et al., 2013;Richter et al., 2015;Ferreira, 2019;Wakap et al., 2020). RDs are diagnosed in all fields of medicine and occur in all demographic groups (Pariser and Gahl, 2014).
There is a variation and steady increase in the reported RDs according to the main available sources. The Online Mendelian Inheritance in Man (OMIM) database contains 6,806 phenotypes with known genetic nature 1 . According to the Orphanet portal of RDs, about 10,500 RDs are currently registered 2 . A very detailed analysis of the known number, point, and cumulative prevalences of RDs has been performed by several teams (Ferreira, 2019;Wakap et al., 2020). Reviews on RD number analysis and point prevalence estimation cite data from the "Epidemiology section of Orphanet 3 ". It has been shown that 84.5% of the analyzed diseases from the Orphanet database have a point prevalence less than 1 per 1,000,000. However, 77.3-80.7% of the burden of RDs in the population falls on a limited number of diseases, representing only about 4.2% of all identified diseases, with a point prevalence of one to five per 10,000 of the population (Wakap et al., 2020). Ferreira estimates the burden of RDs with manifestation at different periods of life as 6.2% of the total population (Ferreira, 2019). A more conservative estimate by Wakap et al. (2020) demonstrates the cumulative prevalence of RDs in the population to be 3.5-5.9%.
According to various researchers, 72-80% of RDs have a genetic cause, some of which have already been confirmed (Richter et al., 2015;Wakap et al., 2020). The widespread introduction of technologies of whole genome and/or wholeexome analysis into practical healthcare has significantly increased the number of genetically determined diseases in the structure of human morbidity. While by 2012 the molecular nature was identified for 3,650 nosological forms, there has been a greater increase in genetically determined diseases over the past 5 years. According to annual observations of OMIM statistics in 2016, molecular nature was confirmed for 5,888 diseases, in 2017-6,087 (+199), in 2018-6340 (+253), in 2019-6572 (+232), and in 2020-6,800 (+228), i.e., genetic nature is established for an additional 200-250 diseases each year (see text footnote 1). Most of the newly reported forms are rare and found in single families.
Since genetic diseases constitute a high percentage in the RD group, it is advisable to assess the point prevalence of monogenic hereditary diseases in the modern population based on actual data from a specific population survey. This article provides an epidemiological analysis of the point prevalence of rare monogenic hereditary diseases (RHDs) in geographically remote populations of the European part of Russia.

RESULTS
The population of 96 rural areas, 86 small towns, and urban-type settlements were surveyed during this study. Data on patients with presumptive RHDs were obtained using a questionnaire (Zinchenko et al., 2020b) from medical workers from 125 medical clinics and 2,056 rural ambulant clinics. More than 45,000 patients with various presumably hereditary conditions, including patients with structural chromosomal changes, multiple congenital malformations, and isolated anomalies, were examined in total. A total of 554 clinical forms of RHDs were verified in 10,265 patients (including 4,270 children patients). Table 1 shows the number of identified patients with RHDs in the regions and the variation of the cumulative prevalence for administrative districts within particular regions.
The cumulative prevalence for all RHDs in the sample examined was 277.21 per 100,000 (Zinchenko et al., 2001a(Zinchenko et al., ,b, 2007, 558.71 per 100,000 children (Zinchenko et al., 2019(Zinchenko et al., , 2020a. The differentiation in the values of cumulative prevalence by region was explained by the peculiarities of the genetic structure of various populations. In the sample under consideration, the main factor of microevolution is genetic drift, migration processes with little influence of natural selection. These results were demonstrated in studies on the role of the genetic structure in the formation of cumulative prevalence in every population (Zinchenko et al., 2000(Zinchenko et al., , 2020b. The number of familial cases averaged 57.82%. There were no statistically significant differences between the cumulative prevalence of AD and AR pathology in the groups of men and women (p > 0.05).
After that, we have analyzed the point prevalence values and the number of diseases by groups in the surveyed populations ( Table 2). The highest number of patients was found in point prevalence class 1 (1:50,000 and more) (59.46%) with the autosomal dominant (AD) type of inheritance-the autosomal recessive (AR) type of inheritance was observed in 34.66% of patients, the X-linked (XL) type of inheritance in 7.88%. The  same ratio is observed in the analysis of the number of detected diseases: with the AD type of inheritance-48.01%; with AR-41.34%; and with XL-10.65%. Analysis of the distribution of the number of patients according to the point prevalence values showed that 33 diseases accounted for most of the patients (59.86%), representing only 5.96% of the total number of diseases (Figure 1). The class of RHDs with a point prevalence of "1:500,001 and less" represents 350 diseases (63.18% of all detected RHDs) with 844 patients (8.22% of the patients): 6.85% with the AD type of inheritance, 10.40% with the AR type, and 8.65% with the XL type. Next, we have analyzed reported diseases and the number of patients with a point prevalence of 1:100,000 or more ( Table 3). Table 3 also presents disease prevalence values (adjusted for gender differences) and point prevalence in the child population. It is worth noting that for congenital diseases and those with manifestation in childhood, the prevalence in the child population is much higher than in the general population. Most of the diseases are from heterogeneous groups but having a similar clinical picture. For example, about 100 genetic forms with similar clinical manifestations of the disease are known for retinitis pigmentosa. In our studies, we also determined a wide heterogeneity for this disease based on NGS studies-we diagnosed 35 clinical genetic variants, and they are combined under the clinical diagnosis "retinitis pigmentosa" in Table 3. Some forms were listed separately, i.e., Stargardt disease, because it has characteristic features and is well diagnosed clinically. The same pattern is observed for Charcot-Marie-Tooth disease, the verification of specific nosological forms in most cases being determined by a molecular genetic study. RHDs vary in prevalence, locus, and allelic heterogeneity depending on the geographic region, which is associated with the specific genetic structure of the population. More than 20 rare genetic variants for Charcot-Marie-Tooth disease associated with different loci and represented by single families have been identified in different regions of Russia (Schagina et al., 2007;Khidiyatova et al., 2013;Dadali et al., 2016;Shchagina et al., 2018Shchagina et al., , 2020Murtazina et al., 2020).
Differential diagnosis of "undifferentiated intellectual disability" is particularly difficult, and work is being done in this direction identifying new genes and genetic variants, most of which we identify in single families (Levchenko et al., 2019). In our sample, the majority of cases were familial.
In Russia, we have identified allelic heterogeneity, and regional and ethnic peculiarities, for most diseases included in newborn screening (Petrova et al., 2016(Petrova et al., , 2019aGundorova et al., 2018). Our studies identified the highest prevalence of phenylketonuria around the world (birth prevalence 1:850 newborns) in one region of the Russian Federation, and it is associated with the spread of a specific mutation (p.R261 * in the PAH gene) associated with the founder effect (Gundorova et al., 2018). The study of allelic heterogeneity is necessary because the varying significance of different genetic variants leads to different clinical courses of diseases and treatment correction (Petrova et al., 2016(Petrova et al., , 2019aGundorova et al., 2018). For some diseases, the analysis of genotype-phenotype correlations and the mutation spectrum has identified peculiarities of the clinical course of the disease (Andreeva et al., 2016;Marakhonov et al., 2019;Petrova et al., 2019b;Vasilyeva et al., 2021).
In addition to RHDs that are frequent for all populations of the Russian Federation and Europe, regionally specific diseases have been identified. Our study identified diseases endemic to specific regions of the Russian Federation-several types of hypotrichosis (Kazantseva et al., 2006;Zernov et al., 2016), osteopetrosis (Bliznetz et al., 2009), and a number of RDs identified in the Russian Federation and worldwide in single cases-primary microcephaly (Marakhonov et al., 2018), gnathodiaphyseal dysplasia (Andreeva et al., 2016), metatropic dysplasia , etc.
The assessment of the cumulative prevalence and burden of RHDs according to the common medical classification of diseases is important for public health and public policy in Russia and in European countries.
We analyzed the diversity and point prevalence of RHDs according to the conventional classification of the disease by types of affected organs and systems according to their main clinical manifestations-neurological syndrome, ophthalmic syndrome, genodermatoses, skeletal syndrome, hereditary syndrome, and other pathology (hereditary diseases of metabolism, blood, hearing, etc.). Table 3 presents the number of patients, the number of diseases, and the point prevalence of RHDs distributed according to the medical classification of diseases.
Analysis of Table 4 shows that the maximum number of patients (23.56%) was found to have neurological and psychiatric pathology, the cumulative point prevalence of which is 65.30/100,000 or 1:1530 people. The next most numerous groups were hereditary syndromes-18.04% (cumulative point prevalence 50.01/100,000 or 1:2,000) and other pathology (hereditary diseases of metabolism, blood, hearing, etc.)-17.11% (47.42/100,000 or 1:2,109 people). For the first two groups of diseases, the maximum incidence was 18.38 and 36.94%, respectively.
Analysis of the age category of patients with RHDs showed that the majority of patients (about 45.3%) belonged to the category of childhood (from newborn to 17 full years), despite the fact that the share of the child population in the regions is only 20.64%; for the reproductive and post-reproductive age, 54.7% of all patients. The results obtained demonstrate the need to develop preventive programs specifically among children.

DISCUSSION
The issue of point prevalence, cumulative prevalence, and burden of RHDs is discussed in many articles and is an urgent problem that is associated with the rapid progress of genetic technology, the identification of thousands of genes, and the problems in society, public health, and social structures of states. According to OMIM statistics, genetic nature has been determined for 6,806 phenotypes so far, and the genetic nature is established for another 200-250 diseases each year (see text footnote 1). Current medical classifications have not kept pace with genetic progress; nosological forms of RHDs are underrepresented in   coding systems (e.g., International Classification of Disease, ICD-11). Data collection from different researchers lacks a unified methodological approach, which makes comparative analysis difficult. Insufficient organization of the process, lack of knowledge, lack of diagnostic expertise, lack of information on point prevalence, distribution of RHDs by medical areas, and cumulative burden of RHDs prevent the full development of a public health strategy. In addition to public health questions, it is a priority to provide medical care for patients with specific RHDs and to identify the most common RHDs for the necessary prioritization and development of regional, national, and global health programs. This study presents an epidemiological analysis of the point prevalence of RHDs based on actual material based on a total survey of several regions of the European part of Russia. Russia is a multinational country with a population of 146,238,185, which makes it difficult to choose a unified strategy in public health. Fourteen regions (Northern, Central, and Southern Russia) were chosen for the survey and subsequent analysis, both populations with different ethnic extractions, and regions of a single ethnic origin. Selection of different ethnically diverse territories in a multinational country was necessary to be able to identify groups of the most common RHDs, determine point prevalence, and describe the structure of RHDs diversity by medical areas. The study was performed by a single team, and the collection and processing of material remained unchanged throughout the study.
Despite the listed limitations of the method (see section "Features and Limitations of the Method") in our study, every 351 people have a hereditary disease. Remarkably higher values of cumulative prevalence were obtained in childrenthe proportion of children out of the total number of patients with RHDs was 43.3%. This number is remarkably higher than the proportion of children in the general population (20.64%). The main reason for this age distribution is that up to 70% of RHDs manifest in childhood according to the Orphanet database. However, the distribution according to the inheritance type was uneven: 39.17% with AD pathology, 45.24% with AR and 52.28% X-linked. Moreover, the proportion of child patients (of pre-productive age) varied from 38 to 51% in different surveyed populations (Zinchenko et al., 2019(Zinchenko et al., , 2020a. The proportion of patients in the reproductive age (18-45 years) out of the total number of patients with RHDs was 37.2% (from 31.91 to 40% by population); for the post-reproductive age (46 and older), only 17.5% (variation 17-22%) out of all patients (Elchinova et al., 2017). Analysis of the sample showed that some diseases do not occur in older age groups because they have high mortality in childhood and middle age (the rate is not constant and varies depending on the population, the causing gene, and the mutation). The lower point prevalence in the older age group is mostly due to a milder and stable course of a limited number of RHDs, with fitting approaching 1, which reduces their relevance and referral to medical facilities.
The number of familial cases and individual cases in family (including sporadic cases for AD pathology) varied depending on the type of inheritance and the average size of the family in a particular region. Among families with the AD type of inheritance, the familial case rate ranged from 70 to 80% in different populations. In families with the AR type, familial cases were 26-34%; for those X-linked, 15-20%. The distribution of men and women did not differ for autosomal pathology. We assume that due to the limitations detailed in the materials and methods, the data obtained may be an underestimate. However, it should be noted that there was a variation by region and by location. The cumulative prevalence and diversity of RHDs was determined for each region, and the most common RHDs were identified. A general disease registry was then compiled. The analysis showed that a single class of the most common RHDs was identified for all populations, with insignificant variation by region for most diseases. The most significant differences in prevalence were found among the child population for congenital and hereditary diseases with early onset. The largest number of patients was detected in the point prevalence class "1:50,000 and more"-33 clinical forms of RHDs (5.96% of the total number of detected diseases) accounting for 59.86% of the patients. Most of the diseases are from heterogeneous groups of RHDs, and there is locus and allelic heterogeneity by region. The smallest number of patients, 8.22%, was identified in the class of RHDs with a point prevalence of "1:500,001 or less"-350 diseases (63.18% of all identified RHDs). These RHDs were mostly detected in single families. Similar data were obtained by Wakap et al. (2020) when analyzing the Orphanet database epidemiological data (see text footnote 3) for RDs, not all of which are of monogenic nature, but only some of them (70-80%) (Wakap et al., 2020). Their study showed that out of 5,304 diseases-84.5% have a point prevalence <1/1,000,000, and 77.3-80.7% of the burden of RDs in the population accounts for 4.2% (n = 149) of the diseases. The analysis performed in our study on a real contemporary population demonstrates the rarity of most RHDs affecting single families and highlights the difficulty of detection and diagnosis by physicians. Wakap et al. (2020) also highlighted that it would be a benefit to perform an analysis of distribution of diseases in real populations according to the medical classification for organization of real medical care. An analysis of the pattern of diversity of RHDs by medical specialties, which has been performed in the current study, showed a predominance of patients (41.60% of patients) with neurological, psychiatric, and hereditary syndromes with a point prevalence of 115/100,000 people. Most diseases of neurological, psychiatric, and hereditary syndromes; hereditary metabolism diseases; and hereditary blood diseases are characterized by reduced fitness, multisystemic lesions, disability, and reduced life expectancy and quality because of the lack of effective treatment. Most hereditary skin, eye, ear, skeletal, and treatable metabolic diseases could affect quality of life (including possible disability) but have adaptation in society and average life expectancy. The findings require a comprehensive public health approach.

CONCLUSION
In conclusion, the present time is characterized by the accumulation of data on the point prevalence of RHDs, and this will help to systematize our knowledge and allow us to develop a strategy of healthcare for patients with RHDs.

MATERIALS AND METHODS
We analyzed long-term studies (1985-2020) on the epidemiology of rare monogenic hereditary diseases in the European part of Russia. This approach allows us to identify disease peculiarities in a given region and provides insight into the diversity of rare hereditary pathologies and their point prevalence.

Surveyed Population
Russia is a multinational state with more than 190 ethnic groups, with Russians constituting about 80%. The study covered various remote areas of European Russia-Central, Northern, Southern, and the North Caucasus. The relative location of the studied regions is shown in Figure 2. The selection of survey areas in each region was focused on the indigenous population (the history and migration flows of population formation were studied). Table 5 shows the populations surveyed, the size of child populations, the ethnic composition of the populations, number of medical organizations (hospital, medical ambulance, paramedic, and obstetric centers), and the number medical workers (physicians, nurses, paramedics) participated in the study. The total size of the surveyed population was 3,703,018 people from 96 rural districts, including 764,260 children (from newborns to 17 full years of age). The sex distribution of the total surveyed population was 1,716,298 (46.35%) men/1,987,087 (53.66%) women 4 . There was an uneven distribution in age: up to 35 years 1:1 (M/W), 36-50 years 0.8:1 (M/W), and the postreproductive period 0.6:1 (M/W). In total, from the surveyed regions, the study involved 2,181 medical organization and 6,370 medical workers. Each of the studied districts of the particular region has one central district hospital, which includes inpatient and medical ambulance, and several rural ambulances and paramedic-obstetric centers in the villages. The number of rural medical organizations depends on the number of villages in the region and is determined by the Department of Medical Statistics of the hospital. The medical workers permanently residing in the area has full medical information about the attached population.

Survey and Protocol
The survey was carried out by a single team of the Research Centre for Medical Genetics (RCMG) in accordance with the protocol of medical and genetic examination of small populations described previously in detail (Zinchenko et al., 2020b). A survey of the investigated populations was conducted regardless of ethnicity, age, and gender structure. The protocol includes the study of populations through different genetic systems simultaneously: (1) population survey, study of the point and cumulative prevalence of hereditary diseases in the particular population; (2) study of the genetic structure using standard methods of population statistics (analysis of the marriages and migration structure, demographic data, analysis of the frequency of surnames to obtain indicators of random inbreeding through isonymy); and (3) study of the genetic structure through the neutral DNA loci of the nuclear genome.
The survey was conducted in three consecutive steps in accordance with the protocol. At the first step, a lecture course is conducted for all medical workers of each surveyed districts of regions (physicians of different specialties, nurses, paramedics) to explain the goals and objectives of the study. During the lecture, the medical personnel are given an information card-questionnaire ( Figure 3A) with request for filling in the information on patients (Figure 3B), including those at the initial stage of disease and with minimal clinical manifestations 5 . The questionnaire contains easily detectable clinical symptoms of RHDs, almost each of which is characteristic for a group of diseases (isolated, syndromic forms). This protocol allowed the detection of the maximum possible number of nosological forms of RHDs known to date. The card-questionnaire with the listed symptoms allows to reveal practically all "portrait" syndromes. Registration of families presumably with RHDs was done through affected persons in the family. The "Multiple registration" method was used (Cavalli-Sforza and Bodmer, 1999). In addition to the questionnaire card, other sources of information are involved: (i) data on disabled persons (from childhood to adults) in the district provided by the hospital; (ii) information from special schools for the blind and visually impaired, deaf, and hard 5 Transfer of patients' data was performed in accordance with Article 13 of Federal Law No. 323-FL "On the basics of health protection of citizens of the Russian Federation" which allowed the submission of information about patients in the following cases: the exchange of information by medical organizations, including those deposited in medical information systems, for the purpose of providing medical care, taking into account the requirements of the legislation of the Russian Federation (paragraph 8), as well as for the purpose of accounting and control within the mandatory medical insurance system (paragraph 9) [http://docs.cntd. ru/document/902312609]. of hearing and schools for children with intellectual disability; and (iii) data from the genetic counseling unit. Thus, registration of the same patient was possible from several sources of information, i.e., being multiple, but recorded in a single database as one case for further examination. A one-time examination of congenital and hereditary pathology in each region is carried out. In the aggregate, the detection rate of patients from all sources of registration reaches 80%. At the second step, the patients were examined by clinical geneticists from the RCMG. The patient with the presumed RHDs is given a medical card, which contains personal information of the proband and his family members, a brief medical history, genealogical data, and detailed phenotyping data. In the process of data collection, pedigrees and the possibility of consanguinity were analyzed. Consanguinity was revealed in rare cases. If necessary, a cytogenetic study is conducted to rule out chromosomal abnormalities. In complicated cases, additional examinations were prescribed for patients to be able to verify the diagnoses (biochemical, radiological, electromyographic, and other methods). As a result of this step, a significant proportion of patients (usually, more than half) from different sources is excluded from the sample because of the external cause of the disease (injuries, infections, isolated congenital pathology, etc.). The remainder families represent a list of families presumably with RHDs for further research.
At the third step, clinical investigations were performed by specialists from leading federal research institutes (geneticist, neurologist, ophthalmologist, dermatologist, pediatrician, otolaryngologist, and orthopedist), which ensured unification of diagnostic criteria. In some cases, blood is collected from patients for molecular genetic diagnosis. Written informed consent was obtained from all identified and examined families for voluntary participation in the study. From 1 to 5% of patients refuse to be

Statistical Methods
Given the heterogeneity of many diseases, the type of inheritance was also confirmed by segregation analysis used in multiple-family registration (Cavalli-Sforza and Bodmer, 1999). Segregation frequency is calculated by Weinberg's proband method (Morton, 1959). Using this method, segregation frequencies are calculated (separately for families with AD and AR pathology) by the ratio of probands; for actually detected patients in families after examination, probability of registration. In our case, the calculated segregation frequencies corresponded to the expected ones-0.25 for the AR type of inheritance and 0.5 for AD. However, it should be noted that the probability of registration differed from 100% and was 83% for the group of families with AR pathology and 72% for AD. The results suggest that irrespective of the total screening performed, a certain number of patients might not have been registered by us.
Point prevalence was calculated according to the procedural document on epidemiology of RDs in Orphanet (2019) as the number of reported cases in the population at a given time point per 100,000 people (all age categories were considered) (European Union (EU), 2009). The prevalence of X-linked pathology was calculated for the male population in the surveyed regions. The average male population in the regions was 46.44% (variation 45.6-46.9%).
Based on clinical manifestations, we additionally analyzed point prevalence according to the generally accepted classification of diseases: neurological syndrome, ophthalmic syndrome, genodermatoses, skeletal syndrome, hereditary syndrome, and other pathology (hereditary diseases of metabolism, blood, hearing, etc.).
The methods for collection and processing of medical genetic material remained unchanged throughout all the studies, which allows comparison of newly obtained data with results from the previously surveyed populations of the country.

Molecular Genetic Analysis
Confirmatory DNA diagnostics was carried out in the laboratories of the RCMG: Laboratory of Genetic Epidemiology (head-R.A. Zinchenko), Laboratory of Epigenetics (head-Sci. V.V. Strelnikov), and Laboratory of DNA Diagnostics (head-A.V. Polyakov). A variety of methods were used for DNA diagnosis-Sanger sequencing, MLPA, RFLP, AFLP, and whole-exome sequencing-depending on the studied nosology according to the protocols published elsewhere by the authors of the current manuscript (Zinchenko et al., 2020b).

Features and Limitations of the Method
Monogenic hereditary diseases listed in the OMIM and Orphanet databases with AD, AR, and X-linked (XL) types of inheritance were included in the analysis. Patients with mitochondrial disorders and chromosomal rearrangements were excluded from the analysis after cytogenetic and molecular genetic studies.
Some patients may not be identified or missed because of the following reasons: (i) patients with subclinical forms of the disease; (ii) patients in the initial stage of the disease with late onset; (iii) patients who refused from examination (from 1 to 5%) for various reasons: observation and treatment by a specific physician, unwillingness to disclose their diagnosis, and others; (iv) patients who are not registered in a medical organization of the region; (v) patients who have not passed this examination and do not live in the region (even in familial cases of the disease); (vi) patients who died by the time of the survey; (vii) child patients under 1 year of age with severe hereditary metabolic diseases unable to pass the survey; and (viii) we also assume that due to the increase in the number of confirmed phenotypes at the genetic level in recent years, the recognition of these diseases by physicians of various specialties is not yet possible. There is a lack of knowledge and diagnostic experience, which must inevitably lead to the omission of sporadic cases of rare hereditary diseases. Doctors in all countries face these problems.
The resulting values of the probability of registration are 83% for the group of families with AR pathology and 72% for AD.
We were also unable to estimate the annual incidence (number of newly diagnosed cases in a population within 1 year) due to the lack of this information in medical organizations. In consequence of the above, certain corrections should be made taking into account the limited number of nosological forms envisaged by our study.

DATA AVAILABILITY STATEMENT
The data analyzed in this study is subject to the following licenses/restrictions: Data are available upon request.
Requests to access these datasets should be directed to RZ, renazinchenko@mail.ru.

ETHICS STATEMENT
The studies involving human participants were reviewed and approved by The Ethical Committee of the Research Centre for Medical Genetics. Written informed consent to participate in this study was provided by the participants' legal guardian/next of kin.

AUTHOR CONTRIBUTIONS
RZ and EG: conceptualization, methodology, visualization, and funding acquisition. RZ, TV, and OA: validation. RZ, NP, VK, AP, and AM: formal analysis. RZ: resources, data curation, and writing-original draft preparation. AM: writing-review and editing. EG and SK: supervision. RZ, EG, and SK: project administration. All authors have read and agreed to the published version of the manuscript.

FUNDING
This study was supported by the Russian Science Foundation, grant no. 17-15-01051.