Science: Using genetic information to map millions of people
Release date: 2018-03-28 Genealogy has a wide range of applications in genetics, anthropology, economics and many other fields. However, the collection of extended genealogy is cumbersome and often relies on a limited geographic range and complex data resources with usage rights. Recently, a research report published in the journal Science reported that the online data published on the genealogy website collected the genealogy of a population of 10 million. Cienpies Design/Alamy The researchers first obtained 86 million records (of which 43 million had family information) from Geni.com, one of the world's largest collaborative genealogy sites, where users shared their genealogy on Geni. It uses an automated pipeline to remove any invalid topology (for example, an individual with more than two parents). This automated approach has more than 90% consistency with the genealogist's decision on invalid topologies. The largest family tree in the data set includes 13 million people, spanning an average of 11 generations between founders and their descendants. The true quality of the genealogy was also confirmed by evaluating the genetic separation patterns of single-line markers such as mitochondrial DNA (maternal transmission) and short tandem repeats of the Y chromosome (parental transmission). Next, the author extracts demographic data (including birth and death dates and individual geographic locations) from the online archive. By correlating relevant historical events (for example, rising mortality in important wars), the life expectancy is closely matched with historical data, and the birth and death periods of the population are analyzed and confirmed. It is further concluded that the first historical appearance of individuals in major cities is related to their establishment date. Consistent, thus determining the accuracy of the location data. The authors also created a model to examine the effects of genetic variation (including additivity, superiority, epistasis, etc.) on longevity, which demonstrates complex genetics that are difficult to dissect using genome-wide association studies. Analysis of the lifespan of 3 million relatives revealed a significant additive (about 16%) of the genetic component, a low rate of dominant contribution (2 to 4%), and no epistasis was detected (indicating that the effect of the gene on longevity may be greater than Scientists imagine small). The additive genetic factors in these new data are lower than previous estimates (about 25%), suggesting that it is more difficult than ever to attempt to detect genetic variants of disease-causing genes from genomic data to seek long-lived genes. In addition, when the authors used their datasets to analyze human migration patterns, they found that there were more female immigrants in Western society than men, albeit short-distance migration. Therefore, the authors speculate that large-scale transnational immigration activities may involve more men than women. By analyzing the radius of marriage (distance between the birthplaces of spouses) - as it increases, it is predicted that the genetic separation of the couple will be reduced according to the "isolation distance" theory. In the main social and political events of the time, the average marriage radius also changed as expected, such as the increase in the radius of marriage after the industrial revolution in 1750 and 1870. Interestingly, after 1800, there was a 50-year delay between the increase in the radius of marriage and the decline in the genetic association of couples, during which time individuals continued to marry relatives despite the distance. The author attributes this phenomenon to changes in cultural norms (rather than the emergence of long-distance traffic in the early 19th century). This study demonstrates the effectiveness of collaboration between researchers and the public (through an online database). After genomic sequencing data is covered onto these lineages, the large family established by this method will facilitate quantitative assessment of genetics and public health. Reference materials: Human genetics: Population-scale family trees from publicly available data Source: Bio-Exploration Edible Mushroom Extract,Hericium Erinaceus Extract,Boletus Extract Fufeng Sinuote Biotechnology Co.,Ltd. , https://www.sntbiology.com