BACKGROUND:Copy number variation is an important class of genomic variation that has been reported in 75% of the human genome. However, it is underreported in African populations. Copy number variants (CNVs) could have important impacts on disease susceptibility and environmental adaptation. To describe CNVs and their possible impacts in Africans, we sequenced genomes of 232 individuals from three major African ethno-linguistic groups: (1) Niger Congo A from Guinea and Côte d'Ivoire, (2) Niger Congo B from Uganda and the Democratic Republic of Congo and (3) Nilo-Saharans from Uganda. We used GenomeSTRiP and cn.MOPS to identify copy number variant regions (CNVRs). RESULTS:We detected 7608 CNVRs, of which 2172 were only deletions, 2384 were only insertions and 3052 had both. We detected 224 previously un-described CNVRs. The majority of novel CNVRs were present at low frequency and were not shared between populations. We tested for evidence of selection associated with CNVs and also for population structure. Signatures of selection identified previously, using SNPs from the same populations, were overrepresented in CNVRs. When CNVs were tagged with SNP haplotypes to identify SNPs that could predict the presence of CNVs, we identified haplotypes tagging 3096 CNVRs, 372 CNVRs had SNPs with evidence of selection (iHS > 3) and 222 CNVRs had both. This was more than expected (p < 0.0001) and included loci where CNVs have previously been associated with HIV, Rhesus D and preeclampsia. When integrated with 1000 Genomes CNV data, we replicated their observation of population stratification by continent but no clustering by populations within Africa, despite inclusion of Nilo-Saharans and Niger-Congo populations within our dataset. CONCLUSIONS:Novel CNVRs in the current study increase representation of African diversity in the database of genomic variants. Over-representation of CNVRs in SNP signatures of selection and an excess of SNPs that both tag CNVs and are subject to selection show that CNVs may be the actual targets of selection at some loci. However, unlike SNPs, CNVs alone do not resolve African ethno-linguistic groups. Tag haplotypes for CNVs identified may be useful in predicting African CNVs in future studies where only SNP data is available.
In: Twin research and human genetics: the official journal of the International Society for Twin Studies (ISTS) and the Human Genetics Society of Australasia, Band 17, Heft 2, S. 108-120
We have evaluated copy number variants (CNVs) in six monozygotic twin pairs discordant for schizophrenia. The data from Affymetrix® Human SNP 6.0 arrays™ were analyzed using Affymetrix® Genotyping Console™, Partek® Genomics Suite™, PennCNV, and Golden Helix SVS™. This yielded both program-specific and overlapping results. Only CNVs called by Affymetrix Genotyping Console, Partek Genomics Suite, and PennCNV were used in further analysis. This analysis included an assessment of calls in each of the six twin pairs towards identification of unique CNVs in affected and unaffected co-twins. Real time polymerase chain reaction (PCR) experiments confirmed one CNV loss at 7q11.21 that was found in the affected patient but not in the unaffected twin. The results identified CNVs and genes that were previously implicated in mental abnormalities in four of the six twin pairs. It includedPYY(twin pairs 1 and 5),EPHA3(twin pair 3),KIAA1211L(twin pair 4), andGPR139(twin pair 5). They represent likely candidate genes and CNVs for the discordance of four of the six monozygotic twin pairs for this heterogeneous neurodevelopmental disorder. An explanation for these differences is ontogenetic de novo events that differentiate in the monozygotic twins during development.
Cardiomyopathy (CMP) is a heritable disorder. Over 50% of cases are gene-elusive on clinical gene panel testing. The contribution of variants in non-coding DNA elements that result in cryptic splicing and regulate gene expression has not been explored. We analyzed whole-genome sequencing (WGS) data in a discovery cohort of 209 pediatric CMP patients and 1953 independent replication genomes and exomes. We searched for protein-coding variants, and non-coding variants predicted to affect the function or expression of genes. Thirty-nine percent of cases harbored pathogenic coding variants in known CMP genes, and 5% harbored high-risk loss-of-function (LoF) variants in additional candidate CMP genes. Fifteen percent harbored high-risk regulatory variants in promoters and enhancers of CMP genes (odds ratio 2.25, p = 6.70 × 10−7 versus controls). Genes involved in α-dystroglycan glycosylation (FKTN, DTNA) and desmosomal signaling (DSC2, DSG2) were most highly enriched for regulatory variants (odds ratio 6.7–58.1). Functional effects were confirmed in patient myocardium and reporter assays in human cardiomyocytes, and in zebrafish CRISPR knockouts. We provide strong evidence for the genomic contribution of functionally active variants in new genes and in regulatory elements of known CMP genes to early onset CMP. ; This project was supported by the Ted Rogers Centre for Heart Research (SM, JE), the Canadian Institutes of Health Research (PJT 175034) (SM, JE) and by the Canadian Institutes of Health Research (ENP 161429), under the frame of ERA PerMed (SM). SM holds the Heart and Stroke Foundation of Canada & Robert M Freedom Chair in Cardiovascular Science. SWS holds the GlaxoSmithKline Endowed Chair in Genome Sciences at the Hospital for Sick Children and the University of Toronto. PGM holds a Canada Research Chair Tier 2 in Non-coding Disease Mechanisms. PGM acknowledges the support of the Government of Canada's New Frontiers in Research Fund (NFRF), [NFRFE-2018-01305]. EO holds the Bitove Family Professorship of Adult Congenital Heart Disease. MM holds a Ramon y Cajal grant from the Spanish Ministry of Science and Innovation (RYC-2017-22249). WO is supported by funding from Fundació La Marató (321/C/2019). JB is funded by a Frans Van de Werf fellowship for clinical cardiovascular research, and by a senior clinical investigator fellowship of the FWO Flanders. KM was a National Science Foundation Graduate Research Fellow under grant no. DGE1144152 during the majority of the project. CS is the recipient of a National Health and Medical Research Council (NHMRC) Practitioner Fellowship (1154992). JI is the recipient of an NHMRC Career Development Fellowship (1162929). RDB is the recipient of a New South Wales Health Cardiovascular Disease Senior Scientist Grant. PSD is supported by the DBT/Wellcome Trust- Indian Alliance. We acknowledge the Labatt Family Heart Centre Biobank at the Hospital for Sick Children for access to DNA samples, and The Centre for Applied Genomics at the Hospital for Sick Children for performing WGS. We thank Xiucheng Cui and Emanuela Pannia for performing the zebrafish experiments at the SickKids Zebrafish Genetics and Disease Models Core (CRISPR-Cas9 and gRNA syntheses, zebrafish embryo microinjections, gRNA PCR validation, qRT-PCR, cardiac imaging). This research was made possible through access to the data and findings generated by the 100,000 Genomes Project. The 100,000 Genomes Project is managed by Genomics England Limited (a wholly owned company of the Department of Health and Social Care). The 100,000 Genomes Project is funded by the National Institute for Health Research and NHS England. The Wellcome Trust, Cancer Research UK and the Medical Research Council have also funded research infrastructure. The 100,000 Genomes Project uses data provided by patients and collected by the National Health Service as part of their care and support. We thank members of the ICGC/PCAWG working groups for generating the variant calls used in our case-control burden analyses. ; Peer Reviewed ; "Article signat per 38 autors/es: Robert Lesurf, Abdelrahman Said, Oyediran Akinrinade, Jeroen Breckpot, Kathleen Delfosse, Ting Liu, Roderick Yao, Gabrielle Persad, Fintan McKenna, Ramil R. Noche, Winona Oliveros, Kaia Mattioli, Shreya Shah, Anastasia Miron, Qian Yang, Guoliang Meng, Michelle Chan Seng Yue, Wilson W. L. Sung, Bhooma Thiruvahindrapuram, Jane Lougheed, Erwin Oechslin, Tapas Mondal, Lynn Bergin, John Smythe, Shashank Jayappa, Vinay J. Rao, Jayaprakash Shenthar, Perundurai S. Dhandapany, Christopher Semsarian, Robert G. Weintraub, Richard D. Bagnall, Jodie Ingles, Genomics England Research Consortium, Marta Melé, Philipp G. Maass, James Ellis, Stephen W. Scherer & Seema Mital" ; Postprint (published version)
In: Twin research and human genetics: the official journal of the International Society for Twin Studies (ISTS) and the Human Genetics Society of Australasia, Band 26, Heft 4-5, S. 280-289
AbstractSchizophrenia (SZ) is a severe, complex, and common mental disorder with high heritability (80%), an adult age of onset, and high discordance (∼50%) in monozygotic twins (MZ). Extensive studies on familial and non-familial cases have implicated a number of segregating mutations and de novo changes in SZ that may include changes to the mitochondrial genome. Yet, no single universally causal variant has been identified, highlighting its extensive genetic heterogeneity. This report specifically focuses on the assessment of changes in the mitochondrial genome in a unique set of monozygotic twins discordant (MZD) for SZ using blood. Genomic DNA from six pairs of MZD twins and two sets of parents (N = 16) was hybridized to the Affymetrix Human SNP Array 6.0 to assess mitochondrial DNA copy number (mtDNA-CN). Whole genome sequencing (WGS) and quantitative polymerase chain reaction (qPCR) was performed for a subset of MZD pairs and their parents and was also used to derive mtDNA-CN estimates. The WGS data were further analyzed to generate heteroplasmy (HP) estimates. Our results show that mtDNA-CN estimates for within-pair and mother-child differences were smaller than comparisons involving unrelated individuals, as expected. MZD twins showed discordance in mtDNA-CN estimates and displayed concordance in directionality of differences for mtDNA-CN across all technologies. Further, qPCR performed better than Affymetrix in estimating mtDNA-CN based on relatedness. No reliable differences in HP were detected between MZD twins. The within-MZD differences in mtDNA-CN observed represent postzygotic somatic changes that may contribute to discordance of MZ twins for diseases, including SZ.
This work was supported by grants from Ministerio de Ciencia e Innovación-Spain, Fundación M. Botín, European Union (FP7 MicroEnviMet), Instituto de Salud Carlos III (ISCIII) and Red Temática de Investigación del Cáncer (RTICC) del ISCIII
To access publisher full text version of this article. Please click on the hyperlink in Additional Links field. ; A trio of genome-wide association studies recently reported sequence variants at three loci to be significantly associated with schizophrenia. No sequence polymorphism had been unequivocally (P<5 × 10(-8)) associated with schizophrenia earlier. However, one variant, rs1344706[T], had come very close. This polymorphism, located in an intron of ZNF804A, was reported to associate with schizophrenia with a P-value of 1.6 × 10(-7), and with psychosis (schizophrenia plus bipolar disorder) with a P-value of 1.0 × 10(-8). In this study, using 5164 schizophrenia cases and 20,709 controls, we replicated the association with schizophrenia (odds ratio OR = 1.08, P = 0.0029) and, by adding bipolar disorder patients, we also confirmed the association with psychosis (added N = 609, OR = 1.09, P = 0.00065). Furthermore, as it has been proposed that variants such as rs1344706[T]-common and with low relative risk-may also serve to identify regions harboring less common, higher-risk susceptibility alleles, we searched ZNF804A for large copy number variants (CNVs) in 4235 psychosis patients, 1173 patients with other psychiatric disorders and 39,481 controls. We identified two CNVs including at least part of ZNF804A in psychosis patients and no ZNF804A CNVs in controls (P = 0.013 for association with psychosis). In addition, we found a ZNF804A CNV in an anxiety patient (P = 0.0016 for association with the larger set of psychiatric disorders). ; European Union LSHM-CT-2006-037761 PIAP-GA-2008-218251 HEALTH-F2-2009-223423 National Genomic Network (NGFN-2) of the German Federal Ministry of Education and Research (BMBF) National Institute of Mental Health R01 MH078075 Center of Excellence for Complex Disease Genetics of the Academy of Finland 213506 129680 Biocentrum Helsinki Foundation and Faculty of Medicine, University of Helsink info:eu-repo/grantAgreement/EC/FP7/218251
Joint grant from the United Kingdom Medical Research Council and GlaxoSmithKline (Grant No. G0701420) and the National Institute for Health Research (NIHR) Biomedical Research Centre for Mental Health at South London and Maudsley National Health Service (NHS) Foundation Trust and Institute of Psychiatry, King's College London. This work presents independent research in part funded by the NIHR. Also supported by the Wellcome Trust Grant No. 086635 (JJHR); NIHR Specialist Biomedical Research Centre for Mental Health at the South London and Maudsley NHS Foundation Trust and the Institute of Psychiatry, King's College London (SCW); a Marie Curie Intra-European Fellowship within the 7th European Community Framework Programme (MRiv); European Commission Grant Agreement No. 115008 (RU, PM); and Canada Research Chairs program (http://www. chairs-chaires.gc.ca/) (RU). Dr. Aitchison holds an Alberta Centennial Addiction and Mental Health Research Chair, funded by the government of Alberta, Canada. The Genome Based Therapeutic Drugs for Depression study was funded by a European Commission Framework 6 grant, European Commission Contract Reference LSHB-CT-2003-503428, and GlaxoSmithKline. Genotyping was performed at the Centre Nationale De Genotypage, Evry, Paris. We acknowledge the contribution of phase 2 of the Wellcome Trust Case Control Consortium in providing access to control datasets from the 1958 British birth cohort and the National Blood Service cohort.
In: Twin research and human genetics: the official journal of the International Society for Twin Studies (ISTS) and the Human Genetics Society of Australasia, Band 21, Heft 1, S. 1-11
Hundreds of penetrant risk loci have been identified across different neurodevelopmental disorders (NDDs), and these often involve rare (<1% frequency) copy number variations (CNVs), which can involve one or more genes. Monozygotic (MZ) twin pairs are long thought to share 100% of their genomic information. However, genetic differences in the form of postzygotic somatic variants have been reported recently both in typically developing (TD) and in clinically discordant MZ pairs. We sought to investigate the contribution of rare CNVs in 100 twin pairs enriched for NDD phenotypes with a particular focus on postzygotic CNVs in MZ pairs discordant for autism spectrum disorder (ASD) using the Illumina Infinium PsychArray. In our sample, no postzygotic de novo CNVs were found in 55 MZ twin pairs, including the 13 pairs discordant for ASD. We did detect a higher rate of CNVs overlapping genes involved in disorders of the nervous system, such as a rare deletion affecting HNRNPU, in MZ pairs discordant and concordant for ASD in comparison with TD pairs (p = .02). Our results are in concordance with earlier findings that postzygotic de novo CNV events are typically rare in genomic DNA derived from saliva or blood, and suggests that the discordance of NDDs in our sample of twins is not explained by discordant CNVs. Still, studies investigating postzygotic variation in MZ discordant twins using DNA from different tissues and single cells and higher resolution genomics are needed in the future.
Extreme phenotypic diversity, a history of artificial selection, and socioeconomic value make domestic dog breeds a compelling subject for genomic research. Copy number variation (CNV) is known to account for a significant part of inter-individual genomic diversity in other systems. However, a comprehensive genome-wide study of structural variation as it relates to breed-specific phenotypes is lacking. We have generated whole genome CNV maps for more than 300 canids. Our data set extends the canine structural variation landscape to more than 100 dog breeds, including novel variants that cannot be assessed using microarray technologies. We have taken advantage of this data set to perform the first CNV-based genome-wide association study (GWAS) in canids. We identify 96 loci that display copy number differences across breeds, which are statistically associated with a previously compiled set of breed-specific morphometrics and disease susceptibilities. Among these, we highlight the discovery of a long-range interaction involving a CNV near MED13L and TBX3, which could influence breed standard height. Integration of the CNVs with chromatin interactions, long noncoding RNA expression, and single nucleotide variation highlights a subset of specific loci and genes with potential functional relevance and the prospect to explain trait variation between dog breeds. ; J.P. and E.A.O. were funded by the Intramural Program of the National Human Genome Research Institute of the National Institutes of Health. T.M.-B. was funded by European Research Council ERC-CON-2019-864203, BFU2017-86471-P (MINECO/FEDER, UE), "Unidad de Excelencia María de Maeztu,"funded by the Agencia Estatal de Investigación (CEX2018-000792-M), Howard Hughes International Early Career, Obra Social "La Caixa"and Secretaria d'Universitats i Recerca and CERCA Programme del Departament d'Economia i Coneixement de la Generalitat de Catalunya (GRC 2017 SGR 880). ; With funding from the Spanish government through the "Severo Ochoa Centre of Excellence" accreditation (CEX2018-000792-M). ; Peer reviewed
Transgene copy number has a great impact on the expression level and stability of exogenous gene in transgenic plants. Proper selection of endogenous reference genes is necessary for detection of genetic components in genetically modification (GM) crops by quantitative real-time PCR (qPCR) or by qualitative PCR approach, especially in sugarcane with polyploid and aneuploid genomic structure. qPCR technique has been widely accepted as an accurate, time-saving method on determination of copy numbers in transgenic plants and on detection of genetically modified plants to meet the regulatory and legislative requirement. In this study, to find a suitable endogenous reference gene and its real-time PCR assay for sugarcane (Saccharum spp. hybrids) DNA content quantification, we evaluated a set of potential "single copy" genes including P4H, APRT, ENOL, CYC, TST and PRR, through qualitative PCR and absolute quantitative PCR. Based on copy number comparisons among different sugarcane genotypes, including five S. officinarum, one S. spontaneum and two S. spp. hybrids, these endogenous genes fell into three groups: ENOL-3—high copy number group, TST-1 and PRR-1—medium copy number group, P4H-1, APRT-2 and CYC-2—low copy number group. Among these tested genes, P4H, APRT and CYC were the most stable, while ENOL and TST were the least stable across different sugarcane genotypes. Therefore, three primer pairs of P4H-3, APRT-2 and CYC-2 were then selected as the suitable reference gene primer pairs for sugarcane. The test of multi-target reference genes revealed that the APRT gene was a specific amplicon, suggesting this gene is the most suitable to be used as an endogenous reference target for sugarcane DNA content quantification. These results should be helpful for establishing accurate and reliable qualitative and quantitative PCR analysis of GM sugarcane.
Transgene copy number has a great impact on the expression level and stability of exogenous gene in transgenic plants. Proper selection of endogenous reference genes is necessary for detection of genetic components in genetically modification (GM) crops by quantitative real-time PCR (qPCR) or by qualitative PCR approach, especially in sugarcane with polyploid and aneuploid genomic structure. qPCR technique has been widely accepted as an accurate, time-saving method on determination of copy numbers in transgenic plants and on detection of genetically modified plants to meet the regulatory and legislative requirement. In this study, to find a suitable endogenous reference gene and its real-time PCR assay for sugarcane (Saccharum spp. hybrids) DNA content quantification, we evaluated a set of potential "single copy" genes including P4H, APRT, ENOL, CYC, TST and PRR, through qualitative PCR and absolute quantitative PCR. Based on copy number comparisons among different sugarcane genotypes, including five S. officinarum, one S. spontaneum and two S. spp. hybrids, these endogenous genes fell into three groups: ENOL-3—high copy number group, TST-1 and PRR-1—medium copy number group, P4H-1, APRT-2 and CYC-2—low copy number group. Among these tested genes, P4H, APRT and CYC were the most stable, while ENOL and TST were the least stable across different sugarcane genotypes. Therefore, three primer pairs of P4H-3, APRT-2 and CYC-2 were then selected as the suitable reference gene primer pairs for sugarcane. The test of multi-target reference genes revealed that the APRT gene was a specific amplicon, suggesting this gene is the most suitable to be used as an endogenous reference target for sugarcane DNA content quantification. These results should be helpful for establishing accurate and reliable qualitative and quantitative PCR analysis of GM sugarcane.
In: Twin research and human genetics: the official journal of the International Society for Twin Studies (ISTS) and the Human Genetics Society of Australasia, Band 18, Heft 1, S. 13-18
Monozygotic (MZ) twins form an important system for the study of biological plasticity in humans. While MZ twins are generally considered to be genetically identical, a number of studies have emerged that have demonstrated copy-number differences within a twin pair, particularly in those discordant for disease. The rate of autosomal copy-number variation (CNV) discordance within MZ twin pairs was investigated using a population sample of 376 twin pairs genotyped on Illumina Human610-Quad arrays. After CNV calling using both QuantiSNP and PennCNV followed by manual annotation, only a single CNV difference was observed within the MZ twin pairs, being a 130 KB duplication of chromosome 5. Five other potential discordant CNV were called by the software, but excluded based on manual annotation of the regions. It is concluded that large CNV discordance is rare within MZ twin pairs, indicating that any CNV difference found within phenotypically discordant MZ twin pairs has a high probability of containing the causal gene(s) involved.
I will answer the question "What's in a copy?" by considering three sets of related issues: the importance of copies in academia; in cultural life; and in the economic world. In academia the current capability of making copies is challenging pedagogical practices and the trust of its members, plagiarism being the most immediate problem. The notion of authorship is also undergoing changes provoked by a proliferation of authors and new possibilities opened up by cyberspace. In cultural life, imitation and mimesis have long been fundamental engines of socialization. Our enhanced capacity of copying problematizes, with new intensity, the relationships between homogeneity and heterogeneity, between the genuine and the spurious. In the economic world, the digital era is threatening some of the fundamental tenets of capitalism, especially of its variant called the "knowledge society", regarding the control of intellectual property rights. The gap between normativity and social practices is widening. The many dilemmas and tensions identified in the text are understood as symptoms of two major characteristics of the current times: hyperfetishism and hyperanimism.
In: Twin research and human genetics: the official journal of the International Society for Twin Studies (ISTS) and the Human Genetics Society of Australasia, Band 18, Heft 2, S. 117-125
Variation in human cognitive ability is of consequence to a large number of health and social outcomes and is substantially heritable. Genetic linkage, genome-wide association, and copy number variant studies have investigated the contribution of genetic variation to individual differences in normal cognitive ability, but little research has considered the role of rare genetic variants. Exome sequencing studies have already met with success in discovering novel trait-gene associations for other complex traits. Here, we use exome sequencing to investigate the effects of rare variants on general cognitive ability. Unrelated Scottish individuals were selected for high scores on a general component of intelligence (g). The frequency of rare genetic variants (in n = 146) was compared with those from Scottish controls (total n = 486) who scored in the lower to middle range of the g distribution or on a proxy measure of g. Biological pathway analysis highlighted enrichment of the mitochondrial inner membrane component and apical part of cell gene ontology terms. Global burden analysis showed a greater total number of rare variants carried by high g cases versus controls, which is inconsistent with a mutation load hypothesis whereby mutations negatively affect g. The general finding of greater non-synonymous (vs. synonymous) variant effects is in line with evolutionary hypotheses for g. Given that this first sequencing study of high g was small, promising results were found, suggesting that the study of rare variants in larger samples would be worthwhile.