Evolutionary Relationships of Microbial Aromatic Prenyltransferases

Written by on November 30, 2011 – 10:00 pm -

by Tobias Bonitz, Vikram Alva, Orwah Saleh, Andrei N. Lupas, Lutz Heide

The linkage of isoprenoid and aromatic moieties, catalyzed by aromatic prenyltransferases (PTases), leads to an impressive diversity of primary and secondary metabolites, including important pharmaceuticals and toxins. A few years ago, a hydroxynaphthalene PTase, NphB, featuring a novel ten-stranded β-barrel fold was identified in Streptomyces sp. strain CL190. This fold, termed the PT-barrel, is formed of five tandem ααββ structural repeats and remained exclusive to the NphB family until its recent discovery in the DMATS family of indole PTases. Members of these two families exist only in fungi and bacteria, and all of them appear to catalyze the prenylation of aromatic substrates involved in secondary metabolism. Sequence comparisons using PSI-BLAST do not yield matches between these two families, suggesting that they may have converged upon the same fold independently. However, we now provide evidence for a common ancestry for the NphB and DMATS families of PTases. We also identify sequence repeats that coincide with the structural repeats in proteins belonging to these two families. Therefore we propose that the PT-barrel arose by amplification of an ancestral ααββ module. In view of their homology and their similarities in structure and function, we propose to group the NphB and DMATS families together into a single superfamily, the PT-barrel superfamily.

Tags: , ,
Posted in я Computatioanl biology | Comments Off

Simultaneous Mutations in Multi-Viral Proteins Are Required for Soybean mosaic virus to Gain Virulence on Soybean Genotypes Carrying Different R Genes

Written by on November 30, 2011 – 10:00 pm -

by R. V. Chowda-Reddy, Haiyue Sun, John H. Hill, Vaino Poysa, Aiming Wang

Background

Genetic resistance is the most effective and sustainable approach to the control of plant pathogens that are a major constraint to agriculture worldwide. In soybean, three dominant R genes, i.e., Rsv1, Rsv3 and Rsv4, have been identified and deployed against Soybean mosaic virus (SMV) with strain-specificities. Molecular identification of virulent determinants of SMV on these resistance genes will provide essential information for the proper utilization of these resistance genes to protect soybean against SMV, and advance knowledge of virus-host interactions in general.

Methodology/Principal Findings

To study the gain and loss of SMV virulence on all the three resistance loci, SMV strains G7 and two G2 isolates L and LRB were used as parental viruses. SMV chimeras and mutants were created by partial genome swapping and point mutagenesis and then assessed for virulence on soybean cultivars PI96983 (Rsv1), L-29 (Rsv3), V94-5152 (Rsv4) and Williams 82 (rsv). It was found that P3 played an essential role in virulence determination on all three resistance loci and CI was required for virulence on Rsv1- and Rsv3-genotype soybeans. In addition, essential mutations in HC-Pro were also required for the gain of virulence on Rsv1-genotype soybean. To our best knowledge, this is the first report that CI and P3 are involved in virulence on Rsv1- and Rsv3-mediated resistance, respectively.

Conclusions/Significance

Multiple viral proteins, i.e., HC-Pro, P3 and CI, are involved in virulence on the three resistance loci and simultaneous mutations at essential positions of different viral proteins are required for an avirulent SMV strain to gain virulence on all three resistance loci. The likelihood of such mutations occurring naturally and concurrently on multiple viral proteins is low. Thus, incorporation of all three resistance genes in a soybean cultivar through gene pyramiding may provide durable resistance to SMV.


Tags: , ,
Posted in я Computatioanl biology | Comments Off

Comparative Analysis of Human Protein-Coding and Noncoding RNAs between Brain and 10 Mixed Cell Lines by RNA-Seq

Written by on November 30, 2011 – 10:00 pm -

by Geng Chen, Kangping Yin, Leming Shi, Yuanzhang Fang, Ya Qi, Peng Li, Jian Luo, Bing He, Mingyao Liu, Tieliu Shi

In their expression process, different genes can generate diverse functional products, including various protein-coding or noncoding RNAs. Here, we investigated the protein-coding capacities and the expression levels of their isoforms for human known genes, the conservation and disease association of long noncoding RNAs (ncRNAs) with two transcriptome sequencing datasets from human brain tissues and 10 mixed cell lines. Comparative analysis revealed that about two-thirds of the genes expressed between brain and cell lines are the same, but less than one-third of their isoforms are identical. Besides those genes specially expressed in brain and cell lines, about 66% of genes expressed in common encoded different isoforms. Moreover, most genes dominantly expressed one isoform and some genes only generated protein-coding (or noncoding) RNAs in one sample but not in another. We found 282 human genes could encode both protein-coding and noncoding RNAs through alternative splicing in the two samples. We also identified more than 1,000 long ncRNAs, and most of those long ncRNAs contain conserved elements across either 46 vertebrates or 33 placental mammals or 10 primates. Further analysis showed that some long ncRNAs differentially expressed in human breast cancer or lung cancer, several of those differentially expressed long ncRNAs were validated by RT-PCR. In addition, those validated differentially expressed long ncRNAs were found significantly correlated with certain breast cancer or lung cancer related genes, indicating the important biological relevance between long ncRNAs and human cancers. Our findings reveal that the differences of gene expression profile between samples mainly result from the expressed gene isoforms, and highlight the importance of studying genes at the isoform level for completely illustrating the intricate transcriptome.

Tags: , ,
Posted in я Computatioanl biology | Comments Off

Systematic Comparison of Three Methods for Fragmentation of Long-Range PCR Products for Next Generation Sequencing

Written by on November 30, 2011 – 10:00 pm -

by Ellen Knierim, Barbara Lucke, Jana Marie Schwarz, Markus Schuelke, Dominik Seelow

Next Generation Sequencing (NGS) technologies are gaining importance in the routine clinical diagnostic setting. It is thus desirable to simplify the workflow for high-throughput diagnostics. Fragmentation of DNA is a crucial step for preparation of template libraries and various methods are currently known. Here we evaluated the performance of nebulization, sonication and random enzymatic digestion of long-range PCR products on the results of NGS. All three methods produced high-quality sequencing libraries for the 454 platform. However, if long-range PCR products of different length were pooled equimolarly, sequence coverage drastically dropped for fragments below 3,000 bp. All three methods performed equally well with regard to overall sequence quality (PHRED) and read length. Enzymatic fragmentation showed highest consistency between three library preparations but performed slightly worse than sonication and nebulization with regard to insertions/deletions in the raw sequence reads. After filtering for homopolymer errors, enzymatic fragmentation performed best if compared to the results of classic Sanger sequencing. As the overall performance of all three methods was equal with only minor differences, a fragmentation method can be chosen solely according to lab facilities, feasibility and experimental design.

Tags: , ,
Posted in я Computatioanl biology | Comments Off

A Systematic Analysis on DNA Methylation and the Expression of Both mRNA and microRNA in Bladder Cancer

Written by on November 30, 2011 – 10:00 pm -

by Jialou Zhu, Zhimao Jiang, Fei Gao, Xueda Hu, Liang Zhou, Jiahao Chen, Huijuan Luo, Jihua Sun, Song Wu, Yonghua Han, Guangliang Yin, Maoshan Chen, Zujing Han, Xianxin Li, Yi Huang, Weixing Zhang, Fangjian Zhou, Tong Chen, Pingping Fa, Yong Wang, Liang Sun, Huimin Leng, Fenghao Sun, Yuchen Liu, Mingzhi Ye, Huanming Yang, Zhiming Cai, Yaoting Gui, Xiuqing Zhang

Background

DNA methylation aberration and microRNA (miRNA) deregulation have been observed in many types of cancers. A systematic study of methylome and transcriptome in bladder urothelial carcinoma has never been reported.

Methodology/Principal Findings

The DNA methylation was profiled by modified methylation-specific digital karyotyping (MMSDK) and the expression of mRNAs and miRNAs was analyzed by digital gene expression (DGE) sequencing in tumors and matched normal adjacent tissues obtained from 9 bladder urothelial carcinoma patients. We found that a set of significantly enriched pathways disrupted in bladder urothelial carcinoma primarily related to “neurogenesis” and “cell differentiation” by integrated analysis of -omics data. Furthermore, we identified an intriguing collection of cancer-related genes that were deregulated at the levels of DNA methylation and mRNA expression, and we validated several of these genes (HIC1, SLIT2, RASAL1, and KRT17) by Bisulfite Sequencing PCR and Reverse Transcription qPCR in a panel of 33 bladder cancer samples.

Conclusions/Significance

We characterized the profiles between methylome and transcriptome in bladder urothelial carcinoma, identified a set of significantly enriched key pathways, and screened four aberrantly methylated and expressed genes. Conclusively, our findings shed light on a new avenue for basic bladder cancer research.


Tags: , ,
Posted in я Computatioanl biology | Comments Off

Engineered Single-Domain Antibodies with High Protease Resistance and Thermal Stability

Written by on November 30, 2011 – 10:00 pm -

by Greg Hussack, Tomoko Hirama, Wen Ding, Roger MacKenzie, Jamshid Tanha

The extreme pH and protease-rich environment of the upper gastrointestinal tract is a major obstacle facing orally-administered protein therapeutics, including antibodies. Through protein engineering, several Clostridium difficile toxin A-specific heavy chain antibody variable domains (VHHs) were expressed with an additional disulfide bond by introducing Ala/Gly54Cys and Ile78Cys mutations. Mutant antibodies were compared to their wild-type counterparts with respect to expression yield, non-aggregation status, affinity for toxin A, circular dichroism (CD) structural signatures, thermal stability, protease resistance, and toxin A-neutralizing capacity. The mutant VHHs were found to be well expressed, although with lower yields compared to wild-type counterparts, were non-aggregating monomers, retained low nM affinity for toxin A, albeit the majority showed somewhat reduced affinity compared to wild-type counterparts, and were capable of in vitro toxin A neutralization in cell-based assays. Far-UV and near-UV CD spectroscopy consistently showed shifts in peak intensity and selective peak minima for wild-type and mutant VHH pairs; however, the overall CD profile remained very similar. A significant increase in the thermal unfolding midpoint temperature was observed for all mutants at both neutral and acidic pH. Digestion of the VHHs with the major gastrointestinal proteases, at biologically relevant concentrations, revealed a significant increase in pepsin resistance for all mutants and an increase in chymotrypsin resistance for the majority of mutants. Mutant VHH trypsin resistance was similar to that of wild-type VHHs, although the trypsin resistance of one VHH mutant was significantly reduced. Therefore, the introduction of a second disulfide bond in the hydrophobic core not only increases VHH thermal stability at neutral pH, as previously shown, but also represents a generic strategy to increase VHH stability at low pH and impart protease resistance, with only minor perturbations in target binding affinities. These are all desirable characteristics for the design of protein-based oral therapeutics.

Tags: , ,
Posted in я Computatioanl biology | Comments Off

Heterozygous Yeast Deletion Collection Screens Reveal Essential Targets of Hsp90

Written by on November 30, 2011 – 10:00 pm -

by Eric A. Franzosa, Véronique Albanèse, Judith Frydman, Yu Xia, Amie J. McClellan

Hsp90 is an essential eukaryotic chaperone with a role in folding specific “client” proteins such as kinases and hormone receptors. Previously performed homozygous diploid yeast deletion collection screens uncovered broad requirements for Hsp90 in cellular transport and cell cycle progression. These screens also revealed that the requisite cellular functions of Hsp90 change with growth temperature. We present here for the first time the results of heterozygous deletion collection screens conducted at the hypothermic stress temperature of 15°C. Extensive bioinformatic analyses were performed on the resulting data in combination with data from homozygous and heterozygous screens previously conducted at normal (30°C) and hyperthermic stress (37°C) growth temperatures. Our resulting meta-analysis uncovered extensive connections between Hsp90 and (1) general transcription, (2) ribosome biogenesis and (3) GTP binding proteins. Predictions from bioinformatic analyses were tested experimentally, supporting a role for Hsp90 in ribosome stability. Importantly, the integrated analysis of the 15°C heterozygous deletion pool screen with previously conducted 30°C and 37°C screens allows for essential genetic targets of Hsp90 to emerge. Altogether, these novel contributions enable a more complete picture of essential Hsp90 functions.

Tags: , ,
Posted in я Computatioanl biology | Comments Off

ERG Deregulation Induces PIM1 Over-Expression and Aneuploidy in Prostate Epithelial Cells

Written by on November 30, 2011 – 10:00 pm -

by Vera Magistroni, Luca Mologni, Stefano Sanselicio, James Frances Reid, Sara Redaelli, Rocco Piazza, Michela Viltadi, Giorgio Bovo, Guido Strada, Marco Grasso, Manuela Gariboldi, Carlo Gambacorti-Passerini

The ERG gene belongs to the ETS family of transcription factors and has been found to be involved in atypical chromosomal rearrangements in several cancers. To gain insight into the oncogenic activity of ERG, we compared the gene expression profile of NIH-3T3 cells stably expressing the coding regions of the three main ERG oncogenic fusions: TMPRSS2/ERG (tERG), EWS/ERG and FUS/ERG. We found that all three ERG fusions significantly up-regulate PIM1 expression in the NIH-3T3 cell line. PIM1 is a serine/threonine kinase frequently over-expressed in cancers of haematological and epithelial origin. We show here that tERG expression induces PIM1 in the non-malignant prostate cell line RWPE-1, strengthening the relation between tERG and PIM1 up-regulation in the initial stages of prostate carcinogenesis. Silencing of tERG reversed PIM1 induction. A significant association between ERG and PIM1 expression in clinical prostate carcinoma specimens was found, suggesting that such a mechanism may be relevant in vivo. Chromatin Immunoprecipitation experiments showed that tERG directly binds to PIM1 promoter in the RWPE-1 prostate cell line, suggesting that tERG could be a direct regulator of PIM1 expression. The up-regulation of PIM1 induced by tERG over-expression significantly modified Cyclin B1 levels and increased the percentage of aneuploid cells in the RWPE-1 cell line after taxane-based treatment. Here we provide the first evidence for an ERG-mediated PIM1 up-regulation in prostate cells in vitro and in vivo, suggesting a direct effect of ERG transcriptional activity in the alteration of genetic stability.

Tags: , ,
Posted in я Computatioanl biology | Comments Off

Novel Inhibitor Design for Hemagglutinin against H1N1 Influenza Virus by Core Hopping Method

Written by on November 30, 2011 – 10:00 pm -

by Xiao-Bo Li, Shu-Qing Wang, Wei-Ren Xu, Run-Ling Wang, Kuo-Chen Chou

The worldwide spread of H1N1 avian influenza and the increasing reports about its resistance to the current drugs have made a high priority for developing new anti-influenza drugs. Owing to its unique function in assisting viruses to bind the cellular surface, a key step for them to subsequently penetrate into the infected cell, hemagglutinin (HA) has become one of the main targets for drug design against influenza virus. To develop potent HA inhibitors, the ZINC fragment database was searched for finding the optimal compound with the core hopping technique. As a result, the Neo6 compound was obtained. It has been shown through the subsequent molecular docking studies and molecular dynamic simulations that Neo6 not only assumes more favorable conformation at the binding pocket of HA but also has stronger binding interaction with its receptor. Accordingly, Neo6 may become a promising candidate for developing new and more powerful drugs for treating influenza. Or at the very least, the findings reported here may provide useful insights to stimulate new strategy in this area.

Tags: , ,
Posted in я Computatioanl biology | Comments Off

On the Evolutionary Origin of Eukaryotic DNA Methyltransferases and Dnmt2

Written by on November 30, 2011 – 10:00 pm -

by Tomasz P. Jurkowski, Albert Jeltsch

The Dnmt2 enzymes show strong amino acid sequence similarity with eukaryotic and prokaryotic DNA-(cytosine C5)-methyltransferases. Yet, Dnmt2 enzymes from several species were shown to methylate tRNA-Asp and had been proposed that eukaryotic DNA methyltransferases evolved from a Dnmt2-like tRNA methyltransferase ancestor [Goll et al., 2006, Science, 311, 395-8]. It was the aim of this study to investigate if this hypothesis could be supported by evidence from sequence alignments. We present phylogenetic analyses based on sequence alignments of the methyltransferase catalytic domains of more than 2300 eukaryotic and prokaryotic DNA-(cytosine C5)-methyltransferases and analyzed the distribution of DNA methyltransferases in eukaryotic species. The Dnmt2 homologues were reliably identified by an additional conserved CFT motif next to motif IX. All DNA methyltransferases and Dnmt2 enzymes were clearly separated from other RNA-(cytosine-C5)-methyltransferases. Our sequence alignments and phylogenetic analyses indicate that the last universal eukaryotic ancestor contained at least one member of the Dnmt1, Dnmt2 and Dnmt3 families of enzymes and additional RNA methyltransferases. The similarity of Dnmt2 enzymes with DNA methyltransferases and absence of similarity with RNA methyltransferases combined with their strong RNA methylation activity suggest that the ancestor of Dnmt2 was a DNA methyltransferase and an early Dnmt2 enzyme changed its substrate preference to tRNA. There is no phylogenetic evidence that Dnmt2 was the precursor of eukaryotic Dnmts. Most likely, the eukaryotic Dnmt1 and Dnmt3 families of DNA methyltransferases had an independent origin in the prokaryotic DNA methyltransferase sequence space.

Tags: , ,
Posted in я Computatioanl biology | Comments Off

Computational Analysis of the ESX-1 Region of Mycobacterium tuberculosis: Insights into the Mechanism of Type VII Secretion System

Written by on November 30, 2011 – 10:00 pm -

by Chandrani Das, Tarini Shankar Ghosh, Sharmila S. Mande

Type VII secretion system (T7SS) is a recent discovery in bacterial secretion systems. First identified in Mycobacterium tuberculosis, this secretion system has later been reported in organisms belonging to the Actinomycetales order and even to distant phyla like Firmicutes. The genome of M. tuberculosis H37Rv contains five gene clusters that have evolved through gene duplication events and include components of the T7SS secretion machinery. These clusters are called ESAT-6 secretion system (ESX) 1 through 5. Out of these, ESX-1 has been the most widely studied region because of its pathological importance. In spite of this, the overall mechanism of protein translocation through ESX-1 secretion machinery is not clearly understood. Specifically, the structural components contributing to the translocation through the mycomembrane have not been characterized yet. In this study, we have carried out a comprehensive in silico analysis of the genes known to be involved in ESX-1 secretion pathway and identified putative proteins having high probability to be associated with this particular pathway. Our study includes analysis of phylogenetic profiles, identification of domains, transmembrane helices, 3D folds, signal peptides and prediction of protein-protein associations. Based on our analysis, we could assign probable novel functions to a few of the ESX-1 components. Additionally, we have identified a few proteins with probable role in the initial activation and formation of mycomembrane translocon of ESX-1 secretion machinery. We also propose a probable working model of T7SS involving ESX-1 secretion pathway.

Tags: , ,
Posted in я Computatioanl biology | Comments Off

FTO Is Expressed in Neurones throughout the Brain and Its Expression Is Unaltered by Fasting

Written by on November 30, 2011 – 10:00 pm -

by James S. McTaggart, Sheena Lee, Michaela Iberl, Chris Church, Roger D. Cox, Frances M. Ashcroft

Single-nucleotide polymorphisms in the first intron of the ubiquitously expressed FTO gene are associated with obesity. Although the physiological functions of FTO remain unclear, food intake is often altered when Fto expression levels are manipulated. Furthermore, deletion of FTO from neurones alone has a similar effect on food intake to deletion of FTO in all tissues. These results indicate that FTO expression in the brain is particularly important. Considerable focus has been placed on the dynamic regulation of Fto mRNA expression in the hypothalamus after short-term (16–48 hour) fasting, but results have been controversial. There are no studies that quantify FTO protein levels across the brain, and assess its alteration following short-term fasting. Using immunohistochemistry, we found that FTO protein is widely expressed in mouse brain, and present in the majority of neurones. Using quantitative Western blotting and RT-qPCR we show that FTO protein and mRNA levels in the hypothalamus, cerebellum and rostral brain are relatively uniform, and levels in the brain are higher than in skeletal muscles of the lower limbs. Fasting for 18 hours does not alter the expression pattern, or levels, of FTO protein and mRNA. We further show that the majority of POMC neurones, which are critically involved in food intake regulation, also express FTO, but that the percentage of FTO-positive POMC neurones is not altered by fasting. In summary, we find no evidence that Fto/FTO expression is regulated by short-term (18-hour) fasting. Thus, it is unlikely that the hunger and increased post-fasting food intake caused by such food deprivation is driven by alterations in Fto/FTO expression. The widespread expression of FTO in neurones also suggests that physiological studies of this protein should not be limited to the hypothalamus.

Tags: , ,
Posted in я Computatioanl biology | Comments Off

Replication and Virus-Induced Transcriptome of HAdV-5 in Normal Host Cells versus Cancer Cells - Differences of Relevance for Adenoviral Oncolysis

Written by on November 30, 2011 – 10:00 pm -

by Dominik E. Dorer, Frank Holtrup, Kurt Fellenberg, Johanna K. Kaufmann, Sarah Engelhardt, Jörg D. Hoheisel, Dirk M. Nettelbeck

Adenoviruses (Ads), especially HAdV-5, have been genetically equipped with tumor-restricted replication potential to enable applications in oncolytic cancer therapy. Such oncolytic adenoviruses have been well tolerated in cancer patients, but their anti-tumor efficacy needs to be enhanced. In this regard, it should be considered that cancer cells, dependent on their tissue of origin, can differ substantially from the normal host cells to which Ads are adapted by complex virus-host interactions. Consequently, viral replication efficiency, a key determinant of oncolytic activity, might be suboptimal in cancer cells. Therefore, we have analyzed both the replication kinetics of HAdV-5 and the virus-induced transcriptome in human bronchial epithelial cells (HBEC) in comparison to cancer cells. This is the first report on genome-wide expression profiling of Ads in their native host cells. We found that E1A expression and onset of viral genome replication are most rapid in HBEC and considerably delayed in melanoma cells. In squamous cell lung carcinoma cells, we observed intermediate HAdV-5 replication kinetics. Infectious particle production, viral spread and lytic activity of HAdV-5 were attenuated in melanoma cells versus HBEC. Expression profiling at the onset of viral genome replication revealed that HAdV-5 induced the strongest changes in the cellular transcriptome in HBEC, followed by lung cancer and melanoma cells. We identified prominent regulation of genes involved in cell cycle and DNA metabolism, replication and packaging in HBEC, which is in accord with the necessity to induce S phase for viral replication. Strikingly, in melanoma cells HAdV-5 triggered opposing regulation of said genes and, in contrast to lung cancer cells, no weak S phase induction was detected when using the E2F promoter as reporter. Our results provide a rationale for improving oncolytic adenoviruses either by adaptation of viral infection to target tumor cells or by modulating tumor cell functions to better support viral replication.

Tags: , ,
Posted in я Computatioanl biology | Comments Off

SNPs and Other Features as They Predispose to Complex Disease: Genome-Wide Predictive Analysis of a Quantitative Phenotype for Hypertension

Written by on November 30, 2011 – 10:00 pm -

by Joong-Ho Won, Georg Ehret, Aravinda Chakravarti, Richard A. Olshen

Though recently they have fallen into some disrepute, genome-wide association studies (GWAS) have been formulated and applied to understanding essential hypertension. The principal goal here is to use data gathered in a GWAS to gauge the extent to which SNPs and their interactions with other features can be combined to predict mean arterial blood pressure (MAP) in 3138 pre-menopausal and naturally post-menopausal white women. More precisely, we quantify the extent to which data as described permit prediction of MAP beyond what is possible from traditional risk factors such as blood cholesterol levels and glucose levels. Of course, these traditional risk factors are genetic, though typically not explicitly so. In all, there were 44 such risk factors/clinical variables measured and 377,790 single nucleotide polymorphisms (SNPs) genotyped. Data for women we studied are from first visit measurements taken as part of the Atherosclerotic Risk in Communities (ARIC) study. We begin by assessing non-SNP features in their abilities to predict MAP, employing a novel regression technique with two stages, first the discovery of main effects and next discovery of their interactions. The long list of SNPs genotyped is reduced to a manageable list for combining with non-SNP features in prediction. We adapted Efron's local false discovery rate to produce this reduced list. Selected non-SNP and SNP features and their interactions are used to predict MAP using adaptive linear regression. We quantify quality of prediction by an estimated coefficient of determination (R2). We compare the accuracy of prediction with and without information from SNPs.

Tags: , ,
Posted in я Computatioanl biology | Comments Off

Genome-Wide Mapping of Copy Number Variation in Humans: Comparative Analysis of High Resolution Array Platforms

Written by on November 30, 2011 – 10:00 pm -

by Rajini R. Haraksingh, Alexej Abyzov, Mark Gerstein, Alexander E. Urban, Michael Snyder

Accurate and efficient genome-wide detection of copy number variants (CNVs) is essential for understanding human genomic variation, genome-wide CNV association type studies, cytogenetics research and diagnostics, and independent validation of CNVs identified from sequencing based technologies. Numerous, array-based platforms for CNV detection exist utilizing array Comparative Genome Hybridization (aCGH), Single Nucleotide Polymorphism (SNP) genotyping or both. We have quantitatively assessed the abilities of twelve leading genome-wide CNV detection platforms to accurately detect Gold Standard sets of CNVs in the genome of HapMap CEU sample NA12878, and found significant differences in performance. The technologies analyzed were the NimbleGen 4.2 M, 2.1 M and 3×720 K Whole Genome and CNV focused arrays, the Agilent 1×1 M CGH and High Resolution and 2×400 K CNV and SNP+CGH arrays, the Illumina Human Omni1Quad array and the Affymetrix SNP 6.0 array. The Gold Standards used were a 1000 Genomes Project sequencing-based set of 3997 validated CNVs and an ultra high-resolution aCGH-based set of 756 validated CNVs. We found that sensitivity, total number, size range and breakpoint resolution of CNV calls were highest for CNV focused arrays. Our results are important for cost effective CNV detection and validation for both basic and clinical applications.

Tags: , ,
Posted in я Computatioanl biology | Comments Off
RSS