File(s) under permanent embargo
Reason: release upon publication
Sequences and draft annotations of computationally predicted proteins from Balamuthia mandrillaris
Datasets usually provide raw data for analysis. This raw data often comes in spreadsheet form, but can be any collection of data, on which analysis can be performed.
This file contains the sequences and draft annotations of computationally predicted proteins from Balamuthia mandrillaris. The sequences are reconstructed from RNA sequencing of logarithmic phase trophozoites, the infective form of the amoeba. Reads were quality filtered with Trimmomatic and assembled de-novo with Trinity v2.8 (k-mer=25) and Spades v3.13 (k-mer=29 and 33) after clipping of the adaptor sequences. Further, quality-filtered reads were aligned to the published B. mandrillaris genome LFUI01 with STAR v2.6 and assembled with Trinity. The three assemblies thus obtained were combined with EvidentialGene v19jan01 (EviGene) with BUSCO homology scores as input for the classifier. This data set consists of the EviGene ‘main’ proteins. FASTA headers are derived from the functional descriptions and gene ontology (GO) annotations predicted with PANNZER2.
This dataset is a byproduct of the study described in: The transcriptome of Balamuthia mandrillaris trophozoites for structure-based drug design.
Balamuthia mandrillaris, a pathogenic free-living amoeba (FLA), causes cutaneous skin lesions as well as the brain-eating disease: Balamuthia granulomatous amoebic encephalitis (GAE). These diseases, and diseases caused by other pathogenic FLA, Naegleria fowleri or Acanthamoeba species, are minimally studied. Chemotherapies for CNS disease caused by B. mandrillaris require vast improvement. Current therapeutics are limited to a small number of drugs that were previously discovered in the last century through in vitro testing or identified after use in the small pool of surviving reports.
Using our recently published methodology to identify potentially useful therapeutics, we screened a collection of 85 compounds that have previously been reported to have antiparasitic activity. We identified 59 compounds that impacted growth at concentrations below 220 µM. Since there is no fully annotated genome or proteome, we used RNA-Seq to determine the gene products of the specific genes potentially targeted by the compounds in B. mandrillaris trophozoites. We identified the sequence of 17 of these target genes and obtained expression clones for 15 that we validated by direct sequencing.
COBRE:Eukaryotic Pathogens Innovation Center (EPIC)
National Institute of General Medical SciencesFind out more...
Georgia Research Alliance
Select an IC:
- AI - National Institute of Allergy and Infectious Diseases (NIAID)