Genome sequence of the fish pathogen Renibacterium salmoninarum suggests reductive evolution away from an environmental Arthrobacter ancestor.

Wiens GD, Rockey DD, Wu Z, Chang J, Levy R, Crane S, Chen DS, Capri GR, Burnett JR, Sudheesh PS, Schipma MJ, Burd H, Bhattacharyya A, Rhodes LD, Kaul R, Strom MS.

Renibacterium salmoninarum is the causative agent of bacterial kidney disease and a significant threat to healthy and sustainable production of salmonid fish worldwide. This pathogen is difficult to culture in vitro, genetic manipulation is challenging, and current therapies and preventative strategies are only marginally effective in preventing disease. The complete genome of R. salmoninarum ATCC 33209 was sequenced and shown to be a 3,155,250-bp circular chromosome that is predicted to contain 3,507 open-reading frames (ORFs). A total of 80 copies of three different insertion sequence elements are interspersed throughout the genome. Approximately 21% of the predicted ORFs have been inactivated via frameshifts, point mutations, insertion sequences, and putative deletions. The R. salmoninarum genome has extended regions of synteny to the Arthrobacter sp. strain FB24 and Arthrobacter aurescens TC1 genomes, but it is approximately 1.9 Mb smaller than both Arthrobacter genomes and has a lower G+C content, suggesting that significant genome reduction has occurred since divergence from the last common ancestor. A limited set of putative virulence factors appear to have been acquired via horizontal transmission after divergence of the species; these factors include capsular polysaccharides, heme sequestration molecules, and the major secreted cell surface antigen p57 (also known as major soluble antigen). Examination of the genome revealed a number of ORFs homologous to antibiotic resistance genes, including genes encoding beta-lactamases, efflux proteins, macrolide glycosyltransferases, and rRNA methyltransferases. The genome sequence provides new insights into R. salmoninarum evolution and may facilitate identification of chemotherapeutic targets and vaccine candidates that can be used for prevention and treatment of infections in cultured salmonids.

J Bacteriol. 2008 Nov;190(21):6970-82. 
doi: 10.1128/JB.00721-08. Epub 2008 Aug 22.

The genome of Syntrophus aciditrophicus: Life at the thermodynamic limit of microbial growth

Michael J. McInerney, Lars Rohlin, Housna Mouttaki, UnMi Kim, Rebecca S. Krupp, Luis Rios-Hernandez, Jessica Sieber, Christopher G. Struchtemeyer, Anamitra Bhattacharyya, John W. Campbell, and Robert P. Gunsalus

Biochemically, the syntrophic bacteria constitute the missing link in our understanding of anaerobic flow of carbon in the biosphere. The completed genome sequence of Syntrophus aciditrophicus SB, a model fatty acid- and aromatic acid-degrading syntrophic bacterium, provides a glimpse of the composition and architecture of the electron transfer and energy-transducing systems needed to exist on marginal energy economies of a syntrophic lifestyle. The genome contains 3,179,300 base pairs and 3,169 genes where 1,618 genes were assigned putative functions. Metabolic reconstruction of the gene inventory revealed that most biosynthetic pathways of a typical Gram-negative microbe were present. A distinctive feature of syntrophic metabolism is the need for reverse electron transport; the presence of a unique Rnf-type ion-translocating electron transfer complex, menaquinone, and membrane-bound Fe-S proteins with associated heterodisulfide reductase domains suggests mechanisms to accomplish this task. Previously undescribed approaches to degrade fatty and aromatic acids, including multiple AMP-forming CoA ligases and acyl-CoA synthetases seem to be present as ways to form and dissipate ion gradients by using a sodium-based energy strategy. Thus, S. aciditrophicus, although nutritionally self-sufficient, seems to be a syntrophic specialist with limited fermentative and respiratory metabolism. Genomic analysis confirms the S. aciditrophicus metabolic and regulatory commitment to a nonconventional mode of life compared with our prevailing understanding of microbiology.

Published online 2007 Apr 18. doi:  10.1073/pnas.0610456104

The cyanobacterial genome core and the origin of photosynthesis.

Armen Y. Mulkidjanian, Eugene V. Koonin, Kira S. Makarova, Sergey L. Mekhedov, Alexander Sorokin, Yuri I. Wolf, Alexis Dufresne, Frédéric Partensky, Henry Burd, Denis Kaznadzey, Robert Haselkorn, and Michael Y. Galperin

Comparative analysis of 15 complete cyanobacterial genome sequences, including “near minimal” genomes of five strains of Prochlorococcus spp., revealed 1,054 protein families [core cyanobacterial clusters of orthologous groups of proteins (core CyOGs)] encoded in at least 14 of them. The majority of the core CyOGs are involved in central cellular functions that are shared with other bacteria; 50 core CyOGs are specific for cyanobacteria, whereas 84 are exclusively shared by cyanobacteria and plants and/or other plastid-carrying eukaryotes, such as diatoms or apicomplexans. The latter group includes 35 families of uncharacterized proteins, which could also be involved in photosynthesis. Only a few components of cyanobacterial photosynthetic machinery are represented in the genomes of the anoxygenic phototrophic bacteria Chlorobium tepidum, Rhodopseudomonas palustris, Chloroflexus aurantiacus, or Heliobacillus mobilis. These observations, coupled with recent geological data on the properties of the ancient phototrophs, suggest that photosynthesis originated in the cyanobacterial lineage under the selective pressures of UV light and depletion of electron donors. We propose that the first phototrophs were anaerobic ancestors of cyanobacteria (“procyanobacteria”) that conducted anoxygenic photosynthesis using a photosystem I-like reaction center, somewhat similar to the heterocysts of modern filamentous cyanobacteria. From procyanobacteria, photosynthesis spread to other phyla by way of lateral gene transfer.

Proc Natl Acad Sci U S A. 2006 Aug 29; 103(35): 13126–13131.
Published online 2006 Aug 21. doi:  10.1073/pnas.0605709103

Growth of Escherichia coli MG1655 on LB medium: determining metabolic strategy with transcriptional microarrays.

Baev MV, Baev D, Radek AJ, Campbell JW.

Expression profiles of genes related to stress responses, substrate assimilation, acetate metabolism, and biosynthesis were obtained by monitoring growth of Escherichia coli MG1655 in Luria-Bertani (LB) medium with transcriptional microarrays. Superimposing gene expression profiles on a plot of specific growth rate demonstrates that the cells pass through four distinct physiological states during fermentation before entering stationary phase. Each of these states can be characterized by specific patterns of substrate utilization and cellular biosynthesis corresponding to the nutrient status of the medium. These data allow the growth phases of the classical microbial growth curve to be redefined in terms of the physiological states and environmental changes commonly occurring during bacterial growth in batch culture on LB medium.

Appl Microbiol Biotechnol. 2006 Jul;71(3):323-8. Epub 2006 Apr 28

Growth of Escherichia coli MG1655 on LB medium: monitoring utilization of sugars, alcohols, and organic acids with transcriptional microarrays.

Baev MV, Baev D, Radek AJ, Campbell JW.

Microorganisms respond to environmental changes by reprogramming their metabolism primarily through altered patterns of gene expression. DNA microarrays provide a tool for exploiting microorganisms as living sensors of their environment. The potential of DNA microarrays to reflect availability of nutrient components during fermentations on complex media was examined by monitoring global gene expression throughout batch cultivation of Escherichia coli MG1655 on Luria-Bertani (LB) medium. Gene expression profiles group into pathways that clearly demonstrate the metabolic changes occurring in the course of fermentation. Functional analysis of the gene expression related to metabolism of sugars, alcohols, and organic acids revealed that E. coli growing on LB medium switches from a sequential mode of substrate utilization to the simultaneous one in the course of the growth. Maltose and maltodextrins are the first of these substrates to support growth. Utilization of these nutrients associated with the highest growth rate of the culture was followed by simultaneous induction of enzymes involved in assimilation of a large group of other carbon sources including D-mannose, melibiose, D-galactose, L-fucose, L-rhamnose, D-mannitol, amino sugars, trehalose, L-arabinose, glycerol, and lactate. Availability of these nutrients to the cells was monitored by induction of corresponding transport and/or catabolic systems specific for each of the compounds.

Appl Microbiol Biotechnol. 2006 Jul;71(3):310-6. Epub 2006 Apr 21.

Identification of open reading frames unique to a select agent: Ralstonia solanacearum race 3 biovar 2.

Gabriel DW, Allen C, Schell M, Denny TP, Greenberg JT, Duan YP, Flores-Cruz Z, Huang Q, Clifford JM, Presting G, González ET, Reddy J, Elphinstone J, Swanson J, Yao J, Mulholland V, Liu L, Farmerie W, Patnaikuni M, Balogh B, Norman D, Alvarez A, Castillo JA, Jones J, Saddler G, Walunas T, Zhukov A, Mikhailova N.

An 8x draft genome was obtained and annotated for Ralstonia solanacearum race 3 biovar 2 (R3B2) strain UW551, a United States Department of Agriculture Select Agent isolated from geranium. The draft UW551 genome consisted of 80,169 reads resulting in 582 contigs containing 5,925,491 base pairs, with an average 64.5% GC content. Annotation revealed a predicted 4,454 protein coding open reading frames (ORFs), 43 tRNAs, and 5 rRNAs; 2,793 (or 62%) of the ORFs had a functional assignment. The UW551 genome was compared with the published genome of R. solanacearum race 1 biovar 3 tropical tomato strain GMI1000. The two phylogenetically distinct strains were at least 71% syntenic in gene organization. Most genes encoding known pathogenicity determinants, including predicted type III secreted effectors, appeared to be common to both strains. A total of 402 unique UW551 ORFs were identified, none of which had a best hit or >45% amino acid sequence identity with any R. solanacearum predicted protein; 16 had strong (E < 10(-13)) best hits to ORFs found in other bacterial plant pathogens. Many of the 402 unique genes were clustered, including 5 found in the hrp region and 38 contiguous, potential prophage genes. Conservation of some UW551 unique genes among R3B2 strains was examined by polymerase chain reaction among a group of 58 strains from different races and biovars, resulting in the identification of genes that may be potentially useful for diagnostic detection and identification of R3B2 strains. One 22-kb region that appears to be present in GMI1000 as a result of horizontal gene transfer is absent from UW551 and encodes enzymes that likely are essential for utilization of the three sugar alcohols that distinguish biovars 3 and 4 from biovars 1 and 2.

Mol Plant Microbe Interact. 2006 Jan;19(1):69-79.
http://dx.doi.org/10.1094/MPMI-19-0069

Comparative genome analysis of Bacillus cereus group genomes with Bacillus subtilis.

Anderson I, Sorokin A, Kapatral V, Reznik G, Bhattacharya A, Mikhailova N, Burd H, Joukov V, Kaznadzey D, Walunas T, Markd'Souza, Larsen N, Pusch G, Liolios K, Grechkin Y, Lapidus A, Goltsman E, Chu L, Fonstein M, Ehrlich SD, Overbeek R, Kyrpides N, Ivanova N.

Genome features of the Bacillus cereus group genomes (representative strains of Bacillus cereus, Bacillus anthracis and Bacillus thuringiensis sub spp. israelensis) were analyzed and compared with the Bacillus subtilis genome. A core set of 1381 protein families among the four Bacillus genomes, with an additional set of 933 families common to the B. cereus group, was identified. Differences in signal transduction pathways, membrane transporters, cell surface structures, cell wall, and S-layer proteins suggesting differences in their phenotype were identified. The B. cereus group has signal transduction systems including a tyrosine kinase related to two-component system histidine kinases from B. subtilis. A model for regulation of the stress responsive sigma factor sigmaB in the B. cereus group different from the well studied regulation in B. subtilis has been proposed. Despite a high degree of chromosomal synteny among these genomes, significant differences in cell wall and spore coat proteins that contribute to the survival and adaptation in specific hosts has been identified.

FEMS Microbiol Lett. 2005 Sep 15;250(2):175-84.
DOI: http://dx.doi.org/10.1016/j.femsle.2005.07.008

The Wolbachia genome of Brugia malayi: endosymbiont evolution within a human pathogenic nematode.

Foster J, Ganatra M, Kamal I, Ware J, Makarova K, Ivanova N, Bhattacharyya A, Kapatral V, Kumar S, Posfai J, Vincze T, Ingram J, Moran L, Lapidus A, Omelchenko M, Kyrpides N, Ghedin E, Wang S, Goltsman E, Joukov V, Ostrovskaya O, Tsukerman K, Mazur M, Comb D, Koonin E, Slatko B.

Complete genome DNA sequence and analysis is presented for Wolbachia, the obligate alpha-proteobacterial endosymbiont required for fertility and survival of the human filarial parasitic nematode Brugia malayi. Although, quantitatively, the genome is even more degraded than those of closely related Rickettsia species, Wolbachia has retained more intact metabolic pathways. The ability to provide riboflavin, flavin adenine dinucleotide, heme, and nucleotides is likely to be Wolbachia's principal contribution to the mutualistic relationship, whereas the host nematode likely supplies amino acids required for Wolbachia growth. Genome comparison of the Wolbachia endosymbiont of B. malayi (wBm) with the Wolbachia endosymbiont of Drosophila melanogaster (wMel) shows that they share similar metabolic trends, although their genomes show a high degree of genome shuffling. In contrast to wMel, wBm contains no prophage and has a reduced level of repeated DNA. Both Wolbachia have lost a considerable number of membrane biogenesis genes that apparently make them unable to synthesize lipid A, the usual component of proteobacterial membranes. However, differences in their peptidoglycan structures may reflect the mutualistic lifestyle of wBm in contrast to the parasitic lifestyle of wMel. The smaller genome size of wBm, relative to wMel, may reflect the loss of genes required for infecting host cells and avoiding host defense systems. Analysis of this first sequenced endosymbiont genome from a filarial nematode provides insight into endosymbiont evolution and additionally provides new potential targets for elimination of cutaneous and lymphatic human filarial disease.

PLoS Biol. 2005 Apr; 3(4): e121.
Published online 2005 Mar 29. doi:  10.1371/journal.pbio.0030121

Gene array analysis of Yersinia enterocolitica FlhD and FlhC: regulation of enzymes affecting synthesis and degradation of carbamoylphosphate.

Kapatral V, Campbell JW, Minnich SA, Thomson NR, Matsumura P, Prüss BM.

This paper focuses on global gene regulation by FlhD/FlhC in enteric bacteria. Even though Yersinia enterocolitica FlhD/FlhC can complement an Escherichia coli flhDC mutant for motility, it is not known if the Y. enterocolitica FlhD/FlhC complex has an effect on metabolism similar to E. coli. To study metabolic gene regulation, a partial Yersinia enterocolitica 8081c microarray was constructed and the expression patterns of wild-type cells were compared to an flhDC mutant strain at 25 and 37 degrees C. The overlap between the E. coli and Y. enterocolitica FlhD/FlhC regulated genes was 25 %. Genes that were regulated at least fivefold by FlhD/FlhC in Y. enterocolitica are genes encoding urocanate hydratase (hutU), imidazolone propionase (hutI), carbamoylphosphate synthetase (carAB) and aspartate carbamoyltransferase (pyrBI). These enzymes are part of a pathway that is involved in the degradation of L-histidine to L-glutamate and eventually leads into purine/pyrimidine biosynthesis via carbamoylphosphate and carbamoylaspartate. A number of other genes were regulated at a lower rate. In two additional experiments, the expression of wild-type cells grown at 4 or 25 degrees C was compared to the same strain grown at 37 degrees C. The expression of the flagella master operon flhD was not affected by temperature, whereas the flagella-specific sigma factor fliA was highly expressed at 25 degrees C and reduced at 4 and 37 degrees C. Several other flagella genes, all of which are under the control of FliA, exhibited a similar temperature profile. These data are consistent with the hypothesis that temperature regulation of flagella genes might be mediated by the flagella-specific sigma factor FliA and not the flagella master regulator FlhD/FlhC.

Microbiology. 2004 Jul;150(Pt 7):2289-300.

Genome of Methanocaldococcus (methanococcus) jannaschii.

Graham DE, Kyrpides N, Anderson IJ, Overbeek R, Whitman WB.

Methanocaldococcus (Methanococcus) jannaschii strain JAL-1 is a hyperthermophilic methanogenic archaeon that was isolated from surface material collected at a “white smoker” chimney at a depth of 2600 m in the East Pacific Rise near the western coast of Mexico. Cells are irregular cocci possessing polar bundles of flagella. The cell envelope is composed of a cytoplasmic membrane and a protein surface layer. Similar isolates have been obtained from hydrothermally active sediments in the Guaymas Basin and the Mid-Atlantic Ridge, and related species have been found at other marine hydrothermal vents. Because these hyperthermophilic species are very different from the mesophilic methanococci, they have been reclassified into a new family, Methanocaldococcaceae, and two new genera, Methanocaldococcus and Methanotorris. The characteristics of the source material for these isolates suggest that they possess adaptations for growth at high temperature and pressure as well as moderate salinity.

Methods Enzymol. 2001;330:40-123. doi:10.1016/S0076-6879(01)30370-1

Aerobic tryptophan degradation pathway in bacteria: novel kynurenine formamidase.

Kurnasov O1, Jablonski L, Polanuyer B, Dorrestein P, Begley T, Osterman A.

While a variety of chemical transformations related to the aerobic degradation of L-tryptophan (kynurenine pathway), and most of the genes and corresponding enzymes involved therein have been predominantly characterized in eukaryotes, relatively little was known about this pathway in bacteria. Using genome comparative analysis techniques we have predicted the existence of the three-step pathway of aerobic L-tryptophan degradation to anthranilate (anthranilate pathway) in several bacteria. Based on the chromosomal gene clustering analysis, we have identified a previously unknown gene encoding for kynurenine formamidase (EC 3.5.1.19) involved with the second step of the anthranilate pathway. This functional prediction was experimentally verified by cloning, expression and enzymatic characterization of recombinant kynurenine formamidase orthologs from Bacillus cereus, Pseudomonas aeruginosa and Ralstonia metallidurans. Experimental verification of the inferred anthranilate pathway was achieved by functional expression in Escherichia coli of the R. metallidurans putative kynBAU operon encoding three required enzymes: tryptophan 2,3-dioxygenase (gene kynA), kynurenine formamidase (gene kynB), and kynureninase (gene kynU). Our data provide the first experimental evidence of the connection between these genes (only one of which, kynU, was previously characterized) and L-tryptophan aerobic degradation pathway in bacteria.

FEMS Microbiol Lett. 2003 Oct 24;227(2):219-27.

Experimental determination and system level analysis of essential genes in Escherichia coli MG1655.

Gerdes SY, Scholle MD, Campbell JW, Balázsi G, Ravasz E, Daugherty MD, Somera AL, Kyrpides NC, Anderson I, Gelfand MS, Bhattacharya A, Kapatral V, D'Souza M, Baev MV, Grechkin Y, Mseeh F, Fonstein MY, Overbeek R, Barabási AL, Oltvai ZN, Osterman AL.

Defining the gene products that play an essential role in an organism's functional repertoire is vital to understanding the system level organization of living cells. We used a genetic footprinting technique for a genome-wide assessment of genes required for robust aerobic growth of Escherichia coli in rich media. We identified 620 genes as essential and 3,126 genes as dispensable for growth under these conditions. Functional context analysis of these data allows individual functional assignments to be refined. Evolutionary context analysis demonstrates a significant tendency of essential E. coli genes to be preserved throughout the bacterial kingdom. Projection of these data over metabolic subsystems reveals topologic modules with essential and evolutionarily preserved enzymes with reduced capacity for error tolerance.

J Bacteriol. 2003 Oct;185(19):5673-84.

Missing genes in metabolic pathways: a comparative genomics approach.

Osterman A, Overbeek R.

The new techniques of genome context analysis--chromosomal gene clustering, protein fusions, occurrence profiles and shared regulatory sites--infer functional coupling between genes. In combination with metabolic reconstructions, these techniques can dramatically accelerate the pace of gene discovery.

Curr Opin Chem Biol. 2003 Apr;7(2):238-51.

The ERGO genome analysis and discovery system.

Overbeek R, Larsen N, Walunas T, D'Souza M, Pusch G, Selkov E Jr, Liolios K, Joukov V, Kaznadzey D, Anderson I, Bhattacharyya A, Burd H, Gardner W, Hanke P, Kapatral V, Mikhailova N, Vasieva O, Osterman A, Vonstein V, Fonstein M, Ivanova N, Kyrpides N.

The ERGO (http://ergo.integratedgenomics.com/ERGO/) genome analysis and discovery suite is an integration of biological data from genomics, biochemistry, high-throughput expression profiling, genetics and peer-reviewed journals to achieve a comprehensive analysis of genes and genomes. Far beyond any conventional systems that facilitate functional assignments, ERGO combines pattern-based analysis with comparative genomics by visualizing genes within the context of regulation, expression profiling, phylogenetic clusters, fusion events, networked cellular pathways and chromosomal neighborhoods of other functionally related genes. The result of this multifaceted approach is to provide an extensively curated database of the largest available integration of genomes, with a vast collection of reconstructed cellular pathways spanning all domains of life. Although access to ERGO is provided only under subscription, it is already widely used by the academic community. The current version of the system integrates 500 genomes from all domains of life in various levels of completion, 403 of which are available for subscription.

Nucleic Acids Res. 2003 Jan 1;31(1):164-71.

FlhD/FlhC is a regulator of anaerobic respiration and the Entner-Doudoroff pathway through induction of the methyl-accepting chemotaxis protein Aer.

Prüss BM, Campbell JW, Van Dyk TK, Zhu C, Kogan Y, Matsumura P.

The regulation by two transcriptional activators of flagellar expression (FlhD and FlhC) and the chemotaxis methyl-accepting protein Aer was studied with glass slide DNA microarrays. An flhD::Kan insertion and an aer deletion were independently introduced into two Escherichia coli K-12 strains, and the effects upon gene regulation were investigated. Altogether, the flhD::Kan insertion altered the expression of 29 operons of known function. Among them was Aer, which in turn regulated a subset of these operons, namely, the ones involved in anaerobic respiration and the Entner-Doudoroff pathway. In addition, FlhD/FlhC repressed enzymes involved in aerobic respiration and regulated many other metabolic enzymes and transporters in an Aer-independent manner. Expression of 12 genes of uncharacterized function was also affected. FlhD increased gltBD, gcvTHP, and ompT expression. The regulation of half of these genes was subsequently confirmed with reporter gene fusions, enzyme assays, and real-time PCR. Growth phenotypes of flhD and flhC mutants were determined with Phenotype MicroArrays and correlated with gene expression.

J Bacteriol. 2003 Jan; 185(2): 534–543.
doi:  10.1128/JB.185.2.534-543.2003

Ribosylnicotinamide kinase domain of NadR protein: identification and implications in NAD biosynthesis.

Kurnasov OV, Polanuyer BM, Ananta S, Sloutsky R, Tam A, Gerdes SY, Osterman AL.

NAD is an indispensable redox cofactor in all organisms. Most of the genes required for NAD biosynthesis in various species are known. Ribosylnicotinamide kinase (RNK) was among the few unknown (missing) genes involved with NAD salvage and recycling pathways. Using a comparative genome analysis involving reconstruction of NAD metabolism from genomic data, we predicted and experimentally verified that bacterial RNK is encoded within the 3' region of the nadR gene. Based on these results and previous data, the full-size multifunctional NadR protein (as in Escherichia coli) is composed of (i) an N-terminal DNA-binding domain involved in the transcriptional regulation of NAD biosynthesis, (ii) a central nicotinamide mononucleotide adenylyltransferase (NMNAT) domain, and (iii) a C-terminal RNK domain. The RNK and NMNAT enzymatic activities of recombinant NadR proteins from Salmonella enterica serovar Typhimurium and Haemophilus influenzae were quantitatively characterized. We propose a model for the complete salvage pathway from exogenous N-ribosylnicotinamide to NAD which involves the concerted action of the PnuC transporter and NRK, followed by the NMNAT activity of the NadR protein. Both the pnuC and nadR genes were proven to be essential for the growth and survival of H. influenzae, thus implicating them as potential narrow-spectrum drug targets.

J Bacteriol. 2002 Dec;184(24):6906-17.

Bioinformatics classification and functional analysis of PhoH homologs.

Kazakov AE, Vassieva O, Gelfand MS, Osterman A, Overbeek R.

PhoH protein is a putative ATPase belonging to the phosphate regulon in Escherichia coli. EC-PhoH homologs are present in different organisms, but it is not clear if they are functionally related, besides nothing is known about their regulation. To distinguish true functional orthologs of EC-PhoH in different classes of bacteria and to identify their functional role in bacterial metabolic network we performed phylogenetic analysis of these proteins and comparative study of position and regulation of the related genes. Three groups of proteins were identified. Proteins of the first group (BS-PhoH orthologs) are present in most of bacteria and are proposed to be functionally linked to phospholipid metabolism and RNA modification. Proteins of the second group (BS-YlaK orthologs) are present in most of aerobes and Actinobacterial YlaK orthologs are shown to be members of a fatty acid beta-oxidation regulons. EC-PhoH orthologs are classified in a third group, specific for Enterobacteria. Functional role of PhoH homologs in the lipid and RNA metabolism and proposed interrelation of PhoH paralogs in one organism are discussed.

In Silico Biol. 2003;3(1-2):3-15. Epub 2002 Dec 30

Genes for the cytoskeletal protein tubulin in the bacterial genus Prosthecobacter.

Jenkins C, Samudrala R, Anderson I, Hedlund BP, Petroni G, Michailova N, Pinel N, Overbeek R, Rosati G, Staley JT.

Tubulins, the protein constituents of the microtubule cytoskeleton, are present in all known eukaryotes but have never been found in the Bacteria or Archaea. Here we report the presence of two tubulin-like genes [bacterial tubulin a (btuba) and bacterial tubulin b (btubb)] in bacteria of the genus Prosthecobacter (Division Verrucomicrobia). In this study, we investigated the organization and expression of these genes and conducted a comparative analysis of the bacterial and eukaryotic protein sequences, focusing on their phylogeny and 3D structures. The btuba and btubb genes are arranged as adjacent loci within the genome along with a kinesin light chain gene homolog. RT-PCR experiments indicate that these three genes are cotranscribed, and a probable promoter was identified upstream of btuba. On the basis of comparative modeling data, we predict that the Prosthecobacter tubulins are monomeric, unlike eukaryotic alpha and beta tubulins, which form dimers and are therefore unlikely to form microtubule-like structures. Phylogenetic analyses indicate that the Prosthecobacter tubulins are quite divergent and do not support recent horizontal transfer of the genes from a eukaryote. The discovery of genes for tubulin in a bacterial genus may offer new insights into the evolution of the cytoskeleton.

Proc Natl Acad Sci U S A. 2002 Dec 24; 99(26): 17049–17054.
Published online 2002 Dec 16. 
doi:  10.1073/pnas.012516899

Draft Sequencing and Comparative Genomics of Xylella fastidiosa Strains Reveal Novel Biological Insights.

Anamitra Bhattacharyya, Stephanie Stilwagen, Gary Reznik, Helene Feil, William S. Feil, Iain Anderson, Axel Bernal, Mark D'Souza, Natalia Ivanova, Vinayak Kapatral, Niels Larsen, Tamara Los, Athanasios Lykidis, Eugene Selkov, Jr., Theresa L. Walunas, Alexander Purcell, Rob A. Edwards, Trevor Hawkins, Robert Haselkorn, Ross Overbeek, Nikos C. Kyrpides, and Paul F. Predki

Draft sequencing is a rapid and efficient method for determining the near-complete sequence of microbial genomes. Here we report a comparative analysis of one complete and two draft genome sequences of the phytopathogenic bacterium, Xylella fastidiosa, which causes serious disease in plants, including citrus, almond, and oleander. We present highlights of an in silico analysis based on a comparison of reconstructions of core biological subsystems. Cellular pathway reconstructions have been used to identify a small number of genes, which are likely to reside within the draft genomes but are not captured in the draft assembly. These represented only a small fraction of all genes and were predominantly large and small ribosomal subunit protein components. By using this approach, some of the inherent limitations of draft sequence can be significantly reduced. Despite the incomplete nature of the draft genomes, it is possible to identify several phage-related genes, which appear to be absent from the draft genomes and not the result of insufficient sequence sampling. This region may therefore identify potential host-specific functions. Based on this first functional reconstruction of a phytopathogenic microbe, we spotlight an unusual respiration machinery as a potential target for biological control. We also predicted and developed a new defined growth medium for Xylella.

Genome Res. 2002 Oct; 12(10): 1556–1563.
doi:  10.1101/gr.370702

The genome of Brucella melitensis.

DelVecchio VG, Kapatral V, Elzer P, Patra G, Mujer CV.

The genome of Brucella melitensis strain 16M was sequenced and contained 3,294,931 bp distributed over two circular chromosomes. Chromosome I was composed of 2,117,144 bp and chromosome II has 1,177,787 bp. A total of 3,198 ORFs were predicted. The origins of replication of the chromosomes are similar to each other and to those of other alpha-proteobacteria. Housekeeping genes such as those that encode for DNA replication, protein synthesis, core metabolism, and cell-wall biosynthesis were found on both chromosomes. Genes encoding adhesins, invasins, and hemolysins were also identified.

Vet Microbiol. 2002 Dec 20;90(1-4):587-92.
doi:10.1016/S0378-1135(02)00238-9