Cookies on this website

We use cookies to ensure that we give you the best experience on our website. If you click 'Accept all cookies' we'll assume that you are happy to receive all cookies and you won't see this message again. If you click 'Reject all non-essential cookies' only necessary cookies providing core functionality such as security, network management, and accessibility will be enabled. Click 'Find out more' for information on how to change your cookie settings.

The 4,639,221-base pair sequence of Escherichia coli K-12 is presented. Of 4288 protein-coding genes annotated, 38 percent have no attributed function. Comparison with five other sequenced microbes reveals ubiquitous as well as narrowly distributed gene families; many families of similar genes within E. coli are also evident. The largest family of paralogous proteins contains 80 ABC transporters. The genome as a whole is strikingly organized with respect to the local direction of replication; guanines, oligonucleotides possibly related to replication and recombination, and most genes are so oriented. The genome also contains insertion sequence (IS) elements, phage remnants, and many other patches of unusual composition indicating genome plasticity through horizontal transfer.

More information Original publication

DOI

10.1126/science.277.5331.1453

Type

Journal article

Publication Date

1997-09-05T00:00:00+00:00

Volume

277

Pages

1453 - 1462

Total pages

9

Keywords

Bacterial Proteins, Bacteriophage lambda, Base Composition, Binding Sites, Chromosome Mapping, DNA Replication, DNA Transposable Elements, DNA, Bacterial, Escherichia coli, Genes, Bacterial, Genome, Bacterial, Molecular Sequence Data, Mutation, Operon, RNA, Bacterial, RNA, Transfer, Recombination, Genetic, Regulatory Sequences, Nucleic Acid, Repetitive Sequences, Nucleic Acid, Sequence Analysis, DNA, Sequence Homology, Amino Acid