Jim's Discussion Notes - Classification

(Campbell, Ch.25 (subset) and Johnson/Raven, Ch.20)

 

*See also notes on Evolutionary History of Life on Earth.

 

Introduction

·        Classification and Taxonomy are parts of a process for organizing and understanding the relationships of living things.

·        Systematics is the study of biological diversity in an evolutionary context.

·        Part of the scope of systematics is the development of phylogeny, the evolutionary history of a species or group of related species.

 

 

 

1. Taxonomy employs a hierarchical system of classification

·        The Linnean system, first formally proposed by Linneaus in Systema naturae in the 18th century, has two main characteristics.

·        Each species has a two-part name.

·        Species are organized hierarchically into broader and broader groups of organisms.

·        Under the binomial system, each species is assigned a two-part latinized name, a binomial.

·        The first part, the genus, is the closest group to which a species belongs.

·        The second part, the specific epithet, refers to one species within each genus.

·        The first letter of the genus is capitalized and both names are italicized and latinized.

·        For example, Linnaeus assigned to humans the scientific name Homo sapiens, which means “wise man,” perhaps in a show of optimism.

·        A hierachical classification groups species into broader taxonomic categories.

·        Species that appear to be closely related are grouped into the same genus.

·        For example, the leopard, Panthera pardus, belongs to a genus that includes the African lion (Panthera leo) and the tiger (Panthera tigris).

·        Biology’s taxonomic scheme formalizes our tendency to group related objects.

·        Genera are grouped into progressively broader categories: family, order, class, phylum, kingdom and domain.

·        Each taxonomic level is more comprehensive than the previous one.

·        As an example, all species of cats are mammals, but not all mammals are cats.

·        The named taxonomic unit at any level is called a taxon.

·        Example: Pinus is a taxon at the genus level, the generic name for various species of pine trees.

·        Mammalia, a taxon at the class level, includes all the many orders of mammals.

·        Phylogenetic trees reflect the hierarchical classification of taxonomic groups nested within more inclusive groups.

 

2. Modern phylogenetic systematics is based on cladistic analysis

·        A phylogeny is determined by a variety of evidence including fossils, molecular data, anatomy, and other features.

·        Most systematists use cladistic analysis, developed by a German  entomologist Willi Hennig to analyze the data

·        A phylogenetic diagram or cladogram is constructed from a series of dichotomies.

·        These dichotomous branching diagrams can include more taxa.

·        The sequence of branching symbolizes historical chronology.

·        The last ancestor common to both the cat and dog families lived longer ago than the last common ancestor shared by leopards and domestic cats.

·        Each branch or clade can be nested within larger clades.

·        A clade consists of an ancestral species and all its descendents, a monophyletic group.

·        Groups that do not fit this definition are unacceptable in cladistics.

·        Determining which similarities between species are relevant to grouping the species in a clade is a challenge.

·        It is especially important to distinguish similarities that are based on shared ancestry or homology from those that are based on convergent evolution or analogy.

·        These two desert plants are not closely related but owe their resemblance to analogous adaptations.

·        As a general rule, the more homologous parts that two species share, the more closely related they are.

·        Adaptation can obscure homology and convergence can create misleading analogies.

·        Also, the more complex two structures are, the less likely that they evolved independently.

·        For example, the skulls of a human and chimpanzee are composed not of a single bone, but a fusion of multiple bones that match almost perfectly.

·        It is highly improbable that such complex structures matching in so many details could have separate origins.

·        For example, the forelimbs of bats and birds are analogous adaptations for flight because the fossil record shows that both evolved independently from the walking forelimbs of different ancestors.

·        Their common specializations for flight are convergent, not indications of recent common ancestry.

·        The presence of forelimbs in both birds and bats is homologous, though at a higher level of the cladogram, at the level of tetrapods.

·        The question of homology versus analogy often depends on the level of the clade that is being examined.

·        Systematists must sort through homologous features or characters to separate shared derived characters from shared primitive characters.

·        A shared derived character is unique to a particular clade.

·        A shared primitive character is found not only in the clade being analyzed, but older clades too.

·        Shared derived characters are useful in establishing a phylogeny, but shared primitive characters are not.

·        For example, the presence of hair is a good character to distinguish the clade of mammals from other tetrapods.

·        It is a shared derived character that uniquely identifies mammals.

·        However, the presence of a backbone can qualify as a shared derived character, but at a deeper branch point that distinguishes all vertebrates from other mammals.

·        Among vertebrates, the backbone is a shared primitive character because it evolved in the ancestor common to all vertebrates.

·        Shared derived characters are useful in establishing a phylogeny, but shared primitive characters are not.

·        The status of a character as analogous versus homologous or shared versus primitive may depend on the level at which the analysis is being performed.

 

·        A key step in cladistic analysis is outgroup comparison which is used to differentiate shared primitive characters from shared derived ones.

·        To do this we need to identify an outgroup:

·        a species or group of species that is closely related to the species that we are studying,

·        but known to be less closely related than any study-group members are to each other.

·        To study the relationships among five vertebrates (the ingroup)—a leopard, a turtle, a salamander, a tuna, and a lamprey—on a cladogram, then an animal called the lancet would be a good choice.

·        The lancet is closely related to the most primitive vertebrates based on other evidence and other lines of analysis.

·        These other analyses also show that the lancet is not more closely related to any of the ingroup taxa.

·        In an outgroup analysis, the assumption is that any homologies shared by the ingroup and outgroup must be primitive characters already present in the ancestor common to both groups.

·        Homologies present in some or all of the ingroup taxa must have evolved after the divergence of the ingroup and outgroup taxa.

·        In our example, a notochord, present in lancets and in the embryos of the ingroup, would be a shared primitive character and not useful.

·        The presence of a vertebral column, shared by all members of the ingroup but not the outgroup, is a useful character for the whole ingroup.

·        Similarly, the presence of jaws, absent in lampreys and present in the other ingroup taxa, helps to identify the earliest branch in the vertebrate cladogram.

·        Analyzing the taxonomic distribution of homologies enables us to identify the sequence in which derived characters evolved during vertebrate phylogeny.

·        A cladogram presents the chronological sequence of branching during the evolutionary history of a set of organisms.

·        However, this chronology does not indicate the time of origin of the species that we are comparing, only the groups to which they belong.

·        For example, a particular species in an old group may have evolved more recently than a second species that belongs to a newer group.

·        Systematists can use cladograms to place species in the taxonomic hierarchy.

·        For example, using turtles as the outgroup, we can assign increasingly exclusive clades to finer levels of the hierarchy of taxa.

·        However, some systematists argue that the hierarchical system is antiquated because such a classification must be rearranged when a cladogram is revised based on new evidence.

·        These systematists propose replacing the Linneaen system with a strictly cladistic classification called phylocode that drops the hierarchical tags, such as class, order, and family.

·        So far, biologists still prefer a hierachical system of taxonomic levels as a more useful way of organizing the diversity of life.

 

3. Systematists can infer phylogeny from molecular data

·        The application of molecular methods and data for comparing species and tracing phylogenies has accelerated revision of taxonomic trees.

·        If homology reflects common ancestry, then comparing genes and proteins among organisms should provide insights into their evolutionary relationships.

·        The more recently two species have branched from a common ancestor, the more similar their DNA and amino acid sequences should be.

·        These data for many species are available via the internet.

·        Molecular systematics makes it possible to assess phylogenetic relationships that cannot be measured by comparative anatomy and other non-molecular methods.

·        This includes groups that are too closely related to have accumulated much morphological divergence.

·        At the other extreme, some groups (e.g., fungi, animals, and plants) have diverged so much that little morphological homology remains.

 

·        Most molecular systematics is based on a comparison of nucleotide sequences in DNA or RNA.

·        Each nucleotide position along a stretch of DNA represents an inherited character as one of the four DNA bases: A (adenine), G (guanine), C (cytosine), and T (thymine).

·        Systematists may compare hundreds or thousands of adjacent nucleotide positions from several DNA regions to assess the relationship between two species.

·        This DNA sequence analysis provides a quantitative tool for constructing cladograms with branch points defined by mutations in DNA sequence.

·        The rates of change in DNA sequences vary from one part of the genome to another.

·        Some regions (e.g., rRNA) that change relatively slowly are useful in investigating relationships between taxa that diverged hundreds of millions of years ago.

·        Other regions (e.g., mtDNA) evolve relatively rapidly and can be employed to assess the phylogeny of species that are closely related or even populations of the same species.

·        The first step in DNA comparisons is to align homologous DNA sequences for the species we are comparing.

·        Two closely related species may differ only in which base is present at a few sites.

·        Less closely related species may not only differ in bases at many sites, but there may be insertions and deletions that alter the length of genes

·        This creates problems for establishing homology.

 

4. The principle of parsimony helps systematists reconstruct phylogeny

·        The process of converting data into phylogenetic trees can be a daunting problem.

·        If we wish to determine the relationships among four species or taxa, we would need to choose among several potential trees.

·        As we consider more and more taxa, the number of possible trees increases dramatically.

·        There are about 3 x 1076 possible phylogenetic trees for a group of 50 species.

·        Even computer analyses of these data sets can take too long to search for the tree that best fits the DNA data.

·        Systematists use the principle of parsimony to choose among the many possible trees to find the tree that best fits the data.

·        The principle of parsimony (“Occam’s Razor”) states that a theory about nature should be the simplest explanation that is consistent with the facts.

·        This minimalist approach to problem solving has been attributed to William of Occam, a 14th century English philosopher.

·        In phylogenetic analysis, parsimony is used to justify the choice of a tree that represents the smallest number of evolutionary changes.

·        As an example, if we wanted to use the DNA sequences from seven sites to determine the most parsimonious arrangement of four species, we would begin by tabulating the sequence data.

·        Then, we would draw all possible phylogenies for the four species, including the three shown here.

·        We would trace the number of events (mutations) necessary on each tree to produce the data in our DNA table.

·        After all the DNA sites have been added to each tree we add up the total events for each tree and determine which tree required the fewest changes, the most  parsimonious tree.

 

5. Phylogenetic trees are hypotheses

·        The rationale for using parsimony as a guide to our choice among many possible trees is that for any species’ characters, hereditary fidelity is more common than change.

·        At the molecular level, point mutations do occasionally change a base within a DNA sequence, but exact transmission from generation to generation is thousands of times more common than change.

·        Similarly, one could construct a primitive phylogeny that places humans and apes as distant clades but this would assume an unnecessarily complicated scenario.

·        A cladogram that is not the most parsimonious would assume an unnecessarily complicated scenario, rather than the simplest explanation.

·        Given a choice of possible trees we can draw for a set of species or higher taxa, the best hypothesis is the one that is the best fit for all the available data.

·        In the absence of conflicting information, the most parsimonious tree is the logical choice among alternative hypotheses.

·        A limited character set may lead to acceptance of a tree that is most parsimonious, but that is also wrong.

·        Therefore, it is always important to remember that any phylogenetic diagram is a hypothesis, subject to rejection or revision as more character data are available.

·        For example, based on the number of heart chambers alone, birds and mammals, both with four chambers, appear to be more closely related to each other than lizards with three chambers.

·        But abundant evidence indicated that birds and mammals evolved from different reptilian ancestors.

·        The four-chambered hearts are analogous, not homologous, leading to a misleading cladogram.

·        Regardless of the source of data (DNA sequence, morphology, etc.), the most reliable trees are based on the largest data base.

·        Occasionally misjudging an analogous similarity in morphology or gene sequence as a shared derived homology is less likely to distort a phylogenetic tree if each clade in the tree is defined by several derived characters.

·        The strongest phylogenetic hypotheses are supported by both the morphological and molecular evidence.

 

6. Molecular clocks may keep track of evolutionary time

·        The timing of evolutionary events has rested primarily on the fossil record.

·        Recently, molecular clocks have been applied to place the origin of taxonomic groups in time.

·        Molecular clocks are based on the observation that some regions of genomes evolve at constant rates.

·        For these regions, the number of nucleotide and amino acid substitutions between two lineages is proportional to the time that has elapsed since they branched.

·        For example, the homologous proteins of bats and dolphins are much more alike than are those of sharks and tuna.

·        This is consistent with the fossil evidence that sharks and tuna have been on separate evolutionary paths much longer than bats and dolphins.

·        In this case, molecular divergence has kept better track of time than have changes in morphology.

·        Proportional differences in DNA sequences can be applied to access the relative chronology of branching in phylogeny, but adjustments for absolute time must be viewed with some caution.

·        No genes mark time with a precise tick-tock accuracy in the rate of base changes.

·        Genes that make good molecular clocks have fairly smooth average rates of change.

·        Over time there may be chance deviations above and below the average rate.

·        Each molecular clock must be calibrated in actual time.

·        Typically, one graphs the number of amino acid or nucleotide differences against the times for a series of evolutionary events known from the fossil record.

·        The slope of the best line through these points represents the evolution rate of that molecular clock.

·        This rate can be used to estimate the absolute date of evolutionary events that have no fossil record.

 

·        The molecular clock approach assumes that much of the change in DNA sequences is due to genetic drift and is selectively neutral.

·        If certain DNA changes were favored by natural selection, then the rate would probably be too irregular to mark time accurately.

·        Also, some biologists are skeptical of conclusions derived from molecular clocks that have been extrapolated to time spans beyond the calibration in the fossil record.

·        The molecular clock approach has been used to date the jump of the HIV virus from related SIV viruses that infect chimpanzees and other primates to humans.

·        Investigators calibrated their molecular clock by comparing DNA sequences in a specific HIV gene from patients sampled at different times.

·        From their analysis, they project that the HIV-1M strain invaded humans in the 1930s.

 

7. Modern systematics is flourishing with lively debate

·        Systematics is thriving at the interface of modern evolutionary biology and taxonomic theory.

·        The development of cladistics provides a more objective method for comparing morphology and developing phylogenetic hypotheses.

·        Cladistic analysis of morphological and molecular characters, complemented by a revival in paleontology and comparative biology, has brought us closer to an understanding of the history of life on Earth.

·        For example, the fossil record, comparative anatomy, and molecular comparisons all concur that crocodiles are more closely related to birds than to lizards and snakes.

·        In other cases, molecular data present a different picture than other approaches.

·        For example, fossil evidence dates the origin of the orders of mammals at about 60 million years ago, but molecular clock analyses place their origin to 100 million years ago.

·        In one camp are those who place more weight in the fossil evidence and express doubts about the reliability of the molecular clocks.

·        In the other camp are those who argue that paleontologists have not yet documented an earlier origin for most mammalian orders because the fossil record is incomplete.

·        Between these two extremes is a phylogenetic fuse hypothesis.

·        This hypothesis proposes that the modern mammalian orders originated about 100 million years ago.