BAC libraries and comparative genomics of aquatic chordate species

BAC libraries and comparative genomics of aquatic chordate species

Comparative Biochemistry and Physiology, Part C 138 (2004) 233 – 244 www.elsevier.com/locate/cbpc Review BAC libraries and comparative genomics of a...

661KB Sizes 24 Downloads 148 Views

Comparative Biochemistry and Physiology, Part C 138 (2004) 233 – 244 www.elsevier.com/locate/cbpc

Review

BAC libraries and comparative genomics of aquatic chordate species

$

Tsutomu Miyake*, Chris T. Amemiya Molecular Genetics Department, Benaroya Research Institute at Virginia Mason, 1201 Ninth Avenue, Seattle, WA 98101, United States Received 25 March 2004; received in revised form 9 July 2004; accepted 14 July 2004

Abstract The bacterial artificial chromosome (BAC) system is useful for creating a representation of the genomes of target species. The system is advantageous in that it can accommodate exogenous inserts that are very large (N100 kilobases, kb), thereby allowing entire eukaryotic genes (including flanking regulatory regions) to be encompassed in a single clone. The interest in BACs has recently been spawned by vast improvements in high throughput genomic sequencing such that comparisons of orthologous regions from different genomes (comparative genomics) are being routinely investigated, and comprise a significant component, of all major sequencing centers. In this review, we discuss the general principles of BAC cloning, the resources that are currently available, and some of the applications of the technology. It is not intended to be an exhaustive treatise; rather our goal is to provide a primer of the BAC technology in order to make readers aware of these resources and how they may utilize them in their own research programs. D 2004 Elsevier Inc. All rights reserved. Keywords: BAC libraries; Comparative genomics; Aquatic chordate species

Contents 1. What are BAC libraries? . . . . . . . . . . . . . . 2. Who sponsors BAC library construction? . . . . . 3. What BAC libraries are currently available? . . . . 4. Utility of BAC libraries and comparative genomics 5. Summary . . . . . . . . . . . . . . . . . . . . . . Acknowledgements . . . . . . . . . . . . . . . . . . . References . . . . . . . . . . . . . . . . . . . . . . . .

. . . . . . .

. . . . . . .

. . . . . . .

. . . . . . .

. . . . . . .

1. What are BAC libraries? Bacterial artificial chromosomes are based on plasmid vectors that are essentially composed of an F-factor origin of replication with a chloramphenicol resistance gene (Fig. 1) (Shizuya et al., 1992; Osoegawa et al., 1998; Amemiya et

. . . . . . .

. . . . . . .

. . . . . . .

. . . . . . .

. . . . . . .

. . . . . . .

. . . . . . .

. . . . . . .

. . . . . . .

. . . . . . .

. . . . . . .

. . . . . . .

. . . . . . .

. . . . . . .

. . . . . . .

. . . . . . .

. . . . . . .

. . . . . . .

. . . . . . .

. . . . . . .

. . . . . . .

. . . . . . .

. . . . . . .

. . . . . . .

. . . . . . .

. . . . . . .

. . . . . . .

. . . . . . .

. . . . . . .

. . . . . . .

. . . . . . .

. . . . . . .

. . . . . . .

. . . . . . .

233 235 236 236 241 241 241

al., 1999). The F-factor replicon allows propagation of the bacterial plasmid as a single copy entity in Escherichia coli, thus permitting stable propagation of cloned inserts greater than 100 kilobase pairs (kb).1 The ability to accommodate such large inserts is advantageous for many applications in genome biology, including positional cloning, targeted genomic sequencing, and as vehicles for generating trans-

B

This paper is based on a presentation given at the conference: Aquatic Animal Models of Human Disease hosted by the American Type Culture Collection and the University of Miami in Manassas, Virginia, USA, September 29–October 2, 2003. * Corresponding author. Tel.: +1 206 583 6093; fax: +1 206 583 2297. E-mail address: [email protected] (T. Miyake). 1532-0456/$ - see front matter D 2004 Elsevier Inc. All rights reserved. doi:10.1016/j.cca.2004.07.001

1 Note, a similar vector, P1-derived artificial chromosome (PAC) (Ioannou et al., 1994), uses a P1 bacteriophage origin of replication instead of the F-factor replicon (as used in BACs). However, for all intents and purposes, the vectors are very similar with respect to cloning of large exogenous fragments and will both be called BACs in this review.

234

T. Miyake, C.T. Amemiya / Comparative Biochemistry and Physiology, Part C 138 (2004) 233–244

Fig. 1. Diagram of a typical BAC vector. This vector is a compilation of the original pBAC108L (Shizuya et al., 1992), pBACe3.6 (Frengen et al., 1999) and pCC1BACk (Epicentre Technologies). The camR gene encodes chloramphenicol antibiotic resistance. The repE, parA, parB, parC and redF genes are essential components of the single-copy F-factor replicon. In order to prepare large quantities of vector, the oriV replicon is used for multi-copy replication. Note the oriV replicon must be used in a host E. coli strain (EPI300) that allows induction of the multi-copy origin of replication via growth of the clone in arabinose-containing media.

genic animals. A flow chart for BAC cloning is given in Fig. 2. The procedure is conceptually simple although the actual generation of a library is both technically challenging and laborious. Briefly, high molecular weight genomic DNA from the target species is prepared in low melting point agarose; subsequent manipulations are done in situ in the agarose to prevent shearing of the DNA. The genomic DNA is partially digested with a restriction enzyme, extensively size-selected using a series of pulsed-field gel electrophoresis runs, and ligated to dephosphorylated BAC vector. Ligation products are electrotransformed into E. coli using a strain that has a particular cell wall defect for increasing transformation of large DNA constructs (DH10B, Invitrogen). Subsequent colonies are plated on a chloramphenicol plate and barrayedQ with a robotic picker into individual 384-well microtiter dishes, incubated overnight and ultimately stored at 80 8C in glycerol containing media. The library thus is encompassed in numerous 384-well microtiter dishes, the number of which is dependent on the genomic bcoverageQ of the library. The size range of cloned

genomic DNA in a BAC library typically varies from 100 to 200 kb and, in general, the larger the average insert size the more labor, time and expense have been afforded in generating the library. The genomic coverage of each BAC library depends on several factors, including the purpose of the library and the genome size of the target species. The most high quality BAC libraries are in the range of 10 genomic coverage with 150 kb inserts and contain fewer than 5% ghost clones.2 Insofar as the feasibility of constructing BAC libraries, most metazoan DNAs appear tractable to BAC cloning; however, certain genomes do tend to be problematic as a result of skewed base composition, high repeat frequencies, symbiont DNA contamination and excessively large genomes. The former two are due to the inability of the E. coli host system to accommodate such DNA,3 and may have a pronounced effect on both the representation of the library and whether constructing a library from that organism is even possible. The latter two biological factors will also ultimately affect the quality and representation of the library. BAC libraries containing symbiont DNA contamination will contain a lower proportion of the desired clones, a substantial problem if that library is being used for a genome sequencing project. This is less of a problem if the library is to be used only to isolate genomic regions rather than as a sequencing reagent. A large genome is a more practical issue that is difficult to deal with effectively since the number of 384-well microtiter dishes will become unmanageable for a library of usable representation. Thus certain organisms with enormous genomes (e.g., lungfishes) will never be candidates for BAC libraries. In addition to the aforementioned problems, two other issues that have been shown to influence the BAC cloning process are high endogenous nuclease activities of the tissue chosen for DNA extraction as well as presence of cloning inhibitors in the extracted DNA. Success in overcoming these problems has been achieved in some cases but not others (P. De Jong, R. Wing and C. Amemiya, unpublished). Once BAC libraries are generated, high-density colony filters (macro-arrays) are produced using a robotics workstation.4 These filters are used for hybridization screening of the libraries, and identification of positive clones is done in a very straightforward manner (Amemiya et al., 1999). Every clone in the library has a unique address, thus in one step the investigator can retrieve the clones of interest for subsequent verification and characterization. Moreover, clever bovergo-oligoQ strategies have been developed that

2 Ghost clones are, by definition, non-recombinant clones that do not have inserts. These include vector bartifactsQ from rearrangement/deletion during the BAC cloning procedure. 3 These DNAs are less clonable due to their toxicity to the E. coli host. 4 In some cases, bsmartQ DNA pools are produced for the library, allowing screening via PCR. However, this method is comparatively more tedious than hybridization-based screening and is not readily amenable to multiplexing.

T. Miyake, C.T. Amemiya / Comparative Biochemistry and Physiology, Part C 138 (2004) 233–244

235

Fig. 2. Flow chart for the preparation of a large-insert BAC genomic library from vertebrates. This figure is meant to serve only as a basic overview. (1) Cells (or nuclei) are isolated, washed, and counted using a hemacytometer. (2) The cells (or nuclei) are embedded at a fixed density (roughly 20–30 Ag DNA per 80 Al plug mold) in low melting point agarose blocks. The blocks are then treated with proteolytic agents to lyse the cells (nuclei). The embedding in agarose prevents shearing of the high molecular weight (HMW) DNA during handling. (3) The blocks are extensively equilibrated with TE buffer to remove salts and other small molecules and are partially digested in situ with suitable restriction enzyme for BAC cloning. The partial digestion reactions are carried out using either a time-course experiment or methylase competition. (4) A preparative pulsed field gel is then run on the selected digests, and the region of interest is excised without staining of the DNA with ethidium bromide or exposure to UV irradiation. The region of interest is then excised in order to extract different sized DNA fractions. (5) A small piece of gel from each fraction is run on an analytical pulsed field gel to validate the size range of each fraction of DNA isolated. (6) The DNA is extracted from gel pieces via electroelution and then ligated to dephosphorylated BAC vector using conditions that foster formation of large circles. (7) Ligation products are electro-transformed into electro-competent DH10B T1 phage-resistant E. coli cells and transformants are plated on agarbased selective medium (chloramphenicol for BAC vectors) and allowed to grow. (8) Recombinant colonies are robotically picked (arrayed) into individual wells of 384-well microtiter dishes that contain media supplemented with 10% filter-sterilized glycerol, incubated at 37 8C overnight and then the plates are stored at 80 8C. Subsequent manipulations (e.g., production of filter macroarrays) utilize replicas of the library. Readers requiring more detail on the specifics of BAC library construction should consult (Amemiya et al., 1996) and (Osoegawa et al., 1999).

allow simultaneous hybridization with complex probe mixtures (multiplexing) in order to isolate large contiguous genomic regions in a single probing (Ross et al., 1999; Thomas et al., 2002; Romanov et al., 2003). With regard to time and ease of screening, BAC libraries are considerably easier to screen than lambda genomic libraries or un-arrayed cosmid libraries. In terms of clone handling, since BACs are simply large plasmid-based clones, the methods used for routine isolation and handling of plasmid DNAs (with slight modifications) can be used, albeit with lower yields than conventional high-copy plasmid clones.

from federal granting agencies (predominantly NIH and NSF). In this way, the libraries are assured to be of high quality and will be disseminated to the scientific community without sensitive intellectual property issues. Both the National Human Genome Research Institute (NHGRI) at NIH as well as the Directorate for Biological Sciences at NSF have recognized the utility of BAC resources and have been major driving forces for BAC library construction. Both have requested grant applications for BAC library construction; however, only the NHGRI program is still actively soliciting proposals for BAC libraries from a variety of organisms. Species for which NSF awarded grants can be found at: http://www.nsf.gov/bio/pubs/awards/

2. Who sponsors BAC library construction? Due to the high cost and considerable expertise required in making BAC libraries, most libraries are generated in specialty laboratories5 through contract-type mechanisms

5 It is generally not recommended that individual investigators attempt to construct BAC libraries on their own if they have not previously worked with very high molecular weight DNA and do not possess the equipment and infrastructure for carrying out this work.

236

T. Miyake, C.T. Amemiya / Comparative Biochemistry and Physiology, Part C 138 (2004) 233–244

bachome.htm. These represent taxa thought to be of the highest immediate importance to a variety of biological problems and run the gamut from metazoans to plants. Although NSF no longer is specifically funding grants for generating BAC libraries, in certain programs (e.g., Tree of Life, Biological Research Collections) BAC library construction, if sufficiently justified, can be written into collaborative or investigator-initiated proposals. The NHGRI has established the NIH BAC Resource Network for constructing BAC libraries of many metazoan taxa (http://www.genome.gov/10001844). In contrast to the NSF BAC program, three specialized Centers for BAC library production were established in order to fulfill the needs of the biomedical community for BAC library construction: Benaroya Research Institute at Virginia Mason (http://www.benaroyaresearch.org, Chris Amemiya, Director), BACPAC Resources at Children’s Hospital Oakland (http://bacpac.chori.org/home.htm, Pieter de Jong, Director) and Arizona Genomics Institute at University of Arizona (http://www.genome.arizona.edu, Rod Wing, Director). The committee at the NHGRI determines which of BAC libraries from the choice of taxa were made based on bwhite paperQ submissions by the community, and assignments are then made to the respective BAC Library Production Centers. The white paper submissions are done electronically; instructions and selection criteria are specified on the NHGRI website (http://www.genome.gov/ 10001845). A BAC library so produced is of very high quality and must meet strict quality assurance standards prior to being distributed to the community. Thus, this mechanism creates a win-win situation: the NHGRI pays for a high-quality library to be generated and the proposer and community benefit greatly by gaining access to a valuable genome resource that can be utilized for addressing important biological questions. Lastly, it should be mentioned that there are a small number of biotech companies that will construct BAC libraries on a contractual basis.

3. What BAC libraries are currently available? BAC libraries have now been generated from several metazoan species, including both protostomes and deuterostomes. Table 1 lists BAC libraries from aquatic chordate species, many of these libraries being constructed in our laboratory. While it is difficult to keep an accurate accounting of all the libraries that are now in existence, BAC libraries made or being made as part of the NSF BAC program are listed at http://www.nsf.gov/bio/pubs/awards/ bachome.htm; libraries made or being made as part of the NHGRI BAC Resource Network are listed at http:// www.genome.gov/page.cfm?pageID=10001852. In addition, NHGRI provides a relatively up-to-date listing of all metazoan libraries that are available to the public at http:// www.genome.gov/11008350.

4. Utility of BAC libraries and comparative genomics BAC libraries are very useful for many applications in modern biology: isolation of intact genes or gene clusters from regions of interest (Kim et al., 2000; Chiu et al., 2002, 2004; Powers and Amemiya, 2004), whole genome physical mapping (Chen et al., 2004), elucidating gene organization (Amores et al., 1998; Strong et al., 1999), positional cloning (Brownlie et al., 1998; Donovan et al., 2000), long range DNA sequencing and anchoring (Mahairas et al., 1999; Lander et al., 2001; Ness et al., 2002; Krzywinski et al., 2004), and identification of putative cis-regulatory elements (Sumiyama et al., 2002, Sumiyama and Ruddle, 2003). In addition, because BAC clones can harbor large stretches of flanking DNA, they are exceptionally good substrates for inserting reporter genes into coding sequences for elucidating gene expression patterns in transgenic systems (Jessen et al., 1999; Sumiyama and Ruddle, 2003; Gong et al., 2003). While there are several other applications of BAC clones as described above, we will now briefly discuss what has become a major focus of the genomics community, namely to elucidate the logic of non-coding, cis-regulatory DNA (Collins et al., 2003). The recent explosion in DNA sequencing throughput is resulting in a paradigm shift in our approach to biological investigation. We no longer need to rely on bthought experimentsQ for inferring how the genome has diverged over evolutionary time and space; we are able to directly observe these changes and more clearly understand how the genome is organized and regulated, and what kinds of changes result in biological novelty. It is, after all, the genome that evolves; the phenotype must necessarily follow suit. Thus we can use btargeted comparative sequencingQ (sequencing of BACs from orthologous genomic regions between taxa) as a first approximation as to how non-coding cis-regulatory regions contribute to genome functioning (Pennacchio and Rubin, 2001, 2003; Thomas et al., 2003; Force et al., in press). Fig. 3 gives an overview of how targeted comparative sequencing is carried out; the ultimate goal is to obtain sequence information from orthologous regions of different genomes for comparison and scrutiny. We assume that embedded in the genomes of organisms are snippets of sequences (cis-regulatory elements) that, when such comparisons are made, will begin to reveal how genes and associated networks are intertwined and regulated. Fig. 4 shows an example of such long-range comparisons (graphical output of a global alignment between divergent taxa). Empirical studies still must be carried out to validate inferences made from these comparative genomic analyses as diagrammed in Fig. 3 (DiLeone et al., 2000; Mortlock et al., 2003; Juan and Ruddle, 2003). As we examine the diversity of life, it becomes evident that there are biological novelties at every major phylogenetic node, and that several of these characteristics are reminiscent of those possessed by bhigherQ vertebrates, namely humans. Table 2 lists several major chordate taxa (in

T. Miyake, C.T. Amemiya / Comparative Biochemistry and Physiology, Part C 138 (2004) 233–244

237

Table 1 A list of BAC libraries generated from selected aquatic chordate speciesa Organism

Average insert size (kb)

Coverage

Site of construction and reference

Alligator mississipiensis (American alligator) Chrysemys picta (painted turtle) Silurana (Xenopus) tropicalis

Latimeria menadoensis (Indonesian coelacanth)

145 140 75 170 108 85 130 170

10 10 6.8 6.6 2 3 2 N10

Spheroides nephelus (pufferfish)

140

6–7

Takifugu rubripes (pufferfish) Chaenocephalus aceratus (Antarctic icefish) Notothenia coriiceps (Antarctic rock cod) Dissostichus mawsoni (Antarctic toothfish) Haplochromis chilotes (Lake Victoria cichlid)

80

9

130 128

7 10

150

12

65–194 190

6 – 65 10X

145 190 188

5 – 6 19.3 18.8

154 –177 115 40 (fosmid)

6.2 – 13.1 5 10

165 131 175 161

10.4 2.4 10 10.6

130 130 100 110 60 80 145 46

5–6 5 3 – 4 N20 1–2 10 13.7

80 142 130

10 17 13

VMRC, JGI (VMRC8) VMRC, JGI ISB-1 (CHORI) CHORI (CHORI-216) Research Genetics VMRC (RZPD-710) VMRC VMRC (VMRC4) (Danke et al., 2004) VMRC (RZPD-715) (Amemiya et al., 2001) HGMP-RC VMRC, in progress VMRC, in progress VMRC (VMRC13) Okada Laboratory (Watanabe et al., 2003) VMRC (VMRC14) HCGS, in progress (Katagiri et al., 2001) Volff laboratory (Froschauer et al., 2002) VMRC (RZPD-756) CHORI (CHORI-213) CHORI (CHORI-214) Reviewed in (Thorgaard et al., 2002) CHORI (CHORI-217) VMRC (RZPD-706) VMRC, in progress (sheared DNA from one animal) CHORI (CHORI-211, RZPD-736) CHORI (RZPD-728) Keygene (RZPD-735) CHORI (CHORI-212) HCGS, in progress VMRC VMRC9 VMRC VMRC VMRC CHORI, in progress VMRC (VMRC3) CHORI (CHORI-301, sheared library) CHORI, in progress VMRC (VMRC2) CHORI (CHORI-302) CUGI

Xenopus laevis Xenopus laevis-gilli

Astatotilapia burtoni (African cichlid) Metriaclima zebra (Lake Malawi zebra cichlid) Oreochromis niloticus (Nile tilapia), 4 libraries Xiphophorus maculatus (platyfish) Oryzias latipes (medaka) Gasterosteus aculatus (threespine stickleback) Salmo salar (Atlantic salmon) Oncorhynchus mykiss (rainbow trout), at least 4 libraries Oncorhynchus tshawytscha (chinook salmon) Danio rerio (zebrafish)

Ictalurus punctatus (channel catfish) Astyanax mexicanus (blind cavefish) Polypterus senegalis (bichir) Raja eglanteria (skate) Heterodontus francisci (horn shark) Petromyzon marinus (lamprey) Ciona intestinalis (tunicate) Ciona savignyi (tunicate) Oikopleura dioica (larvacean) Branchiostoma floridae (Amphioxus) Saccoglossus kowalevskii (acorn worm)

CHORI: The BACPAC Resources Center, Children’s Hospital Oakland Research Institute, Oakland, CA. www.chori.org/bacpac/home.htm. CUGI: Clemson University Genomics Institute, Clemson University, Clemson, SC. www.genome.clemson.edu. HCGS: Hubbard Center for Genomic Studies, University of New Hampshire, Durham, NH. http://hcgs.unh.edu/index.php. ISB: Institute for Systems Biology, Seattle, WA. www.systemsbiology.org. JGI: Joint Genome Institute, Walnut Creek, CA. http://www.jgi.doe.gov/programs/comparative/top_level/BAC.html. MRC HGMP-RC: UK Human Genome Mapping Project Resource Center, Cambridge, UK. www.hgmp.mrc.ac.uk. Research Genetics: Invitrogen Corporation, Carlsbad, CA.www.invitrogen.com. RZPD: Resource Center and Primary Database, Deutsches Ressourcenzentrum fur Genomforschung, Berlin, Germany. www.rzpd.de. Okada Laboratory: Department of Biological Sciences, Tokyo Institute of Technology, Tokyo, Japan. http://www.evolution.bio.titech.ac.jp/index_e.html. VMRC: Benaroya Research Institute at Virginia Mason, Seattle, WA. http://benaroyaresearch.org/bri_investigators/amemiya/default.htm. Volff Laboratory: Lehrstuhl Physiologische Chemie I, Biozentrum, University of Wurzburg, Am Hubland, Germany. http://www.biozentrum.uni-wuerzburg.de/ pc1/volff/index. a Note that this table is not an exhaustive listing of all BAC libraries and primarily lists those from the Amemiya laboratory and others that are publicly available. A compilation of all metazoan BAC libraries available to the scientific community that is updated regularly is available at http://www.genome.gov/ 11008350.

238

T. Miyake, C.T. Amemiya / Comparative Biochemistry and Physiology, Part C 138 (2004) 233–244

Fig. 3. Flowchart for targeted comparative sequencing approach. In targeted comparative sequencing (Thomas et al., 2003), BAC libraries from different species are utilized in order to isolate specific regions that are orthologous to one another (1, 2). Sequences are obtained from respective BAC clones (3) and aligned and compared to the same region from other taxa (4). The results obtained in (4) allow testable inferences to be made so that functional experiments can be carried out in vivo and in vitro (5), particularly with regard to the regulation of genes and gene families. These experiments involve generation of transgenic reporter constructs and knock-in/knockout strategies for assessing functionality of genomic regions (Shashikant et al., 1998; DiLeone et al., 2000; Sumiyama et al., 2002; Mortlock et al., 2003; Juan and Ruddle, 2003).

a broad phylogenetic context) along with emergent phenotypic and/or developmental characters at each node. Can comparative genomics help to unravel evolution of morphological structures and innovations? And are we able to use comparative genomics of bancestralQ organisms in order to better understand form and function in more complex organisms such as humans? These questions lie at the heart of the newly revitalized field of evolution-developmental biology (evo-devo) and functional comparative genomics. One of the most powerful tools for evolution-developmental research is to screen and characterize gene clusters from a BAC library, including flanking regions of coding sequences that are known to contain conserved non-coding

sequences and to function as cis-regulatory elements. As shown in Fig. 3, once gene clusters are screened from a BAC library, genome-wide comparisons enable us to characterize gene clusters, to identify conserved non-coding regions and to allow functional studies of coding and noncoding sequences in animal models (Brown et al., 2002; Martinez and Amemiya, 2002; Miyake et al., 2002). There are a number of examples in which BAC libraries have been utilized to carry out a multitude of studies as described above: Hox gene and their clusters in fishes (Amores et al., 1998; Kim et al., 2000; Chiu et al., 2004; Amores et al., 2004), Dlx gene family in mammals (Miyake et al., 2002), novel immune-type receptor family in teleost fish (NITRs)

T. Miyake, C.T. Amemiya / Comparative Biochemistry and Physiology, Part C 138 (2004) 233–244

239

Fig. 4. Comparative sequence analysis of an extended region in the zebrafish genome. The complete sequence of a BAC clone [BUSM 179B16 (Amemiya and Zon, 1999), 107286 bp] was obtained from GenBank (AL954668). This sequence was generated by the Sanger Centre as part of the zebrafish genome initiative. The sequence was used in a GenomeVista analysis in order to determine its syntenic relationship to other genomic sequences (Lawrence Berkeley National Laboratory; http://www-gsd.lbl.gov/vista/index.shtml), namely human, mouse and rat. The multiple VISTA plot shows a portion of the extended zebrafish region that was globally aligned with the human orthologous region on chromosome 17 (plot number 1). Two other plots are shown: (2) human vs. mouse; and (3) human vs. rat. The numbers below the plot indicate the relative position on human chromosome 17. There are two genes whose alignments are clearly shown: MYH1 and MYH2, myosin heavy chain 1 and myosin heavy chain 2, respectively. The blue boxes in each represent the exon sequences whereas the arrowheads denote transcriptional orientation. Peaks within the plots represent levels of nucleotide identity between the species being compared; blue peaks represent sequence identity within exons whereas pink peaks represent sequence identity in noncoding regions that potentially represent cisregulatory elements. As can be seen, there are considerably higher levels of noncoding sequence identity between human and mouse (2) and between human and rat (3), and that outside of the exons, no identity exists between zebrafish and human. This is not always the case and is somewhat dependent on the particular gene being examined. In some instances such as the HOX clusters, there is considerable conservation of noncoding sequences between organisms as phylogenetically disparate as sharks and tetrapods (Kim et al., 2000; Chiu et al., 2002). The functionality of the putative cis-regulatory regions must be assessed via in vitro and in vivo experiments.

(Strong et al., 1999; Yoder et al., 2001), VLR (variable lymphocyte receptor) adaptive immune system in lampreys (Pancer et al., 2004). Among the examples given above, BAC libraries have played a major role in characterizing Hox clusters from different chordate animals (Martinez and Amemiya, 2002). A discovery of seven to eight clusters of Hox genes in bony fishes has sparked intensive efforts to investigate the evolution and functional roles of genome-wide duplication of Hox genes in lower chordate species (Amores et al., 1998; Chiu et al., 2002; Force et al., in press). It has been shown that all duplicated Hox genes are not necessarily preserved—some have been lost and others been preserved as a result of the partitioning of gene subfunctions between gene duplicates (Force et al., 1999; Force et al., in press). In order to examine the genomic structure of Hox clusters in a group of phylogenetically advanced teleost fishes, the pufferfishes, characterization of BAC contigs (Amemiya et al., 2001) as well as data mining were used to reveal structural changes as a result of teleost-specific genome duplication (Amores et al., 2004). The comparative analysis of cluster organization among vertebrates revealed that a disproportionate number of Hox genes were lost in pufferfishes (and other teleosts) but that the expression patterns of

retained duplicates often exhibited notable differences between duplicates. Recently, hitherto unidentified vertebrate Hox14 genes (Hoxa14 in the Indonesian coelacanth Latimeria menadoensis and Hoxd14 in the horn shark Heterodontus francisci) were discovered as a result of sequence analysis of clusters that have been screened and characterized from BAC libraries (Danke et al., 2004; Powers and Amemiya, 2004). These genes are noteworthy as they are highly suggestive that the ancestral vertebrate HOX clusters possessed an additional bposteriorQ gene (Hox14), not the 13 paralog group genes as previously believed. The implication from all these data is that dynamic changes of Hox duplicates might have contributed to developmental diversification of morphologies in the evolution of bony fishes. Since Hox genes have been shown to regulate the axial patterning and limb development, evolutionary changes of Hox gene organization and their function will provide clues for deciphering the underlying genetic and evolutionary mechanisms for emerging phenotypes at respective evolutionary nodes (Wagner et al., 2003), i.e., the innovation of jaws in gnathostomes (jawed vertebrates) (Cohn, 2002; Takio et al., 2004) and a transition of fins to limbs in tetrapods (Clark, 2002) (Table 2).

240

T. Miyake, C.T. Amemiya / Comparative Biochemistry and Physiology, Part C 138 (2004) 233–244

Table 2 Broad classification of extant chordates, with a listing of phenotypes that are emergent at each node of chordate evolution

Table 2 (continued) Taxa

Notable characteristics, including those which are germane to humans and medicine

Taxa

Notable characteristics, including those which are germane to humans and medicine

Ray-finned fishes

Hemichordates

Neural tube (Holland and Graham, 1995; Hall, 1999), notochord (Hall, 1999), pharyngeal slits. Endodermal fore- and midgut (Lehman, 1983), expression of a series of genes that indicate initial evolution of neural crest primordial cells (BMP-7, (Hammerschmidt et al., 1996; Miya et al., 1996, 2001) and development of primordial facial structures (Pax-3, Pax-7; (Wada et al., 1996), the endostyle homologous with the thyroid gland (Dehal et al., 2002), development of the heart with expression of a series of genes analogous to development of the vertebrate heart (Dehal et al., 2002). Brain developed into three or more distinct regions, metameric organization of the pharyngeal arches (Lehman, 1983), putative cranial neural crest dcellsT (Holland and Garcia-Fernandez, 1996; Wada et al., 1996; Hall, 1999), segmented axial muscles in the trunk (Lehman, 1983). Three sensory organs (a single nose, eyes without eye lens and inner ears) derived from placodes (Forey and Janvier, 1993), the earliest sign of pituitary development (Gorbman and Tamarin, 1985; Sower, 1998). Cranial skeletons derived from neural crest and mesodermal cells (Northcutt and Gans, 1983; Hall, 1998; Smith and Hall, 1990, 1993), Meckel’s cartilage in the mandibular arch (Hall, 1984, 1998), a unique set of sensory and motor cranial nerves (Forey and Janvier, 1993; Janvier, 1996), cranial muscles including extrinsic eye muscles (Miyake et al., 1992; Holland et al., 1993), at least two semicircular canals in the labyrinth, development of neuromasts, atrium and ventricle of heart, neural innervation of the heart in adults, metameric axial skeletons derived from somites (Smith and Hall, 1990, 1993), nephrons as a functional excretory organ. Development of the jaws (palatoquadrate and Meckel’s cartilage in the lower and upper jaw, respectively) (Carroll, 1988), dentine and enameloid in teeth and scales (Smith and Hall, 1990, 1993; Hall, 1998; Miyake et al., 1999), type II collagen as the major matrix protein in cartilages Miyake et al., 1999), true lymphoid organs including thymus and a tissue equivalent to bone marrow (Hansen and Zapata, 1998; Zapata and Amemiya, 2000).

Tooth bearing dermal bones on the gape of the jaws (De Beer, 1985), the palate formed by the perichondrally ossified palatoquadrate and a series of dermal bones on the oral cavity not on the gape of the upper jaw (De Beer, 1985; Schultze, 1993), enamel in scales (Polypteridae and probably Amia and gars) but lost in most of ray-fin fishes (Sire, 1989; Zylberberg et al., 1997), mineralized cranial and axial skeletons (Smith and Hall, 1990, 1993), dermal fin rays made up by lepidotrichiae, very small vertebrate genomes (Brenner et al., 1993; Baxendale et al., 1995; Elgar et al., 1996). Endoskeletal supports in pectoral fins (a transition from multiple elements to single element (dhumerusT) articulating with girdle (Cloutier and Ahlberg, 1996), enamel in teeth (Smith, 1978), lung-like structures (Helfman et al., 1997). Features adapted for life on land: limbs, cranial and axial skeletons, respiration, water balance, sensory input (Nelson, 1984; Carroll, 1988). Extraembryonic membrane: allantois, chorion and amnion (Carroll, 1988), the articular–quadrate joint, development of the tympanum (Lombard and Bolt, 1979). Feathers, true flight capability associated with morphological changes, warm-bloodedness (Carroll, 1988). Enucleated erythrocytes, a lateral temporal fenestra in the skull, fur (Carroll, 1988).

Urochordates

Cephalochordates

Hagfish

Lamprey

Chimeras, sharks and rays

Lobe-finned fishes

Amphibians

Reptiles

Birds

Mammals

In many instances, the emergent characters are shared with higher vertebrates (humans, mice).

There is no doubt that comparative genomics can unveil hidden signatures in the genome and that such signatures may serve as targets for therapy and/or drug intervention in the case of humans (Stevens et al., 2001). This realization has largely been responsible for the enthusiasm for generating BAC resources by NIH and has also resulted in a large all-encompassing initiative, ENCODE (ENCyclopedia Of DNA Elements), which seeks to delineate all the noncoding regulatory elements in the human genome (Collins et al., 2003). The initiative utilizes comparative genomics on a very large scale to initially predict those DNA elements that putatively encode functions, and to develop methods to survey whether or not the predictions are accurate. While the community has previously relied on relatively labor-intensive means to evaluate the biological activity of conserved DNA elements (i.e., transgenesis, knock-in/knockout technology), such methods are clearly insufficient for the daunting abundance of data that are being generated by these large-scale efforts. Ironically,

T. Miyake, C.T. Amemiya / Comparative Biochemistry and Physiology, Part C 138 (2004) 233–244

although the human genome is completely sequenced, the difficult task in deciphering how the genome actually works is only in its infancy, a most exciting time indeed.

5. Summary BAC libraries are collections of large exogenous DNA inserts cloned into stable plasmid vectors and propagated in E. coli. The libraries are extremely useful for biological investigation and programs have been initiated to develop more BAC resources for the community. BACs have become exceedingly useful for the comparative genomic approach wherein orthologous regions are compared and contrasted between different taxa in order to begin to understand the logic of genome organization and regulation. This approach has garnered much interest in both the evolution and biomedical communities since it unveils a new paradigm for studying the regulation of the genome and how genomic changes in non-coding sequences may effect physiological and anatomical change.

Acknowledgements We thank the members of the Amemiya laboratory. Our laboratory is funded, in part, by National Institutes of Health (RR14085, HG02526-01), the National Science Foundation (IBN-0207870, IBN-0321461) and the United States Department of Energy (DE-FG03-01ER63273).

References Amemiya, C.T., Zon, L.I., 1999. Generation of a zebrafish P1 artificial chromosome library. Genomics 58, 211 – 213. Amemiya, C.T., Ota, T., Litman, G.W., 1996. Construction of P1 artificial chromosome (PAC) libraries from lower vertebrates. In: Lai, E., Birren, B. (Eds.), Analysis of Nonmammalian Genomes. Academic Press, San Diego, pp. 223 – 256. Amemiya, C.T., Zhong, T.P., Silverman, G.A., Fishman, M.C., Zon, L.I., 1999. Zebrafish YAC, BAC, and PAC genomic libraries. Methods Cell Biol. 60, 235 – 258. Amemiya, C.T., Amores, A., Ota, T., Mueller, G., Garrity, D., Postlethwait, J.H., Litman, G.W., 2001. Generation of a P1 artificial chromosome library of the Southern pufferfish. Gene 272, 283 – 289. Amores, A., Force, A., Yan, Y.L., Joly, L., Amemiya, C., Fritz, A., Ho, R.K., Langeland, J., Prince, V., Wang, Y.L., Westerfield, M., Ekker, M., Postlethwait, J.H., 1998. Zebrafish hox clusters and vertebrate genome evolution. Science 282, 1711 – 1714. Amores, A., Suzuki, T., Pomeroy, J., Singer, A., Amemiya, C.T., Postlethwait, J.H., 2004. Developmental roles of pufferfish Hox clusters and genome evolution in ray-fin fish. Genome Res. 14, 1 – 10. Baxendale, S., Abdulla, S., Elgar, G., Buck, D., Berks, M., Micklem, G., Durbin, R., Bates, G., Brenner, S., Beck, S., Lehrach, H., 1995. Comparative sequence analysis of the human and pufferfish Huntington’s disease genes. Nat. Genet. 10, 67 – 76. Brenner, S., Elgar, G., Sandford, R., Macrae, A., Venkatesh, B., Aparicio, S., 1993. Characterization of the pufferfish (Fugu) genome as a compact model vertebrate genome. Nature 366, 265 – 268.

241

Brown, C.T., Rust, A.G., Clarke, P.J.C., Pan, Z., Schilstra, M.J., De Buysscher, T., Griffin, G., Wold, B.J., Cameron, R.A., Davidson, E.H., Bolouri, H., 2002. New computational approaches for analysis of cisregulatory networks. Dev. Biol. 246, 86 – 102. Brownlie, A., Donovan, A., Pratt, S.J., Paw, B.H., Oates, A.C., Brugnara, C., Witkowska, H.E., Sassa, S., Zon, L.I., 1998. Positional cloning of the zebrafish sauternes gene: a model for congenital sideroblastic anaemia. Nat. Genet. 20, 244 – 250. Carroll, R.L., 1988. Vertebrate Paleontology and Evolution. Freeman, New York. Chen, R., Sodergren, E., Weinstock, G.M, Gibbs, R.A., 2004. Dynamic building of a BAC clone tiling path for the Rat Genome Sequencing Project. Genome Res. 14, 679 – 684. Chiu, C.H., Amemiya, C., Dewar, K., Kim, C.B., Ruddle, F.H., Wagner, G.P., 2002. Molecular evolution of the HoxA cluster in the three major gnathostome lineages. Proc. Natl. Acad. Sci. U. S. A. 99, 5492 – 5497. Chiu, C.H., Dewar, K., Wagner, G.P., Takahashi, K., Ruddle, F.H., Ledje, C., Bartsch, P., Scemama, J.L., Stellwag, E., Fried, C., Prohaska, S.J., Stadler, P.F., Amemiya, C.T., 2004. Bichir HoxA cluster sequence reveals surprising trends in ray-finned fish genomic evolution. Genome Res. 14, 11 – 17. Clark, J.A., 2002. Gaining ground: the origin and early evolution of tetrapods. Indiana Univ. Press, Bloomington. Cloutier, R., Ahlberg, P.E., 1996. Morphology, characters, and the interrelationships of basal sarcopterygians. In: Stiassney, M.L., Parenti, L.R., Johnson, G.D. (Eds.), Interrelationships of Fishes. Academic Press, San Diego, pp. 445 – 479. Cohn, M.J., 2002. Lamprey Hox genes and the origin of jaws. Nature 416, 386 – 387. Collins, F.S., Green, E.D., Guttmacher, A.E., Guyer, M.S., 2003. A vision for the future of genomics research. Nature 422, 835 – 847. Danke, J., Miyake, T., Powers, T., Schein, J., Shin, H., Bosdet, I., Erdmann, M., Caldwell, R., Amemiya, C.T., 2004. Genome resource for the indonesian coelacanth. Latimeria menadoensis. J. Exp. Zool. 301A, 228 – 234. de Beer, G.R., 1985. The Development of the Vertebrate Skull. The University of Chicago Press, Chicago. Dehal, P., Satou, Y., Campbell, R.K., Chapman, J., Degnan, B., De Tomaso, A., Davidson, B., Di Gregorio, A., Gelpke, M., Goodstein, D.M., Harafuji, N., Hastings, K.E., Ho, I., Hotta, K., Huang, W., Kawashima, T., Lemaire, P., Martinez, D., Meinertzhagen, I.A., Necula, S., Nonaka, M., Putnam, N., Rash, S., Saiga, H., Satake, M., Terry, A., Yamada, L., Wang, H.G., Awazu, S., Azumi, K., Boore, J., Branno, M., DeSantis, R., Doyle, S., Francino, P., Keys, D.N., Haga, S., Hayashi, H., Hino, K., Imai, K.S., Inaba, K., Kano, S., Kobayashi, K., Kobayashi, M., Lee, B.I., Makabe, K.W., Manohar, C., Matassi, G., Medina, M., Mochizuki, Y., Mount, S., Morishita, T., Miura, S., Nakayama, A., Nishizaka, S., Nomoto, H., Ohta, F., Oishi, K., Rigoutsos, I., Sano, M., Sasaki, A., Sasakura, Y., Shoguchi, E., Shin-i, T., Spagnuolo, A., Stainier, D., Suzuki, M.M., Tassy, O., Takatori, N., Tokuoka, M., Yagi, K., Yoshizaki, F., Wada, S., Zhang, C., Hyatt, P.D., Larimer, F., Detter, C., Doggett, N., Glavina, T., Hawkins, T., Richardson, P., Lucas, S., Kohara, Y., Levine, M., Satoh, N., Rokhsar, D.S., 2002. The draft genome of Ciona intestinalis: insights into chordate and vertebrate origins. Science 298, 2157 – 2167. DiLeone, R.J., Marcus, G.A., Johnson, M.D., Kingsley, D.M., 2000. Efficient studies of long-distance Bmp5 gene regulation using bacterial artificial chromosomes. Proc. Natl. Acad. Sci. U. S. A. 97, 1612 – 1617. Donovan, A., Brownlie, A., Zhou, Y., Shepard, J., Pratt, S.J., Moynihan, J., Paw, B.H., Drejer, A., Barut, B., Zapata, A., Law, T.C., Brugnara, C., Lux, S.E., Pinkus, G.S., Pinkus, J.L., Kingsley, P.D., Palis, J., Fleming, M.D., Andrews, N.C., Zon, L.I., 2000. Positional cloning of zebrafish ferroportin1 identifies a conserved vertebrate iron exporter. Nature 403, 776 – 781. Elgar, G., Sandford, R., Aparicio, S., Macrae, A., Venkatesh, B., Brenner, S., 1996. Small is beautiful: comparative genomics with the pufferfish (Fugu rubripes). Trends Genet. 12, 145 – 150.

242

T. Miyake, C.T. Amemiya / Comparative Biochemistry and Physiology, Part C 138 (2004) 233–244

Force, A., Lynch, M., Pickett, F.B., Amores, A., Yan, Y.L., Postlethwait, J.H., 1999. Preservation of duplicate genes by complementary, degenerative mutations. Genetics 151, 1531 – 1545. Force, A., Shashikant, C., Stadler, P., and Amemiya, C.T., Comparative genomics, cis-regulatory elements, and gene duplication. In Detrich, H.W., III, Westerfield,M., and Zon, L.I., (eds.), The Zebrafish. 2nd Edition, Academic Press, Los Angeles. (In press). Forey, P., Janvier, P., 1993. Agnathans and the origin of jawed vertebrates. Nature 361, 129 – 134. Frengen, E., Weichenhan, D., Zhao, B., Osoegawa, K., van Geel, M., de Jong, P.J., 1999. A modular, positive selection bacterial artificial chromosome vector with multiple cloning sites. Genomics 58, 250 – 253. Froschauer, A., Korting, C., Katagiri, T., Aoki, T., Asakawa, S., Shimizu, N., Schartl, M., Volff, J.N., 2002. Construction and initial analysis of bacterial artificial chromosome (BAC) contigs from the sex-determining region of the platyfish Xiphophorus maculatus. Gene 295, 247 – 254. Gong, S., Zheng, C., Doughty, M.L., Losos, K., Didkovsky, N., Schambra, U.B., Nowak, N.J., Joyner, A., Leblanc, G., Hatten, M.E., Heintz, N., 2003. A gene expression atlas of the central nervous system based on bacterial artificial chromosomes. Nature 425, 917 – 925. Gorbman, A., Tamarin, A., 1985. Early development of oral, olfactory and adenohypophyseal structures of reproduction in hagfish. In: Foreman, R.E., A.G.J.M.D.a.R.O. (Eds.), Evolutionary Biology of Primitive Fishes. Plenum, New York, pp. 165 – 185. Hall, B.K., 1984. Developmental processes underlying heterochrony as an evolutinary mechanism. Can. J. Zool. 62, 1 – 7. Hall, B.K., 1998. Evolutionary Developmental Biology. Chapman & Hall, London. Hall, B.K., 1999. The Neural Crest in Development and Evolution. Springer-Verlag, New York. Hammerschmidt, M., Serbedzija, G.N., McMahon, A.P., 1996. Genetic analysis of dorsoventral pattern formation in the zebrafish: requirement of a BMP-like ventralizing activity and its dorsal repressor. Genes Dev. 10, 2452 – 2461. Hansen, J.D., Zapata, A.G., 1998. Lymphocyte development in fish and amphibians. Immunol. Rev. 166, 199 – 220. Helfman, G.S., Collette, B.B., Facey, D.E., 1997. The diversity of fishes. Blackwell, Malden. Holland, P.W., Graham, A., 1995. Evolution of regional identity in the vertebrate nervous system. Perspect. Dev. Neurobiol. 3, 17 – 27. Holland, P.W., Garcia-Fernandez, J., 1996. Hox genes and chordate evolution. Dev. Biol. 173, 382 – 395. Holland, N.D., Holland, L.Z., Honma, Y., Fujii, T., 1993. Engrailed expression during develpment of a lamprey, Lampetra japonica: a possible clue to homologies between agnathan and gnathostome muscles of the mandibular arch. Dev. Growth Differ. 35, 153 – 160. Ioannou, P.A., Amemiya, C.T., Garnes, J.A., Kroisel, P., Shizuya, H., Chen, C., Batzer, M.A., de Jong, P.J., 1994. A new bacteriophage P1-derived vector for the propagation of large human DNA fragments. Nat. Genet. 6, 84 – 89. Janvier, P., 1996. In: Hall, B.K., Hanken, J.Patterns of Diversity in the Skull of Jawless Fishes vol. 2. The University of Chicago Press, Chicago, pp. 131 – 188. Jessen, J.R., Willett, C.E., Lin, S., 1999. Artificial chromosome transgenesis reveals long-distance negative regulation of rag1 in zebrafish [letter]. Nat. Genet. 23, 15 – 16. Juan, A.H., Ruddle, F.H., 2003. Enhancer timing of Hox gene expression: deletion of the endogenous Hoxc8 early enhancer. Development 130, 4823 – 4834. Katagiri, T., Asakawa, S., Minagawa, S., Shimizu, N., Hirono, I., Aoki, T., 2001. Construction and characterization of BAC libraries for three fish species; rainbow trout, carp and tilapia. Anim. Genet. 32, 200 – 204. Kim, C.B., Amemiya, C., Bailey, W., Kawasaki, K., Mezey, J., Miller, W., Minoshima, S., Shimizu, N., Wagner, G., Ruddle, F., 2000. Hox cluster genomics in the horn shark, Heterodontus francisci. Proc. Natl. Acad. Sci. U. S. A. 97, 1655 – 1660.

Krzywinski, M., Wallis, J., Gosele, C., Bosdet, I., Chiu, R., Graves, T., Hummel, O., Layman, D., Mathewan, C., Wye, N., Zhu, B., Albracht, D., Asano, J., Barber, S., Brown-John, M., Chan, S., Chand, S., Cloutier, A., Davito, J., Fjell, C., Gaige, T., Ganten, D., Girn, N., Guggenheimer, K., Himmelbauer, H., Kreiter, T., Leach, S., Lee, D., Lehrach, H., Mayo, M., Mead, K., Olson, T., Pandoh, P., Prabhu, A.-L., Shin, H., Tanzer, S., Thompson, J., Tsai, M., Walker, J., Yang, G., Sekhon, M., Hillier, L., Zimdahi, H., Marziali, A., Ososegawa, K., Zhao, S., Siddiqui, A., de Jong, P.J., Warren, W., Mardis, E., McPherson, J.D., Wilson, R., Hubner, N., Jones, S., Marra, M., Schein, J., 2004. Integrated and sequence-ordered BAC- and YAC-based physical maps for the rat genome. Genome Res. 14, 765 – 779. Lander, E.S., Linton, L.M., Birren, B., Nusbaum, C., Zody, M.C., Baldwin, J., Devon, K., Dewar, K., Doyle, M., FitzHugh, W., Funke, R., Gage, D., Harris, K., Heaford, A., Howland, J., Kann, L., Lehoczky, J., LeVine, R., McEwan, P., McKernan, K., Meldrim, J., Mesirov, J.P., Miranda, C., Morris, W., Naylor, J., Raymond, C., Rosetti, M., Santos, R., Sheridan, A., Sougnez, C., Stange-Thomann, N., Stojanovic, N., Subramanian, A., Wyman, D., Rogers, J., Sulston, J., Ainscough, R., Beck, S., Bentley, D., Burton, J., Clee, C., Carter, N., Coulson, A., Deadman, R., Deloukas, P., Dunham, A., Dunham, I., Durbin, R., French, L., Grafham, D., Gregory, S., Hubbard, T., Humphray, S., Hunt, A., Jones, M., Lloyd, C., McMurray, A., Matthews, L., Mercer, S., Milne, S., Mullikin, J.C., Mungall, A., Plumb, R., Ross, M., Shownkeen, R., Sims, S., Waterston, R.H., Wilson, R.K., Hillier, L.W., McPherson, J.D., Marra, M.A., Mardis, E.R., Fulton, L.A., Chinwalla, A.T., Pepin, K.H., Gish, W.R., Chissoe, S.L., Wendl, M.C., Delehaunty, K.D., Miner, T.L., Delehaunty, A., Kramer, J.B., Cook, L.L., Fulton, R.S., Johnson, D.L., Minx, P.J., Clifton, S.W., Hawkins, T., Branscomb, E., Predki, P., Richardson, P., Wenning, S., Slezak, T., Doggett, N., Cheng, J.F., Olsen, A., Lucas, S., Elkin, C., Uberbacher, E., Frazier, M., Gibbs, R.A., Muzny, D.M., Scherer, S.E., Bouck, J.B., Sodergren, E.J., Worley, K.C., Rives, C.M., Gorrell, J.H., Metzker, M.L., Naylor, S.L., Kucherlapati, R.S., Nelson, D.L., Weinstock, G.M., Sakaki, Y., Fujiyama, A., Hattori, M., Yada, T., Toyoda, A., Itoh, T., Kawagoe, C., Watanabe, H., Totoki, Y., Taylor, T., Weissenbach, J., Heilig, R., Saurin, W., Artiguenave, F., Brottier, P., Bruls, T., Pelletier, E., Robert, C., Wincker, P., Smith, D.R., Doucette-Stamm, L., Rubenfield, M., Weinstock, K., Lee, H.M., Dubois, J., Rosenthal, A., Platzer, M., Nyakatura, G., Taudien, S., Rump, A., Yang, H., Yu, J., Wang, J., Huang, G., Gu, J., Hood, L., Rowen, L., Madan, A., Qin, S., Davis, R.W., Federspiel, N.A., Abola, A.P., Proctor, M.J., Myers, R.M., Schmutz, J., Dickson, M., Grimwood, J., Cox, D.R., Olson, M.V., Kaul, R., Raymond, C., Shimizu, N., Kawasaki, K., Minoshima, S., Evans, G.A., Athanasiou, M., Schultz, R., Roe, B.A., Chen, F., Pan, H., Ramser, J., Lehrach, H., Reinhardt, R., McCombie, W.R., de la, B.M., Dedhia, N., Blocker, H., Hornischer, K., Nordsiek, G., Agarwala, R., Aravind, L., Bailey, J.A., Bateman, A., Batzoglou, S., Birney, E., Bork, P., Brown, D.G., Burge, C.B., Cerutti, L., Chen, H.C., Church, D., Clamp, M., Copley, R.R., Doerks, T., Eddy, S.R., Eichler, E.E., Furey, T.S., Galagan, J., Gilbert, J.G., Harmon, C., Hayashizaki, Y., Haussler, D., Hermjakob, H., Hokamp, K., Jang, W., Johnson, L.S., Jones, T.A., Kasif, S., Kaspryzk, A., Kennedy, S., Kent, W.J., Kitts, P., Koonin, E.V., Korf, I., Kulp, D., Lancet, D., Lowe, T.M., McLysaght, A., Mikkelsen, T., Moran, J.V., Mulder, N., Pollara, V.J., Ponting, C.P., Schuler, G., Schultz, J., Slater, G., Smit, A.F., Stupka, E., Szustakowki, J., Thierry-Mieg, D., Wagner, L., Wallis, J., Wheeler, R., Williams, A., Wolf, Y.I., Wolfe, K.H., Yang, S.P., Yeh, R.F., Collins, F., Guyer, M.S., Peterson, J., Felsenfeld, A., Wetterstrand, K.A., Patrinos, A., Morgan, M.J., 2001. Initial sequencing and analysis of the human genome. Nature 409, 860 – 921. Lehman, H.E., 1983. Chordate Development: A Practical Textbook with Atlases and Techniques for Experimental and Descriptive Embryology. Hunter Textbooks, Winston-Salem, NC. Lombard, R.E., Bolt, J.R., 1979. Evolution of the tetrapod ear: an analysis and interpretation. Biol. J. Linn. Soc. Lond. 11, 19 – 76. Mahairas, G.G., Wallace, J.C., Smith, K., Swartzell, S., Holzman, T.,

T. Miyake, C.T. Amemiya / Comparative Biochemistry and Physiology, Part C 138 (2004) 233–244 Keller, A., Shaker, R., Furlong, J., Young, J., Zhao, S., Adams, M.D., Hood, L., 1999. Sequence-tagged connectors: a sequence approach to mapping and scanning the human genome. Proc. Natl. Acad. Sci. U. S. A. 96, 9739 – 9744. Martinez, P., Amemiya, C.T., 2002. Genomics of the Hox gene cluster. Comp. Biochem. Physiol., Part B Biochem. Mol. Biol. 133, 571 – 580. Miya, T., Morita, K.U.N., Satoh, N., 1996. An ascidian homologue of vertebrate BMPs 5–8 is expressed in the midline of the anterior neuroectoderm and in the midline of the ventral epidermis of the embryo. Mech. Dev. 57, 181 – 190. Miya, T., Morita, K., Suzuki, A., Ueno, N., Satoh, N., 2001. Functional analysis of an ascidian homologue of vertebrate BMP-2/BMP-4 suggests its role in the inhibition of neural fate specification. Development 124, 5149 – 5159. Miyake, T., McEachran, J.D., Hall, B.K., 1992. Edgeworth’s legacy of cranial muscle development with an analysis of muscles in the ventral gill arch region of batoid fishes (Chondrichthyes: Batoidea). J. Morphol. 212, 213 – 256. Miyake, T., Vaglia, J.L., Taylor, L.H., Hall, B.K., 1999. Development of dermal denticles in skates (Chondrichthyes Batoidea): patterning and cellular differentiation. J. Morphol. 241, 61 – 81. Miyake, T., Sumiyama, K., Dickson, M., Grimwood, J., Irvine, S., Schmutz, J., Stuart, A., Myers, R.M., Ruddle, F.H., Amemiya, C.T., 2002. Genomic identification and analysis of shared cis-regulatory elements in a developmentally critical homeobox cluster. A Meeting on Evolution of Developmental Diversity, Cold Spring Harbor Laboratory, April 17–April 21, New York. pp. 69. (Abstract). Mortlock, D.P., Guenther, C., Kingsley, D.M., 2003. A general approach for identifying distant regulatory elements applied to the Gdf6 gene. Genome Res. 13, 2069 – 2081. Nelson, J.S., 1984. Fishes of the World. Wiley, New York. Ness, S.R., Terpstra, W., Krzywinski, M., Marra, M.A., Jones, S.J., 2002. Assembly of fingerprint contigs: parallelized FPC. Bioinformatics 18, 484 – 485. Northcutt, R.G., Gans, C., 1983. The genesis of neural crest and epidermal placodes: a reinterpretation of vertebrate origins. Q. Rev. Biol. 58, 1 – 28. Osoegawa, K., Woon, P.Y., Zhao, B., Frengen, E., Tateno, M., Catanese, J.J., de Jong, P.J., 1998. An improved approach for construction of bacterial artificial chromosome libraries. Genomics 52, 1 – 8. Osoegawa, K., de Jong, P., Frengen, E., Ioannou, P., 1999. Construction of Bacterial Artificial Chromosome (BAC/PAC) Libraries. In: Dracopoli, N.C., Haines, J.L., Korf, B.R., Moir, D.T., Morton, C.C., Seidman, C.E., Seidman, J.G., Smith, D.R. (Eds.), Current Protocols in Human Genetics. Wiley, New York, pp. 5.15.1 – 5.15.32. Pancer, Z., Amemiya, C.T., Ehrhardt, G.R.A., Ceitlin, J., Gartland, G.L., Cooper, M.D., 2004. Somatic diversification of variable lymphocyte receptors in the agnathan sea lamprey. Nature 430, 174 – 180. Pennacchio, L.A., Rubin, E.M., 2001. Genomic strategies to identify mammalian regulatory sequences. Nat. Rev., Genet. 2, 100 – 109. Pennacchio, L.A., Rubin, E.M., 2003. Comparative genomic tools and databases: providing insights into the human genome. J. Clin. Invest. 111, 1099 – 1106. Powers, T.P., Amemiya, C.T., 2004. Evidence for Hox 14 paralog group in vertebrates. Curr. Biol. 14, R183 – R184. Romanov, M.N., Price, J.A., Dodgson, J.B., 2003. Integration of animal linkage and BAC contig maps using overgo hybridization. Cytogenet. Genome Res. 102, 277 – 281. Ross, M.T., LaBrie, S., McPherson, J., Stanton, V.P., 1999. Screening largeinsert libraries by hybridization. In: Boyl, Ann (Ed.), Current Protocols in Human Genetics. Wiley, New York, NY, pp. 5.6.1 – 5.6.52. Schultze, H.-P., 1993. Patterns of diversity in the skulls of jawed fishes. In: Hall, B.K., Hanken, J. (Eds.), The Skull. The University of Chicago Press, Chicago, pp. 189 – 254. Shashikant, C.S., Kim, C.B., Borbely, M.A., Wang, W.C., Ruddle, F.H., 1998. Comparative studies on mammalian Hoxc8 early enhancer

243

sequence reveal a baleen whale-specific deletion of a cis-acting element. Proc. Natl. Acad. Sci. U. S. A. 95, 15446 – 15451. Shizuya, H., Kim, U.-G., Mancino, V., Slepak, T., Tachiiri, Y., Simon, M., 1992. Cloning and stable maintenance of 300-kilobase-pair fragments of human DNA in Escherichia coli using an F-factor-based vector. Proc. Natl. Acad. Sci. U. S. A. 89, 8794 – 8797. Sire, J.-Y., 1989. Scales in young Polypterus senegalus are elasmoid: new phylogenetic implications. Am. J. Anat. 186, 315 – 323. Smith, M.M., 1978. Enamel in the oral teeth of Latimeria chalumnae (Pisces: Actinistia): a scanning electron microscopy study. J. Zool., London 185, 355 – 369. Smith, M.M., Hall, B.K., 1990. Developmental and evolutionary origins of vertebrate skeletogenic and odontogenic tissues. Biol. Rev. Camb. Philos. Soc. 65, 277 – 374. Smith, M.M., Hall, B.K., 1993. A developmental model for evolution of the vertebrate exoskeleton and teeth: the role of cranial and trunk neural crest. Evol. Biol. 27, 387 – 448. Sower, S.T., 1998. Brain and pituitary hormones of lampresy, recent findings and their evolutionary significance. Am. Zool. 38, 15 – 38. Stevens, R.C., Yokoyama, S., Wilson, I.A., 2001. Global efforts in structural genomics. Science 294, 89 – 92. Strong, S.J., Mueller, M.G., Litman, R.T., Hawke, N.A., Haire, R.N., Miracle, A.L., Rast, J.P., Amemiya, C.T., Litman, G.W., 1999. A novel multigene family encodes diversified variable regions. Proc. Natl. Acad. Sci. U. S. A. 96, 15080 – 15085. Sumiyama, K., Ruddle, F.H., 2003. Regulation of Dlx3 gene expression in visceral arches by evolutionarily conserved enhancer elements. Proc. Natl. Acad. Sci. U. S. A. 100, 4030 – 4034. Sumiyama, K., Irvine, S.Q., Stock, D.W., Weiss, K.M., Kawasaki, K., Shimizu, N., Shashikant, C.S., Miller, W., Ruddle, F.H., 2002. Genomic structure and functional control of the Dlx3–7 bigene cluster. Proc. Natl. Acad. Sci. U. S. A. 99, 780 – 785. Takio, Y., Pasqualettis, M., Kuraku, S., Hirano, S., Rijli, F.M., Kuratani, S., 2004. Evolutionary biology: lamprey Hox genes and the evolution of jaws. Nature 429, 1 – 2. Thomas, J.W., Prasad, A.B., Summers, T.J., Lee-Lin, S.Q., Maduro, V.V., Idol, J.R., Ryan, J.F., Thomas, P.J., McDowell, J.C., Green, E.D., 2002. Parallel construction of orthologous sequence-ready clone contig maps in multiple species. Genome Res. 12, 1277 – 1285. Thomas, J.W., Touchman, J.W., Blakesley, R.W., Bouffard, G.G., Beckstrom-Sternberg, S.M., Margulies, E.H., Blanchette, M., Thomas, P.J., McDowell, J.C., Maskeri, B., Hansen, N.F., Schwartz, M.S., Weber, R.J., Kent, W.J., Karolchik, D., Bruen, T.C., Bevan, R., Cutler, D.J., Schwartz, S., Elnitski, L., Idol, J.R., Prasad, A.B., Lee-Lin, S.Q., Maduro, V.V., Summers, T.J., Portnoy, M.E., Dietrich, N.L., Akhter, N., Ayele, K., Benjamin, B., Cariaga, K., Brinkley, C.P., Brooks, S.Y., Granite, S., Guan, X., Gupta, J., Haghighi, P., Ho, S.L., Huang, M.C., Karlins, E., Laric, P.L., Legaspi, R., Lim, M.J., Maduro, Q.L., Masiello, C.A., Mastrian, S.D., McCloskey, J.C., Pearson, R., Stantripop, S., Tiongson, E.E., Tran, J.T., Tsurgeon, C., Vogt, J.L., Walker, M.A., Wetherby, K.D., Wiggins, L.S., Young, A.C., Zhang, L.H., Osoegawa, K., Zhu, B., Zhao, B., Shu, C.L., de Jong, P.J., Lawrence, C.E., Smit, A.F., Chakravarti, A., Haussler, D., Green, P., Miller, W., Green, E.D., 2003. Comparative analyses of multi-species sequences from targeted genomic regions. Nature 424, 788 – 793. Thorgaard, G.H., Bailey, G.S., Williams, D., Buhler, D.R., Kaattari, S.L., Ristow, S.S., Hansen, J.D., Winton, J.R., Bartholomew, J.L., Nagler, J.J., Walsh, P.J., Vijayan, M.M., Devlin, R.H., Hardy, R.W., Overturf, K.E., Young, W.P., Robison, B.D., Rexroad, C., Palti, Y., 2002. Status and opportunities for genomics research with rainbow trout. Comp. Biochem. Physiol., B 133, 609 – 646. Wada, H., Holland, P.W.H., Satoh, N., 1996. Origin of patterning in neural tubes. Nature 384, 123. Wagner, G.P., Amemiya, C.T., Ruddle, F.H., 2003. Hox cluster duplications and the opportunity for evolutionary novelties. Proc. Natl. Acad. Sci. U. S. A. 100, 14603 – 14606. Watanabe, M., Kobayashi, N., Fujiyama, A., Okada, N., 2003. Construction

244

T. Miyake, C.T. Amemiya / Comparative Biochemistry and Physiology, Part C 138 (2004) 233–244

of a BAC library for Haplochromis chilotes, a cichlid fish from Lake Victoria. Genes & Genet. Syst. 78, 103 – 105. Yoder, J.A., Nielsen, M.E., Amemiya, C.T., Litman, G.W., 2001. Zebrafish as an immunological model system. Microbes Infect. 4, 1469 – 1478. Zapata, A.G., Amemiya, C.T., 2000. Phylogeny of lower vertebrates and their immunological structures. In: Du Pasquier, L. (Ed.), Origin and

Evolution of the Vertebrate Immune System. Springer-Verlag, Heidelberg, pp. 67 – 110. Zylberberg, L., Sire, J.Y., Nanci, A., 1997. Immunodetection of amelogenin-like proteins in the ganoine of experimentally regenerating scales of Calamoichthys calabaricus, a primitive actinopterygian fish. Anat. Rec. 249, 86 – 95.