Gene, 161 (1995) 63-67 © 1995 Elsevier Science B.V. All rights reserved. 0378-1119/95/$09.50
63
GENE 09018
Sequences of nifX, nifW, nifZ, nifB and two ORF in the Frankia nitrogen fixation gene cluster (Actinomycete; actinorhizal; root nodule; nitrogenase; n/f)
O l i v i a T. H a r r i o t t *, T h o m a s J. H o s t e d * a n d D a v i d R. B e n s o n Department of Molecular and Cell Biology, University of Connecticut, Storrs, CT06268-3044, USA Received by C.R. Hutchinson: 24 January 1995; Revised/Accepted: 16 March/28 March 1995; Received at publishers: 25 April 1995
SUMMARY
The actinomycete Frankia alni fixes N2 in root nodules of several non-leguminous plants. It is one of the few known Nz-fixing members of the high-GC Gram + lineage of prokaryotes. Thus, we have undertaken a study of its nitrogen fixation gene (n/f) organization to compare with that of the more extensively characterized proteobacteria. A cosmid (pFN1) containing the n/f region of Fa CpI1 was isolated from a cosmid library using the nifHDK genes of Fa CpI1 as a probe. A 4.5-kb BamHI fragment that mapped downstream from the previously characterized n/fHDK genes was cloned and sequenced. Based on nt and aa sequence similarities to n/f from other Nz-fixing bacteria, eight ORF were identified and designated nifX, off3, orfl, nifW, n/fZ, nifB, orf2 and n/fU. A region that hybridized to Rhizobium meliloti and Klebsiella pneumoniae n/fA did not appear to contain a n/fA-like gene. We have revised the map of the Fa n/f region to reflect current information.
[NTRODUCTION
Strains of Frankia are strictly aerobic euactinomycetes that form mutualistic N2-fixing root nodules on some nonleguminous plants (actinorhizal plants) (Benson and Silvester, 1993). At least five n/f genes (HDKAB) have been detected by heterologous hybridization in Frankia (Simonet et al., 1988; Mullin and An, 1990), and nifHD and K, that encode the three polypeptides of the nitroCorrespondence to: Dr. D.R. Benson, Department of Molecular and Cell Biology, Box U-44, University of Connecticut, Storrs, CT 06279-3044, USA. Tel. (1-203) 486-4258; Fax (1-203) 486-1784; e-mail:
[email protected] * Present addresses: (O.T.H.) Center for Agricultural Molecular Biology, Rutgers University, Cook College, New Brunswick, NJ 08903, USA. Tel. (1-908) 932-1586; (T.J.H.) Eli Lilly Co., Lilly Corporate Center, Indianapolis, IN 46285, USA. Tel. (1-317) 276-5318.
genase complex, have been sequenced (Normand and Bousquet, 1989; Normand et al., 1992; Hirsch et al., 1995). NifB from Kp synthesizes a low-molecular-mass Fe-S cluster precursor of the FeMo cofactor of nitrogenase (Shah et al., 1994). NifA activates c~54-mediated transcription of n/f and some fix operons in response to low p O 2 and/or nitrogen starvation in diazotrophs from the at and 7 groups of the purple bacteria (Dean and Jacobson, 1992). This report describes the characterization of a 4.5-kb DNA fragment containing eight Frankia nif genes. Although heterologous nifA probes hybridized to some cloned fragments of this n/f region, no nt or aa similarity to n/fA is present. This region is linked to the structural genes for the nitrogenase Fe protein and MoFe protein.
EXPERIMENTAL AND DISCUSSION Abbreviations: aa, amino acid(s); Ac, Azorhizobium caulinodans; Av, Azotobacter vinelandii; bp, base pair(s); Fa, Frankia alni; kb, kilobase(s) or 1000 bp; Kp, Klebsiella pneumoniae; nif, genes encoding proteins necessary for nitrogen fixation; nt, nucleotide(s); ORF (Orf), open reading frame(s); Rc, Rhodobacter capsulatus; Rm, Rhizobium meliloti. SSD1 0378-1119(95)00300-2
(a) Isolation of the nif region of Fa CpI1 Plasmids and strains used in this study are listed in Table I. The plasmid pFN1 was isolated from a pLAFR3
64 TABLE I Bacterial strains and plasmids Strain or plasmida Strains: Fa CpI1 E. coli XL1-Blue Plasmids: pBluescriptlI pFR140 pFN1 pFN2.1 pFNB4.5 pJE294 pCE300 pFQ149
Genotype or properties
Reference
Comptonia peregrina isolate, wild type supE44 hsdR17 recA1 endA1 gyrA 46 thi relA1 lac IF' proAB + lacI lacZAM15 Tnl0(TcR)]
(Callaham et al., 1978) Stratagene, La Jolla, CA, USA
Am ~ Tc R, pUC18-derived cloning vector Am R (pUC19), 7.1-kb BamHI insert with nifHDK Tc R [pLAFR3], 28-kb BamHI insert pBluescriptSK-, 2.1-kb SstI insert pBluescriptSK-, 4.5-kb BamHI insert Kp nifA Rm nifA Fa ArI3 4.5-kb n/f clone containing nifB
Stratagene (Rochefort and Benson, 1990) (Hosted, 1992) This study This study (Santero et al., 1989) C. Earl (Simonet et al., 1988)
a Escherichia coli strains were grown at 37°C in Terrific Broth or 2 x YT medium. When required ampicillin (100 gg/ml) or tetracycline (10 pg/ml) was added. Fa CpI1 was grown in defined succinate medium (Noridge and Benson, 1986).
cosmid library ofFa CpI1 genomic DNA by probing with CpI1 nifHDK genes in pFR140. A 4.5 kb BamHI fragment was subcloned from pFN1 and designated pFNB4.5 (Fig. 1). In its restriction map, and hybridization pattern to heterologous nifA probes, the insert in pFNB4.5 is identical to that in pFQ149, a clone containing the homologous region from the closely related Fa ArI3 (Simonet et al., 1988). pFNB4.5 hybridized strongly to appropriately sized bands of restriction digested Fa CpI1 genomic DNA confirming that pFNB4.5 originated from Fa CpI1 (not shown).
(b) Sequence analysis of pFNB4.5 The strategy used to sequence pFNB4.5 and the resulting sequence are recorded in Figs. 1 and 2. High-%G + C Gram ÷ bacteria like Frankia generally have a G or C in the third codon position. Thus, ORF were initially circumscribed on the basis of third position bias." ORF were identified by the similarity of their inferred aa sequences to aa sequences of n/f genes from other Ne-fixing bacteria. Altogether, eight n/f genes were identified including nifX, [ HD
I
E
I
I I
E
KB H
l,
5kb
K
E~,,,
I', ',',
KB EB H K
I. . . . . . . . . . . . . . . . . . I
.I
I
II
B
KH
I.
. . . . . . .
,,
B
SH
I
K
P
P K
sP
s
SB
I
pFN1
"l i
pFNB4.5
I
5OO bp
Fig. 1. Physical and genetic map of pFNB4.5 and its position relative to nifHDK. B, BamHI; Bg, BgllI; H, HindlII; K, KpnI; P, PstI; S, SstI. The area under the bracket spanning nifZ and nifB represents the region that hybridized to Rm and Kp nifA. Stippled boxes represent known genes whose sequences have not been reported.
orfl, off3, nifW, nifZ, nifB, orf2 and nifU. All are transcribed in the same direction and, with the exception of
nifB, are preceded by G G A G or G A G G potential ShineDalgarno regions. The most probable Shine-Dalgarno region for nifB is G G A C G beginning 12 bp before the putative G T G start codon. The genes reported here overlap in six instances. Sufficient sequence information was obtained to identify orf2 and nifU but is not reported because the complete sequence of both strands was not obtained. The characteristics of each gene product are listed in Table I.
(c) nifX, off3, orfl, nifW and nifZ The Fa CpI1 nifX gene is upstream from off3 as in Av and Rc (Jacobson et al., 1989; Moreno-Vivian et al., 1989). Overlapping the nifX GTG start codon is a TGA stop codon in frame with upstream codons having > 90% GC bias in the third position, probably indicating an additional ORF. Frankia NifX has 30% predicted aa sequence identity to Rc, 27% to Av and 28% to Kp nifX genes. Accounting for conservative substitutions, Fa NifX has 54% similarity to NifX from all three microorganisms. The role of NifX is unclear but has been suggested to help negatively regulate nitrogenase synthesis (Gosink et al., 1990). A 444-bp ORF that overlaps nifX by four nt shows 40% predicted aa sequence identity, and 62% aa similarity, to Orf3 of Av and Orf4 of Rc (Jacobson et al., 1989; Moreno-Vivian et al., 1989). Overlapping the 3' end of orf3 by 11 nt, is a 249-bp ORF with 33% derived aa sequence identity, and 55% aa sequence similarity, to Ac Orfl (Arigoni et al., 1991). The functions of Orf3 and Orfl are not known.
65 ~GCt~3C~CQC QACGOCAGA,ACC CGAC C ~ C T C C,AC.~.~C CTGTT C ~
G
~
CC C G ~
M
~GAT
L
C
K
~
I
~ A C
~
A
F
A
T
C<~GTCACACCT G T T C G A G A C C C G G C T G C T C G A G O C G G C G T C G G A C G A C G A G C A G G A C A A ~ T ~ G S H L F E T R L L E A A S D D E Q D K I E S
A R
G L
~ A
nlfX
~
A
T
D
P
G S
~
G A C T C G T C G T G C T C GACAACGC(3C ~ A C ~ R L V V L D N A L R D V
A Q
~ R
CC-CC CC,C G A T C A C T C C TGAGCCGCCAC, C C C C G G A C A C C G C G A G C C G C G A C G C C G A G ~ V A R D D S *
E
M
T
P
E
P
P
A
P
D
T
A
S
R
D
A
E
D
Q
~
A
C
R
A
G
v
~ F
G
F
N
~ ~ ~
N
L
D
A
D
L
P
I
G
W
E
G
L
L
D
V
A
R
R
T
Y
D
A
Y
A
E
I
A
A
D
~ C
A
P
L
D
Q
v
V
S
Y
A
~ E
S
E
~
A
~
A
Q
G
A N
G
R
~ K
A
A
C
Q
L
D ~
E
K
T
A
R
G
F
E
~ A
~ L
L
R
K
L
E
K
H
S
~ E
A
S
R
G C A G T A C T T C G A C C T G C T G G A G G T C G A C T A C G A C C C G C G G G T CGTGC,C E Q Y F D L L E V D Y D P R V V A
GGCATCTCCTC, C G T G A C T A C C C ~ G A C C ~ C T C G T C C G C T C C T A C G A G G C G T T C A C C A A C G C C R H L L R D Y R D A L V R S Y E A F T N A G
~ V
G A
~ R
~ A C A T ~ L H I L
G C G C L D H
G C ~ A C R L F K
N
E
I
L
V
E
A
G
T
R
G
Q
v
I
N
V
G
L
Y
L
Q
A C A T ~ T A ~ A T ~
E
H
I
V
Y
A
I
~ ~ A T V L K
~ D
~
R
~ A
G A
~ K
C V
Q
~ P
A
~ A
S
Q
T
A T
G
A
A
N
G
~
A
K
L
T
C
~
S
A
R
I
~
F
E
L
V
G
~
Q
~ A
~ G ~ P A K
E
~ A C ~ A A T P
CGGCACGCC CC~GCGAAGGACGGT C G A C G G T G A C G G G G G A G C C G G C G T T C GC C G C C G C ~ C C ~ C G ~ G T P G A K D G R R * nif8 M T G E P A F A A A P A G G L
~
~
P
~ A
A
R
R
G TGCA GG AC CCGGAGATCGCC GA GAAGATCGCCAACCACCCGTGCTA CA C G G C C G A G G C G C A C C A G T A C T A C G C C C G C A ~ V Q D P E I A E K I A N H P C Y T A E A H Q Y Y A R M
H
G K
~
~ P
C A
~ T
A P
~ A
~
A
W
R
450 132
T
~
~ ~A
600 143 39
A
P
F
V
V
T
I
L
~ L
N
S
E
G
89
~ A
A S
W
900 139
750
~
A
K
L
T
I
E
R
~AC~CC
R
L
K
T
T
D
A
G
Q
L
D
L
D
~ A
A G
G A
~ D
~
H
~
~ ~C D
L
~
1050 147 45
A
~
~A
1200 82
C
D
F
~
E
F
13
N
~ P
1350 63
~ C G A C A T ~ A C ~ V P T S E I T V
1500 113
~AC ~ ~AC~
N
D
G
A
L
E
R
T
Y
~ ~
L
~ P
~
~ ~ P P
A
T
C
C
A
Q
A
P
A
S
A
G
~ G
~ C
K
T
D
P
E
A
A
~ K A
V
~ A
V
A
~ ~ P G
~ C
N
~ I
Q
C
N
F
C
E
~AT ~
A
I
S
~ A G ~
L
A
A
~GAT~
P
~ E
S ~
32 300 82
~ D
~
R
K
~
K
150
E
*
~
C
V
S
E
C G C G A C G C A G G C G G A G C C C G C G G C C A ~ G C C C A C C G C C G C C G C G G C G G C G G A G G G C T C G G C C C C G G C C G C ~ ~ A ~ ~ C C ~ ~ A C ~ ~ G ~ A T Q A E P A A T P T A A A A A E G S A P A A V L V A E K V P A E P A A H G A G G H A A K GTC CGCGAACTGCAAGTCC GC CGCA GC GGAC GC CGAGCCCGCGGTAC CGAGCACGOC~ S A N C K S A A A D A E P A V P S T P
P
L T
T
V
A
C
A
L
~G
G
~
~ ~
~ T
~ AG~ ~ P Q A A
A
~
M
~ A
~ A
~ A ~ ~ ~ ~ A C ~ C A K H F A G E L A K L Q
A G C C,C92C G G G C T C G A C C C A G C C A T C G G A A A C G T C C GAGCC~ACGG/~C-GAC~C~ C G A T ~ C C A C ~ C AC ~ A C ~ A C ~ C G ~ A T Q ~ P G S T Q P S E T S E R T E E A R * M T T N T Y D V G D V V C G A G A T C C T G G T C G A GC-CC G G C A C C C G C G G T C A G G T C A T C A A C G T C G G C C T G T A C C T G C A G G A ~
G
C T
~ ~ ~ ~
M A CC GCTGCT CCACCGC~ N R C S T A
V
D
C
~
S
E
~
C
K
A
T
y
R
A A
G
V
~ ~ E P
F
~
D
A
A
~ ~
F
G
~
~AC~AC
R
I
R
~ A
A
D
~
A
~ ~ c ~ L
~
~ ~ ~ ~ ~
R
~
I
T
~
R
~
~ ~ ~ g V L ~
C
C
L
G
L
W
T
~
~ E
G
I
~
Q
~
F
C
A
~ ~ A A T
~
H
~
L
D
~ S
L
V
~
G A G G A C C T C C C G A T C G G C T G G G A G G G C C T G C T C G A C G T C O C C C G C C G G A CC T A T G A C G C C T A T G C C G A G A T ~ C ~ E
A
P
GK3CT G C G G T C G T T C T A C C A G G C C G T C ~ R L R S F Y Q A V A
L
A
~ A C ~ A C
T
~
R
L
~
V ~
A G G A G C A G C G C C G C A C G A T C G C G A C C A A C T G C G A C C T C G A T A T C G C C A C CC,C ~ C C K E Q R R T I A T N C D L D I A T A A
D
~
~
G
A G C G T C C ~ G C G G A C - C T C*AAC~AC, A C G C G G C A T G A G C G A G G C G A A G G A ~ ~ S V A A E L K G D A A * E/f3 M S E A K D F L
T CC,G T C G G G T G A T C A T T T T T C ~ T C F G R V I I F A G
T
C
D ~
AAGATCCACCC GGTGAAGGTGGC CAAGGGCACC AGCG TGGAGGAC CT GC T C A C C C G C T A C G ~ A C ~ A C K I H P V K V A K G T $ V E D L L T R Y V R D A G R ~ ~
A
G
C P
G
~ ~A
D
A
D
82
T
C
K
H
G
1950 132
K
T
V
G
A
2100 182
C
C
T
S
A
P
2250 192 40
~ C ~ G ~ C G A C ~ C N R K F D C
A
~C N
2400 90
T
F
R
2550 140
P
E
V
G
2700 190
~ T G A T N S V M
I
P
G
2850 240
E
L
K
3000 290
T
A
G
T
~
S
A
~
C
G
C
G A G A G C C G G C c G G G C G T C A C G A G C A C C C T G C T G A C T C C C G A G G A C G C G C T C G C C A A G G T ~ C ~ ~ A ~ T ~ ~ ~ ~ ~ G A C ~ C E
S
R
P
G
V
T
S
T
L
L
T
P
E
D
A
L
A
K
V
N
V
V
A
S
E
I
K
Q
A C G A T C G A G C T G G T G G C C C G G G A C T G C C C G G A C A T C A A G C T G T G C C T GT C G A C C A A C GGCCTC, C G C C T G C C G G A G ~ T I E L V A R D C P D I K L C L S T N G L R L P E F V D GAC,CC,C,A T C T A C C C G T G G G T C C , C C T G G C G GGC,C A A G C G G T A C A C C G G C C G G G A G G C C T C G A A G A T C C ~ A G ~ A G ~ E R I Y P W V A W R G K R Y T G R E A S K I L S E Q Q
M
S
V
b
G
I
~AT ~ R I A
~ D
D
N
V
G
D
~ CCAC~CGAT D H V T
I
A
N
~ ~AT~CAT T I N
M
P
K
~ T C ~ I D
P ~
A
L
T
E
~ R
K
I
C L
C
K
V
ATcAACGACGAC~CATCTCGTCGAGGTCTC~CGGACCGTCAAGGGGCTCGGGC,CCTTCCTGCACAACG~AT~~~AC~~CA~G~CCCC~AG~G I N D E H L V E V 5 R T V K G L G A F L H N V M P L V S A
P
E
H
G
T
V
F
G
L
T
G
Q
R
G
P
T
P
Q
GCCCTGCAGGA CC GG TGCGAGCAGG AC GACGGCGCCGAGAT GAACATGA TG CGCCACTGCCGGCAGT GC ~ A L Q D R C E Q D P G A E M N M M R H C R Q C R
G
~ L
L
G
E
D
R
A G
C D
D
F
~ L
A ~ P E T
F
Q
G
~ ~ R E
~ ~ T C ~ L N I
A
T
D
V
V
~ R
C C ~ P A E
G
A
G
W
T
R
T A C G A C C T C C~CC,G C C G C C A G C A G G C G C A C G A G G A G A T C G A G C G C TGC,C G C A C C G A G G T C G C C G C G A C C ~ A G A C Y D L A G R Q Q A H E E I E R W R T E V A A T R Q
T
CTCGTCGCGGTCGCCAC L
V
A
V
A
CAAGGGCA GC GGCG TC GT CA AC CA GCACTTCCGCCACGCCGGCGAG T
K
G
S
G
V
V
N
Q
H
F
G
H
A
G
~CTGGAT E
F
W
~ A
A V
CTAC GA G G ~ I
Y
E
A
~
~
~
~ G
~
P
~ G
C
A A
G
A
~ R
C
~
~
R
T
A
C
L
T
L
A
D
~
P
L
L
~
P
G
~
~
G
A
A
~
A
V
P ~
C Q
~ C ~ ~ C p A A C
~ D
A V
~ E
P
~
R
Y
C D
~ C C
~ N
G A C G A G G A C G C C T C C A A G C T C G A C C C ' C A C C A T C G A G A T G T T G T C C G A C T G C G C C G C T G T G C ~ ~ ~ A ~ A ~ A T ~ T C ~ T C T A T G A C ~ C ~ G ~ G D E D A S g L D R T I E M L S D C A A V L C S K I G L G P R E A L E E A G I E P V E I Y D G T G G C C G A A A T C G G C G C C C G C C T C G T C A C G G A C C GCGCCCCAGCC, G A C,G T A G G C A C C C C A T G A C C G C G A C C G C C A C C ~ C A V A E I G A R b V T D R A P A E V G T P * off2 M T A T A T V
~ ~C~AC~AC~G I
E
L
T
D
K
1650 132 32 1800
E
~ A
~
I
G
~
~
~ ~ C I V
3150 340
~C V
3300 390
~ ~ ~AC ~C
3450
P
S
V
D
C
440
3600 L
I
D
K
A
490
3698 510 13
Fig. 2. Nucleotide sequence of pFNB4.5 encoding the nifX-orf3-orfl-n!fW-nifZ-nifB region from Fa CpI1 and deduced aa sequences. Putative ShineDalgarno sequences are underlined. Methods: For sequencing, overlapping deletion clones were created for subclone pFN2.1 (for the noncoding strand) and pFNB4.5 (for the coding strand) using the Erase-a-Base system (Promega, Madison, WI, USA). Restriction fragments of pFNB4.5 were subcloned into pBluescript II vectors. Single stranded DNA was prepared using phage VCSM 13 (Stratagene, La Jolla, CA, USA). Plasmid isolation, restriction endonuclease digestions, and DNA cloning were carried out using standard techniques (Sambrook et al., 1989: Lee and Rasheed, 1990). DNA was sequenced using TAQuence 2.0 (US Biochemical, Cleveland, OH, USA) with 7-deaza-2'-dGTP in place of dGTP to minimize secondary structure formation in the high-% G+C Fa DNA. The insert in pFNB4.5 was sequenced in both directions except for a short segment of nifX where high GC content made reading of one strand difficult. In this case, redundant sequences were obtained using both 35S-labelled nucleotides and automated laser fluorescence (ALF) sequencing and the sequence derived was unambiguous. Programs used to analyze the data were from the GCG Sequence Analysis Software Package (Genetics Computer Group, Madison, WI, USA). Sequences used in the aa sequence alignments were obtained from GenBank, release 72.0 (Intelligenetics, Mountain View, CA, USA). This sequence has been deposited with GenBank {accession No. L29299).
Fa N i f W has 39% identity and 65% similarity to Ac N i f W (Arigoni et al., 1991), 28% identity and 55% similarity to Av N i f W (Jacobson et al., 1989) and 28% identity and 49% similarity to Kp N i f W (Paul and Merrick, 1987). N i f W m a y play a role in homocitrate processing
( M a s e p o h l et al., 1993). Homocitrate is an organic component of the F e M o cofactor in nitrogenase ( H o o v e r et al., 1989; Wang et al., 1991; Meijer and Tabita, 1992). Fa NifZ has 29% identity and 47% similarity to Av NifZ, and 24% identity and 45% similarity to Kp NifZ.
66 Several direct repeats found at the 3' end of nifZ are reflected in the aa sequence as an Ala + Pro-rich region. The role of NifZ is unclear, although studies in Av suggest that NifZ might be required for the maturation of the MoFe protein (Jacobson et al., 1989). (d) nifB NifB is essential for nitrogen fixation and is highly conserved (Dean and Jacobson, 1992). It synthesizes an Fe-S cluster that forms part of the FeMo-cofactor that is present at the active site of the molybdenum-iron protein of nitrogenase (Shah et al., 1994). Fa nifB has a predicted molecular mass of 55 078 Da (see Table II), a value that is close to that of other NifB proteins. It is most similar to Anabaena NifB with 58% aa identity and 75% similarity, followed by Av NifB (53% and 69%), Rm NifB (50% and 66%) and Kp NifB (49% and 69%). The nine conserved cysteine residues found in other NifB proteins are also present in Fa NifB. The start of nifB overlaps the 3' end of nifZ by 2 nt, suggesting that nifB is cotranscribed with nifX, off3, off1, nifW and nifZ. However, several repeated sequences, ranging in length from 11 to 33 nt, and within the 3' region of nifZ, may suggest the presence of a regulatory region such as occur in promoter regions of genes in Streptomyces sp. (Delic et al., 1992). In the only other NifB sequence from a Gram + organism, Clostridium pasteurianurn, nifB is fused to the 3' end of nifN and is transcribed as part of the nifEN-B operon (Dean and Jacobson, 1992; J.-S. Chen, personal communication). In the Gram-diazotrophs Av, Kp, Anabaena sp. strain 7120 and Rc, nifB is transcribed from its own promoter (Buikema et al., 1987; Joerger and Bishop, 1988; Klipp et al., 1988; Masepohl et al., 1988; Mulligan and Haselkorn, 1989). Sequence of the region directly downstream from nifB indicates the presence of two other ORF. We have desigTABLE II Properties of predicted polypeptides from pFNB4.5 genes ORF
Positions a (nt)
N u m b e r of aa b
Molecular mass b (Da)
pI
n~X off3 orfl n~W n~Z n~B
55-486 483 926 916-1164 1161 1559 556-2134 2131 3663
143 147 82 132 192 510
15407 16 166 9081 14822 19002 55 078
6.78 4.73 4.74 5.08 5.14 5.06
N u m b e r s refer to positions in the sequence shown in Fig. 2. Includes the first position in the initiation codon and the third position in the stop codon. u Includes the N-terminal Met.
nated one as off2; it has 42% sequence identity and 64% similarity to off6 from Av. The other ORF was designated nifU due to its predicted aa similarity to the carboxyl domain of NifU from other diazotrophs. Truncated forms of nifU have been reported from Rc and Bradyrhizobium japonicum (Dean and Jacobson, 1992). The reading frame bias and aa alignments, including placement of conserved cysteine residues, are consistent with these designations. (e) Hybridization to Rm and Kp nifA Plasmid pFNB4.5 has a restriction map identical to that of pFQ149 (Simonet et al., 1988). pFQ149 contains a 4.5-kb BamHI insert from Fa strain ArI3 that hybridized with Rm nifA anff,nifB. We probed CpI1 chromosomal DNA, pFNB4.5 and pFN1 with a 945 bp PstIEcoRV fragment of pJE294 containing an internal fragment of Kp nifA. Kp nifA did not hybridize to digested Fa CpI1 chromosomal DNA but did hybridize to the 4.5-kb BamHI fragment of pFN1 and pFNB4.5, the 3.5-kb HindIII-BamHI and 1.2-kb PstI fragments of pFNB4.5 (Fig. 1). Identical results were obtained when Rm nifA from pCE300 was used as a probe (not shown). These observations confirm previously reported hybridization results (Simonet et al., 1988). The absence of nifA in pFNB4.5 suggests that the hybridization obtained with cloned fragments of pFNB4.5 was artefactual. Short regions (25-50 nt) present upstream from nifB with 65-70% similarity to nifA probes might be sufficient to allow some binding to highcopy-number Frankia nif DNA sequences on Southern blots. Therefore, we conclude that a conserved nifA is not present in Fa CpI1, although a significantly diverged homologue cannot be ruled out,
(f) Conclusions (1) The nifX-orf3-orfl-nifW-nifZ-nifB-orf2-nifU region of Fa CpIl is about 3 kb away from the nifH-nifD-nifK region. (2) n/fA-hybridizable DNA was detected in the nifZ region but an ORF corresponding to nifA was not found. Therefore Frankia strains have not yet been shown to possess a nifA homologue.
ACKNOWLEDGEMENTS
We thank P.W. Betts for his expert technical assistance, P. Normand for pFQ149 and T. Hoover for pJES294.
REFERENCES Arigoni, F., Kaminski, P.A., Hennecke, H. and Elmerich, C.: Nucleotide sequence of thefixABC region of Azorhizobium caulinodans ORS571:
67 similarity of thefixB product with eukaryotic flavoproteins, characterization offixX, and identification of n/fW. Mol. Gen. Genet. 225 (1991) 514-520. Benson, D.R. and Silvester, W.B.: Biology of Frankia strains, actinomycete symbionts of actinorhizal plants. Microbiol. Rev. 57 (1993) 293-319. Buikema, W.J., Klingensmith, J.A., Gibbons, S.L. and Ausubel, F.M.: Conservation of structure and location of Rhizobium meliloti and Klebsiella pneumoniae nifB genes. J. Bacteriol. 169 (1987) 1120 1126. Callaham, D., DelTredici, P. and Torrey, J.G.: Isolation and cultivation in vitro of the actinomycete causing root nodulation in Comptonia. Science 199 (1978) 899-902. Dean, D.R. and Jacobson, M.R.: Biochemical genetics of nitrogenase. In: Stacey, G., Burris, R.H. and Evans, H.J. (Eds.), Biological Nitrogen Fixation. Chapman and Hall, New York, NY, 1992, pp. 763-834. Delic, I., Robbins, P. and Westpheling, J.: Direct repeat sequences are implicated in the regulation of two Streptomyces chitinase promoters that are subject to carbon catabolite control. Proc. Natl. Acad. Sci. USA 89 (1992) 1885-1889. Gosink, M.M., Franklin, N.M. and Roberts, G.P.: The product of the Klebsiella pneumoniae nifX gene is a negative regulator of the nitrogen fixation (nif) regulon. J. Bacteriol. 172 (1990) 1441-1447. Hirsch, A.M., McKhann, H.I., Reddy, A., Liao, J., Fang, Y. and Marshall, C.R.: Assessing horizontal transfer of nifHDK genes in eubacteria: nucleotide sequence of nifK from Frankia strain HFPCcI3. Mol. Biol. Evol. 12 (1995) 16-27. Hoover, T.R., Imperial, J., Ludden, P.W. and Shah, V.K.: Homocitrate is a component of the iron-molybdenum cofactor of nitrogenase. Biochemistry 28 (1989) 2768 2771. Hosted, T.J.: Studies on Frankia Symbiotic Genes. Ph. D. Thesis, University of Connecticut, Storrs, CT, 1992. Jacobson, M.R., Cash, V.L., Weiss, M.C., Laird, N.F., Newton, W.E. and Dean, D.R.: Biochemical and genetic analysis of the nifUS VWZM cluster from Azotobacter vinelandii. Mol. Gen. Genet. 219 (1989) 49 57. Joerger, R.D. and Bishop, P.E.: Nucleotide sequence and genetic analysis of the nifB-nifQ region from Azotobacter vinelandii. J. Bacteriol. 170 (1988) 1475-1487. Klipp, W., Masepohl, B. and Puhler, A.: Identification and mapping of nitrogen fixation genes of Rhodobacter capsulatus: duplication of a niJA-nifB region. J. Bacteriol. 170 (1988) 693-699. Lee, S. and Rasheed, S.: A simple procedure for maximum yield of highquality plasmid DNA. Biotechniques 9 (1990) 676-679. Masepohl, B., Angermuller, S., Hennecke, S., Hubner, P., MorenoVivian, C. and Klipp, W.: Nucleotide sequence and genetic analysis of the Rhodobacter capsulatus orf6-nifUISVW gene region: possible role of NifW in homocitrate processing. Mol. Gen. Genet. 238 (1993) 369 382.
Masepohl, B., Klipp, W. and Pfihler, A.: Genetic characterization and sequence analysis of the duplicated nifA/nifB gene region of Rhodobacter capsulatus. Mol. Gen. Genet. 212 (1988) 27-37. Meijer, W.G. and Tabita, R.: Isolation and characterization of the nifUSVW-rpoN gene cluster from Rhodobacter sphaeroides. J. Bacteriol. 174 (1992) 3855-3866. Moreno-Vivian, C., Schmehl, M., Masepohl, B., Arnold, W. and Klipp, W.: DNA sequence and genetic analysis of the Rhodobacter capsulatus nifENX gene region: homology between NifX and NifB suggests involvement of NifX in processing of the iron-molybdenum cofactor. Mol. Gen. Genet. 216 (1989) 353-363. Mulligan, M.E. and Haselkorn, R.: Nitrogen fixation (nif) genes of the cyanobacterium Anabaena species strain PCC 7120. J. Biol. Chem 264 (1989) 19200-19207. Mullin, B.C. and An, C.S.: The molecular genetics of Frankia. In: Schwintzer, C.R. and Tjepkema, J.D. (Eds.), The Biology of Frankia and Actinorhizal Plants. Academic Press, San Diego, CA, 1990, pp. 195 214. Noridge, N.A. and Benson, D.R.: Isolation and nitrogen-fixing activity of Frankia sp. strain CpI1 vesicles. J. Bacteriol. 166 (1986) 301-305. Normand, P. and Bousquet, J.: Phylogeny of nitrogenase sequences in Frankia and other nitrogen-fixing microorganisms. J. Mol. Evol. 29 (1989) 436-447. Normand, P., Gouy, M., Cournoyer, B. and Simonet, P.: Nucleotide sequence of nifD from Frankia alni strain ArI3: phylogenetic inferences. Mol. Biol. Evol. 9 (1992) 495-506. Paul, W. and Merrick, M.: The nucleotide sequence of the nifM gene of Klebsiella pneumoniae and identification of a new n/fgene: nifZ. Eur. J. Biochem. 170 (1987) 259-265. Rochefort, D.A. and Benson, D.R.: Molecular cloning, sequencing, and expression of the glutamine synthetase II (glnlI) gene from the actinomycete root nodule symbiont Frankia sp. strain CpI1. J. Bacteriol. 172 (1990) 5335 5342. Sambrook, J., Fritsch, E.F. and Maniatis, T.: Molecular Cloning. A Laboratory Manual, 2nd ed. Cold Spring Harbor Laboratory Press, Cold Spring Harbor, NY, 1989. Santero, E., Hoover, T., Keener, J. and Kustu, S.: In vitro activity of the nitrogen fixation regulatory protein NifA. Proc. Natl. Acad. Sci. USA 86 (1989) 7346-7350. Shah, V.K., Allen, J.R., Spangler, N.J. and Ludden, P.W.: In vitro synthesis of the iron-molybdenum cofactor of nitrogenase. J. Biol. Chem. 269 (1994) 1154-1158. Simonet, P., Normand, P. and Bardin, R.: Heterologous hybridization of Frankia DNA to Rhizobium meliloti and Klebsiella pneumoniae n/f genes. FEMS Microbiol. Lett. 55 (1988) 141-146. Wang, S., Dean, D.R., Chen, J. and Johnson, J.L.: The N-terminal and C-terminal portions of NifV are encoded by two different genes in Clostridium pasteurianum. J. Bacteriol. 173 (1991) 3041-3046.