TY - JOUR
T1 - Structural analysis of Arabidopsis thaliana Chromosome 5. III. Sequence features of the regions of 1,191,918 bp covered by seventeen physically assigned P1 clones
AU - Nakamura, Yasukazu
AU - Sato, Shusei
AU - Kaneko, Takakazu
AU - Kotani, Hirokazu
AU - Asamizu, Erika
AU - Miyajima, Nobuyuki
AU - Tabata, Satoshi
PY - 1997
Y1 - 1997
N2 - A total of 17 P1 and TAC clones each containing a marker(s) specifically mapped on chromosome 5 were isolated from P1 and TAC libraries of the Arabidopsis thaliana Columbia genome, and their nucleotide sequences were determined according to the shot gun-based strategy and precisely located on the physical map of chromosome 5. The total length of the clones sequenced in this study was 1,191,918 bp. As we have previously reported the sequence of 2,662,078 bp by analysis of 33 P1 clones, the total length of the sequences of chromosome 5 determined so far is now 3,853,996 bp. The sequences determined in this study were subjected to similarity search against protein and EST databases and analysis with computer programs for gene modeling, and a total of 310 potential protein-coding genes and/or gene segments with known or predicted functions were identified. The positions of exons which do not show apparent similarity to known genes were also predicted by computer-aided analysis. An average density of the assigned genes and/or gene segments was 1 gene/3,845 bp. Introns were identified in 78% of the potential protein genes, and the average number per gene and the average length of the introns were 3.7 and 185 bp, respectively. The numbers of the Arabidopsis ESTs matched to each of the predicted genes have been counted to monitor the transcription level. The sequence data and gene information are available on the World Wide Web database KAOS (Kazusa Arabidopsis data Opening Site) at http://www.kazusa.or.jp/arabi/.
AB - A total of 17 P1 and TAC clones each containing a marker(s) specifically mapped on chromosome 5 were isolated from P1 and TAC libraries of the Arabidopsis thaliana Columbia genome, and their nucleotide sequences were determined according to the shot gun-based strategy and precisely located on the physical map of chromosome 5. The total length of the clones sequenced in this study was 1,191,918 bp. As we have previously reported the sequence of 2,662,078 bp by analysis of 33 P1 clones, the total length of the sequences of chromosome 5 determined so far is now 3,853,996 bp. The sequences determined in this study were subjected to similarity search against protein and EST databases and analysis with computer programs for gene modeling, and a total of 310 potential protein-coding genes and/or gene segments with known or predicted functions were identified. The positions of exons which do not show apparent similarity to known genes were also predicted by computer-aided analysis. An average density of the assigned genes and/or gene segments was 1 gene/3,845 bp. Introns were identified in 78% of the potential protein genes, and the average number per gene and the average length of the introns were 3.7 and 185 bp, respectively. The numbers of the Arabidopsis ESTs matched to each of the predicted genes have been counted to monitor the transcription level. The sequence data and gene information are available on the World Wide Web database KAOS (Kazusa Arabidopsis data Opening Site) at http://www.kazusa.or.jp/arabi/.
KW - Arabidopsis thaliana chromosome 5
KW - Gene prediction
KW - Genomic sequence
KW - P1 genomic library
KW - TAC genomic library
UR - http://www.scopus.com/inward/record.url?scp=0031592995&partnerID=8YFLogxK
UR - http://www.scopus.com/inward/citedby.url?scp=0031592995&partnerID=8YFLogxK
U2 - 10.1093/dnares/4.6.401
DO - 10.1093/dnares/4.6.401
M3 - Article
C2 - 9501997
AN - SCOPUS:0031592995
SN - 1340-2838
VL - 4
SP - 401
EP - 404
JO - DNA Research
JF - DNA Research
IS - 6
ER -