TY - JOUR
T1 - In silico analysis of 3′-end-processing signals in aspergillus oryzae using expressed sequence tags and genomic sequencing data
AU - Tanaka, Mizuki
AU - Sakai, Yoshifumi
AU - Yamada, Osamu
AU - Shintani, Takahiro
AU - Gomi, Katsuya
PY - 2011/6
Y1 - 2011/6
N2 - To investigate 3′-end-processing signals in Aspergillus oryzae, we created a nucleotide sequence data set of the 3′-untranslated region (3′ UTR) plus 100 nucleotides (nt) sequence downstream of the poly(A) site using A. oryzae expressed sequence tags and genomic sequencing data. This data set comprised 1065 sequences derived from 1042 unique genes. The average 3′ UTR length in A. oryzae was 241 nt, which is greater than that in yeast but similar to that in plants. The 3′ UTR and 100 nt sequence downstream of the poly(A) site is notably U-rich, while the region located 1530 nt upstream of the poly(A) site is markedly A-rich. The most frequently found hexanucleotide in this A-rich region is AAUGAA, although this sequence accounts for only 6 of all transcripts. These data suggested that A. oryzae has no highly conserved sequence element equivalent to AAUAAA, a mammalian polyadenylation signal. We identified that putative 3′-end-processing signals in A. oryzae, while less well conserved than those in mammals, comprised four sequence elements: the furthest upstream U-rich element, A-rich sequence, cleavage site, and downstream U-rich element flanking the cleavage site. Although these putative 3′-end-processing signals are similar to those in yeast and plants, some notable differences exist between them.
AB - To investigate 3′-end-processing signals in Aspergillus oryzae, we created a nucleotide sequence data set of the 3′-untranslated region (3′ UTR) plus 100 nucleotides (nt) sequence downstream of the poly(A) site using A. oryzae expressed sequence tags and genomic sequencing data. This data set comprised 1065 sequences derived from 1042 unique genes. The average 3′ UTR length in A. oryzae was 241 nt, which is greater than that in yeast but similar to that in plants. The 3′ UTR and 100 nt sequence downstream of the poly(A) site is notably U-rich, while the region located 1530 nt upstream of the poly(A) site is markedly A-rich. The most frequently found hexanucleotide in this A-rich region is AAUGAA, although this sequence accounts for only 6 of all transcripts. These data suggested that A. oryzae has no highly conserved sequence element equivalent to AAUAAA, a mammalian polyadenylation signal. We identified that putative 3′-end-processing signals in A. oryzae, while less well conserved than those in mammals, comprised four sequence elements: the furthest upstream U-rich element, A-rich sequence, cleavage site, and downstream U-rich element flanking the cleavage site. Although these putative 3′-end-processing signals are similar to those in yeast and plants, some notable differences exist between them.
KW - 3′-end processing signal
KW - 3′-untranslated region (3′ UTR)
KW - Aspergillus oryzae
KW - polyadenylation signal
UR - http://www.scopus.com/inward/record.url?scp=79958855346&partnerID=8YFLogxK
UR - http://www.scopus.com/inward/citedby.url?scp=79958855346&partnerID=8YFLogxK
U2 - 10.1093/dnares/dsr011
DO - 10.1093/dnares/dsr011
M3 - Article
C2 - 21586533
AN - SCOPUS:79958855346
SN - 1340-2838
VL - 18
SP - 189
EP - 200
JO - DNA Research
JF - DNA Research
IS - 3
ER -