Table 2 The motifs (tetra-, penta-nucleotides) extracted from different
data sets (u > 1.64).
|
tetra- penta- |
Motifs in upstream regions: ATGT TACC CCAA CCAT CCCA
CCAG GTAC CACC ACCC TCCG CGGC CCGT TGGA GGAA AGGT GGGC TAGG CTAG GCCT CTCC
GCGC [CCGG GGCC] TGTA ACCA TCCC CAGA AGAG GAGC ACTG CTGC TGAA GAAA AACC CGCT GGCT GAGG
ACGC CGGG GGCA CAGG CCGC TCTC GGAC CGCC CGCG [GCTG] TATTG ATGTA ATACC CCATG
ACATT TGTAC ACACC CACCC TGAAA AACCC ACCCA CCCAT CCAAG ATCCG GCCGT CCGTA CGTAC
ACCGT CCGTC AAATG AGGTA CAAAC CGCTC CGGGC AAATT TATGG TCTAG CAGTG AGTGT CCTAG
GTGGA GGATG ACGGA CTCCG TCCCA CCCAC CCTGG GGTGG CATAC CGGCA GCACG CGCGC [CCCCC CCGGC GGCAG CCTAT
GGGAG GCCTA CCTAC AGGCC GGCCC GCCCT GCTAG TCCGC CCGCC CTCCA CGGCC GCTGG GGGCG] ACCAA CCCAG CCAGA CAGAC ACATC CCCAA AACTG AAACC CGGCT GAGGT CACGC
CTCGG GCAGG GTTCC TTCCC CCCGC CCATC TCTCT GCCTC CTCTC CTCGC ACTGG CTGGA TCACC
CACCA CCTCC CGTCC CGCCT CTGGC GGAAG GCGTC GTCCG GCGCA GCCGC ACGCG ACGGG TCGCG [AATCT CAAGA GACAG CAGAG AGAGC ACTGC TTGAA TCCGG GGCTC GGAAA GGCTG
TCCCG CCGCT AGTAG TAGCT CTTTC TGCTG TCAGT AGCCT ACCAG CGGAA GCTGC GCCGA TGGGC
CAGGC GGACT CTCAG CAGCG TCTGC CTGCG GGCGC GCTCC GCGCT] |
|
tetra- penta- |
Motifs in introns: TTAA TAAA
AAAT AATT TTAT TATT TGGT TTCA GAAT ATTA AGAC ATAT TATC CATG CACG GGTC TAAAA AAATT TATTT ATTTG TAAAT ATTGA ATTTC TTTCA AGAAT AATTA TATCG
ATTCA AATAT TATTC CAAGC ATATC GATAA TCACG CACGC CTATT ACCAC TGCTA ATTAT ATAGT
TATTA ACGTG GATTG TGGTG [TTAAA GAATT] |
The bolds are potential positive transcriptional regulatory elements of
“set I upstream”, i.e. they are significantly over-represented in “set I
upstream” relative to in “set II upstream”, and others are potential general
transcriptional regulatory motifs. All of the motifs agree with “test sites”, except
for those in bracket [ ]. The motifs in introns are all potential positive
transcriptional regulatory elements, obtained by comparing “set I intron” to “set
II intron”. The reverse complements are not listed.