Please answer the following two questions on Sequence X:
1) DNA sequence databases
1. What is the EMBL-Bank accession number for this sequence (use BLAST)?
2. Which gene does this sequence code for?
3. In which taxonomic division and species does this sequence derive from?
4. How many TATA signals are in the promoter region of this gene and what are their co-ordinates? Are the TATA box sequences identical?
5. What database cross-references exist for this sequence?
2) Genome databases
1. What would be an appropriate genome database for the species from which this sequence derives?
2. What is the accession number for the related gene/locus in that genome database?
3. On what chromosome is this gene located?
4. What are Standard Names and the Systematic Names of the upstream and downstream (neighbouring) genes for this gene?
Here is a nucleotide sequence in fasta format:
>Sequence X
tcgagggtatctcttatcggtattactttcttatcaatgaaaaatttctgccagggaaaa
tgcgcccgctttttttccggccatccttactcgctgtcgcatacaaaatagcgcctctaa
tctagttgcgataaggaatgtgtatgtgtaattgaagatccaggatgttttccttttcag
ggagatgagaaggaataataggatggattgaccgctttgctgtcacgtcgataaggttcc
tttaaagattgtgtccaatgattagcatagagaggtagagtatcagagaaacaagtttgt
aatcgagaaacttgatctgctagtgttgagcatagaaggctaggaaaacatggggaagaa
aaaaaaagtataaataattagcttgatgagtagtttgaatatatatgttactttagtttc
cctttttgaccttttatattcatctacatcttgtgatataaaacatcaacaaagacgaga
atggctaacgacgctctaagtgctattttcagtaatccttcgaggaaaggtgtccaaccc
tccacatctattgtgtcatatacaaacaatgaagatgatattatagatgtggaaaatggg
aagttcaacaagaacaagaatatcaacactaatgtgtatgtggacaactcctcaatagag
gagagcgaagtcgtgcccttgcccgaaacaaagtccatctggagtaaaatatactacgat
ttcattgtgttagacaagacaactttgaatgtttcgttgaaagagtcgttcttgtataac
agagacttgaaaccggttgaagaagaaagaaggtgttggtcctggttcaattacttatat
ttctggctggcagactgtttcaatattaacacatggcaaatagctggtacaggtctacaa
ctaggtctgaattggtggcaatgttggcttacagtttggatcggctacacttttgcaggt
atcttcgtagtattgaactcgagatttggttccgcatatcacttatctttccctattact
gttagggcctcatttggtatattcttttctatgtggccgattataaatcgtgtcgtgatg
gctatagtatggtatgccgtgcaagcctggttaggtgcaacgcccgtggcactgatgcta
aaatctatttttggcaagaatctggaagatagaatcccaaaccattttggttctccaaat
agcactacttttgaattcatgtgtttctttatattttgggtggtcagtataccatttgtc
ctagtggctcctcataaaatcaggcatttattcacagtaaaagcagctttgatccccttc
gcagcctttggatttttaatctgggctttgaagaaatcgcacggtaaaattgagttgggg
acgctgaatgattattcacctcatggttccgaattttcatggatattcgttagatcccta
atggcctgtgttgctaactttgccgctttgattatcaacgcccctgacttcggtagattt
gccaaaaatcctcaagcgtctttgtggccacaattggttgccatcccattgttcttcgcc
ataacatgtttgatcggtatcattgttactgcggccggttatcacttatatggggttaac
tattggtcaccactggatgtacttggtcaatttttggagaccacttacaccagaggtact
agggctggtgtgttcttgatttcttttgtatttgccttagctcaactgggtacaaacatt
tctgccaactctctggcatgtggtgctgatatgacggctttgtttccaagatatattaat
attagaagaggttctttattctgtgtggcaatggctctatgtatctgtccatggaactta
atggccagttcaagtaagttcaccagcgctttgggtgcttatgcaattttcctttccagt
attgctggtgtcatttgcgcggattatttcgtagtaagaagaggatatgtgaaattaaca
catttattcctggcacagaagggttccttttacatgtttggaaacaaatttggtgccaat
tggagggcctttgttgcgtatatttgcggtatcgctccaaatttaccaggttttataggt
gatgttggagctccaaaaattacggtttcagagggtgcaatgaggttatactatttaggt
tatccggtaggtttctttattagtgcggtgatatacctcatattatgttacttttttcct
gtccctggtactcccgtaaccaattttctgacagagaaaggatggttccaaagatgggct
tatgttgaggacttcgagcaagattggaagaatgagttacgtagggatgacctctgcgat
gacacagtcagtatctatgatggcaccgaggaaaagatagtttactaaataaaaatccca
atttattctcttaccagtagtgatcaggagaattgataatttcttcaaaacaccttaata
ttcgttctgtgtatttttatgtacctttcttaatcaaatttatcagcttcatacatgcat
tcccgtctgaataatgtactgatgattagctaaaccgtcatcaacctaccaaacttgcgc
ttgtttgagcctatcagaactccggataattccgacatcttatctttgact