Basic Statistics
Measure | Value |
---|---|
Filename | EGAF00005444633 |
File type | Conventional base calls |
Encoding | Sanger / Illumina 1.9 |
Total Sequences | 619138 |
Sequences flagged as poor quality | 0 |
Sequence length | 251 |
%GC | 44 |
Per base sequence quality
Per tile sequence quality
Per sequence quality scores
Per base sequence content
Per sequence GC content
Per base N content
Sequence Length Distribution
Sequence Duplication Levels
Overrepresented sequences
Sequence | Count | Percentage | Possible Source |
---|---|---|---|
AATTAGGCTGTGGGTGGTTGTGTTGATTCAAATTATGTGTTTTTTGGAAA | 8710 | 1.406794607987234 | No Hit |
AAATCTTACCCCGCCTGTTTACCAAAAACATCACCTCTAGCATCACCAGT | 8629 | 1.3937119026775937 | No Hit |
GGCAGGTCAATTTCACTGGTTAAAAGTAAGAGACAGCTGAACCCTCGTGG | 8031 | 1.2971260042187687 | No Hit |
GCCATACTAGTCTTTGCCGCCTGCGAAGCAGCGGTGGGCCTAGCCCTACT | 6873 | 1.110091772755024 | No Hit |
AATTAGGCTGTGGGTAGAAGTAGAGGTTAAGGAGGGTGATGGTGGCTATG | 1126 | 0.18186575529203505 | No Hit |
AATTAGGCTGTGGGTGTGATAGGTGGCACGGAGAATTTTGGATTCTCAGG | 658 | 0.1062767912807807 | No Hit |
Adapter Content
Kmer Content
Sequence | Count | PValue | Obs/Exp Max | Max Obs/Exp Position |
---|---|---|---|---|
TTCGGCC | 10 | 0.0014760375 | 245.00002 | 245 |
CGTACCC | 35 | 4.9112714E-11 | 210.00002 | 5 |
GTACCCC | 30 | 4.802132E-9 | 204.16666 | 6 |
TCGTACC | 60 | 0.0 | 204.16666 | 4 |
CGTACCT | 25 | 4.7481626E-7 | 196.0 | 5 |
GGCTGTA | 130 | 0.0 | 188.46153 | 6 |
TACCTCG | 45 | 2.8376235E-10 | 163.33334 | 7 |
GCGTCGG | 130 | 0.0 | 141.34616 | 245 |
AGGCTGT | 2755 | 0.0 | 140.50816 | 5 |
TAGTCTT | 1370 | 0.0 | 136.80658 | 8 |
ATCTTAC | 1590 | 0.0 | 125.58176 | 3 |
ATTAGGC | 3310 | 0.0 | 125.460724 | 2 |
GTACCTC | 40 | 4.9283244E-6 | 122.50001 | 6 |
CGGATTG | 50 | 1.0148142E-7 | 122.5 | 245 |
CAGGTCA | 1745 | 0.0 | 121.447 | 3 |
GTCAATT | 1750 | 0.0 | 119.7 | 6 |
TTAGGCT | 3440 | 0.0 | 117.51453 | 3 |
GCAGGTC | 1895 | 0.0 | 113.126656 | 2 |
CTTACCC | 1755 | 0.0 | 113.07692 | 5 |
AGTCTTT | 1675 | 0.0 | 111.89553 | 9 |