Basic Statistics
Measure | Value |
---|---|
Filename | EGAF00005002663 |
File type | Conventional base calls |
Encoding | Sanger / Illumina 1.9 |
Total Sequences | 501476830 |
Sequences flagged as poor quality | 0 |
Sequence length | 151 |
%GC | 40 |
Per base sequence quality
Per tile sequence quality
Per sequence quality scores
Per base sequence content
Per sequence GC content
Per base N content
Sequence Length Distribution
Sequence Duplication Levels
Overrepresented sequences
Sequence | Count | Percentage | Possible Source |
---|---|---|---|
ATCGGAAGAGCACACGTCTGAACTCCAGTCACCTGAAGCTATCTCGTATG | 915917 | 0.18264393192403325 | TruSeq Adapter, Index 19 (97% over 37bp) |
Adapter Content
Kmer Content
Sequence | Count | PValue | Obs/Exp Max | Max Obs/Exp Position |
---|---|---|---|---|
ATCGGAA | 218435 | 0.0 | 78.530426 | 1 |
TCGGAAG | 233505 | 0.0 | 73.29477 | 2 |
CGGAAGA | 259510 | 0.0 | 65.86517 | 3 |
AGAGCAC | 554930 | 0.0 | 31.191368 | 7 |
AGCACAC | 598800 | 0.0 | 28.884323 | 9 |
GAGCACA | 606910 | 0.0 | 28.568895 | 8 |
GAAGAGC | 610625 | 0.0 | 28.462746 | 5 |
TATGCCG | 182255 | 0.0 | 20.051212 | 45-49 |
AAGAGCA | 894190 | 0.0 | 19.663675 | 6 |
CTCGTAT | 171710 | 0.0 | 18.615767 | 40-44 |
CGTATGC | 198150 | 0.0 | 18.245932 | 45-49 |
TCTCGTA | 175460 | 0.0 | 18.20303 | 40-44 |
GGAAGAG | 998345 | 0.0 | 17.827421 | 4 |
ATGCCGT | 208610 | 0.0 | 17.34843 | 45-49 |
TATCTCG | 182625 | 0.0 | 17.23402 | 40-44 |
GCCGTCT | 212085 | 0.0 | 16.76655 | 50-54 |
TGCCGTC | 208070 | 0.0 | 16.276445 | 45-49 |
ATCTCGT | 199920 | 0.0 | 16.032475 | 40-44 |
TCGTATG | 188405 | 0.0 | 15.844952 | 40-44 |
ACACGTC | 234830 | 0.0 | 15.682719 | 10-14 |