Basic Statistics
Measure | Value |
---|---|
Filename | EGAF00002855818 |
File type | Conventional base calls |
Encoding | Sanger / Illumina 1.9 |
Total Sequences | 372470425 |
Sequences flagged as poor quality | 0 |
Sequence length | 151 |
%GC | 40 |
Per base sequence quality
Per tile sequence quality
Per sequence quality scores
Per base sequence content
Per sequence GC content
Per base N content
Sequence Length Distribution
Sequence Duplication Levels
Overrepresented sequences
Sequence | Count | Percentage | Possible Source |
---|---|---|---|
ATCGGAAGAGCACACGTCTGAACTCCAGTCACGAATTCGTATCTCGTATG | 1066727 | 0.2863924028330571 | TruSeq Adapter, Index 7 (97% over 34bp) |
Adapter Content
Kmer Content
Sequence | Count | PValue | Obs/Exp Max | Max Obs/Exp Position |
---|---|---|---|---|
ATCGGAA | 202870 | 0.0 | 98.31106 | 1 |
TCGGAAG | 214000 | 0.0 | 93.14979 | 2 |
CGGAAGA | 227595 | 0.0 | 87.39731 | 3 |
AGAGCAC | 457125 | 0.0 | 43.890373 | 7 |
AGCACAC | 488270 | 0.0 | 41.09768 | 9 |
GAGCACA | 494645 | 0.0 | 40.55358 | 8 |
GAAGAGC | 504765 | 0.0 | 39.860344 | 5 |
AAGAGCA | 723230 | 0.0 | 28.096369 | 6 |
GGAAGAG | 788475 | 0.0 | 25.899773 | 4 |
TATGCCG | 173140 | 0.0 | 23.157713 | 45-49 |
TCGTATC | 183715 | 0.0 | 21.84655 | 35-39 |
CGTATGC | 181870 | 0.0 | 21.82928 | 45-49 |
CTCGTAT | 158975 | 0.0 | 21.313301 | 40-44 |
TCTCGTA | 160995 | 0.0 | 21.00626 | 40-44 |
GTCACGA | 192695 | 0.0 | 20.974249 | 25-29 |
ATGCCGT | 191665 | 0.0 | 20.8491 | 45-49 |
TGCCGTC | 190770 | 0.0 | 20.622398 | 45-49 |
TATCTCG | 164440 | 0.0 | 20.328136 | 40-44 |
GCCGTCT | 192355 | 0.0 | 20.302145 | 50-54 |
TCGTATG | 168065 | 0.0 | 19.814629 | 40-44 |