Basic Statistics
Measure | Value |
---|---|
Filename | EGAF00005101152 |
File type | Conventional base calls |
Encoding | Sanger / Illumina 1.9 |
Total Sequences | 69401599 |
Sequences flagged as poor quality | 0 |
Sequence length | 151 |
%GC | 41 |
Per base sequence quality
Per tile sequence quality
Per sequence quality scores
Per base sequence content
Per sequence GC content
Per base N content
Sequence Length Distribution
Sequence Duplication Levels
Overrepresented sequences
Sequence | Count | Percentage | Possible Source |
---|---|---|---|
GATCGGAAGAGCACACGTCTGAACTCCAGTCACTTGGACTGATCTCGTAT | 302217 | 0.43546114838074557 | TruSeq Adapter, Index 3 (97% over 37bp) |
ATCGGAAGAGCACACGTCTGAACTCCAGTCACTTGGACTGATCTCGTATG | 154074 | 0.22200353049502503 | TruSeq Adapter, Index 3 (97% over 36bp) |
Adapter Content
Kmer Content
Sequence | Count | PValue | Obs/Exp Max | Max Obs/Exp Position |
---|---|---|---|---|
GATCGGA | 60550 | 0.0 | 81.2719 | 1 |
ATCGGAA | 79805 | 0.0 | 61.653027 | 2 |
TCGGAAG | 81855 | 0.0 | 60.04496 | 3 |
CGGAAGA | 85000 | 0.0 | 57.83182 | 4 |
AGAGCAC | 123275 | 0.0 | 39.951656 | 8 |
GAGCACA | 131930 | 0.0 | 37.352364 | 9 |
GAAGAGC | 134220 | 0.0 | 36.818115 | 6 |
AAGAGCA | 170010 | 0.0 | 29.212292 | 7 |
GGAAGAG | 188605 | 0.0 | 26.57085 | 5 |
TATGCCG | 59990 | 0.0 | 24.988142 | 45-49 |
CTCGTAT | 61650 | 0.0 | 24.21895 | 40-44 |
CGTATGC | 62000 | 0.0 | 24.044746 | 45-49 |
TCTCGTA | 59325 | 0.0 | 23.97545 | 40-44 |
ATGCCGT | 64250 | 0.0 | 23.245586 | 45-49 |
GCCGTCT | 64655 | 0.0 | 22.828037 | 50-54 |
ATCTCGT | 63570 | 0.0 | 22.326551 | 40-44 |
CGTCTTC | 68575 | 0.0 | 21.69016 | 50-54 |
CCGTCTT | 69330 | 0.0 | 21.38075 | 50-54 |
CGTCTGA | 74280 | 0.0 | 20.033417 | 15-19 |
AGCACAC | 128730 | 0.0 | 19.973858 | 9 |