Basic Statistics
Measure | Value |
---|---|
Filename | EGAF00005189291 |
File type | Conventional base calls |
Encoding | Sanger / Illumina 1.9 |
Total Sequences | 30120318 |
Sequences flagged as poor quality | 0 |
Sequence length | 35-51 |
%GC | 46 |
Per base sequence quality
Per tile sequence quality
Per sequence quality scores
Per base sequence content
Per sequence GC content
Per base N content
Sequence Length Distribution
Sequence Duplication Levels
Overrepresented sequences
Sequence | Count | Percentage | Possible Source |
---|---|---|---|
NNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNN | 25633858 | 85.10487173475393 | No Hit |
CGGAAGAGCACACGTCTGAACTCCAGTCACGTCGTAGAATCTCGTATGCCG | 119720 | 0.39747256320467794 | TruSeq Adapter, Index 20 (97% over 34bp) |
Adapter Content
Kmer Content
Sequence | Count | PValue | Obs/Exp Max | Max Obs/Exp Position |
---|---|---|---|---|
TATGCCG | 31095 | 0.0 | 43.855896 | 45 |
GATCGGA | 14255 | 0.0 | 43.685535 | 1 |
ATCGGAA | 14110 | 0.0 | 42.860363 | 2 |
TCGGAAG | 13015 | 0.0 | 42.406403 | 3 |
CGGAAGC | 4215 | 0.0 | 36.24819 | 4 |
TGCCGTC | 6645 | 0.0 | 34.47412 | 45 |
GACACGT | 2570 | 0.0 | 33.446163 | 9 |
AGACACG | 2655 | 0.0 | 32.291866 | 8 |
CGGAAGG | 1970 | 0.0 | 31.203453 | 4 |
GTATGCC | 33115 | 0.0 | 30.81787 | 44 |
GGAAGAG | 21785 | 0.0 | 30.285904 | 2 |
ACACGTC | 25205 | 0.0 | 29.261444 | 10 |
GGAAGAC | 3890 | 0.0 | 28.908733 | 5 |
AGAGCAC | 22625 | 0.0 | 28.788866 | 5 |
CACACGT | 22685 | 0.0 | 28.78176 | 9 |
AAGCACG | 1930 | 0.0 | 28.729902 | 7 |
GAGCACA | 22905 | 0.0 | 28.641424 | 6 |
CACGTCT | 28270 | 0.0 | 28.34968 | 11 |
CGTAACT | 530 | 0.0 | 28.150192 | 16 |
CGGAAGA | 22770 | 0.0 | 28.122593 | 1 |