Basic Statistics
Measure | Value |
---|---|
Filename | EGAF00005740523 |
File type | Conventional base calls |
Encoding | Sanger / Illumina 1.9 |
Total Sequences | 20888752 |
Sequences flagged as poor quality | 0 |
Sequence length | 51 |
%GC | 50 |
Per base sequence quality
Per tile sequence quality
Per sequence quality scores
Per base sequence content
Per sequence GC content
Per base N content
Sequence Length Distribution
Sequence Duplication Levels
Overrepresented sequences
Sequence | Count | Percentage | Possible Source |
---|---|---|---|
GATCGGAAGAGCACACGTCTGAACTCCAGTCACCGTACGTAATCTCGTATG | 60599 | 0.29010349684844744 | TruSeq Adapter, Index 22 (97% over 40bp) |
AGATCGGAAGAGCACACGTCTGAACTCCAGTCACCGTACGTAATCTCGTAT | 23593 | 0.11294595292241488 | TruSeq Adapter, Index 22 (97% over 40bp) |
Adapter Content
Kmer Content
Sequence | Count | PValue | Obs/Exp Max | Max Obs/Exp Position |
---|---|---|---|---|
TCGTATG | 7380 | 0.0 | 39.905804 | 45 |
CGTACGT | 9895 | 0.0 | 29.96763 | 34 |
TACGTAA | 9880 | 0.0 | 29.89927 | 36 |
CGTAATC | 9885 | 0.0 | 29.542744 | 38 |
ACGTAAT | 10070 | 0.0 | 29.268106 | 37 |
CTCGTAT | 10105 | 0.0 | 28.96635 | 44 |
GTACGTA | 10490 | 0.0 | 28.22495 | 35 |
ACCGTAC | 10615 | 0.0 | 27.85019 | 32 |
ACACGTC | 10865 | 0.0 | 27.18866 | 13 |
GATCGGA | 11375 | 0.0 | 26.764076 | 1 |
GTCACCG | 11540 | 0.0 | 25.578838 | 29 |
CACGTCT | 11605 | 0.0 | 25.513119 | 14 |
CACCGTA | 11760 | 0.0 | 25.234241 | 31 |
CACACGT | 12015 | 0.0 | 24.86721 | 12 |
GTAATCT | 11860 | 0.0 | 24.793835 | 39 |
AGATCGG | 4740 | 0.0 | 24.758064 | 1 |
TCGGAAG | 12280 | 0.0 | 24.733648 | 3 |
TGATACG | 1440 | 0.0 | 24.52957 | 3 |
ATACGGC | 1490 | 0.0 | 24.310417 | 5 |
TCTCGTA | 12585 | 0.0 | 23.937576 | 43 |