Basic Statistics
Measure | Value |
---|---|
Filename | EGAF00005418791 |
File type | Conventional base calls |
Encoding | Sanger / Illumina 1.9 |
Total Sequences | 18609419 |
Sequences flagged as poor quality | 0 |
Sequence length | 125 |
%GC | 33 |
Per base sequence quality
Per tile sequence quality
Per sequence quality scores
Per base sequence content
Per sequence GC content
Per base N content
Sequence Length Distribution
Sequence Duplication Levels
Overrepresented sequences
Sequence | Count | Percentage | Possible Source |
---|---|---|---|
CGGGCGCGGTGGTTTACGTTTGTAATTTTAGTATTTTGGGAGGTCGAGGC | 70607 | 0.3794153917432887 | No Hit |
GATCGGAAGAGCACACGTCTGAACTCCAGTCACCTTGTAATCTCGTATGC | 34691 | 0.18641635184849134 | TruSeq Adapter, Index 12 (100% over 50bp) |
CGGTTAATTTTTTGTATTTTTAGTAGAGACGGGGTTTTATCGTGTTAGTT | 26964 | 0.14489436773926148 | No Hit |
TGGAGTAGTAAGTTATAATATGGGAGATTATTTTGAAGTTTGGTAGGATA | 23185 | 0.12458744681926932 | No Hit |
CGGAATGGAATGGAATGGAATGGAATGGAATGAAATGTAATGGATTTAAT | 19882 | 0.10683837039727033 | No Hit |
CGGGCGCGGTGGTTTACGTTTGTAATTTTAGTATTTTGGGAGGTCGAGGT | 18829 | 0.10117994548889463 | No Hit |
Adapter Content
Kmer Content
Sequence | Count | PValue | Obs/Exp Max | Max Obs/Exp Position |
---|---|---|---|---|
CGGGCGC | 36495 | 0.0 | 115.03986 | 1 |
CGGGTGC | 11890 | 0.0 | 114.315315 | 1 |
CGGTTAA | 99870 | 0.0 | 112.36262 | 1 |
CGGGTAC | 14360 | 0.0 | 108.61212 | 1 |
GGGCGCG | 43475 | 0.0 | 103.072655 | 2 |
CGGCTAA | 715 | 0.0 | 102.329094 | 1 |
CGGGTAT | 40540 | 0.0 | 96.22495 | 1 |
CGGGCAT | 260 | 0.0 | 96.089516 | 1 |
GGCGCGG | 45110 | 0.0 | 95.97329 | 3 |
CGCGGTG | 44945 | 0.0 | 95.61142 | 5 |
CGGGTGT | 35205 | 0.0 | 95.32982 | 1 |
CGGATAC | 1935 | 0.0 | 95.29734 | 1 |
GCGCGGT | 44960 | 0.0 | 94.48212 | 4 |
CGGGCGT | 51740 | 0.0 | 93.98561 | 1 |
CGGATGC | 1340 | 0.0 | 91.445526 | 1 |
CGGAATA | 15675 | 0.0 | 91.41749 | 1 |
CGGGTTT | 88385 | 0.0 | 90.9371 | 1 |
CGGAAAT | 10740 | 0.0 | 89.66906 | 1 |
CGGTAAT | 13180 | 0.0 | 89.361374 | 1 |
CGGTGGT | 80490 | 0.0 | 89.31102 | 7 |