Basic Statistics
Measure | Value |
---|---|
Filename | EGAF00005418990 |
File type | Conventional base calls |
Encoding | Sanger / Illumina 1.9 |
Total Sequences | 19370313 |
Sequences flagged as poor quality | 0 |
Sequence length | 125 |
%GC | 33 |
Per base sequence quality
Per tile sequence quality
Per sequence quality scores
Per base sequence content
Per sequence GC content
Per base N content
Sequence Length Distribution
Sequence Duplication Levels
Overrepresented sequences
Sequence | Count | Percentage | Possible Source |
---|---|---|---|
CGGGCGCGGTGGTTTACGTTTGTAATTTTAGTATTTTGGGAGGTCGAGGC | 85480 | 0.44129385002710075 | No Hit |
CGGAATGGAATGGAATGGAATGGAATGGAATGAAATGTAATGGATTTAAT | 24831 | 0.12819101064603344 | No Hit |
CGGTTAATTTTTTGTATTTTTAGTAGAGACGGGGTTTTATCGTGTTAGTT | 23896 | 0.12336403650266262 | No Hit |
CGGGCGCGGTGGTTTACGTTTGTAATTTTAGTATTTTGGGAGGTCGAGGT | 21544 | 0.11122174432596932 | No Hit |
CGGAATAGAATGGAATGGAATGGAATGGAACGGAATGGAATGGAATGGAA | 21020 | 0.10851657379000534 | No Hit |
CGGAATGGAATGGAATGGAATGGAATGGAATGGAATGGAATGGAATGGAA | 20382 | 0.1052228737862935 | No Hit |
Adapter Content
Kmer Content
Sequence | Count | PValue | Obs/Exp Max | Max Obs/Exp Position |
---|---|---|---|---|
CGGGCGC | 41620 | 0.0 | 114.32491 | 1 |
CGGGTGC | 12725 | 0.0 | 113.188034 | 1 |
CGGTTAA | 90050 | 0.0 | 110.75189 | 1 |
CGGGTAC | 16590 | 0.0 | 108.99625 | 1 |
GGGCGCG | 48180 | 0.0 | 103.50374 | 2 |
CGGCTAA | 730 | 0.0 | 100.640366 | 1 |
CGGATAC | 1945 | 0.0 | 98.39121 | 1 |
CGGGTAT | 49165 | 0.0 | 97.298485 | 1 |
GGCGCGG | 49770 | 0.0 | 96.465645 | 3 |
CGCGGTG | 49375 | 0.0 | 96.117516 | 5 |
GCGCGGT | 48915 | 0.0 | 95.91091 | 4 |
CGGGTGT | 40800 | 0.0 | 93.72213 | 1 |
CGGGCGT | 65340 | 0.0 | 93.70499 | 1 |
CGGAATA | 19565 | 0.0 | 92.816246 | 1 |
CGGTAAT | 15880 | 0.0 | 91.52093 | 1 |
CGGATGC | 1285 | 0.0 | 91.29257 | 1 |
GGGAGGC | 45845 | 0.0 | 89.61019 | 2 |
CGGGCAT | 430 | 0.0 | 89.56086 | 1 |
CGGGTTT | 88325 | 0.0 | 89.477325 | 1 |
GCGGTGG | 70525 | 0.0 | 89.17946 | 6 |