Basic Statistics
Measure | Value |
---|---|
Filename | EGAF00005418732 |
File type | Conventional base calls |
Encoding | Sanger / Illumina 1.9 |
Total Sequences | 17500767 |
Sequences flagged as poor quality | 0 |
Sequence length | 125 |
%GC | 31 |
Per base sequence quality
Per tile sequence quality
Per sequence quality scores
Per base sequence content
Per sequence GC content
Per base N content
Sequence Length Distribution
Sequence Duplication Levels
Overrepresented sequences
Sequence | Count | Percentage | Possible Source |
---|---|---|---|
CGGGCGCGGTGGTTTACGTTTGTAATTTTAGTATTTTGGGAGGTCGAGGC | 35945 | 0.2053909980059731 | No Hit |
CGGAATGGAATGGAATGGAATGGAATGGAATGAAATGTAATGGATTTAAT | 25411 | 0.14519935040561363 | No Hit |
TGGAATGGAATGGAATGGAATGGAATGGAATGAAATGTAATGGATTTAAT | 24574 | 0.14041670287936522 | No Hit |
TGGAATAGAATGGAATGGAATGGAATGGAATGGAATGGAATGGAATGGAA | 24533 | 0.14018242743303763 | No Hit |
CGGAATGGAATGGAATGGAATGGAATGGAATGGAATGGAATGGAATGGAA | 19126 | 0.10928663869417837 | No Hit |
TGGAATGGAATGGAATGGAATGGAATGGAATGGAATGGAATGGAATGGAA | 18760 | 0.107195301783059 | No Hit |
Adapter Content
Kmer Content
Sequence | Count | PValue | Obs/Exp Max | Max Obs/Exp Position |
---|---|---|---|---|
CGGGTGC | 8935 | 0.0 | 113.19505 | 1 |
CGGGCGC | 21310 | 0.0 | 112.3845 | 1 |
CGGGCAT | 435 | 0.0 | 111.04318 | 1 |
CGGTTAA | 49435 | 0.0 | 108.435875 | 1 |
CGGGTAC | 7825 | 0.0 | 105.01734 | 1 |
CGGGGAC | 2680 | 0.0 | 100.577255 | 1 |
CGGGCTT | 615 | 0.0 | 98.90567 | 1 |
GGGCGCG | 26615 | 0.0 | 98.554924 | 2 |
CGGCTAA | 435 | 0.0 | 97.33415 | 1 |
CGGGTAA | 11425 | 0.0 | 97.13737 | 1 |
CGGGTGT | 28270 | 0.0 | 96.92948 | 1 |
CGGGCTA | 185 | 0.0 | 96.70427 | 1 |
CGGTCTA | 100 | 0.0 | 95.41489 | 1 |
GGGAGGC | 40735 | 0.0 | 95.21597 | 2 |
CGGGCAC | 135 | 0.0 | 92.764465 | 1 |
CGGGAGG | 88440 | 0.0 | 91.7373 | 1 |
CGGGTAT | 26285 | 0.0 | 91.20407 | 1 |
CGGGTTT | 54830 | 0.0 | 90.20735 | 1 |
GGCGCGG | 28065 | 0.0 | 90.133606 | 3 |
CGGTACT | 60 | 0.0 | 89.451454 | 1 |