Basic Statistics
Measure | Value |
---|---|
Filename | EGAF00005416159 |
File type | Conventional base calls |
Encoding | Sanger / Illumina 1.9 |
Total Sequences | 18110107 |
Sequences flagged as poor quality | 0 |
Sequence length | 125 |
%GC | 32 |
Per base sequence quality
Per tile sequence quality
Per sequence quality scores
Per base sequence content
Per sequence GC content
Per base N content
Sequence Length Distribution
Sequence Duplication Levels
Overrepresented sequences
Sequence | Count | Percentage | Possible Source |
---|---|---|---|
CGGGCGCGGTGGTTTACGTTTGTAATTTTAGTATTTTGGGAGGTCGAGGC | 71023 | 0.39217327650245243 | No Hit |
TGGATGTTAGAGGGGTGTTTTGGGTAATTTTTGGGATTTAGAAGTGAAAG | 52662 | 0.29078790092184437 | No Hit |
TGGAGTAGTAAGTTATAATATGGGAGATTATTTTGAAGTTTGGTAGGATA | 51936 | 0.28677908970940924 | No Hit |
TGGGTTTTGTTATTTTAATAAATTTTGTTTTTGGGTGGGTGTGGGTATAA | 47036 | 0.2597223749147368 | No Hit |
CGGTTAATTTTTTGTATTTTTAGTAGAGACGGGGTTTTATCGTGTTAGTT | 33701 | 0.18608945822352127 | No Hit |
CGGTTAATTTTTTGTATTTTTAGTAGAGACGGGGTTTTATTTTGTTAGTT | 20005 | 0.11046317948314717 | No Hit |
Adapter Content
Kmer Content
Sequence | Count | PValue | Obs/Exp Max | Max Obs/Exp Position |
---|---|---|---|---|
CGGGCGC | 33555 | 0.0 | 114.59708 | 1 |
CGGGTGC | 10810 | 0.0 | 114.12921 | 1 |
CGGTTAA | 121095 | 0.0 | 113.26474 | 1 |
CGGGTAC | 13595 | 0.0 | 107.5239 | 1 |
CGGCTAA | 785 | 0.0 | 106.95035 | 1 |
GGGCGCG | 38870 | 0.0 | 102.53574 | 2 |
GGCGCGG | 40315 | 0.0 | 95.94766 | 3 |
CGGGTGT | 33070 | 0.0 | 95.715775 | 1 |
CGGAATA | 16135 | 0.0 | 95.394714 | 1 |
CGGGCGT | 46945 | 0.0 | 95.22866 | 1 |
CGGGTAT | 38420 | 0.0 | 95.033676 | 1 |
CGCGGTG | 41050 | 0.0 | 94.09796 | 5 |
GCGCGGT | 40440 | 0.0 | 93.82664 | 4 |
CGGGCAT | 280 | 0.0 | 93.56801 | 1 |
CGGGTTT | 89660 | 0.0 | 91.772095 | 1 |
CGGTAAT | 13260 | 0.0 | 91.60504 | 1 |
CGGATAA | 9980 | 0.0 | 90.50817 | 1 |
CGGATGC | 1335 | 0.0 | 90.09545 | 1 |
CGGAATT | 22010 | 0.0 | 89.49076 | 1 |
CGGTGGT | 74385 | 0.0 | 88.690414 | 7 |