Basic Statistics
Measure | Value |
---|---|
Filename | EGAF00005418959 |
File type | Conventional base calls |
Encoding | Sanger / Illumina 1.9 |
Total Sequences | 17037608 |
Sequences flagged as poor quality | 0 |
Sequence length | 125 |
%GC | 34 |
Per base sequence quality
Per tile sequence quality
Per sequence quality scores
Per base sequence content
Per sequence GC content
Per base N content
Sequence Length Distribution
Sequence Duplication Levels
Overrepresented sequences
Sequence | Count | Percentage | Possible Source |
---|---|---|---|
CGGGCGCGGTGGTTTACGTTTGTAATTTTAGTATTTTGGGAGGTCGAGGC | 35477 | 0.20822758687721893 | No Hit |
TGGAGTAGTAAGTTATAATATGGGAGATTATTTTGAAGTTTGGTAGGATA | 24576 | 0.14424560067352177 | No Hit |
TGGATGTTAGAGGGGTGTTTTGGGTAATTTTTGGGATTTAGAAGTGAAAG | 20058 | 0.11772779371376546 | No Hit |
CGGTTAATTTTTTGTATTTTTAGTAGAGACGGGGTTTTATCGTGTTAGTT | 19492 | 0.11440573113314968 | No Hit |
GATCGGAAGAGCACACGTCTGAACTCCAGTCACATGTCAGAATCTCGTAT | 18273 | 0.10725097091094009 | TruSeq Adapter, Index 15 (97% over 40bp) |
Adapter Content
Kmer Content
Sequence | Count | PValue | Obs/Exp Max | Max Obs/Exp Position |
---|---|---|---|---|
CGGTTAA | 89970 | 0.0 | 112.048904 | 1 |
CGGGCGC | 24285 | 0.0 | 109.18245 | 1 |
CGGGTGC | 8825 | 0.0 | 108.783455 | 1 |
CGGGTAC | 10105 | 0.0 | 104.89888 | 1 |
GGGCGCG | 29240 | 0.0 | 97.27381 | 2 |
CGGGTAT | 34205 | 0.0 | 91.99473 | 1 |
CGGGTGT | 31280 | 0.0 | 91.02646 | 1 |
CGGGTTT | 90935 | 0.0 | 90.537506 | 1 |
CGGGCGT | 40685 | 0.0 | 90.259735 | 1 |
CGCGGTG | 30300 | 0.0 | 89.88009 | 5 |
GGCGCGG | 30800 | 0.0 | 89.46896 | 3 |
CGGAATA | 13275 | 0.0 | 88.27839 | 1 |
GCGCGGT | 30640 | 0.0 | 87.87804 | 4 |
CGGAAAT | 8615 | 0.0 | 87.87697 | 1 |
CGGTAAT | 10790 | 0.0 | 85.71815 | 1 |
CGGAATT | 17830 | 0.0 | 85.32037 | 1 |
CGGTGGT | 56870 | 0.0 | 83.08259 | 7 |
CGGATAA | 6755 | 0.0 | 82.822 | 1 |
CGGAGTG | 9535 | 0.0 | 82.45659 | 1 |
CGGGATA | 9460 | 0.0 | 82.103676 | 1 |