Basic Statistics
Measure | Value |
---|---|
Filename | EGAF00005418821 |
File type | Conventional base calls |
Encoding | Sanger / Illumina 1.9 |
Total Sequences | 17986409 |
Sequences flagged as poor quality | 0 |
Sequence length | 125 |
%GC | 33 |
Per base sequence quality
Per tile sequence quality
Per sequence quality scores
Per base sequence content
Per sequence GC content
Per base N content
Sequence Length Distribution
Sequence Duplication Levels
Overrepresented sequences
Sequence | Count | Percentage | Possible Source |
---|---|---|---|
CGGGCGCGGTGGTTTACGTTTGTAATTTTAGTATTTTGGGAGGTCGAGGC | 58151 | 0.32330522451702287 | No Hit |
GATCGGAAGAGCACACGTCTGAACTCCAGTCACCTTGTAATCTCGTATGC | 38975 | 0.21669139181701028 | TruSeq Adapter, Index 12 (100% over 50bp) |
CGGTTAATTTTTTGTATTTTTAGTAGAGACGGGGTTTTATCGTGTTAGTT | 29312 | 0.16296749395613097 | No Hit |
TGGAATGGAATGGAATGGAATGGAATGGAATGAAATGTAATGGATTTAAT | 23237 | 0.12919199157541675 | No Hit |
CGGAATATTTTTATATAAAATTAAGATAGAAGTATTTTCGGAAATATTTT | 18839 | 0.10474019577782313 | No Hit |
TGGGTGTGGTGGTTTATGTTTGTAATTTTAGTATTTTGGGAGGTTGAGGT | 18610 | 0.10346701223129086 | No Hit |
Adapter Content
Kmer Content
Sequence | Count | PValue | Obs/Exp Max | Max Obs/Exp Position |
---|---|---|---|---|
CGGGCGC | 29955 | 0.0 | 114.01935 | 1 |
CGGTTAA | 106280 | 0.0 | 113.67787 | 1 |
CGGGTGC | 9590 | 0.0 | 112.16055 | 1 |
CGGGTAC | 11865 | 0.0 | 107.30164 | 1 |
CGGCTAA | 790 | 0.0 | 103.92316 | 1 |
GGGCGCG | 36330 | 0.0 | 100.307755 | 2 |
CGCGGTG | 36610 | 0.0 | 96.0473 | 5 |
CGGAATA | 13895 | 0.0 | 94.36553 | 1 |
GGCGCGG | 37280 | 0.0 | 94.32113 | 3 |
GCGCGGT | 36755 | 0.0 | 93.80745 | 4 |
CGGGTAT | 34855 | 0.0 | 93.70605 | 1 |
CGGGTGT | 30850 | 0.0 | 93.41342 | 1 |
CGGGCGT | 43780 | 0.0 | 91.92896 | 1 |
CGGATAC | 1540 | 0.0 | 90.783615 | 1 |
CGGGCAT | 250 | 0.0 | 90.4282 | 1 |
CGGGTTT | 83340 | 0.0 | 90.28058 | 1 |
CGGTGGT | 65595 | 0.0 | 89.23948 | 7 |
CGGATGC | 1185 | 0.0 | 88.35979 | 1 |
GCGGTGG | 53955 | 0.0 | 88.01074 | 6 |
CGGTAAT | 11005 | 0.0 | 87.6301 | 1 |