Basic Statistics
Measure | Value |
---|---|
Filename | EGAF00005416001 |
File type | Conventional base calls |
Encoding | Sanger / Illumina 1.9 |
Total Sequences | 20582351 |
Sequences flagged as poor quality | 0 |
Sequence length | 125 |
%GC | 33 |
Per base sequence quality
Per tile sequence quality
Per sequence quality scores
Per base sequence content
Per sequence GC content
Per base N content
Sequence Length Distribution
Sequence Duplication Levels
Overrepresented sequences
Sequence | Count | Percentage | Possible Source |
---|---|---|---|
GATCGGAAGAGCACACGTCTGAACTCCAGTCACTGACCAATCTCGTATGC | 87479 | 0.4250194742087529 | TruSeq Adapter, Index 4 (100% over 50bp) |
CGGGCGCGGTGGTTTACGTTTGTAATTTTAGTATTTTGGGAGGTCGAGGC | 78810 | 0.3829008649206303 | No Hit |
CGGTTAATTTTTTGTATTTTTAGTAGAGACGGGGTTTTATCGTGTTAGTT | 33439 | 0.1624644337277117 | No Hit |
TGGAGTAGTAAGTTATAATATGGGAGATTATTTTGAAGTTTGGTAGGATA | 27073 | 0.13153502240827591 | No Hit |
TGGATGTTAGAGGGGTGTTTTGGGTAATTTTTGGGATTTAGAAGTGAAAG | 24556 | 0.11930609870563379 | No Hit |
CGGGCGCGGTGGTTTACGTTTGTAATTTTAGTATTTTGGGAGGTCGAGGT | 20676 | 0.10045499661336064 | No Hit |
Adapter Content
Kmer Content
Sequence | Count | PValue | Obs/Exp Max | Max Obs/Exp Position |
---|---|---|---|---|
CGGGCGC | 39365 | 0.0 | 114.43572 | 1 |
CGGGTGC | 12800 | 0.0 | 113.4373 | 1 |
CGGTTAA | 119415 | 0.0 | 112.23889 | 1 |
CGGGTAC | 16070 | 0.0 | 109.61043 | 1 |
GGGCGCG | 45675 | 0.0 | 103.92784 | 2 |
CGGCTAA | 680 | 0.0 | 101.5138 | 1 |
GGCGCGG | 47190 | 0.0 | 97.33871 | 3 |
CGCGGTG | 47260 | 0.0 | 96.955345 | 5 |
GCGCGGT | 46950 | 0.0 | 95.94824 | 4 |
CGGGCGT | 56595 | 0.0 | 95.57886 | 1 |
CGGGTGT | 39285 | 0.0 | 94.61309 | 1 |
CGGGTAT | 46195 | 0.0 | 94.37305 | 1 |
CGGAATA | 17710 | 0.0 | 93.74792 | 1 |
CGGATGC | 1560 | 0.0 | 92.31383 | 1 |
CGGGTTT | 104470 | 0.0 | 92.255486 | 1 |
CGGTAAT | 14590 | 0.0 | 91.28108 | 1 |
CGGTGGT | 87260 | 0.0 | 89.62345 | 7 |
GCGGTGG | 69460 | 0.0 | 89.45687 | 6 |
CGGAATT | 23630 | 0.0 | 87.310425 | 1 |
CGGGTAA | 10390 | 0.0 | 87.2289 | 1 |