Basic Statistics
Measure | Value |
---|---|
Filename | EGAF00005418887 |
File type | Conventional base calls |
Encoding | Sanger / Illumina 1.9 |
Total Sequences | 22929299 |
Sequences flagged as poor quality | 0 |
Sequence length | 125 |
%GC | 33 |
Per base sequence quality
Per tile sequence quality
Per sequence quality scores
Per base sequence content
Per sequence GC content
Per base N content
Sequence Length Distribution
Sequence Duplication Levels
Overrepresented sequences
Sequence | Count | Percentage | Possible Source |
---|---|---|---|
CGGGCGCGGTGGTTTACGTTTGTAATTTTAGTATTTTGGGAGGTCGAGGC | 86353 | 0.37660549500444823 | No Hit |
CGGTTAATTTTTTGTATTTTTAGTAGAGACGGGGTTTTATCGTGTTAGTT | 44971 | 0.19612897891034525 | No Hit |
CGGAATGGAATGGAATGGAATGGAATGGAATGAAATGTAATGGATTTAAT | 34123 | 0.14881833064325256 | No Hit |
CGGAATGGAATGGAATGGAATGGAATGGAATGGAATGGAATGGAATGGAA | 27990 | 0.12207089279092222 | No Hit |
CGGTTAATTTTTTGTATTTTTAGTAGAGACGGGGTTTTATTTTGTTAGTT | 26249 | 0.11447798731221569 | No Hit |
GATCGGAAGAGCACACGTCTGAACTCCAGTCACCGATGTATCTCGTATGC | 24907 | 0.10862521353138618 | TruSeq Adapter, Index 2 (100% over 50bp) |
Adapter Content
Kmer Content
Sequence | Count | PValue | Obs/Exp Max | Max Obs/Exp Position |
---|---|---|---|---|
CGGGCGC | 42495 | 0.0 | 114.988945 | 1 |
CGGTTAA | 151835 | 0.0 | 113.513054 | 1 |
CGGGTGC | 12810 | 0.0 | 113.405914 | 1 |
CGGGTAC | 15990 | 0.0 | 111.09142 | 1 |
CGGCTAA | 1125 | 0.0 | 111.04678 | 1 |
GGGCGCG | 50345 | 0.0 | 103.27804 | 2 |
GGCGCGG | 52125 | 0.0 | 96.68001 | 3 |
CGCGGTG | 52800 | 0.0 | 95.11638 | 5 |
CGGGTAT | 45105 | 0.0 | 94.48657 | 1 |
GCGCGGT | 52420 | 0.0 | 94.455864 | 4 |
CGGAATA | 24235 | 0.0 | 93.96544 | 1 |
CGGGTGT | 40510 | 0.0 | 93.6469 | 1 |
CGGGCAT | 375 | 0.0 | 93.59657 | 1 |
CGGGCGT | 60680 | 0.0 | 92.00849 | 1 |
CGGGTTT | 111790 | 0.0 | 90.23714 | 1 |
CGGTAAT | 15300 | 0.0 | 90.089424 | 1 |
CGGTGGT | 92525 | 0.0 | 89.0183 | 7 |
CGGATAC | 2060 | 0.0 | 88.94524 | 1 |
GCGGTGG | 75305 | 0.0 | 88.70773 | 6 |
CGGATAA | 11770 | 0.0 | 87.69245 | 1 |