Basic Statistics
Measure | Value |
---|---|
Filename | EGAF00005416519 |
File type | Conventional base calls |
Encoding | Sanger / Illumina 1.9 |
Total Sequences | 11683785 |
Sequences flagged as poor quality | 0 |
Sequence length | 125 |
%GC | 33 |
Per base sequence quality
Per tile sequence quality
Per sequence quality scores
Per base sequence content
Per sequence GC content
Per base N content
Sequence Length Distribution
Sequence Duplication Levels
Overrepresented sequences
Sequence | Count | Percentage | Possible Source |
---|---|---|---|
CGGGCGCGGTGGTTTACGTTTGTAATTTTAGTATTTTGGGAGGTCGAGGC | 62444 | 0.5344500947252967 | No Hit |
GATCGGAAGAGCACACGTCTGAACTCCAGTCACGTCCGCACATCTCGTAT | 56041 | 0.479647648428998 | TruSeq Adapter, Index 18 (97% over 40bp) |
CGGTTAATTTTTTGTATTTTTAGTAGAGACGGGGTTTTATCGTGTTAGTT | 17642 | 0.15099558918620978 | No Hit |
CGGGCGCGGTGGTTTACGTTTGTAATTTTAGTATTTTGGGAGGTCGAGGT | 15013 | 0.12849431926383445 | No Hit |
CGGAATGGAATGGAATGGAATGGAATGGAATGAAATGTAATGGATTTAAT | 14739 | 0.1261491888116736 | No Hit |
CGGAATGGAATGGAATGGAATGGAATGGAATGGAATGGAATGGAATGGAA | 12390 | 0.10604440256303928 | No Hit |
Adapter Content
Kmer Content
Sequence | Count | PValue | Obs/Exp Max | Max Obs/Exp Position |
---|---|---|---|---|
CGGGCGC | 29515 | 0.0 | 115.90745 | 1 |
CGGGTGC | 8835 | 0.0 | 114.274445 | 1 |
CGGGTAC | 10905 | 0.0 | 113.18704 | 1 |
CGGTTAA | 63445 | 0.0 | 112.51046 | 1 |
GGGCGCG | 33285 | 0.0 | 106.48594 | 2 |
CGGCTAA | 390 | 0.0 | 100.86062 | 1 |
GGCGCGG | 34595 | 0.0 | 99.39951 | 3 |
CGCGGTG | 34525 | 0.0 | 98.63556 | 5 |
CGGGTAT | 31435 | 0.0 | 98.30544 | 1 |
GCGCGGT | 34240 | 0.0 | 98.065796 | 4 |
CGGGCGT | 41085 | 0.0 | 95.01678 | 1 |
CGGGTGT | 26415 | 0.0 | 94.9891 | 1 |
CGGATGC | 860 | 0.0 | 94.943306 | 1 |
CGGGTTT | 57665 | 0.0 | 92.04764 | 1 |
GCGGTGG | 49020 | 0.0 | 91.874054 | 6 |
CGGAATA | 10835 | 0.0 | 91.695694 | 1 |
CGGTAAT | 8405 | 0.0 | 91.68601 | 1 |
CGGTGGT | 60850 | 0.0 | 91.572235 | 7 |
CGGATAC | 1150 | 0.0 | 90.69482 | 1 |
GGGAGGC | 28305 | 0.0 | 87.69477 | 2 |