Basic Statistics
Measure | Value |
---|---|
Filename | EGAF00005418209 |
File type | Conventional base calls |
Encoding | Sanger / Illumina 1.9 |
Total Sequences | 18232346 |
Sequences flagged as poor quality | 0 |
Sequence length | 125 |
%GC | 32 |
Per base sequence quality
Per tile sequence quality
Per sequence quality scores
Per base sequence content
Per sequence GC content
Per base N content
Sequence Length Distribution
Sequence Duplication Levels
Overrepresented sequences
Sequence | Count | Percentage | Possible Source |
---|---|---|---|
CGGGCGCGGTGGTTTACGTTTGTAATTTTAGTATTTTGGGAGGTCGAGGC | 37123 | 0.2036106598679073 | No Hit |
CGGAATGGAATGGAATGGAATGGAATGGAATGAAATGTAATGGATTTAAT | 25930 | 0.14221976700091146 | No Hit |
TGGAGTAGTAAGTTATAATATGGGAGATTATTTTGAAGTTTGGTAGGATA | 23766 | 0.13035075135147173 | No Hit |
CGGAATAGAATGGAATGGAATGGAATGGAACGGAATGGAATGGAATGGAA | 23316 | 0.12788261038924997 | No Hit |
CGGAATGGAATGGAATGGAATGGAATGGAATGGAATGGAATGGAATGGAA | 20811 | 0.11414329236621552 | No Hit |
GATCGGAAGAGCACACGTCTGAACTCCAGTCACCCGTCCCGATCTCGTAT | 19210 | 0.10536219529839988 | TruSeq Adapter, Index 16 (97% over 40bp) |
Adapter Content
Kmer Content
Sequence | Count | PValue | Obs/Exp Max | Max Obs/Exp Position |
---|---|---|---|---|
CGGGTGC | 10835 | 0.0 | 113.26698 | 1 |
CGGGCGC | 22540 | 0.0 | 108.018616 | 1 |
CGGTTAA | 52815 | 0.0 | 107.183655 | 1 |
CGGGTAC | 9250 | 0.0 | 105.94618 | 1 |
CGGCTAA | 570 | 0.0 | 101.87679 | 1 |
CGGGGAC | 3880 | 0.0 | 96.741806 | 1 |
CGGGTCT | 695 | 0.0 | 95.612915 | 1 |
CGGGTAA | 11680 | 0.0 | 94.360306 | 1 |
CGGGTCA | 280 | 0.0 | 94.074745 | 1 |
CGGGTGT | 34855 | 0.0 | 94.07105 | 1 |
GGGCGCG | 27790 | 0.0 | 93.57823 | 2 |
CGGGCTT | 685 | 0.0 | 92.63896 | 1 |
CGGGCAT | 395 | 0.0 | 92.45089 | 1 |
CGGGTCC | 315 | 0.0 | 91.224 | 1 |
GGGAGGC | 39435 | 0.0 | 91.03865 | 2 |
CGGGTAT | 32635 | 0.0 | 90.91301 | 1 |
CGGAATA | 20485 | 0.0 | 90.74109 | 1 |
CGGATGC | 1315 | 0.0 | 90.14005 | 1 |
CGGATTG | 13495 | 0.0 | 90.09805 | 1 |
GGTTACT | 1490 | 0.0 | 89.99851 | 2 |