Basic Statistics
Measure | Value |
---|---|
Filename | EGAF00005419032 |
File type | Conventional base calls |
Encoding | Sanger / Illumina 1.9 |
Total Sequences | 18333324 |
Sequences flagged as poor quality | 0 |
Sequence length | 125 |
%GC | 33 |
Per base sequence quality
Per tile sequence quality
Per sequence quality scores
Per base sequence content
Per sequence GC content
Per base N content
Sequence Length Distribution
Sequence Duplication Levels
Overrepresented sequences
Sequence | Count | Percentage | Possible Source |
---|---|---|---|
CGGGCGCGGTGGTTTACGTTTGTAATTTTAGTATTTTGGGAGGTCGAGGC | 68188 | 0.3719347348031377 | No Hit |
GATCGGAAGAGCACACGTCTGAACTCCAGTCACGTCCGCACATCTCGTAT | 38020 | 0.20738192375807027 | TruSeq Adapter, Index 18 (97% over 40bp) |
CGGTTAATTTTTTGTATTTTTAGTAGAGACGGGGTTTTATCGTGTTAGTT | 29075 | 0.15859098982814027 | No Hit |
Adapter Content
Kmer Content
Sequence | Count | PValue | Obs/Exp Max | Max Obs/Exp Position |
---|---|---|---|---|
CGGGCGC | 34490 | 0.0 | 115.15074 | 1 |
CGGGTGC | 11570 | 0.0 | 114.67918 | 1 |
CGGTTAA | 105545 | 0.0 | 112.915215 | 1 |
CGGGTAC | 13545 | 0.0 | 108.724304 | 1 |
CGGCTAA | 670 | 0.0 | 103.47803 | 1 |
GGGCGCG | 40170 | 0.0 | 103.2878 | 2 |
CGGGTAT | 41860 | 0.0 | 96.204796 | 1 |
GGCGCGG | 41745 | 0.0 | 96.01101 | 3 |
CGGGTGT | 35620 | 0.0 | 95.38971 | 1 |
CGCGGTG | 41935 | 0.0 | 95.10757 | 5 |
GCGCGGT | 41610 | 0.0 | 94.43412 | 4 |
CGGGTTT | 99640 | 0.0 | 92.81835 | 1 |
CGGGCGT | 52335 | 0.0 | 92.629036 | 1 |
CGGAATA | 15400 | 0.0 | 91.824585 | 1 |
CGGTAAT | 12805 | 0.0 | 91.06314 | 1 |
CGGAATT | 22270 | 0.0 | 88.29593 | 1 |
CGGATAA | 8390 | 0.0 | 88.262115 | 1 |
CGGATGC | 1265 | 0.0 | 87.87946 | 1 |
GCGGTGG | 62370 | 0.0 | 87.264946 | 6 |
CGGTGGT | 77420 | 0.0 | 87.192245 | 7 |