Basic Statistics
Measure | Value |
---|---|
Filename | EGAF00005418913 |
File type | Conventional base calls |
Encoding | Sanger / Illumina 1.9 |
Total Sequences | 19364065 |
Sequences flagged as poor quality | 0 |
Sequence length | 125 |
%GC | 34 |
Per base sequence quality
Per tile sequence quality
Per sequence quality scores
Per base sequence content
Per sequence GC content
Per base N content
Sequence Length Distribution
Sequence Duplication Levels
Overrepresented sequences
Sequence | Count | Percentage | Possible Source |
---|---|---|---|
CGGGCGCGGTGGTTTACGTTTGTAATTTTAGTATTTTGGGAGGTCGAGGC | 56213 | 0.29029545191053635 | No Hit |
TGGAGTAGTAAGTTATAATATGGGAGATTATTTTGAAGTTTGGTAGGATA | 31679 | 0.16359684807916108 | No Hit |
GATCGGAAGAGCACACGTCTGAACTCCAGTCACCAGATCATCTCGTATGC | 30049 | 0.15517919403802868 | TruSeq Adapter, Index 7 (100% over 50bp) |
CGGTTAATTTTTTGTATTTTTAGTAGAGACGGGGTTTTATCGTGTTAGTT | 22772 | 0.11759927473905918 | No Hit |
TGGATGTTAGAGGGGTGTTTTGGGTAATTTTTGGGATTTAGAAGTGAAAG | 22651 | 0.11697440594214076 | No Hit |
Adapter Content
Kmer Content
Sequence | Count | PValue | Obs/Exp Max | Max Obs/Exp Position |
---|---|---|---|---|
CGGTTAA | 96985 | 0.0 | 111.61124 | 1 |
CGGGCGC | 34680 | 0.0 | 111.17493 | 1 |
CGGGTGC | 10535 | 0.0 | 111.05409 | 1 |
CGGGTAC | 13170 | 0.0 | 107.70602 | 1 |
GGGCGCG | 40675 | 0.0 | 100.77407 | 2 |
CGCGGTG | 41320 | 0.0 | 94.54079 | 5 |
GGCGCGG | 41985 | 0.0 | 94.348305 | 3 |
CGGGTAT | 43355 | 0.0 | 93.961105 | 1 |
CGGGCGT | 56075 | 0.0 | 93.95757 | 1 |
GCGCGGT | 41320 | 0.0 | 93.74972 | 4 |
CGGGTGT | 37100 | 0.0 | 91.32829 | 1 |
CGGGTTT | 101565 | 0.0 | 90.89838 | 1 |
CGGTGGT | 74390 | 0.0 | 86.87157 | 7 |
CGGAATT | 20670 | 0.0 | 86.32967 | 1 |
GCGGTGG | 61280 | 0.0 | 86.08274 | 6 |
CGGTAAT | 11790 | 0.0 | 85.98829 | 1 |
CGGAAAT | 9765 | 0.0 | 85.08232 | 1 |
CGGAATA | 13925 | 0.0 | 83.67589 | 1 |
CGGATAA | 7890 | 0.0 | 82.111 | 1 |
GGGCGTG | 38580 | 0.0 | 81.82778 | 2 |