Basic Statistics
Measure | Value |
---|---|
Filename | EGAF00005304720 |
File type | Conventional base calls |
Encoding | Sanger / Illumina 1.9 |
Total Sequences | 38641814 |
Sequences flagged as poor quality | 0 |
Sequence length | 75 |
%GC | 33 |
Per base sequence quality
Per tile sequence quality
Per sequence quality scores
Per base sequence content
Per sequence GC content
Per base N content
Sequence Length Distribution
Sequence Duplication Levels
Overrepresented sequences
Sequence | Count | Percentage | Possible Source |
---|---|---|---|
CGGAATGGAATGGAATGGAATGGAATGGAATGAAATGTAATGGATTTAATTTGATTGTAATGGAATGGAATAGAA | 100488 | 0.26004990345432544 | No Hit |
GATCGGAAGAGCACACGTCTGAACTCCAGTCACCGGAATATCTCGTATGCCGTCTTCTGCTTGAAAAAAAAAAAA | 63866 | 0.16527691997068253 | RNA PCR Primer, Index 37 (100% over 43bp) |
CGGAATGGAATGGAATGGAATGGAATGTAAAGTAATGGAATTAATTTGATTGTAATGGAATGGAATGGAATGGAA | 51384 | 0.13297512378689055 | No Hit |
Adapter Content
Kmer Content
Sequence | Count | PValue | Obs/Exp Max | Max Obs/Exp Position |
---|---|---|---|---|
CGGCATA | 100 | 0.0 | 69.19469 | 1 |
CCGTCAT | 15 | 0.0023561749 | 68.999664 | 50 |
TCGGACC | 25 | 3.0966603E-6 | 68.998764 | 44 |
CGCGTCC | 15 | 0.0023563805 | 68.998146 | 63 |
ACCGTGC | 15 | 0.0023566345 | 68.99627 | 57 |
CGGCTAA | 350 | 0.0 | 67.217705 | 1 |
CGGGTGC | 19505 | 0.0 | 66.604996 | 1 |
CGGTTAA | 113105 | 0.0 | 65.05604 | 1 |
CGGGTAC | 18690 | 0.0 | 64.47435 | 1 |
CGGGCGC | 45275 | 0.0 | 63.730946 | 1 |
CGGGCAA | 105 | 0.0 | 62.604725 | 1 |
CGGGTTT | 163130 | 0.0 | 60.28714 | 1 |
CGGCATT | 380 | 0.0 | 60.090126 | 1 |
CGGCAAT | 75 | 0.0 | 59.96874 | 1 |
GGGAGGC | 86270 | 0.0 | 59.453087 | 2 |
AACGATC | 720 | 0.0 | 59.4163 | 60 |
CAAGATC | 105 | 0.0 | 59.141575 | 25 |
TACGCCG | 35 | 3.2909156E-7 | 59.1405 | 31 |
AGATCGC | 28580 | 0.0 | 58.81056 | 27 |
GTGAGTC | 80145 | 0.0 | 58.78856 | 19 |