Basic Statistics
Measure | Value |
---|---|
Filename | EGAF00005754235 |
File type | Conventional base calls |
Encoding | Sanger / Illumina 1.9 |
Total Sequences | 410657067 |
Sequences flagged as poor quality | 0 |
Sequence length | 151 |
%GC | 40 |
Per base sequence quality
Per tile sequence quality
Per sequence quality scores
Per base sequence content
Per sequence GC content
Per base N content
Sequence Length Distribution
Sequence Duplication Levels
Overrepresented sequences
Sequence | Count | Percentage | Possible Source |
---|---|---|---|
ATCGGAAGAGCACACGTCTGAACTCCAGTCACGAATTCGTATCTCGTATG | 839286 | 0.20437636837259154 | TruSeq Adapter, Index 7 (97% over 34bp) |
Adapter Content
Kmer Content
Sequence | Count | PValue | Obs/Exp Max | Max Obs/Exp Position |
---|---|---|---|---|
ATCGGAA | 259145 | 0.0 | 62.838337 | 1 |
TCGGAAG | 270165 | 0.0 | 59.936794 | 2 |
CGGAAGA | 291180 | 0.0 | 55.597908 | 3 |
AGAGCAC | 516535 | 0.0 | 31.749899 | 7 |
AGCACAC | 546585 | 0.0 | 29.97128 | 9 |
GAGCACA | 557315 | 0.0 | 29.40854 | 8 |
GAAGAGC | 570900 | 0.0 | 28.758255 | 5 |
AAGAGCA | 792940 | 0.0 | 20.961239 | 6 |
TATGCCG | 178765 | 0.0 | 19.321424 | 45-49 |
GGAAGAG | 889060 | 0.0 | 18.795595 | 4 |
CGTATGC | 190705 | 0.0 | 17.951319 | 45-49 |
CTCGTAT | 166350 | 0.0 | 17.940819 | 40-44 |
TCGTATC | 192750 | 0.0 | 17.877588 | 35-39 |
TCTCGTA | 166420 | 0.0 | 17.640585 | 40-44 |
ATGCCGT | 200010 | 0.0 | 17.090084 | 45-49 |
TATCTCG | 172475 | 0.0 | 16.854862 | 40-44 |
GCCGTCT | 201080 | 0.0 | 16.714474 | 50-54 |
GTCACGA | 212420 | 0.0 | 16.372616 | 25-29 |
ATTCGTA | 211270 | 0.0 | 16.319353 | 35-39 |
TGCCGTC | 199290 | 0.0 | 15.909393 | 45-49 |