Basic Statistics
Measure | Value |
---|---|
Filename | EGAF00005310429 |
File type | Conventional base calls |
Encoding | Sanger / Illumina 1.9 |
Total Sequences | 15914526 |
Sequences flagged as poor quality | 0 |
Sequence length | 51 |
%GC | 41 |
Per base sequence quality
Per tile sequence quality
Per sequence quality scores
Per base sequence content
Per sequence GC content
Per base N content
Sequence Length Distribution
Sequence Duplication Levels
Overrepresented sequences
Sequence | Count | Percentage | Possible Source |
---|---|---|---|
ATCGGAAGAGCACACGTCTGAACTCCAGTCACGTCCGCACATCTCGTATGC | 241747 | 1.5190336174636931 | TruSeq Adapter, Index 18 (97% over 39bp) |
GATCGGAAGAGCACACGTCTGAACTCCAGTCACGTCCGCACATCTCGTATG | 47685 | 0.2996319211769172 | TruSeq Adapter, Index 18 (97% over 40bp) |
Adapter Content
Kmer Content
Sequence | Count | PValue | Obs/Exp Max | Max Obs/Exp Position |
---|---|---|---|---|
CGTATGC | 29575 | 0.0 | 41.08563 | 45 |
GATCGGA | 5670 | 0.0 | 40.688747 | 1 |
ATCGGAA | 33065 | 0.0 | 37.36775 | 1 |
TCGGAAG | 33420 | 0.0 | 36.92207 | 2 |
CGGAAGA | 34160 | 0.0 | 36.109066 | 3 |
ACGTCCG | 34380 | 0.0 | 35.366386 | 31 |
CGTCCGC | 34465 | 0.0 | 35.31224 | 32 |
GTCCGCA | 34530 | 0.0 | 35.15456 | 33 |
ACACGTC | 34790 | 0.0 | 35.14402 | 12 |
TCCGCAC | 34485 | 0.0 | 35.128677 | 34 |
CGTCTGA | 34790 | 0.0 | 35.118156 | 15 |
CTCGTAT | 34470 | 0.0 | 35.112988 | 43 |
TCGTATG | 34670 | 0.0 | 34.982906 | 44 |
ACGTCTG | 35025 | 0.0 | 34.876106 | 14 |
GTCACGT | 34845 | 0.0 | 34.86215 | 28 |
CCGCACA | 34705 | 0.0 | 34.79623 | 35 |
AGTCACG | 34895 | 0.0 | 34.792854 | 27 |
TCTCGTA | 32480 | 0.0 | 34.722298 | 42 |
TCACGTC | 35105 | 0.0 | 34.674435 | 29 |
CACGTCT | 35320 | 0.0 | 34.60392 | 13 |