Basic Statistics
Measure | Value |
---|---|
Filename | EGAF00005788463 |
File type | Conventional base calls |
Encoding | Sanger / Illumina 1.9 |
Total Sequences | 36228588 |
Sequences flagged as poor quality | 0 |
Sequence length | 101 |
%GC | 51 |
Per base sequence quality
Per tile sequence quality
Per sequence quality scores
Per base sequence content
Per sequence GC content
Per base N content
Sequence Length Distribution
Sequence Duplication Levels
Overrepresented sequences
Sequence | Count | Percentage | Possible Source |
---|---|---|---|
GATCGGAAGAGCACACGTCTGAACTCCAGTCACATCACGATCTCGTATGC | 200564 | 0.553607002293327 | TruSeq Adapter, Index 1 (100% over 50bp) |
AGATCGGAAGAGCACACGTCTGAACTCCAGTCACATCACGATCTCGTATG | 90097 | 0.2486903436589911 | TruSeq Adapter, Index 1 (100% over 49bp) |
Adapter Content
Kmer Content
Sequence | Count | PValue | Obs/Exp Max | Max Obs/Exp Position |
---|---|---|---|---|
CTCGTAT | 40775 | 0.0 | 37.20058 | 42-43 |
TATGCCG | 43545 | 0.0 | 37.026398 | 46-47 |
CGTATGC | 43625 | 0.0 | 36.921833 | 44-45 |
ATCTCGT | 45245 | 0.0 | 33.456924 | 40-41 |
CACGATC | 44865 | 0.0 | 33.209152 | 36-37 |
ATCACGA | 50470 | 0.0 | 31.698408 | 34-35 |
CCGTCTT | 54250 | 0.0 | 29.832653 | 50-51 |
TGCCGTC | 55400 | 0.0 | 29.383389 | 48-49 |
CGATCTC | 54475 | 0.0 | 27.672243 | 38-39 |
TCGTATG | 41645 | 0.0 | 27.594198 | 42-43 |
CGTCTGA | 62485 | 0.0 | 26.087296 | 16-17 |
TCACATC | 62855 | 0.0 | 25.32708 | 30-31 |
CTTGAAA | 64255 | 0.0 | 24.969482 | 60-61 |
CACGTCT | 64990 | 0.0 | 24.960918 | 14-15 |
TGCTTGA | 65880 | 0.0 | 24.639725 | 58-59 |
TCTCGTA | 43740 | 0.0 | 24.545527 | 40-41 |
ACATCAC | 66010 | 0.0 | 24.15701 | 32-33 |
ACGATCT | 44010 | 0.0 | 23.963507 | 36-37 |
TCACGAT | 47170 | 0.0 | 23.731647 | 34-35 |
TCTGAAC | 69025 | 0.0 | 23.722343 | 18-19 |