Basic Statistics
Measure | Value |
---|---|
Filename | EGAF00005727706 |
File type | Conventional base calls |
Encoding | Sanger / Illumina 1.9 |
Total Sequences | 5239822 |
Sequences flagged as poor quality | 0 |
Sequence length | 51 |
%GC | 39 |
Per base sequence quality
Per tile sequence quality
Per sequence quality scores
Per base sequence content
Per sequence GC content
Per base N content
Sequence Length Distribution
Sequence Duplication Levels
Overrepresented sequences
Sequence | Count | Percentage | Possible Source |
---|---|---|---|
GATCGGAAGAGCACACGTCTGAACTCCAGTCACCCGTCCCGATCTCGTATG | 104888 | 2.001747387602098 | TruSeq Adapter, Index 16 (97% over 40bp) |
Adapter Content
Kmer Content
Sequence | Count | PValue | Obs/Exp Max | Max Obs/Exp Position |
---|---|---|---|---|
GATCGGA | 12200 | 0.0 | 43.585396 | 1 |
CGTCCCG | 11780 | 0.0 | 43.069176 | 35 |
ATCGGAA | 12405 | 0.0 | 43.057545 | 2 |
TCGGAAG | 12405 | 0.0 | 42.948723 | 3 |
ACACGTC | 12265 | 0.0 | 42.815258 | 13 |
GTCCCGA | 11795 | 0.0 | 42.7855 | 36 |
CGTCTGA | 12295 | 0.0 | 42.783985 | 16 |
CCCGATC | 11660 | 0.0 | 42.740585 | 38 |
CGGAAGA | 12500 | 0.0 | 42.660343 | 4 |
ACCCGTC | 12075 | 0.0 | 42.59458 | 32 |
TCCCGAT | 11840 | 0.0 | 42.56588 | 37 |
CCGTCCC | 11930 | 0.0 | 42.54651 | 34 |
CACCCGT | 12090 | 0.0 | 42.467297 | 31 |
CCGATCT | 11720 | 0.0 | 42.464184 | 39 |
CCCGTCC | 12090 | 0.0 | 42.44869 | 33 |
TCACCCG | 12125 | 0.0 | 42.38182 | 30 |
CACGTCT | 12395 | 0.0 | 42.36621 | 14 |
ACGTCTG | 12475 | 0.0 | 42.14863 | 15 |
TCTCGTA | 12005 | 0.0 | 42.03787 | 43 |
GCACACG | 12555 | 0.0 | 41.98758 | 11 |