Basic Statistics
Measure | Value |
---|---|
Filename | EGAF00005068470 |
File type | Conventional base calls |
Encoding | Sanger / Illumina 1.9 |
Total Sequences | 4615974 |
Sequences flagged as poor quality | 0 |
Sequence length | 101 |
%GC | 44 |
Per base sequence quality
Per tile sequence quality
Per sequence quality scores
Per base sequence content
Per sequence GC content
Per base N content
Sequence Length Distribution
Sequence Duplication Levels
Overrepresented sequences
Sequence | Count | Percentage | Possible Source |
---|---|---|---|
NNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNN | 4781 | 0.10357510679219596 | No Hit |
Adapter Content
Kmer Content
Sequence | Count | PValue | Obs/Exp Max | Max Obs/Exp Position |
---|---|---|---|---|
GGCGCGT | 845 | 0.0 | 24.172375 | 4 |
GGCGATC | 610 | 0.0 | 24.138285 | 2 |
ACGTGCG | 515 | 0.0 | 21.222576 | 8 |
GCGCGTG | 1050 | 0.0 | 19.454908 | 5 |
TGGCGCG | 1130 | 0.0 | 19.335644 | 3 |
GCGTACG | 140 | 3.573041E-8 | 18.625292 | 94-95 |
CGCGTGC | 1120 | 0.0 | 18.241552 | 6 |
CGTACGG | 125 | 4.650621E-6 | 17.067543 | 94-95 |
GCGCGCG | 390 | 3.0013325E-10 | 17.055906 | 6 |
CGATCTG | 930 | 0.0 | 16.85538 | 4 |
CGCGCGC | 395 | 3.601599E-10 | 16.84129 | 7 |
CGTGCGC | 680 | 0.0 | 16.772898 | 9 |
TAACACG | 775 | 0.0 | 15.323073 | 4 |
TATCGCG | 125 | 8.65696E-5 | 15.213218 | 22-23 |
CGCACGA | 430 | 0.0 | 14.92673 | 28-29 |
GGGCGAT | 1040 | 0.0 | 14.614266 | 1 |
GGTGGCG | 1605 | 0.0 | 14.500448 | 1 |
TCGCACG | 435 | 0.0 | 14.208517 | 26-27 |
CGCGAGT | 435 | 0.0 | 13.658769 | 12-13 |
TGCGACA | 1085 | 0.0 | 13.575109 | 6 |