Basic Statistics
Measure | Value |
---|---|
Filename | EGAF00006071637 |
File type | Conventional base calls |
Encoding | Sanger / Illumina 1.9 |
Total Sequences | 9374796 |
Sequences flagged as poor quality | 0 |
Sequence length | 51 |
%GC | 43 |
Per base sequence quality
Per tile sequence quality
Per sequence quality scores
Per base sequence content
Per sequence GC content
Per base N content
Sequence Length Distribution
Sequence Duplication Levels
Overrepresented sequences
Sequence | Count | Percentage | Possible Source |
---|---|---|---|
GATCGGAAGAGCACACGTCTGAACTCCAGTCACGCCAATATCTCGTATGCC | 119152 | 1.2709823232420205 | TruSeq Adapter, Index 6 (100% over 51bp) |
ATCGGAAGAGCACACGTCTGAACTCCAGTCACGCCAATATCTCGTATGCCG | 35197 | 0.37544283630278463 | TruSeq Adapter, Index 6 (100% over 51bp) |
Adapter Content
Kmer Content
Sequence | Count | PValue | Obs/Exp Max | Max Obs/Exp Position |
---|---|---|---|---|
GATCGGA | 13000 | 0.0 | 43.21655 | 1 |
TATGCCG | 5320 | 0.0 | 42.18188 | 45 |
CGTCACG | 470 | 0.0 | 33.98374 | 27 |
CGCCAAT | 17335 | 0.0 | 31.897877 | 33 |
ACGCCAA | 17590 | 0.0 | 31.486614 | 32 |
TCTCGTA | 17950 | 0.0 | 30.842758 | 41 |
AGTCACG | 18010 | 0.0 | 30.839771 | 28 |
TATCTCG | 17950 | 0.0 | 30.830061 | 39 |
ATCGGAA | 18250 | 0.0 | 30.787033 | 2 |
CTCGTAT | 18010 | 0.0 | 30.727518 | 42 |
TCGGAAG | 18300 | 0.0 | 30.714077 | 3 |
GTCACGC | 18050 | 0.0 | 30.684187 | 29 |
CGTATGC | 18080 | 0.0 | 30.60855 | 44 |
TCGTATG | 18150 | 0.0 | 30.540081 | 43 |
ACACGTC | 18440 | 0.0 | 30.535404 | 13 |
ATCTCGT | 18145 | 0.0 | 30.523699 | 40 |
CGGAAGA | 18620 | 0.0 | 30.391582 | 4 |
CGTCTGA | 18595 | 0.0 | 30.208288 | 16 |
ACGTCTG | 18630 | 0.0 | 30.151537 | 15 |
CACGTCT | 18775 | 0.0 | 29.918674 | 14 |