Basic Statistics
Measure | Value |
---|---|
Filename | EGAF00002855695 |
File type | Conventional base calls |
Encoding | Sanger / Illumina 1.9 |
Total Sequences | 333387407 |
Sequences flagged as poor quality | 0 |
Sequence length | 151 |
%GC | 40 |
Per base sequence quality
Per tile sequence quality
Per sequence quality scores
Per base sequence content
Per sequence GC content
Per base N content
Sequence Length Distribution
Sequence Duplication Levels
Overrepresented sequences
Sequence | Count | Percentage | Possible Source |
---|---|---|---|
ATCGGAAGAGCACACGTCTGAACTCCAGTCACAGCGATAGATCTCGTATG | 1697471 | 0.5091587037659164 | TruSeq Adapter, Index 1 (97% over 35bp) |
Adapter Content
Kmer Content
Sequence | Count | PValue | Obs/Exp Max | Max Obs/Exp Position |
---|---|---|---|---|
ATCGGAA | 313330 | 0.0 | 98.42111 | 1 |
TCGGAAG | 325465 | 0.0 | 94.49269 | 2 |
CGGAAGA | 336840 | 0.0 | 91.1408 | 3 |
AGAGCAC | 537820 | 0.0 | 57.433617 | 7 |
AGCACAC | 565855 | 0.0 | 54.584854 | 9 |
GAGCACA | 573990 | 0.0 | 53.87246 | 8 |
GAAGAGC | 584305 | 0.0 | 52.889603 | 5 |
AAGAGCA | 775665 | 0.0 | 40.00413 | 6 |
GGAAGAG | 858130 | 0.0 | 36.31689 | 4 |
TATGCCG | 252950 | 0.0 | 24.700426 | 45-49 |
CTCGTAT | 230065 | 0.0 | 23.738958 | 40-44 |
CGTATGC | 260625 | 0.0 | 23.686047 | 45-49 |
TCTCGTA | 232350 | 0.0 | 23.401316 | 40-44 |
AGCGATA | 265380 | 0.0 | 23.375933 | 30-34 |
CGATAGA | 264515 | 0.0 | 23.145863 | 35-39 |
ATGCCGT | 270140 | 0.0 | 22.941914 | 45-49 |
GCGATAG | 266725 | 0.0 | 22.895565 | 30-34 |
TCGTATG | 237135 | 0.0 | 22.600245 | 40-44 |
GCCGTCT | 269290 | 0.0 | 22.595093 | 50-54 |
TGCCGTC | 269580 | 0.0 | 22.592205 | 45-49 |