Basic Statistics
Measure | Value |
---|---|
Filename | EGAF00004962578 |
File type | Conventional base calls |
Encoding | Sanger / Illumina 1.9 |
Total Sequences | 461707081 |
Sequences flagged as poor quality | 0 |
Sequence length | 151 |
%GC | 40 |
Per base sequence quality
Per tile sequence quality
Per sequence quality scores
Per base sequence content
Per sequence GC content
Per base N content
Sequence Length Distribution
Sequence Duplication Levels
Overrepresented sequences
Sequence | Count | Percentage | Possible Source |
---|---|---|---|
ATCGGAAGAGCACACGTCTGAACTCCAGTCACTCCGCGAAATCTCGTATG | 592295 | 0.12828371588262472 | TruSeq Adapter, Index 6 (97% over 35bp) |
Adapter Content
Kmer Content
Sequence | Count | PValue | Obs/Exp Max | Max Obs/Exp Position |
---|---|---|---|---|
ATCGGAA | 180205 | 0.0 | 59.1877 | 1 |
TCGGAAG | 194240 | 0.0 | 54.978268 | 2 |
CGGAAGA | 215740 | 0.0 | 49.43543 | 3 |
AGAGCAC | 476265 | 0.0 | 22.908293 | 7 |
CGCGAAA | 94465 | 0.0 | 22.196323 | 35-39 |
TCCGCGA | 98465 | 0.0 | 21.518078 | 30-34 |
AGCACAC | 509615 | 0.0 | 21.420008 | 9 |
GAGCACA | 520175 | 0.0 | 21.053442 | 8 |
CCGCGAA | 100910 | 0.0 | 20.752409 | 30-34 |
GAAGAGC | 532245 | 0.0 | 20.640497 | 5 |
TATGCCG | 112935 | 0.0 | 19.236544 | 45-49 |
CGTATGC | 126280 | 0.0 | 17.109531 | 45-49 |
CTCGTAT | 115855 | 0.0 | 16.808367 | 40-44 |
CTCCGCG | 127765 | 0.0 | 16.764996 | 30-34 |
GCGAAAT | 128625 | 0.0 | 16.741037 | 35-39 |
TCTCGTA | 120295 | 0.0 | 16.133753 | 40-44 |
ACTCCGC | 140380 | 0.0 | 15.467104 | 30-34 |
ATGCCGT | 141785 | 0.0 | 15.41948 | 45-49 |
TGCCGTC | 141575 | 0.0 | 15.246775 | 45-49 |
GCCGTCT | 145665 | 0.0 | 14.7831955 | 50-54 |