Basic Statistics
Measure | Value |
---|---|
Filename | EGAF00004962562 |
File type | Conventional base calls |
Encoding | Sanger / Illumina 1.9 |
Total Sequences | 397938954 |
Sequences flagged as poor quality | 0 |
Sequence length | 151 |
%GC | 40 |
Per base sequence quality
Per tile sequence quality
Per sequence quality scores
Per base sequence content
Per sequence GC content
Per base N content
Sequence Length Distribution
Sequence Duplication Levels
Overrepresented sequences
Sequence | Count | Percentage | Possible Source |
---|---|---|---|
ATCGGAAGAGCACACGTCTGAACTCCAGTCACGAATTCGTATCTCGTATG | 3551690 | 0.8925213187347323 | TruSeq Adapter, Index 7 (97% over 34bp) |
Adapter Content
Kmer Content
Sequence | Count | PValue | Obs/Exp Max | Max Obs/Exp Position |
---|---|---|---|---|
ATCGGAA | 589310 | 0.0 | 99.34981 | 1 |
TCGGAAG | 600510 | 0.0 | 97.63241 | 2 |
CGGAAGA | 623750 | 0.0 | 93.929596 | 3 |
AGAGCAC | 812965 | 0.0 | 71.58155 | 7 |
AGCACAC | 833635 | 0.0 | 69.737526 | 9 |
GAGCACA | 845370 | 0.0 | 68.82174 | 8 |
GAAGAGC | 883600 | 0.0 | 66.18498 | 5 |
AAGAGCA | 1095360 | 0.0 | 53.423553 | 6 |
GGAAGAG | 1184535 | 0.0 | 49.877293 | 4 |
TATGCCG | 477390 | 0.0 | 24.702564 | 45-49 |
CTCGTAT | 469075 | 0.0 | 24.378565 | 40-44 |
TCTCGTA | 474445 | 0.0 | 24.205624 | 40-44 |
CGTATGC | 485705 | 0.0 | 23.996378 | 45-49 |
TCGTATC | 494880 | 0.0 | 23.77351 | 35-39 |
TATCTCG | 483750 | 0.0 | 23.51553 | 40-44 |
TCGTATG | 485790 | 0.0 | 23.166367 | 40-44 |
GTCACGA | 512360 | 0.0 | 23.149237 | 25-29 |
ATGCCGT | 508990 | 0.0 | 23.124783 | 45-49 |
ATCTCGT | 498845 | 0.0 | 23.078623 | 40-44 |
TGCCGTC | 508695 | 0.0 | 22.702393 | 45-49 |