Basic Statistics
Measure | Value |
---|---|
Filename | EGAF00004973620 |
File type | Conventional base calls |
Encoding | Sanger / Illumina 1.9 |
Total Sequences | 431894811 |
Sequences flagged as poor quality | 0 |
Sequence length | 151 |
%GC | 41 |
Per base sequence quality
Per tile sequence quality
Per sequence quality scores
Per base sequence content
Per sequence GC content
Per base N content
Sequence Length Distribution
Sequence Duplication Levels
Overrepresented sequences
Sequence | Count | Percentage | Possible Source |
---|---|---|---|
ATCGGAAGAGCACACGTCTGAACTCCAGTCACGAGATTCCATCTCGTATG | 684974 | 0.15859741366515284 | TruSeq Adapter, Index 7 (97% over 37bp) |
Adapter Content
Kmer Content
Sequence | Count | PValue | Obs/Exp Max | Max Obs/Exp Position |
---|---|---|---|---|
ATCGGAA | 347215 | 0.0 | 39.954666 | 1 |
TCGGAAG | 356575 | 0.0 | 38.774822 | 2 |
CGGAAGA | 371495 | 0.0 | 37.21331 | 3 |
AGAGCAC | 596910 | 0.0 | 23.48078 | 7 |
AGCACAC | 621655 | 0.0 | 22.64174 | 9 |
GAGCACA | 638255 | 0.0 | 22.01993 | 8 |
GAAGAGC | 655555 | 0.0 | 21.512897 | 5 |
TATGCCG | 171060 | 0.0 | 16.218851 | 45-49 |
AAGAGCA | 889345 | 0.0 | 16.0948 | 6 |
CGTATGC | 187330 | 0.0 | 14.706507 | 45-49 |
GGAAGAG | 1000685 | 0.0 | 14.481493 | 4 |
ATGCCGT | 197480 | 0.0 | 14.062214 | 45-49 |
TGCCGTC | 199620 | 0.0 | 13.818502 | 45-49 |
GCCGTCT | 202505 | 0.0 | 13.573137 | 50-54 |
CTCGTAT | 165225 | 0.0 | 13.468883 | 40-44 |
TCTCGTA | 171900 | 0.0 | 12.929852 | 40-44 |
CGTCTTC | 228150 | 0.0 | 12.317527 | 50-54 |
TCGTATG | 180095 | 0.0 | 12.291584 | 40-44 |
GTCACGA | 231340 | 0.0 | 12.119545 | 25-29 |
CCGTCTT | 232995 | 0.0 | 12.054545 | 50-54 |