Basic Statistics
Measure | Value |
---|---|
Filename | EGAF00004973546 |
File type | Conventional base calls |
Encoding | Sanger / Illumina 1.9 |
Total Sequences | 438316898 |
Sequences flagged as poor quality | 0 |
Sequence length | 151 |
%GC | 40 |
Per base sequence quality
Per tile sequence quality
Per sequence quality scores
Per base sequence content
Per sequence GC content
Per base N content
Sequence Length Distribution
Sequence Duplication Levels
Overrepresented sequences
Sequence | Count | Percentage | Possible Source |
---|---|---|---|
ATCGGAAGAGCACACGTCTGAACTCCAGTCACTCCGCGAAATCTCGTATG | 2506783 | 0.571911101634051 | TruSeq Adapter, Index 6 (97% over 35bp) |
Adapter Content
Kmer Content
Sequence | Count | PValue | Obs/Exp Max | Max Obs/Exp Position |
---|---|---|---|---|
ATCGGAA | 559825 | 0.0 | 85.08241 | 1 |
TCGGAAG | 573675 | 0.0 | 82.74614 | 2 |
CGGAAGA | 588150 | 0.0 | 80.46419 | 3 |
AGAGCAC | 817970 | 0.0 | 57.78799 | 7 |
AGCACAC | 839575 | 0.0 | 56.282234 | 9 |
GAGCACA | 854475 | 0.0 | 55.339523 | 8 |
GAAGAGC | 876470 | 0.0 | 54.103752 | 5 |
AAGAGCA | 1117995 | 0.0 | 42.563946 | 6 |
GGAAGAG | 1222285 | 0.0 | 39.12211 | 4 |
TATGCCG | 392095 | 0.0 | 24.31425 | 45-49 |
CGCGAAA | 392630 | 0.0 | 23.916183 | 35-39 |
TCCGCGA | 397835 | 0.0 | 23.895641 | 30-34 |
CCGCGAA | 400580 | 0.0 | 23.498457 | 30-34 |
CGTATGC | 407205 | 0.0 | 23.081991 | 45-49 |
CTCGTAT | 353275 | 0.0 | 22.877747 | 40-44 |
ATGCCGT | 417595 | 0.0 | 22.640665 | 45-49 |
GCGAAAT | 420465 | 0.0 | 22.432207 | 35-39 |
TCTCGTA | 358715 | 0.0 | 22.418842 | 40-44 |
TGCCGTC | 418470 | 0.0 | 22.313047 | 45-49 |
GCCGTCT | 417885 | 0.0 | 22.137533 | 50-54 |