Basic Statistics
Measure | Value |
---|---|
Filename | EGAF00003287448 |
File type | Conventional base calls |
Encoding | Sanger / Illumina 1.9 |
Total Sequences | 417290818 |
Sequences flagged as poor quality | 0 |
Sequence length | 151 |
%GC | 40 |
Per base sequence quality
Per tile sequence quality
Per sequence quality scores
Per base sequence content
Per sequence GC content
Per base N content
Sequence Length Distribution
Sequence Duplication Levels
Overrepresented sequences
Sequence | Count | Percentage | Possible Source |
---|---|---|---|
ATCGGAAGAGCACACGTCTGAACTCCAGTCACCCGTCCCGATCTCGTATG | 1442864 | 0.3457694101479127 | TruSeq Adapter, Index 16 (97% over 39bp) |
Adapter Content
Kmer Content
Sequence | Count | PValue | Obs/Exp Max | Max Obs/Exp Position |
---|---|---|---|---|
ATCGGAA | 320585 | 0.0 | 83.61872 | 1 |
TCGGAAG | 332830 | 0.0 | 80.34865 | 2 |
CGGAAGA | 351330 | 0.0 | 76.09538 | 3 |
AGAGCAC | 582840 | 0.0 | 45.922802 | 7 |
AGCACAC | 606210 | 0.0 | 44.01638 | 9 |
GAGCACA | 621845 | 0.0 | 43.018055 | 8 |
GAAGAGC | 644495 | 0.0 | 41.757847 | 5 |
AAGAGCA | 883855 | 0.0 | 30.664087 | 6 |
GGAAGAG | 991120 | 0.0 | 27.532848 | 4 |
TATGCCG | 230450 | 0.0 | 23.70881 | 45-49 |
CCCGATC | 240585 | 0.0 | 22.525732 | 35-39 |
CGTCCCG | 237095 | 0.0 | 22.15629 | 30-34 |
CGTATGC | 245620 | 0.0 | 22.020792 | 45-49 |
CTCGTAT | 220105 | 0.0 | 22.01101 | 40-44 |
TCCCGAT | 246095 | 0.0 | 21.947153 | 35-39 |
GTCCCGA | 247895 | 0.0 | 21.655025 | 35-39 |
CCGATCT | 221590 | 0.0 | 21.63399 | 35-39 |
TCTCGTA | 224090 | 0.0 | 21.556824 | 40-44 |
ATGCCGT | 255325 | 0.0 | 21.297909 | 45-49 |
GCCGTCT | 257220 | 0.0 | 20.78266 | 50-54 |