Basic Statistics
Measure | Value |
---|---|
Filename | EGAF00005002698 |
File type | Conventional base calls |
Encoding | Sanger / Illumina 1.9 |
Total Sequences | 398115887 |
Sequences flagged as poor quality | 0 |
Sequence length | 151 |
%GC | 41 |
Per base sequence quality
Per tile sequence quality
Per sequence quality scores
Per base sequence content
Per sequence GC content
Per base N content
Sequence Length Distribution
Sequence Duplication Levels
Overrepresented sequences
Sequence | Count | Percentage | Possible Source |
---|---|---|---|
ATCGGAAGAGCGTCGTGTAGGGAAAGAGTGTAGGCTATAGTGTAGATCTC | 562486 | 0.14128700169154515 | Illumina Single End PCR Primer 1 (96% over 33bp) |
Adapter Content
Kmer Content
Sequence | Count | PValue | Obs/Exp Max | Max Obs/Exp Position |
---|---|---|---|---|
AGCGTCG | 261420 | 0.0 | 35.07937 | 9 |
GAGCGTC | 290405 | 0.0 | 31.575226 | 8 |
ATCGGAA | 318010 | 0.0 | 30.1732 | 1 |
TCGGAAG | 332395 | 0.0 | 28.542437 | 2 |
AGAGCGT | 327175 | 0.0 | 28.181124 | 7 |
CGGAAGA | 353425 | 0.0 | 26.709747 | 3 |
AAGAGCG | 380205 | 0.0 | 24.714376 | 6 |
GAAGAGC | 632245 | 0.0 | 15.276539 | 5 |
CGCCGTA | 169375 | 0.0 | 11.0279 | 55-59 |
TCGCCGT | 175735 | 0.0 | 10.579293 | 55-59 |
GGAAGAG | 973920 | 0.0 | 10.266424 | 4 |
GCCGTAT | 186495 | 0.0 | 10.105727 | 55-59 |
GTCGCCG | 197970 | 0.0 | 9.762359 | 55-59 |
TGGTCGC | 209335 | 0.0 | 9.318985 | 50-54 |
CCGTATC | 193800 | 0.0 | 9.285694 | 55-59 |
CGTATCA | 202480 | 0.0 | 9.239201 | 60-64 |
GGTCGCC | 216610 | 0.0 | 8.763046 | 50-54 |
GTGGTCG | 226690 | 0.0 | 8.702748 | 50-54 |
GCGTCGT | 247490 | 0.0 | 7.6823974 | 10-14 |
CGTCGTG | 251365 | 0.0 | 7.596841 | 10-14 |