Basic Statistics
Measure | Value |
---|---|
Filename | EGAF00005316404 |
File type | Conventional base calls |
Encoding | Sanger / Illumina 1.9 |
Total Sequences | 67394 |
Sequences flagged as poor quality | 0 |
Sequence length | 35-251 |
%GC | 44 |
Per base sequence quality
Per tile sequence quality
Per sequence quality scores
Per base sequence content
Per sequence GC content
Per base N content
Sequence Length Distribution
Sequence Duplication Levels
Overrepresented sequences
Sequence | Count | Percentage | Possible Source |
---|---|---|---|
NNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNN | 230 | 0.34127667151378466 | No Hit |
ATCCTATATATCTTAATGGCACATGCAGCGCAAGTAGGTCTACAAGACGC | 77 | 0.11425349437635397 | No Hit |
ATCTTAGGGGCCATCAATTTCATCACAACAATTATCAATATAAAACCCCC | 77 | 0.11425349437635397 | No Hit |
ATTCTATACCAACACCTATTCTGATTTTTCGGTCACCCTGAAGTTTATAT | 72 | 0.10683443629996736 | No Hit |
TGCTAATACAATGCCAGTCAGGCCACCTACGGTGAAAAGAAAGATGAATC | 68 | 0.10089918983885807 | No Hit |
Adapter Content
Kmer Content
Sequence | Count | PValue | Obs/Exp Max | Max Obs/Exp Position |
---|---|---|---|---|
CCAGTTG | 115 | 8.080204E-6 | 110.3163 | 245 |
TATTCTG | 90 | 0.0018471903 | 73.1597 | 230-234 |
TACCAAC | 85 | 0.0019432992 | 72.24406 | 220-224 |
TTTCGGT | 95 | 0.0019465553 | 72.19135 | 240-244 |
GCCCGTC | 35 | 0.0031050537 | 64.29741 | 235-239 |
GTCCATA | 50 | 0.007405004 | 51.6152 | 1 |
TTCGGTC | 70 | 0.009020061 | 49.084187 | 240-244 |
GTGTTCC | 80 | 8.726141E-4 | 43.01267 | 9 |
ATATCTT | 80 | 8.726141E-4 | 43.01267 | 5 |
CCATACA | 125 | 3.8573708E-6 | 41.29216 | 2 |
CTGCCGG | 80 | 0.0025181137 | 39.644917 | 245 |
ACGGTCG | 80 | 0.007742501 | 39.644917 | 245 |
ACTAAGT | 90 | 0.0015574103 | 38.233482 | 8 |
TAACACT | 120 | 1.4720524E-4 | 35.843887 | 4 |
TAGGGGC | 120 | 1.4720524E-4 | 35.843887 | 5 |
GTGCTCA | 160 | 2.0961143E-5 | 32.259502 | 9 |
CATACAC | 270 | 8.62201E-10 | 31.861237 | 3 |
ATATATC | 110 | 0.004166717 | 31.28194 | 3 |
GGGGCCA | 110 | 0.004166717 | 31.28194 | 7 |
CGGCGGC | 80 | 2.6660267E-5 | 31.17915 | 220-224 |