Basic Statistics
Measure | Value |
---|---|
Filename | EGAF00003249824 |
File type | Conventional base calls |
Encoding | Sanger / Illumina 1.9 |
Total Sequences | 405339183 |
Sequences flagged as poor quality | 0 |
Sequence length | 151 |
%GC | 40 |
Per base sequence quality
Per tile sequence quality
Per sequence quality scores
Per base sequence content
Per sequence GC content
Per base N content
Sequence Length Distribution
Sequence Duplication Levels
Overrepresented sequences
Sequence | Count | Percentage | Possible Source |
---|---|---|---|
ATCGGAAGAGCACACGTCTGAACTCCAGTCACGTGGCCTTATCTCGTATG | 412201 | 0.10169285805265955 | TruSeq Adapter, Index 20 (97% over 43bp) |
Adapter Content
Kmer Content
Sequence | Count | PValue | Obs/Exp Max | Max Obs/Exp Position |
---|---|---|---|---|
ATCGGAA | 164470 | 0.0 | 44.451733 | 1 |
TCGGAAG | 176540 | 0.0 | 41.398552 | 2 |
CGGAAGA | 195265 | 0.0 | 37.399128 | 3 |
AGAGCAC | 422670 | 0.0 | 17.862455 | 7 |
TATGCCG | 88195 | 0.0 | 17.378872 | 45-49 |
AGCACAC | 447085 | 0.0 | 17.02478 | 9 |
GAGCACA | 458150 | 0.0 | 16.569317 | 8 |
GAAGAGC | 481190 | 0.0 | 15.8105955 | 5 |
CGTATGC | 100155 | 0.0 | 15.261591 | 45-49 |
CTCGTAT | 94200 | 0.0 | 14.811188 | 40-44 |
TCTCGTA | 98760 | 0.0 | 14.17283 | 40-44 |
ATGCCGT | 112720 | 0.0 | 13.690286 | 45-49 |
TATCTCG | 104660 | 0.0 | 13.4431305 | 40-44 |
TGCCGTC | 109685 | 0.0 | 13.422686 | 45-49 |
GCCGTCT | 114005 | 0.0 | 13.407147 | 50-54 |
TCGTATG | 107825 | 0.0 | 12.563111 | 40-44 |
ATCTCGT | 119615 | 0.0 | 11.935721 | 40-44 |
GTCACGT | 136940 | 0.0 | 11.467594 | 25-29 |
AGTCACG | 137500 | 0.0 | 11.373444 | 25-29 |
CGTCTTC | 140080 | 0.0 | 11.123677 | 50-54 |