Basic Statistics
Measure | Value |
---|---|
Filename | EGAF00005002401 |
File type | Conventional base calls |
Encoding | Sanger / Illumina 1.9 |
Total Sequences | 400678786 |
Sequences flagged as poor quality | 0 |
Sequence length | 151 |
%GC | 40 |
Per base sequence quality
Per tile sequence quality
Per sequence quality scores
Per base sequence content
Per sequence GC content
Per base N content
Sequence Length Distribution
Sequence Duplication Levels
Overrepresented sequences
Sequence | Count | Percentage | Possible Source |
---|---|---|---|
ATCGGAAGAGCACACGTCTGAACTCCAGTCACGAATTCGTATCTCGTATG | 1278917 | 0.3191875998146805 | TruSeq Adapter, Index 7 (97% over 34bp) |
Adapter Content
Kmer Content
Sequence | Count | PValue | Obs/Exp Max | Max Obs/Exp Position |
---|---|---|---|---|
ATCGGAA | 252615 | 0.0 | 84.60901 | 1 |
TCGGAAG | 267585 | 0.0 | 79.777405 | 2 |
CGGAAGA | 286345 | 0.0 | 74.62347 | 3 |
AGAGCAC | 524930 | 0.0 | 41.06578 | 7 |
AGCACAC | 557720 | 0.0 | 38.627033 | 9 |
GAGCACA | 568010 | 0.0 | 37.967796 | 8 |
GAAGAGC | 570170 | 0.0 | 37.920467 | 5 |
AAGAGCA | 796290 | 0.0 | 27.369022 | 6 |
GGAAGAG | 887220 | 0.0 | 24.670195 | 4 |
TATGCCG | 211070 | 0.0 | 20.922844 | 45-49 |
CTCGTAT | 202225 | 0.0 | 20.223171 | 40-44 |
TCTCGTA | 205320 | 0.0 | 19.898556 | 40-44 |
TCGTATC | 225190 | 0.0 | 19.596273 | 35-39 |
CGTATGC | 223985 | 0.0 | 19.459448 | 45-49 |
TATCTCG | 210430 | 0.0 | 19.17971 | 40-44 |
ATGCCGT | 230990 | 0.0 | 19.08336 | 45-49 |
TGCCGTC | 232890 | 0.0 | 18.320675 | 45-49 |
GTCACGA | 242090 | 0.0 | 18.30395 | 25-29 |
GCCGTCT | 236595 | 0.0 | 18.261112 | 50-54 |
TCGTATG | 217545 | 0.0 | 18.232515 | 40-44 |