Basic Statistics
Measure | Value |
---|---|
Filename | EGAF00004962572 |
File type | Conventional base calls |
Encoding | Sanger / Illumina 1.9 |
Total Sequences | 457389295 |
Sequences flagged as poor quality | 0 |
Sequence length | 151 |
%GC | 40 |
Per base sequence quality
Per tile sequence quality
Per sequence quality scores
Per base sequence content
Per sequence GC content
Per base N content
Sequence Length Distribution
Sequence Duplication Levels
Overrepresented sequences
Sequence | Count | Percentage | Possible Source |
---|---|---|---|
ATCGGAAGAGCACACGTCTGAACTCCAGTCACGAGATTCCATCTCGTATG | 666829 | 0.1457902507316005 | TruSeq Adapter, Index 7 (97% over 37bp) |
Adapter Content
Kmer Content
Sequence | Count | PValue | Obs/Exp Max | Max Obs/Exp Position |
---|---|---|---|---|
ATCGGAA | 261875 | 0.0 | 41.943718 | 1 |
TCGGAAG | 272505 | 0.0 | 40.37955 | 2 |
CGGAAGA | 295700 | 0.0 | 37.217846 | 3 |
AGAGCAC | 534245 | 0.0 | 21.058094 | 7 |
AGCACAC | 561585 | 0.0 | 19.973705 | 9 |
GAGCACA | 576470 | 0.0 | 19.513157 | 8 |
GAAGAGC | 604425 | 0.0 | 18.657406 | 5 |
TATGCCG | 130285 | 0.0 | 17.380545 | 45-49 |
CGTATGC | 143610 | 0.0 | 15.639654 | 45-49 |
CTCGTAT | 139090 | 0.0 | 15.544317 | 40-44 |
TCTCGTA | 145170 | 0.0 | 14.89529 | 40-44 |
ATGCCGT | 159450 | 0.0 | 14.347867 | 45-49 |
TGCCGTC | 160595 | 0.0 | 13.965696 | 45-49 |
TCGTATG | 154430 | 0.0 | 13.854731 | 40-44 |
GCCGTCT | 165045 | 0.0 | 13.692758 | 50-54 |
AAGAGCA | 857870 | 0.0 | 13.425058 | 6 |
GTCACGA | 170810 | 0.0 | 13.272255 | 25-29 |
ATCTCGT | 169985 | 0.0 | 12.828295 | 40-44 |
ACGAGAT | 184840 | 0.0 | 12.366893 | 30-34 |
GGAAGAG | 963415 | 0.0 | 12.099527 | 4 |