Basic Statistics
Measure | Value |
---|---|
Filename | EGAF00002855899 |
File type | Conventional base calls |
Encoding | Sanger / Illumina 1.9 |
Total Sequences | 362707184 |
Sequences flagged as poor quality | 0 |
Sequence length | 151 |
%GC | 40 |
Per base sequence quality
Per tile sequence quality
Per sequence quality scores
Per base sequence content
Per sequence GC content
Per base N content
Sequence Length Distribution
Sequence Duplication Levels
Overrepresented sequences
Sequence | Count | Percentage | Possible Source |
---|---|---|---|
ATCGGAAGAGCACACGTCTGAACTCCAGTCACTCTCGCGCATCTCGTATG | 425860 | 0.11741151506941203 | TruSeq Adapter, Index 8 (97% over 35bp) |
Adapter Content
Kmer Content
Sequence | Count | PValue | Obs/Exp Max | Max Obs/Exp Position |
---|---|---|---|---|
ATCGGAA | 143240 | 0.0 | 56.30609 | 1 |
TCGGAAG | 153560 | 0.0 | 52.396336 | 2 |
CGGAAGA | 167810 | 0.0 | 47.93194 | 3 |
AGAGCAC | 384935 | 0.0 | 21.388405 | 7 |
AGCACAC | 411560 | 0.0 | 20.055841 | 9 |
GAGCACA | 421405 | 0.0 | 19.626842 | 8 |
CGCGCAT | 80475 | 0.0 | 19.49048 | 35-39 |
GAAGAGC | 427000 | 0.0 | 19.376484 | 5 |
TCGCGCA | 83725 | 0.0 | 18.901869 | 35-39 |
TCTCGCG | 87740 | 0.0 | 18.402126 | 30-34 |
TATGCCG | 93945 | 0.0 | 17.19743 | 45-49 |
GCGCATC | 94580 | 0.0 | 16.757017 | 35-39 |
CTCGCGC | 99135 | 0.0 | 16.20355 | 30-34 |
CGTATGC | 101625 | 0.0 | 15.856417 | 45-49 |
CTCGTAT | 92880 | 0.0 | 15.354561 | 40-44 |
TCTCGTA | 94995 | 0.0 | 14.841773 | 40-44 |
ATGCCGT | 113815 | 0.0 | 14.211637 | 45-49 |
TGCCGTC | 113335 | 0.0 | 14.120883 | 45-49 |
ACTCTCG | 118260 | 0.0 | 13.807455 | 30-34 |
GCCGTCT | 115615 | 0.0 | 13.774826 | 50-54 |