Basic Statistics
Measure | Value |
---|---|
Filename | EGAF00002855285 |
File type | Conventional base calls |
Encoding | Sanger / Illumina 1.9 |
Total Sequences | 361574948 |
Sequences flagged as poor quality | 0 |
Sequence length | 151 |
%GC | 40 |
Per base sequence quality
Per tile sequence quality
Per sequence quality scores
Per base sequence content
Per sequence GC content
Per base N content
Sequence Length Distribution
Sequence Duplication Levels
Overrepresented sequences
Sequence | Count | Percentage | Possible Source |
---|---|---|---|
ATCGGAAGAGCACACGTCTGAACTCCAGTCACAGCGATAGATCTCGTATG | 561998 | 0.1554305692660986 | TruSeq Adapter, Index 1 (97% over 35bp) |
Adapter Content
Kmer Content
Sequence | Count | PValue | Obs/Exp Max | Max Obs/Exp Position |
---|---|---|---|---|
ATCGGAA | 141690 | 0.0 | 77.461266 | 1 |
TCGGAAG | 150020 | 0.0 | 73.05525 | 2 |
CGGAAGA | 164650 | 0.0 | 66.62137 | 3 |
AGAGCAC | 389050 | 0.0 | 28.604946 | 7 |
AGCACAC | 423310 | 0.0 | 26.402058 | 9 |
GAGCACA | 429215 | 0.0 | 26.084457 | 8 |
GAAGAGC | 434210 | 0.0 | 25.792366 | 5 |
TATGCCG | 113445 | 0.0 | 19.439491 | 45-49 |
CGTATGC | 121040 | 0.0 | 18.099918 | 45-49 |
AAGAGCA | 645570 | 0.0 | 17.574574 | 6 |
CGATAGA | 125820 | 0.0 | 17.30137 | 35-39 |
AGCGATA | 127560 | 0.0 | 17.2621 | 30-34 |
GCGATAG | 127355 | 0.0 | 17.155556 | 30-34 |
CTCGTAT | 105225 | 0.0 | 17.066061 | 40-44 |
TCTCGTA | 106105 | 0.0 | 16.888994 | 40-44 |
TGCCGTC | 130625 | 0.0 | 16.718502 | 45-49 |
ATGCCGT | 132500 | 0.0 | 16.615421 | 45-49 |
GCCGTCT | 132220 | 0.0 | 16.359224 | 50-54 |
GGAAGAG | 709565 | 0.0 | 16.13697 | 4 |
TCGTATG | 111500 | 0.0 | 15.967786 | 40-44 |