Basic Statistics
Measure | Value |
---|---|
Filename | EGAF00004973596 |
File type | Conventional base calls |
Encoding | Sanger / Illumina 1.9 |
Total Sequences | 441487885 |
Sequences flagged as poor quality | 0 |
Sequence length | 151 |
%GC | 40 |
Per base sequence quality
Per tile sequence quality
Per sequence quality scores
Per base sequence content
Per sequence GC content
Per base N content
Sequence Length Distribution
Sequence Duplication Levels
Overrepresented sequences
Sequence | Count | Percentage | Possible Source |
---|---|---|---|
ATCGGAAGAGCACACGTCTGAACTCCAGTCACGAGATTCCATCTCGTATG | 578061 | 0.13093473674821224 | TruSeq Adapter, Index 7 (97% over 37bp) |
Adapter Content
Kmer Content
Sequence | Count | PValue | Obs/Exp Max | Max Obs/Exp Position |
---|---|---|---|---|
ATCGGAA | 266475 | 0.0 | 38.706383 | 1 |
TCGGAAG | 270930 | 0.0 | 37.888763 | 2 |
CGGAAGA | 292700 | 0.0 | 35.13463 | 3 |
AGAGCAC | 518150 | 0.0 | 20.215122 | 7 |
AGCACAC | 536340 | 0.0 | 19.49865 | 9 |
GAGCACA | 552130 | 0.0 | 18.994488 | 8 |
GAAGAGC | 587750 | 0.0 | 18.01486 | 5 |
TATGCCG | 137375 | 0.0 | 15.146596 | 45-49 |
CGTATGC | 152015 | 0.0 | 13.625354 | 45-49 |
AAGAGCA | 822080 | 0.0 | 13.133813 | 6 |
CTCGTAT | 146160 | 0.0 | 12.883405 | 40-44 |
TCTCGTA | 150790 | 0.0 | 12.48209 | 40-44 |
ATGCCGT | 171630 | 0.0 | 12.216692 | 45-49 |
GGAAGAG | 913965 | 0.0 | 11.921531 | 4 |
TGCCGTC | 172830 | 0.0 | 11.913531 | 45-49 |
GCCGTCT | 173350 | 0.0 | 11.88493 | 50-54 |
TCGTATG | 159190 | 0.0 | 11.673256 | 40-44 |
GTCACGA | 186310 | 0.0 | 11.354133 | 25-29 |
ACGAGAT | 197060 | 0.0 | 10.819198 | 30-34 |
ATCTCGT | 176510 | 0.0 | 10.765271 | 40-44 |