Basic Statistics
Measure | Value |
---|---|
Filename | EGAF00004963942 |
File type | Conventional base calls |
Encoding | Sanger / Illumina 1.9 |
Total Sequences | 456322108 |
Sequences flagged as poor quality | 0 |
Sequence length | 151 |
%GC | 41 |
Per base sequence quality
Per tile sequence quality
Per sequence quality scores
Per base sequence content
Per sequence GC content
Per base N content
Sequence Length Distribution
Sequence Duplication Levels
Overrepresented sequences
Sequence | Count | Percentage | Possible Source |
---|---|---|---|
ATCGGAAGAGCACACGTCTGAACTCCAGTCACATTACTCGATCTCGTATG | 550843 | 0.12071363415072583 | TruSeq Adapter, Index 27 (97% over 38bp) |
Adapter Content
Kmer Content
Sequence | Count | PValue | Obs/Exp Max | Max Obs/Exp Position |
---|---|---|---|---|
ATCGGAA | 197090 | 0.0 | 44.84085 | 1 |
TCGGAAG | 207550 | 0.0 | 42.652206 | 2 |
CGGAAGA | 234645 | 0.0 | 37.829487 | 3 |
AGAGCAC | 480670 | 0.0 | 18.93364 | 7 |
AGCACAC | 513335 | 0.0 | 17.848856 | 9 |
GAGCACA | 527965 | 0.0 | 17.351515 | 8 |
GAAGAGC | 549125 | 0.0 | 16.71608 | 5 |
TATGCCG | 122670 | 0.0 | 15.011022 | 45-49 |
CTCGTAT | 127405 | 0.0 | 14.174368 | 40-44 |
CGTATGC | 132670 | 0.0 | 13.739687 | 45-49 |
TCTCGTA | 133690 | 0.0 | 13.4689665 | 40-44 |
TTACTCG | 138145 | 0.0 | 13.361233 | 30-34 |
TCGTATG | 142365 | 0.0 | 12.793863 | 40-44 |
TACTCGA | 144075 | 0.0 | 12.713577 | 35-39 |
AAGAGCA | 793245 | 0.0 | 11.847548 | 6 |
ATGCCGT | 161460 | 0.0 | 11.61122 | 45-49 |
TGCCGTC | 166445 | 0.0 | 11.320084 | 45-49 |
ATCTCGT | 162415 | 0.0 | 11.22697 | 40-44 |
GCCGTCT | 172580 | 0.0 | 10.804911 | 50-54 |
GGAAGAG | 889515 | 0.0 | 10.726892 | 4 |