Basic Statistics
Measure | Value |
---|---|
Filename | EGAF00002855020 |
File type | Conventional base calls |
Encoding | Sanger / Illumina 1.9 |
Total Sequences | 402013370 |
Sequences flagged as poor quality | 0 |
Sequence length | 151 |
%GC | 40 |
Per base sequence quality
Per tile sequence quality
Per sequence quality scores
Per base sequence content
Per sequence GC content
Per base N content
Sequence Length Distribution
Sequence Duplication Levels
Overrepresented sequences
Sequence | Count | Percentage | Possible Source |
---|---|---|---|
ATCGGAAGAGCACACGTCTGAACTCCAGTCACCGCTCATTATCTCGTATG | 496559 | 0.12351803125353766 | TruSeq Adapter, Index 2 (97% over 36bp) |
Adapter Content
Kmer Content
Sequence | Count | PValue | Obs/Exp Max | Max Obs/Exp Position |
---|---|---|---|---|
ATCGGAA | 158585 | 0.0 | 62.965702 | 1 |
TCGGAAG | 170970 | 0.0 | 58.318363 | 2 |
CGGAAGA | 185055 | 0.0 | 53.821625 | 3 |
AGAGCAC | 434225 | 0.0 | 23.42792 | 7 |
AGCACAC | 468205 | 0.0 | 21.755337 | 9 |
GAGCACA | 480650 | 0.0 | 21.293386 | 8 |
GAAGAGC | 488005 | 0.0 | 21.008604 | 5 |
TATGCCG | 108530 | 0.0 | 18.76265 | 45-49 |
CGTATGC | 117610 | 0.0 | 17.157545 | 45-49 |
CTCGTAT | 101185 | 0.0 | 16.121973 | 40-44 |
CGCTCAT | 130355 | 0.0 | 15.565723 | 30-34 |
TCTCGTA | 104950 | 0.0 | 15.527031 | 40-44 |
ATGCCGT | 131145 | 0.0 | 15.50727 | 45-49 |
ACCGCTC | 131845 | 0.0 | 15.381017 | 30-34 |
TGCCGTC | 131420 | 0.0 | 15.105269 | 45-49 |
GCCGTCT | 132670 | 0.0 | 14.972712 | 50-54 |
TCGTATG | 107945 | 0.0 | 14.730907 | 40-44 |
TATCTCG | 110165 | 0.0 | 14.614349 | 40-44 |
GTCACCG | 141370 | 0.0 | 14.5894375 | 25-29 |
AAGAGCA | 716965 | 0.0 | 14.5826645 | 6 |