Basic Statistics
Measure | Value |
---|---|
Filename | EGAF00004859069 |
File type | Conventional base calls |
Encoding | Sanger / Illumina 1.9 |
Total Sequences | 18654323 |
Sequences flagged as poor quality | 0 |
Sequence length | 75 |
%GC | 46 |
Per base sequence quality
Per tile sequence quality
Per sequence quality scores
Per base sequence content
Per sequence GC content
Per base N content
Sequence Length Distribution
Sequence Duplication Levels
Overrepresented sequences
Sequence | Count | Percentage | Possible Source |
---|---|---|---|
GATCGGAAGAGCACACGTCTGAACTCCAGTCACGTTTCGGAATCTCGTATGCCGTCTTCTGCTTGAAAAAAAAAA | 67465 | 0.3616587961943191 | TruSeq Adapter, Index 21 (97% over 40bp) |
AGATCGGAAGAGCACACGTCTGAACTCCAGTCACGTTTCGGAATCTCGTATGCCGTCTTCTGCTTGAAAAAAAAA | 58784 | 0.3151226662045039 | TruSeq Adapter, Index 21 (97% over 40bp) |
Adapter Content
Kmer Content
Sequence | Count | PValue | Obs/Exp Max | Max Obs/Exp Position |
---|---|---|---|---|
AGATCGG | 18400 | 0.0 | 48.56251 | 1 |
TCGTATG | 27955 | 0.0 | 43.283417 | 45 |
CTCGTAT | 27980 | 0.0 | 41.9558 | 44 |
TCTCGTA | 28355 | 0.0 | 40.814995 | 43 |
AATCTCG | 28550 | 0.0 | 40.665657 | 41 |
CGTATGC | 30495 | 0.0 | 39.651657 | 46 |
ATCTCGT | 29195 | 0.0 | 39.559032 | 42 |
CGTTTCG | 34005 | 0.0 | 36.6465 | 33 |
TATGCCG | 33360 | 0.0 | 36.571938 | 48 |
ACACGTC | 35385 | 0.0 | 35.418083 | 13 |
AGTCACG | 34885 | 0.0 | 35.38167 | 28 |
GTCACGT | 35270 | 0.0 | 35.082024 | 29 |
GTTTCGG | 35925 | 0.0 | 35.009037 | 34 |
ATGCCGT | 34715 | 0.0 | 34.890358 | 49 |
GCCGTCT | 34735 | 0.0 | 34.840965 | 51 |
GATCGGA | 36425 | 0.0 | 34.82726 | 1 |
ACGTCTG | 36135 | 0.0 | 34.748703 | 15 |
CGGAATC | 34090 | 0.0 | 34.706028 | 38 |
TTTCGGA | 36090 | 0.0 | 34.64216 | 35 |
ACGTTTC | 36090 | 0.0 | 34.585754 | 32 |