Basic Statistics
Measure | Value |
---|---|
Filename | EGAF00005432106 |
File type | Conventional base calls |
Encoding | Sanger / Illumina 1.9 |
Total Sequences | 168879955 |
Sequences flagged as poor quality | 0 |
Sequence length | 75 |
%GC | 50 |
Per base sequence quality
Per tile sequence quality
Per sequence quality scores
Per base sequence content
Per sequence GC content
Per base N content
Sequence Length Distribution
Sequence Duplication Levels
Overrepresented sequences
Sequence | Count | Percentage | Possible Source |
---|---|---|---|
GATCGGAAGAGCACACGTCTGAACTCCAGTCACGTCCGCACATCTCGTATGCCGTCTTCTGCTTGAAAAAAAAAA | 825715 | 0.488936061121049 | TruSeq Adapter, Index 18 (97% over 40bp) |
AGATCGGAAGAGCACACGTCTGAACTCCAGTCACGTCCGCACATCTCGTATGCCGTCTTCTGCTTGAAAAAAAAA | 535834 | 0.3172869154305495 | TruSeq Adapter, Index 18 (97% over 40bp) |
Adapter Content
Kmer Content
Sequence | Count | PValue | Obs/Exp Max | Max Obs/Exp Position |
---|---|---|---|---|
AGATCGG | 113930 | 0.0 | 41.78617 | 1 |
TCGTATG | 190365 | 0.0 | 36.52742 | 45 |
CTCGTAT | 190630 | 0.0 | 36.24106 | 44 |
TATGCCG | 192095 | 0.0 | 36.06206 | 48 |
CGTATGC | 193270 | 0.0 | 35.961006 | 46 |
AGTCACG | 200710 | 0.0 | 34.828156 | 28 |
ACACGTC | 202455 | 0.0 | 34.79406 | 13 |
GTCACGT | 200825 | 0.0 | 34.77983 | 29 |
ACGTCCG | 202635 | 0.0 | 34.504845 | 32 |
GATCGGA | 207195 | 0.0 | 34.215183 | 1 |
ACGTCTG | 207000 | 0.0 | 33.980846 | 15 |
CGCACAT | 196585 | 0.0 | 33.86367 | 37 |
TCACGTC | 207815 | 0.0 | 33.61221 | 30 |
CACGTCT | 211335 | 0.0 | 33.307594 | 14 |
ATCTCGT | 200650 | 0.0 | 33.11979 | 42 |
GTCCGCA | 213680 | 0.0 | 32.63828 | 34 |
ATGCCGT | 211050 | 0.0 | 32.59597 | 49 |
TTGAACG | 50975 | 0.0 | 32.54135 | 7 |
GCCGTCT | 212550 | 0.0 | 32.21646 | 51 |
TCTCGTA | 206490 | 0.0 | 32.175106 | 43 |