Basic Statistics
Measure | Value |
---|---|
Filename | EGAF00005431858 |
File type | Conventional base calls |
Encoding | Sanger / Illumina 1.9 |
Total Sequences | 70123442 |
Sequences flagged as poor quality | 0 |
Sequence length | 75 |
%GC | 51 |
Per base sequence quality
Per tile sequence quality
Per sequence quality scores
Per base sequence content
Per sequence GC content
Per base N content
Sequence Length Distribution
Sequence Duplication Levels
Overrepresented sequences
Sequence | Count | Percentage | Possible Source |
---|---|---|---|
ATCGGAAGAGCACACGTCTGAACTCCAGTCACTTACAGGAATCTCGTATGCCGTCTTCTGCTTGAAAAAAAAAAA | 917049 | 1.307763814560044 | TruSeq Adapter, Index 4 (97% over 36bp) |
GATCGGAAGAGCACACGTCTGAACTCCAGTCACTTACAGGAATCTCGTATGCCGTCTTCTGCTTGAAAAAAAAAA | 912248 | 1.3009173166371382 | TruSeq Adapter, Index 4 (97% over 37bp) |
Adapter Content
Kmer Content
Sequence | Count | PValue | Obs/Exp Max | Max Obs/Exp Position |
---|---|---|---|---|
GATCGGA | 136165 | 0.0 | 49.706676 | 1 |
TCGTATG | 200585 | 0.0 | 33.934277 | 44 |
TATGCCG | 200570 | 0.0 | 33.894657 | 47 |
CGTATGC | 203395 | 0.0 | 33.453632 | 45 |
CTCGTAT | 203190 | 0.0 | 33.45173 | 43 |
GTATGCC | 205910 | 0.0 | 33.044037 | 46 |
AATCTCG | 201225 | 0.0 | 32.818993 | 40 |
CACTTAC | 207995 | 0.0 | 32.73216 | 30 |
TCACTTA | 209835 | 0.0 | 32.4211 | 29 |
ACTTACA | 211135 | 0.0 | 32.250267 | 31 |
TCTCGTA | 206295 | 0.0 | 32.108727 | 42 |
CGTCTGA | 211040 | 0.0 | 32.09397 | 15 |
ACGTCTG | 211645 | 0.0 | 32.013916 | 14 |
ATCTCGT | 207425 | 0.0 | 31.944517 | 41 |
GCCGTCT | 212655 | 0.0 | 31.89225 | 50 |
ACACGTC | 212725 | 0.0 | 31.832464 | 12 |
ATGCCGT | 215385 | 0.0 | 31.588928 | 48 |
CACGTCT | 217065 | 0.0 | 31.265404 | 13 |
CTTACAG | 218345 | 0.0 | 31.251667 | 32 |
CCGTCTT | 216700 | 0.0 | 31.250523 | 51 |