Basic Statistics
Measure | Value |
---|---|
Filename | EGAF00004859259 |
File type | Conventional base calls |
Encoding | Sanger / Illumina 1.9 |
Total Sequences | 20222839 |
Sequences flagged as poor quality | 0 |
Sequence length | 75 |
%GC | 47 |
Per base sequence quality
Per tile sequence quality
Per sequence quality scores
Per base sequence content
Per sequence GC content
Per base N content
Sequence Length Distribution
Sequence Duplication Levels
Overrepresented sequences
Sequence | Count | Percentage | Possible Source |
---|---|---|---|
AGATCGGAAGAGCACACGTCTGAACTCCAGTCACGTGGCCTTATCTCGTATGCCGTCTTCTGCTTGAAAAAAAAA | 68827 | 0.34034291624435126 | TruSeq Adapter, Index 20 (97% over 44bp) |
GATCGGAAGAGCACACGTCTGAACTCCAGTCACGTGGCCTTATCTCGTATGCCGTCTTCTGCTTGAAAAAAAAAA | 20438 | 0.10106395051654221 | TruSeq Adapter, Index 20 (97% over 44bp) |
Adapter Content
Kmer Content
Sequence | Count | PValue | Obs/Exp Max | Max Obs/Exp Position |
---|---|---|---|---|
AGATCGG | 21140 | 0.0 | 53.3031 | 1 |
TCGTATG | 24845 | 0.0 | 43.79678 | 46 |
TATGCCG | 25010 | 0.0 | 43.603157 | 49 |
CGTATGC | 25270 | 0.0 | 43.26235 | 47 |
GCACACG | 26085 | 0.0 | 43.184155 | 12 |
TATCTCG | 25045 | 0.0 | 42.821182 | 42 |
CTCGTAT | 25280 | 0.0 | 42.692883 | 45 |
ACACGTC | 26515 | 0.0 | 42.261475 | 14 |
AGTCACG | 26130 | 0.0 | 42.102596 | 29 |
GATCGGA | 26925 | 0.0 | 41.891003 | 2 |
ACGTCTG | 27165 | 0.0 | 41.31069 | 16 |
GTCACGT | 26370 | 0.0 | 41.151093 | 30 |
TCTCGTA | 26365 | 0.0 | 40.522575 | 44 |
CACACGT | 27750 | 0.0 | 40.49111 | 13 |
CACGTCT | 27790 | 0.0 | 40.37135 | 15 |
GCCGTCT | 26980 | 0.0 | 40.318897 | 52 |
ATGCCGT | 27020 | 0.0 | 40.246803 | 50 |
ATCGGAA | 27970 | 0.0 | 40.204334 | 3 |
ATCTCGT | 26665 | 0.0 | 40.01558 | 43 |
TCACGTG | 27535 | 0.0 | 39.54607 | 31 |