Basic Statistics
Measure | Value |
---|---|
Filename | EGAF00005727585 |
File type | Conventional base calls |
Encoding | Sanger / Illumina 1.9 |
Total Sequences | 13750492 |
Sequences flagged as poor quality | 0 |
Sequence length | 51 |
%GC | 40 |
Per base sequence quality
Per tile sequence quality
Per sequence quality scores
Per base sequence content
Per sequence GC content
Per base N content
Sequence Length Distribution
Sequence Duplication Levels
Overrepresented sequences
Sequence | Count | Percentage | Possible Source |
---|---|---|---|
GATCGGAAGAGCACACGTCTGAACTCCAGTCACTTAGGCATCTCGTATGCC | 1329774 | 9.670737599789158 | TruSeq Adapter, Index 3 (100% over 51bp) |
GATTGGAAGAGCACACGTCTGAACTCCAGTCACTTAGGCATCTCGTATGCC | 27664 | 0.2011855284887261 | TruSeq Adapter, Index 3 (98% over 51bp) |
TATCGGAAGAGCACACGTCTGAACTCCAGTCACTTAGGCATCTCGTATGCC | 23090 | 0.16792126419912828 | TruSeq Adapter, Index 3 (98% over 51bp) |
GATCGGAAGAGCACACGTCTGAACTCCAGTCACTTAGGCTCTCGTATGCCG | 14053 | 0.10219997946255305 | TruSeq Adapter, Index 3 (97% over 39bp) |
Adapter Content
Kmer Content
Sequence | Count | PValue | Obs/Exp Max | Max Obs/Exp Position |
---|---|---|---|---|
GATCGGA | 159005 | 0.0 | 44.505413 | 1 |
ATCGGAA | 163465 | 0.0 | 44.03341 | 2 |
TCGGAAG | 163550 | 0.0 | 44.025658 | 3 |
CGGAAGA | 163750 | 0.0 | 43.945778 | 4 |
ACACGTC | 166940 | 0.0 | 43.676113 | 13 |
CGTCTGA | 168215 | 0.0 | 43.632076 | 16 |
ACGTCTG | 167325 | 0.0 | 43.59713 | 15 |
GCACACG | 166995 | 0.0 | 43.593014 | 11 |
CACGTCT | 167285 | 0.0 | 43.581997 | 14 |
CACACGT | 167225 | 0.0 | 43.554585 | 12 |
CATCTCG | 158525 | 0.0 | 43.513752 | 39 |
AGAGCAC | 170450 | 0.0 | 43.218887 | 8 |
CTTAGGC | 159305 | 0.0 | 43.19446 | 33 |
GAAGAGC | 170950 | 0.0 | 43.189873 | 6 |
ATCTCGT | 161640 | 0.0 | 43.170856 | 40 |
GAGCACA | 169770 | 0.0 | 43.134903 | 9 |
AGCACAC | 169375 | 0.0 | 43.103992 | 10 |
AGGCATC | 156480 | 0.0 | 42.992207 | 36 |
GCATCTC | 158465 | 0.0 | 42.980434 | 38 |
ACTTAGG | 161120 | 0.0 | 42.978794 | 32 |