Basic Statistics
Measure | Value |
---|---|
Filename | EGAF00005432050 |
File type | Conventional base calls |
Encoding | Sanger / Illumina 1.9 |
Total Sequences | 65739634 |
Sequences flagged as poor quality | 0 |
Sequence length | 100 |
%GC | 49 |
Per base sequence quality
Per tile sequence quality
Per sequence quality scores
Per base sequence content
Per sequence GC content
Per base N content
Sequence Length Distribution
Sequence Duplication Levels
Overrepresented sequences
Sequence | Count | Percentage | Possible Source |
---|---|---|---|
AGATCGGAAGAGCACACGTCTGAACTCCAGTCACATGTCAGAATCTCGTA | 286108 | 0.4352138620059856 | TruSeq Adapter, Index 15 (97% over 40bp) |
GATCGGAAGAGCACACGTCTGAACTCCAGTCACATGTCAGAATCTCGTAT | 199164 | 0.30295879043074686 | TruSeq Adapter, Index 15 (97% over 40bp) |
Adapter Content
Kmer Content
Sequence | Count | PValue | Obs/Exp Max | Max Obs/Exp Position |
---|---|---|---|---|
CTCGTAT | 68740 | 0.0 | 36.01842 | 44-45 |
TATGCCG | 69850 | 0.0 | 35.596745 | 48-49 |
CGTATGC | 73615 | 0.0 | 33.86055 | 46-47 |
ATCTCGT | 73640 | 0.0 | 33.62803 | 42-43 |
AGATCGG | 92070 | 0.0 | 31.515848 | 1 |
CCGTCTT | 83820 | 0.0 | 29.750546 | 52-53 |
TGCCGTC | 85620 | 0.0 | 29.125172 | 50-51 |
ATCGGAA | 103585 | 0.0 | 28.078976 | 3 |
GATCGGA | 103945 | 0.0 | 27.99261 | 2 |
CGTCTGA | 92225 | 0.0 | 27.56675 | 16-17 |
CACGTCT | 92555 | 0.0 | 27.478573 | 14-15 |
CACACGT | 92740 | 0.0 | 27.436522 | 12-13 |
ACATGTC | 92820 | 0.0 | 27.346514 | 32-33 |
TCGGAAG | 111995 | 0.0 | 26.125435 | 4 |
CGGAAGA | 114500 | 0.0 | 25.758905 | 5 |
GAATCTC | 97695 | 0.0 | 25.619707 | 40-41 |
TCACATG | 102265 | 0.0 | 24.910421 | 30-31 |
ATGTCAG | 107400 | 0.0 | 23.740395 | 34-35 |
GTCAGAA | 109295 | 0.0 | 23.2878 | 36-37 |
AGAGCAC | 130980 | 0.0 | 22.784742 | 9 |