Basic Statistics
Measure | Value |
---|---|
Filename | EGAF00005330999 |
File type | Conventional base calls |
Encoding | Sanger / Illumina 1.9 |
Total Sequences | 18302211 |
Sequences flagged as poor quality | 0 |
Sequence length | 101 |
%GC | 44 |
Per base sequence quality
Per tile sequence quality
Per sequence quality scores
Per base sequence content
Per sequence GC content
Per base N content
Sequence Length Distribution
Sequence Duplication Levels
Overrepresented sequences
Sequence | Count | Percentage | Possible Source |
---|---|---|---|
GATCGGAAGAGCACACGTCTGAACTCCAGTCACAGTCAACAATCTCGTAT | 110910 | 0.6059923579724876 | TruSeq Adapter, Index 13 (97% over 40bp) |
AGATCGGAAGAGCACACGTCTGAACTCCAGTCACAGTCAACAATCTCGTA | 64135 | 0.3504221429858939 | TruSeq Adapter, Index 13 (97% over 40bp) |
Adapter Content
Kmer Content
Sequence | Count | PValue | Obs/Exp Max | Max Obs/Exp Position |
---|---|---|---|---|
TATGCCG | 24970 | 0.0 | 38.892784 | 48-49 |
CGTATGC | 25520 | 0.0 | 38.073204 | 46-47 |
CTCGTAT | 24580 | 0.0 | 37.21504 | 44-45 |
TGCCGTC | 26870 | 0.0 | 35.9303 | 50-51 |
CCGTCTT | 26970 | 0.0 | 35.772877 | 52-53 |
ATCTCGT | 25780 | 0.0 | 35.10274 | 42-43 |
AGTCAAC | 34300 | 0.0 | 28.495348 | 34-35 |
AACAATC | 34025 | 0.0 | 26.910631 | 38-39 |
TGCTTGA | 35790 | 0.0 | 26.872423 | 60-61 |
CAATCTC | 34465 | 0.0 | 26.649755 | 40-41 |
CGTCTGA | 39785 | 0.0 | 24.936693 | 16-17 |
TCGTATG | 25855 | 0.0 | 24.24208 | 44-45 |
ATGCCGT | 26095 | 0.0 | 24.03328 | 48-49 |
GCCGTCT | 26190 | 0.0 | 23.864405 | 50-51 |
CTTGAAA | 40525 | 0.0 | 23.856712 | 62-63 |
CACGTCT | 41970 | 0.0 | 23.66103 | 14-15 |
ATCGGAA | 54030 | 0.0 | 23.660585 | 2 |
TCTCGTA | 24910 | 0.0 | 23.65991 | 42-43 |
GATCGGA | 54275 | 0.0 | 23.633577 | 1 |
TCGGAAG | 54350 | 0.0 | 23.57166 | 3 |