Basic Statistics
Measure | Value |
---|---|
Filename | EGAF00004842322 |
File type | Conventional base calls |
Encoding | Sanger / Illumina 1.9 |
Total Sequences | 21463787 |
Sequences flagged as poor quality | 0 |
Sequence length | 51 |
%GC | 40 |
Per base sequence quality
Per tile sequence quality
Per sequence quality scores
Per base sequence content
Per sequence GC content
Per base N content
Sequence Length Distribution
Sequence Duplication Levels
Overrepresented sequences
Sequence | Count | Percentage | Possible Source |
---|---|---|---|
GATCGGAAGAGCACACGTCTGAACTCCAGTCACGATCAGATCTCGTATGCC | 180350 | 0.8402524680290575 | TruSeq Adapter, Index 9 (100% over 51bp) |
GATCGGGAAGAGCACACGTCTGAACTCCAGTCACGATCAGATCTCGTATGC | 65004 | 0.3028542912767444 | TruSeq Adapter, Index 9 (100% over 46bp) |
Adapter Content
Kmer Content
Sequence | Count | PValue | Obs/Exp Max | Max Obs/Exp Position |
---|---|---|---|---|
GATCGGG | 9165 | 0.0 | 42.73054 | 1 |
ATCGGGA | 9115 | 0.0 | 41.937393 | 2 |
TCGGGAA | 9315 | 0.0 | 40.758545 | 3 |
GATCGGA | 26475 | 0.0 | 40.101192 | 1 |
CGGGAAG | 9830 | 0.0 | 38.051155 | 4 |
ATCGGAA | 27830 | 0.0 | 38.029343 | 2 |
TCGGAAG | 26630 | 0.0 | 37.931587 | 3 |
CGGAAGA | 27105 | 0.0 | 37.034508 | 4 |
GTATGCC | 26825 | 0.0 | 35.391724 | 45 |
CGTATGC | 34630 | 0.0 | 27.395561 | 44 |
ACACGTC | 35910 | 0.0 | 27.28906 | 13 |
CGTCTGA | 35920 | 0.0 | 27.194279 | 16 |
GCACACG | 36275 | 0.0 | 27.06409 | 11 |
ACGTCTG | 36335 | 0.0 | 26.908442 | 15 |
CACGTCT | 36450 | 0.0 | 26.87291 | 14 |
CACACGT | 37015 | 0.0 | 26.565573 | 12 |
GTCACGA | 35670 | 0.0 | 26.515022 | 29 |
CTCGTAT | 35765 | 0.0 | 26.463272 | 42 |
TCTCGTA | 33890 | 0.0 | 26.460665 | 41 |
AGTCACG | 35905 | 0.0 | 26.447973 | 28 |