Basic Statistics
Measure | Value |
---|---|
Filename | EGAF00004691440 |
File type | Conventional base calls |
Encoding | Sanger / Illumina 1.9 |
Total Sequences | 21643352 |
Sequences flagged as poor quality | 0 |
Sequence length | 51 |
%GC | 46 |
Per base sequence quality
Per tile sequence quality
Per sequence quality scores
Per base sequence content
Per sequence GC content
Per base N content
Sequence Length Distribution
Sequence Duplication Levels
Overrepresented sequences
Sequence | Count | Percentage | Possible Source |
---|---|---|---|
GATCGGAAGAGCACACGTCTGAACTCCAGTCACTGACCAATCTCGTATGCC | 222161 | 1.0264629988922234 | TruSeq Adapter, Index 4 (100% over 51bp) |
AATGATACGGCGACCACCGAGATCTACACTCTTTCCCTACACGACGCTCTT | 36751 | 0.16980271817415343 | Illumina Single End PCR Primer 1 (100% over 51bp) |
Adapter Content
Kmer Content
Sequence | Count | PValue | Obs/Exp Max | Max Obs/Exp Position |
---|---|---|---|---|
CTCGTAT | 23990 | 0.0 | 43.10509 | 42 |
CGTATGC | 24320 | 0.0 | 42.95492 | 44 |
TCGTATG | 24150 | 0.0 | 42.78225 | 43 |
TCTCGTA | 24295 | 0.0 | 42.508396 | 41 |
ATCGGAA | 26110 | 0.0 | 42.424652 | 2 |
CGTCTGA | 26185 | 0.0 | 42.14784 | 16 |
GATCGGA | 26415 | 0.0 | 42.036213 | 1 |
ACACGTC | 26370 | 0.0 | 41.91885 | 13 |
TCGGAAG | 26475 | 0.0 | 41.83077 | 3 |
ACGTCTG | 26595 | 0.0 | 41.50528 | 15 |
CACGTCT | 26730 | 0.0 | 41.32081 | 14 |
CGGAAGA | 26955 | 0.0 | 41.06915 | 4 |
GTATGCC | 25550 | 0.0 | 40.90053 | 45 |
CACACGT | 27160 | 0.0 | 40.7741 | 12 |
GCACACG | 27430 | 0.0 | 40.389244 | 11 |
ATCTCGT | 25650 | 0.0 | 40.28913 | 40 |
AATCTCG | 26500 | 0.0 | 39.090218 | 39 |
GACCAAT | 27375 | 0.0 | 38.144794 | 35 |
ACCAATC | 27765 | 0.0 | 38.09511 | 36 |
ACTGACC | 27655 | 0.0 | 37.994476 | 32 |