Basic Statistics
Measure | Value |
---|---|
Filename | EGAF00004691368 |
File type | Conventional base calls |
Encoding | Sanger / Illumina 1.9 |
Total Sequences | 15786431 |
Sequences flagged as poor quality | 0 |
Sequence length | 51 |
%GC | 42 |
Per base sequence quality
Per tile sequence quality
Per sequence quality scores
Per base sequence content
Per sequence GC content
Per base N content
Sequence Length Distribution
Sequence Duplication Levels
Overrepresented sequences
Sequence | Count | Percentage | Possible Source |
---|---|---|---|
ATCGGAAGAGCACACGTCTGAACTCCAGTCACAACAACCAATCTCGTATGC | 869894 | 5.510390537291171 | TruSeq Adapter, Index 1 (97% over 36bp) |
GATCGGAAGAGCACACGTCTGAACTCCAGTCACAACAACCAATCTCGTATG | 506532 | 3.2086543183826666 | TruSeq Adapter, Index 1 (97% over 37bp) |
ATCGGAAGAGCACACGTCTGAACTCCAGTCACAACAACCAAACTCGTATGC | 32501 | 0.2058793403018073 | TruSeq Adapter, Index 1 (97% over 36bp) |
Adapter Content
Kmer Content
Sequence | Count | PValue | Obs/Exp Max | Max Obs/Exp Position |
---|---|---|---|---|
CGTATGC | 100585 | 0.0 | 43.92583 | 45 |
GATCGGA | 62720 | 0.0 | 39.227615 | 1 |
ACTCGTA | 4475 | 0.0 | 33.683056 | 42 |
AAACTCG | 4670 | 0.0 | 32.035618 | 40 |
AACTCGT | 4945 | 0.0 | 30.572525 | 41 |
TCGTATG | 154625 | 0.0 | 28.526148 | 44 |
CTCGTAT | 152705 | 0.0 | 28.477922 | 43 |
CGTCTGA | 156405 | 0.0 | 28.328083 | 15 |
ACACGTC | 156890 | 0.0 | 28.269192 | 12 |
ACGTCTG | 156970 | 0.0 | 28.237583 | 14 |
CACGTCT | 157290 | 0.0 | 28.194437 | 13 |
TCTCGTA | 145870 | 0.0 | 28.162067 | 42 |
ATCTCGT | 146195 | 0.0 | 28.103987 | 41 |
TCACAAC | 157470 | 0.0 | 28.04676 | 29 |
CACACGT | 158230 | 0.0 | 28.045427 | 11 |
GTCACAA | 157765 | 0.0 | 27.994314 | 28 |
GCACACG | 158530 | 0.0 | 27.985258 | 10 |
CCAGTCA | 158015 | 0.0 | 27.976986 | 25 |
CAACCAA | 157335 | 0.0 | 27.970013 | 35 |
AACAACC | 157665 | 0.0 | 27.95285 | 33 |