Basic Statistics
Measure | Value |
---|---|
Filename | EGAF00005729516 |
File type | Conventional base calls |
Encoding | Sanger / Illumina 1.9 |
Total Sequences | 36766402 |
Sequences flagged as poor quality | 0 |
Sequence length | 101 |
%GC | 46 |
Per base sequence quality
Per tile sequence quality
Per sequence quality scores
Per base sequence content
Per sequence GC content
Per base N content
Sequence Length Distribution
Sequence Duplication Levels
Overrepresented sequences
Sequence | Count | Percentage | Possible Source |
---|---|---|---|
AAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAA | 223573 | 0.608090506109355 | No Hit |
GATCGGAAGAGCACACGTCTGAACTCCAGTCACGGACTTGGATCTCGTAT | 136016 | 0.36994645274237065 | TruSeq Adapter, Index 4 (97% over 37bp) |
TTTTTTTTTTTTTTTTTTTTTTTTTTTTTTTTTTTTTTTTTTTTTTTTTT | 37077 | 0.10084478758623158 | No Hit |
Adapter Content
Kmer Content
Sequence | Count | PValue | Obs/Exp Max | Max Obs/Exp Position |
---|---|---|---|---|
TCGTATG | 21115 | 0.0 | 41.050087 | 44-45 |
GCCGTCT | 25365 | 0.0 | 34.03258 | 50-51 |
GTATGCC | 25720 | 0.0 | 33.775093 | 46-47 |
TATGCCG | 21200 | 0.0 | 32.9674 | 48-49 |
CGGACTT | 28505 | 0.0 | 30.617304 | 32-33 |
GATCTCG | 27310 | 0.0 | 29.798618 | 40-41 |
CGTATGC | 23950 | 0.0 | 29.270803 | 46-47 |
CTCGTAT | 24420 | 0.0 | 28.453844 | 44-45 |
CACGGAC | 30755 | 0.0 | 28.198622 | 30-31 |
TCTCGTA | 28990 | 0.0 | 28.064121 | 42-43 |
ATGCCGT | 32310 | 0.0 | 27.165977 | 48-49 |
GTCACGG | 35010 | 0.0 | 25.307623 | 28-29 |
ATCTCGT | 26490 | 0.0 | 25.056025 | 42-43 |
CCGGTAA | 3610 | 0.0 | 23.794706 | 1 |
ACGGACT | 29595 | 0.0 | 23.615793 | 32-33 |
TTGGATC | 29255 | 0.0 | 22.494354 | 38-39 |
TGGATCT | 36935 | 0.0 | 22.272905 | 38-39 |
TGCCGTC | 32395 | 0.0 | 21.801596 | 50-51 |
GCTTGAA | 41965 | 0.0 | 20.893602 | 60-61 |
TCACGGA | 34160 | 0.0 | 20.841387 | 30-31 |