Basic Statistics
Measure | Value |
---|---|
Filename | EGAF00004859433 |
File type | Conventional base calls |
Encoding | Sanger / Illumina 1.9 |
Total Sequences | 18817594 |
Sequences flagged as poor quality | 0 |
Sequence length | 75 |
%GC | 47 |
Per base sequence quality
Per tile sequence quality
Per sequence quality scores
Per base sequence content
Per sequence GC content
Per base N content
Sequence Length Distribution
Sequence Duplication Levels
Overrepresented sequences
Sequence | Count | Percentage | Possible Source |
---|---|---|---|
GATCGGAAGAGCACACGTCTGAACTCCAGTCACCGTACGTAATCTCGTATGCCGTCTTCTGCTTGAAAAAAAAAA | 36818 | 0.19565731942138828 | TruSeq Adapter, Index 22 (97% over 40bp) |
AGATCGGAAGAGCACACGTCTGAACTCCAGTCACCGTACGTAATCTCGTATGCCGTCTTCTGCTTGAAAAAAAAA | 30219 | 0.16058907424615496 | TruSeq Adapter, Index 22 (97% over 40bp) |
Adapter Content
Kmer Content
Sequence | Count | PValue | Obs/Exp Max | Max Obs/Exp Position |
---|---|---|---|---|
AGATCGG | 11430 | 0.0 | 40.700085 | 1 |
TCGTATG | 15730 | 0.0 | 40.364376 | 45 |
CGTATGC | 17265 | 0.0 | 36.916374 | 46 |
CTCGTAT | 16865 | 0.0 | 36.759514 | 44 |
AATCTCG | 16920 | 0.0 | 36.520363 | 41 |
TCTCGTA | 17010 | 0.0 | 35.803413 | 43 |
CGTACGT | 18200 | 0.0 | 35.421616 | 34 |
ATCTCGT | 17315 | 0.0 | 35.219517 | 42 |
TACGTAA | 18480 | 0.0 | 34.903934 | 36 |
CCGTACG | 18860 | 0.0 | 34.775738 | 33 |
TATGCCG | 18735 | 0.0 | 34.736103 | 48 |
GTACGTA | 18670 | 0.0 | 34.602608 | 35 |
ACCGTAC | 19165 | 0.0 | 34.2223 | 32 |
ACGTAAT | 18790 | 0.0 | 33.87664 | 37 |
CGTAATC | 18415 | 0.0 | 33.79262 | 38 |
ACACGTC | 20265 | 0.0 | 32.741653 | 13 |
GATCGGA | 20685 | 0.0 | 32.53462 | 1 |
ATGCCGT | 20325 | 0.0 | 31.951965 | 49 |
CACCGTA | 20580 | 0.0 | 31.8858 | 31 |
ATCGGAA | 21350 | 0.0 | 31.586256 | 2 |