Basic Statistics
Measure | Value |
---|---|
Filename | EGAF00004890215 |
File type | Conventional base calls |
Encoding | Sanger / Illumina 1.9 |
Total Sequences | 19500381 |
Sequences flagged as poor quality | 0 |
Sequence length | 65 |
%GC | 48 |
Per base sequence quality
Per tile sequence quality
Per sequence quality scores
Per base sequence content
Per sequence GC content
Per base N content
Sequence Length Distribution
Sequence Duplication Levels
Overrepresented sequences
Sequence | Count | Percentage | Possible Source |
---|---|---|---|
GATCGGAAGAGCACACGTCTGAACTCCAGTCACACTGATATATCTCGTATGCCGTCTTCTGCTTG | 64195 | 0.32919869616906455 | TruSeq Adapter, Index 25 (97% over 44bp) |
ATCGGAAGAGCACACGTCTGAACTCCAGTCACACTGATATATCTCGTATGCCGTCTTCTGCTTGA | 27146 | 0.13920753650915846 | TruSeq Adapter, Index 25 (97% over 43bp) |
Adapter Content
Kmer Content
Sequence | Count | PValue | Obs/Exp Max | Max Obs/Exp Position |
---|---|---|---|---|
TCGTATG | 11910 | 0.0 | 39.186638 | 45 |
CTCGTAT | 11990 | 0.0 | 38.77516 | 44 |
TATCTCG | 12285 | 0.0 | 38.060852 | 41 |
TATGCCG | 12510 | 0.0 | 37.402473 | 48 |
CGTATGC | 12665 | 0.0 | 36.896904 | 46 |
GTATGCC | 13020 | 0.0 | 35.892082 | 47 |
GATCGGA | 13310 | 0.0 | 35.790764 | 1 |
ATATCTC | 13135 | 0.0 | 35.711044 | 40 |
TGATATA | 13325 | 0.0 | 35.401283 | 36 |
ACGTCTG | 13540 | 0.0 | 34.85629 | 15 |
CACGTCT | 13670 | 0.0 | 34.805325 | 14 |
ACACGTC | 13915 | 0.0 | 34.00286 | 13 |
TCTCGTA | 13735 | 0.0 | 33.97782 | 43 |
ATCTCGT | 13820 | 0.0 | 33.705147 | 42 |
CTGATAT | 14250 | 0.0 | 33.103302 | 35 |
CACACGT | 14660 | 0.0 | 32.516346 | 12 |
GCCGTCT | 13190 | 0.0 | 32.05312 | 51 |
ATATATC | 14730 | 0.0 | 31.904415 | 38 |
ACTGATA | 15140 | 0.0 | 31.391003 | 34 |
TCGGAAG | 15340 | 0.0 | 31.209309 | 3 |