Basic Statistics
Measure | Value |
---|---|
Filename | EGAF00005729510 |
File type | Conventional base calls |
Encoding | Sanger / Illumina 1.9 |
Total Sequences | 65799705 |
Sequences flagged as poor quality | 0 |
Sequence length | 101 |
%GC | 50 |
Per base sequence quality
Per tile sequence quality
Per sequence quality scores
Per base sequence content
Per sequence GC content
Per base N content
Sequence Length Distribution
Sequence Duplication Levels
Overrepresented sequences
Sequence | Count | Percentage | Possible Source |
---|---|---|---|
AAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAA | 1211680 | 1.8414672223834436 | No Hit |
CAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAA | 178649 | 0.2715042567440082 | No Hit |
ATCGGAAGAGCACACGTCTGAACTCCAGTCACAGCGATAGATCTCGTATG | 125256 | 0.19035951604950205 | TruSeq Adapter, Index 1 (97% over 35bp) |
CCCGGGGCTCCCGCCGGCTTCTCCGGGATCGGTCGCGTTACCGCACTGGA | 90094 | 0.13692158650255348 | No Hit |
Adapter Content
Kmer Content
Sequence | Count | PValue | Obs/Exp Max | Max Obs/Exp Position |
---|---|---|---|---|
TCGTATG | 19240 | 0.0 | 37.539017 | 44-45 |
TATGCCG | 20940 | 0.0 | 31.337976 | 46-47 |
GCGATAG | 24930 | 0.0 | 29.060339 | 34-35 |
CGATAGA | 22910 | 0.0 | 28.699783 | 34-35 |
CGTATGC | 24310 | 0.0 | 27.160099 | 44-45 |
GCCGTCT | 26635 | 0.0 | 26.897213 | 50-51 |
GTATGCC | 27025 | 0.0 | 26.821653 | 46-47 |
CTCGTAT | 26415 | 0.0 | 25.265461 | 42-43 |
AGCGATA | 26435 | 0.0 | 25.087507 | 32-33 |
ATGCCGT | 30315 | 0.0 | 24.165236 | 48-49 |
TGTTACG | 15310 | 0.0 | 22.649302 | 3 |
TGTCGAG | 13675 | 0.0 | 22.01603 | 5 |
GTTACGA | 15915 | 0.0 | 21.967398 | 4 |
CAGCGAT | 35865 | 0.0 | 20.775333 | 32-33 |
ACCGCGT | 5365 | 0.0 | 20.535028 | 6 |
TTACGAC | 16990 | 0.0 | 20.51539 | 5 |
GATCTCG | 34250 | 0.0 | 20.079208 | 40-41 |
TGCCGTC | 33580 | 0.0 | 20.061901 | 48-49 |
GCGCGTT | 6225 | 0.0 | 19.57216 | 90-91 |
TGTGTCG | 14140 | 0.0 | 19.349966 | 3 |