Basic Statistics
Measure | Value |
---|---|
Filename | EGAF00004953892 |
File type | Conventional base calls |
Encoding | Sanger / Illumina 1.9 |
Total Sequences | 31877130 |
Sequences flagged as poor quality | 0 |
Sequence length | 76-101 |
%GC | 52 |
Per base sequence quality
Per tile sequence quality
Per sequence quality scores
Per base sequence content
Per sequence GC content
Per base N content
Sequence Length Distribution
Sequence Duplication Levels
Overrepresented sequences
Sequence | Count | Percentage | Possible Source |
---|---|---|---|
GATCGGAAGAGCACACGTCTGAACTCCAGTCACCTGAAGCTATCTCGTAT | 236853 | 0.7430185841699048 | TruSeq Adapter, Index 19 (97% over 38bp) |
ATCGGAAGAGCACACGTCTGAACTCCAGTCACCTGAAGCTATCTCGTATG | 48456 | 0.15200866577386357 | TruSeq Adapter, Index 19 (97% over 37bp) |
Adapter Content
Kmer Content
Sequence | Count | PValue | Obs/Exp Max | Max Obs/Exp Position |
---|---|---|---|---|
TTGAAAA | 60195 | 0.0 | 22.52956 | 62-63 |
GCTTGAA | 63025 | 0.0 | 21.586067 | 60-61 |
CTTGAAA | 60340 | 0.0 | 19.238104 | 62-63 |
GAAAAAA | 71250 | 0.0 | 19.203444 | 64-65 |
GCCGTCT | 72300 | 0.0 | 18.75807 | 50-51 |
TGAAAAA | 62225 | 0.0 | 18.731606 | 64-65 |
TTACGAC | 2920 | 0.0 | 17.868126 | 5 |
ATGCCGT | 76045 | 0.0 | 17.85419 | 48-49 |
GTATGCC | 77620 | 0.0 | 17.444029 | 46-47 |
GTTACGA | 3075 | 0.0 | 17.24791 | 4 |
TGCTTGA | 69450 | 0.0 | 16.937885 | 60-61 |
CGTCTTC | 81355 | 0.0 | 16.845634 | 52-53 |
TCGTATG | 80525 | 0.0 | 16.76174 | 44-45 |
TATGCCG | 70935 | 0.0 | 16.255787 | 48-49 |
CTGCTTG | 86335 | 0.0 | 16.008099 | 58-59 |
CGTTCGA | 4720 | 0.0 | 15.347885 | 7 |
TACGACT | 3520 | 0.0 | 15.067423 | 6 |
TGCCGTC | 78035 | 0.0 | 14.964597 | 50-51 |
CGTATGC | 77880 | 0.0 | 14.802839 | 46-47 |
TGTTACG | 3685 | 0.0 | 14.392513 | 3 |