Basic Statistics
Measure | Value |
---|---|
Filename | EGAF00005441197 |
File type | Conventional base calls |
Encoding | Sanger / Illumina 1.9 |
Total Sequences | 21479470 |
Sequences flagged as poor quality | 0 |
Sequence length | 151 |
%GC | 45 |
Per base sequence quality
Per tile sequence quality
Per sequence quality scores
Per base sequence content
Per sequence GC content
Per base N content
Sequence Length Distribution
Sequence Duplication Levels
Overrepresented sequences
Sequence | Count | Percentage | Possible Source |
---|---|---|---|
AAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAA | 38884 | 0.18102867528854297 | No Hit |
CCCGTCCCGGGGGTTGGCCGCGCGGGCCCCGGTGGGGCGGCCACCCGGGG | 30473 | 0.14187035341188586 | No Hit |
GGCGTACGGAAGACCCGCTCCCCGGCGCCGCTCGTGGGGGGCCCAAGTCC | 25558 | 0.11898803834545266 | No Hit |
ATCGGAAGAGCACACGTCTGAACTCCAGTCACGCAGAATTATCTCGTATG | 23022 | 0.10718141555634286 | TruSeq Adapter, Index 5 (97% over 36bp) |
Adapter Content
Kmer Content
Sequence | Count | PValue | Obs/Exp Max | Max Obs/Exp Position |
---|---|---|---|---|
GCGTACG | 5190 | 0.0 | 70.13197 | 2 |
CGTACGG | 5570 | 0.0 | 65.3375 | 3 |
GGCGTAC | 5765 | 0.0 | 63.448845 | 1 |
ACCGCGT | 2990 | 0.0 | 52.127888 | 6 |
CGCGTTC | 3345 | 0.0 | 47.02908 | 8 |
CCGCGTT | 3355 | 0.0 | 46.672825 | 7 |
GTACGGA | 8505 | 0.0 | 42.788914 | 4 |
TACGGAA | 9365 | 0.0 | 38.937054 | 5 |
AGACCGC | 4215 | 0.0 | 37.149933 | 4 |
GACCGCG | 4350 | 0.0 | 36.330395 | 5 |
CAGACCG | 4985 | 0.0 | 33.157646 | 3 |
ACGGAAG | 11830 | 0.0 | 31.068914 | 6 |
TCGACCC | 7980 | 0.0 | 30.710968 | 2 |
GACCCGT | 8270 | 0.0 | 29.453415 | 4 |
GCGTTCT | 5420 | 0.0 | 29.024405 | 9 |
CGTGCGG | 9240 | 0.0 | 26.753786 | 8 |
ACCCGTG | 9500 | 0.0 | 26.097889 | 5 |
GTCGCGC | 4675 | 0.0 | 25.87169 | 1 |
ATATCGC | 3115 | 0.0 | 24.668951 | 8 |
GCGCCGT | 4865 | 0.0 | 24.437843 | 4 |