Basic Statistics
Measure | Value |
---|---|
Filename | EGAF00005709082 |
File type | Conventional base calls |
Encoding | Sanger / Illumina 1.9 |
Total Sequences | 20884066 |
Sequences flagged as poor quality | 0 |
Sequence length | 25-358 |
%GC | 51 |
Per base sequence quality
Per sequence quality scores
Per base sequence content
Per sequence GC content
Per base N content
Sequence Length Distribution
Sequence Duplication Levels
Overrepresented sequences
Sequence | Count | Percentage | Possible Source |
---|---|---|---|
GGCAATGAGCGGTTCCGCTGCCCTGAGGCACTCTTCCAGCCTTCCTTCCT | 32869 | 0.15738793393968398 | No Hit |
GTCCACGTCACACTTCATGATGGAGTTGAAGGTAGTTTCGTGGATGCCAC | 32052 | 0.1534758604957483 | No Hit |
CAATGGGGTACTTCAGGGTCAGGATGCCACGCTTGCTCTGGGCCTCGTCG | 27729 | 0.1327758684539687 | No Hit |
GGCAATGAGCGGTTCCGGTGTCCGGAGGCGCTGTTCCAGCCTTCCTTCCT | 22214 | 0.1063681756225057 | No Hit |
Adapter Content
Kmer Content
Sequence | Count | PValue | Obs/Exp Max | Max Obs/Exp Position |
---|---|---|---|---|
ACCCCCC | 5120 | 8.636145E-6 | 21778.102 | 330-339 |
ACCCCGG | 7940 | 2.0769257E-5 | 14043.31 | 340-341 |
AAGGGGG | 9885 | 0.0 | 11280.109 | 340-341 |
GGTTTTT | 10455 | 3.601041E-5 | 10665.125 | 320-329 |
GGGTTTT | 16270 | 0.0 | 6853.3423 | 330-339 |
GGGGCCC | 18380 | 1.1129381E-4 | 6066.5874 | 310-319 |
CGTTATC | 1025 | 2.1803338E-4 | 6043.5713 | 280-289 |
GGGTTAA | 2470 | 2.4116275E-4 | 5642.909 | 290-299 |
GGTTTTA | 10045 | 1.9943668E-4 | 5550.2183 | 330-339 |
GTTTTTA | 11005 | 2.3937719E-4 | 5066.0557 | 320-329 |
GTTTTAA | 7425 | 0.0 | 5005.786 | 330-339 |
CCCCCCA | 11650 | 2.682581E-4 | 4785.574 | 330-339 |
CCCCGGG | 12225 | 2.95391E-4 | 4560.486 | 340-341 |
GTAACGT | 1420 | 0.0 | 4362.4365 | 280-289 |
GATATAC | 3350 | 4.4359837E-4 | 4160.593 | 290-299 |
CCCGCGA | 3395 | 0.0 | 4105.445 | 290-299 |
TTTGCAC | 9805 | 6.650141E-4 | 3249.184 | 300-309 |
ACGTATA | 510 | 7.7489146E-4 | 3239.0378 | 280-289 |
GGGCCCT | 17615 | 6.132687E-4 | 3165.0264 | 310-319 |
CCCCCAA | 12130 | 7.269924E-4 | 3064.1353 | 330-339 |