Basic Statistics
Measure | Value |
---|---|
Filename | EGAF00005331588 |
File type | Conventional base calls |
Encoding | Sanger / Illumina 1.9 |
Total Sequences | 51956069 |
Sequences flagged as poor quality | 0 |
Sequence length | 51 |
%GC | 43 |
Per base sequence quality
Per tile sequence quality
Per sequence quality scores
Per base sequence content
Per sequence GC content
Per base N content
Sequence Length Distribution
Sequence Duplication Levels
Overrepresented sequences
Sequence | Count | Percentage | Possible Source |
---|---|---|---|
AAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAA | 1021769 | 1.966601822782243 | No Hit |
GTTAGGGTTAGGGTTAGGGTTAGGGTTAGGGTTAGGGTTAGGGTTAGGGTT | 104316 | 0.20077731438843072 | No Hit |
CTAACCCTAACCCTAACCCTAACCCTAACCCTAACCCTAACCCTAACCCTA | 101956 | 0.1962350153934856 | No Hit |
CCTAACCCTAACCCTAACCCTAACCCTAACCCTAACCCTAACCCTAACCCT | 83297 | 0.16032198278895965 | No Hit |
GAAAGAAAGAAAGAAAGAAAGAAAGAAAGAAAGAAAGAAAGAAAGAAAGAA | 76972 | 0.14814823654191392 | No Hit |
AAAGAAAGAAAGAAAGAAAGAAAGAAAGAAAGAAAGAAAGAAAGAAAGAAA | 76107 | 0.14648336847808868 | No Hit |
Adapter Content
Kmer Content
Sequence | Count | PValue | Obs/Exp Max | Max Obs/Exp Position |
---|---|---|---|---|
TCGTCCG | 1170 | 0.0 | 16.732153 | 3 |
GTTCGTC | 1820 | 0.0 | 14.980589 | 1 |
TACGGGC | 1055 | 0.0 | 14.077156 | 2 |
CGGGCTA | 1240 | 0.0 | 12.882766 | 4 |
TACGCGA | 180 | 1.6956892E-5 | 12.500697 | 9 |
CTACGGG | 1260 | 0.0 | 12.160552 | 1 |
GTCCCGT | 2845 | 0.0 | 11.325794 | 1 |
CCGCGTC | 7195 | 0.0 | 11.225832 | 42 |
CTCGATT | 16080 | 0.0 | 11.140278 | 1 |
CGTCGCG | 245 | 2.900897E-6 | 11.021319 | 3 |
ACCGCCG | 8380 | 0.0 | 10.605363 | 4 |
ACGCGGC | 1655 | 0.0 | 10.603953 | 5 |
CGCGTCC | 7690 | 0.0 | 10.532491 | 43 |
CTACGCG | 260 | 5.9079484E-6 | 10.385195 | 8 |
GGGGGCC | 19980 | 0.0 | 10.36418 | 1 |
CGTATAG | 655 | 0.0 | 10.320361 | 1 |
CGGGAAT | 2500 | 0.0 | 10.274951 | 1 |
TCGGGGG | 8935 | 0.0 | 10.149249 | 2 |
ACGGGCT | 1670 | 0.0 | 9.836143 | 3 |
CGATTGG | 6325 | 0.0 | 9.8189945 | 3 |