Basic Statistics
Measure | Value |
---|---|
Filename | EGAF00005406528 |
File type | Conventional base calls |
Encoding | Sanger / Illumina 1.9 |
Total Sequences | 5933648 |
Sequences flagged as poor quality | 0 |
Sequence length | 89-101 |
%GC | 55 |
Per base sequence quality
Per tile sequence quality
Per sequence quality scores
Per base sequence content
Per sequence GC content
Per base N content
Sequence Length Distribution
Sequence Duplication Levels
Overrepresented sequences
Sequence | Count | Percentage | Possible Source |
---|---|---|---|
CACACACACACACACACACACACACACACACACACACACACACACACACA | 56858 | 0.9582300803822539 | No Hit |
GTGTGTGTGTGTGTGTGTGTGTGTGTGTGTGTGTGTGTGTGTGTGTGTGT | 45637 | 0.7691221319498561 | No Hit |
Adapter Content
Kmer Content
Sequence | Count | PValue | Obs/Exp Max | Max Obs/Exp Position |
---|---|---|---|---|
CGCGTGT | 2065 | 0.0 | 27.960606 | 1 |
TAGGCCC | 755 | 0.0 | 23.81909 | 4 |
TAGCCCC | 1460 | 0.0 | 23.338242 | 5 |
TGTGCGT | 4000 | 0.0 | 23.072771 | 8 |
TAGATAG | 985 | 0.0 | 23.061817 | 5 |
AGGACGC | 5355 | 0.0 | 21.742428 | 9 |
GCCGTCT | 30570 | 0.0 | 20.642498 | 94-95 |
GTGTGCG | 4570 | 0.0 | 20.400393 | 7 |
GCGTAGC | 1950 | 0.0 | 20.386913 | 2 |
GCTTGAA | 9125 | 0.0 | 20.090343 | 94-95 |
CGTAGCC | 2020 | 0.0 | 19.913889 | 3 |
CGTGTGG | 4730 | 0.0 | 19.711147 | 1 |
TACACTA | 890 | 0.0 | 19.674343 | 5 |
CTTGAAA | 6670 | 0.0 | 19.033875 | 94-95 |
TGGTGTG | 5955 | 0.0 | 18.35772 | 5 |
CGGGACG | 1995 | 0.0 | 18.265701 | 5 |
GTATTAG | 940 | 0.0 | 18.125145 | 1 |
GTGCGTC | 5585 | 0.0 | 17.880997 | 9 |
GGTGTGC | 5625 | 0.0 | 17.415516 | 6 |
GTGTGGT | 5445 | 0.0 | 17.209732 | 2 |