Basic Statistics
Measure | Value |
---|---|
Filename | EGAF00004693161 |
File type | Conventional base calls |
Encoding | Sanger / Illumina 1.9 |
Total Sequences | 55007417 |
Sequences flagged as poor quality | 0 |
Sequence length | 75 |
%GC | 50 |
Per base sequence quality
Per tile sequence quality
Per sequence quality scores
Per base sequence content
Per sequence GC content
Per base N content
Sequence Length Distribution
Sequence Duplication Levels
Overrepresented sequences
Sequence | Count | Percentage | Possible Source |
---|---|---|---|
GATCGGAAGAGCACACGTCTGAACTCCAGTCACACAGTGGTATCTCGTATGCCGTCTTCTGCTTGAAAAAAAAAA | 114979 | 0.2090245393634826 | TruSeq Adapter, Index 5 (97% over 41bp) |
Adapter Content
Kmer Content
Sequence | Count | PValue | Obs/Exp Max | Max Obs/Exp Position |
---|---|---|---|---|
TATGCCG | 19315 | 0.0 | 49.422077 | 48 |
TCGTATG | 19600 | 0.0 | 48.77384 | 45 |
CGTATGC | 20895 | 0.0 | 45.96565 | 46 |
CTCGTAT | 21270 | 0.0 | 44.81533 | 44 |
GTATGCC | 23185 | 0.0 | 41.30655 | 47 |
TATCTCG | 21010 | 0.0 | 40.410927 | 41 |
ATGCCGT | 27605 | 0.0 | 34.36778 | 49 |
TCTCGTA | 25635 | 0.0 | 33.456543 | 43 |
ATCTCGT | 25890 | 0.0 | 33.20697 | 42 |
CGTCTGA | 31620 | 0.0 | 30.636742 | 16 |
GTATCTC | 27850 | 0.0 | 30.634594 | 40 |
CCGTCTT | 31235 | 0.0 | 30.163847 | 52 |
TGGTATC | 29825 | 0.0 | 28.733305 | 38 |
TGCCGTC | 33350 | 0.0 | 28.664688 | 50 |
GTGGTAT | 30330 | 0.0 | 28.208979 | 37 |
GCCGTCT | 35260 | 0.0 | 27.102165 | 51 |
ACGTCTG | 35890 | 0.0 | 26.97252 | 15 |
GGTATCT | 32315 | 0.0 | 26.455233 | 39 |
CACGTCT | 36885 | 0.0 | 26.33845 | 14 |
CGTCTTC | 36040 | 0.0 | 26.285866 | 53 |