Basic Statistics
Measure | Value |
---|---|
Filename | EGAF00004962340 |
File type | Conventional base calls |
Encoding | Sanger / Illumina 1.9 |
Total Sequences | 21119008 |
Sequences flagged as poor quality | 0 |
Sequence length | 101 |
%GC | 53 |
Per base sequence quality
Per tile sequence quality
Per sequence quality scores
Per base sequence content
Per sequence GC content
Per base N content
Sequence Length Distribution
Sequence Duplication Levels
Overrepresented sequences
Sequence | Count | Percentage | Possible Source |
---|---|---|---|
GATCGGAAGAGCACACGTCTGAACTCCAGTCACCGTACGTAATCTCGTAT | 27664 | 0.13099100109247555 | TruSeq Adapter, Index 22 (97% over 40bp) |
Adapter Content
Kmer Content
Sequence | Count | PValue | Obs/Exp Max | Max Obs/Exp Position |
---|---|---|---|---|
GATCGGA | 5165 | 0.0 | 50.84453 | 1 |
ATCGGAA | 5240 | 0.0 | 50.12793 | 2 |
CGTACGT | 3650 | 0.0 | 35.91777 | 34-35 |
CCGTACG | 3875 | 0.0 | 34.138508 | 32-33 |
TCGTATG | 3905 | 0.0 | 33.32737 | 44-45 |
GTACGTA | 4075 | 0.0 | 32.696285 | 34-35 |
CTCGTAT | 4005 | 0.0 | 32.613823 | 44-45 |
TCGGAAG | 8620 | 0.0 | 30.968128 | 3 |
ACCGTAC | 4380 | 0.0 | 30.094002 | 32-33 |
TATGCCG | 4320 | 0.0 | 30.015871 | 48-49 |
ACGTAAT | 4445 | 0.0 | 29.653545 | 36-37 |
TCTCGTA | 4460 | 0.0 | 29.180235 | 42-43 |
CGTATGC | 4445 | 0.0 | 29.118387 | 46-47 |
TACGTAA | 4495 | 0.0 | 29.112352 | 36-37 |
CGGAAGA | 9960 | 0.0 | 27.040184 | 4 |
CGTAATC | 5220 | 0.0 | 25.204956 | 38-39 |
CACCGTA | 5595 | 0.0 | 24.322857 | 30-31 |
ATGCCGT | 5570 | 0.0 | 23.791458 | 48-49 |
ATCTCGT | 5650 | 0.0 | 23.16041 | 42-43 |
ACACGTC | 6415 | 0.0 | 20.731455 | 12-13 |