Basic Statistics
Measure | Value |
---|---|
Filename | EGAF00004912731 |
File type | Conventional base calls |
Encoding | Sanger / Illumina 1.9 |
Total Sequences | 1049004 |
Sequences flagged as poor quality | 0 |
Sequence length | 125 |
%GC | 49 |
Per base sequence quality
Per tile sequence quality
Per sequence quality scores
Per base sequence content
Per sequence GC content
Per base N content
Sequence Length Distribution
Sequence Duplication Levels
Overrepresented sequences
Sequence | Count | Percentage | Possible Source |
---|---|---|---|
AAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAA | 4912 | 0.46825369588676496 | No Hit |
TCCTAGGTCAGCCCAAGGCTGCCCCCTCGGTCACTCTGTTCCCGCCCTCC | 1269 | 0.12097189333882426 | No Hit |
GTGTGGTGGTCTCCACTCCCGCCTTGACGGGGCTGCTATCTGCCTTCCAG | 1081 | 0.10305013136270214 | No Hit |
Adapter Content
Kmer Content
Sequence | Count | PValue | Obs/Exp Max | Max Obs/Exp Position |
---|---|---|---|---|
TCTGTCG | 110 | 1.7644197E-10 | 48.69962 | 8 |
ATCAACG | 2875 | 0.0 | 47.453396 | 3 |
CAACGCA | 3040 | 0.0 | 44.681824 | 5 |
TCAACGC | 3035 | 0.0 | 44.55914 | 4 |
AACGCAG | 3165 | 0.0 | 43.47146 | 6 |
GTATCAA | 3320 | 0.0 | 42.34904 | 1 |
CGCAGAG | 3310 | 0.0 | 40.82006 | 8 |
GCAGAGT | 3645 | 0.0 | 37.080803 | 9 |
ACGCAGA | 3710 | 0.0 | 36.27585 | 7 |
TATCAAC | 4055 | 0.0 | 34.67295 | 2 |
GTCTCGC | 195 | 3.8198777E-11 | 33.606842 | 1 |
GGTATCA | 2485 | 0.0 | 33.08433 | 1 |
TATACCG | 75 | 0.003886043 | 31.773745 | 5 |
AGACCTT | 475 | 0.0 | 30.094255 | 6 |
ATACCGA | 100 | 4.4115778E-4 | 29.780773 | 6 |
ACAACGG | 270 | 1.8189894E-12 | 28.68463 | 3 |
GACTCTC | 640 | 0.0 | 27.91414 | 7 |
GTACTGG | 280 | 3.6379788E-12 | 27.660177 | 1 |
AGAGTAC | 3640 | 0.0 | 27.39897 | 10-11 |
CAGAGTA | 3900 | 0.0 | 26.946411 | 10-11 |