@HD VN:1.4 SO:coordinate @RG ID:1#30 PL:ILLUMINA PU:140417_HS31_12675_A_H8M0LADXX_1#30 LB:9920458 DS:EGAS00001000764: We are developing a protocol to differentiate mouse and human induced pluripotent stem (IPS) and embryonic stem (ES) cells towards the haematopoietic pathway to generate erythrocytes in vitro. This system has many applications such as the study of the role of specific genes and human polymorphisms in infectious diseases such as malaria, as well as haematological diseases such as myelodysplastic syndrome. The nature of the in vitro differentiation process means that a heterogeneous population of cells is generated. In order to understand the types of cells produced with our protocol, we have performed a single cell analysis, which has the power to reveal the different populations of cells and their characteristics. For this, a cDNA library has been made that needs to be sequenced to obtain the gene expression profiles of the different cells. With this information we will be able to assess the quality of the differentiation protocol and improve it in order to produce better cells for the downstream applications. This data is part of a pre-publication release. For information on the proper use of pre-publication data shared by the Wellcome Trust Sanger Institute (including details of any publication moratoria), please see http://www.sanger.ac.uk/datasharing/ DT:2014-04-17T00:00:00+0100 SM:APRBC-30 PG:BamIndexDecoder CN:SC @PG ID:SCS PN:HiSeq Control Software DS:Controlling software on instrument VN:2.0.12.0 @PG ID:basecalling PN:RTA PP:SCS DS:Basecalling Package VN:1.17.21.3 @PG ID:Illumina2bam PN:Illumina2bam PP:basecalling DS:Convert Illumina BCL to BAM or SAM file VN:V1.13 CL:uk.ac.sanger.npg.illumina.Illumina2bam INTENSITY_DIR=/nfs/sf32/ILorHSany_sf32/analysis/140417_HS31_12675_A_H8M0LADXX/Data/Intensities BASECALLS_DIR=/nfs/sf32/ILorHSany_sf32/analysis/140417_HS31_12675_A_H8M0LADXX/Data/Intensities/BaseCalls LANE=1 OUTPUT=/dev/stdout SAMPLE_ALIAS=APRBC-1,APRBC-2,APRBC-3,APRBC-4,APRBC-5,APRBC-6,APRBC-7,APRBC-8,APRBC-9,APRBC-10,APRBC-11,APRBC-12,APRBC-13,APRBC-14,APRBC-15,APRBC-16,APRBC-17,APRBC-18,APRBC-19,APRBC-20,APRBC-21,APRBC-22,APRBC-23,APRBC-24,APRBC-25,APRBC-26,APRBC-27,APRBC-28,APRBC-29,APRBC-30,APRBC-31,APRBC-32,APRBC-33,APRBC-34,APRBC-35,APRBC-36,APRBC-37,APRBC-38,APRBC-39,APRBC-40,APRBC-41,APRBC-42,APRBC-43,APRBC-44,APRBC-45,APRBC-46,APRBC-47,APRBC-48,APRBC-49,APRBC-50,APRBC-51,APRBC-52,APRBC-53,APRBC-54,APRBC-55,APRBC-56,APRBC-57,APRBC-58,APRBC-59,APRBC-60,APRBC-61,APRBC-62,APRBC-63,APRBC-64,APRBC-65,APRBC-66,APRBC-67,APRBC-68,APRBC-69,APRBC-70,APRBC-71,APRBC-72,APRBC-73,APRBC-74,APRBC-75,APRBC-76,APRBC-77,APRBC-78,APRBC-79,APRBC-80,APRBC-81,APRBC-82,APRBC-83,APRBC-84,APRBC-85,APRBC-86,APRBC-87,APRBC-88,APRBC-89,APRBC-90,APRBC-91,APRBC-92,APRBC-93,APRBC-94,APRBC-95,APRBC-96,phiX_for_spiked_buffers STUDY_NAME=EGAS00001000764: We are developing a protocol to differentiate mouse and human induced pluripotent stem (IPS) and embryonic stem (ES) cells towards the haematopoietic pathway to generate erythrocytes in vitro. This system has many applications such as the study of the role of specific genes and human polymorphisms in infectious diseases such as malaria, as well as haematological diseases such as myelodysplastic syndrome. The nature of the in vitro differentiation process means that a heterogeneous population of cells is generated. In order to understand the types of cells produced with our protocol, we have performed a single cell analysis, which has the power to reveal the different populations of cells and their characteristics. For this, a cDNA library has been made that needs to be sequenced to obtain the gene expression profiles of the different cells. With this information we will be able to assess the quality of the differentiation protocol and improve it in order to produce better cells for the downstream applications. This data is part of a pre-publication release. For information on the proper use of pre-publication data shared by the Wellcome Trust Sanger Institute (including details of any publication moratoria), please see http://www.sanger.ac.uk/datasharing/,Illumina Controls: SPIKED_CONTROL COMPRESSION_LEVEL=0 GENERATE_SECONDARY_BASE_CALLS=false PF_FILTER=true READ_GROUP_ID=1 LIBRARY_NAME=unknown SEQUENCING_CENTER=SC PLATFORM=ILLUMINA BARCODE_SEQUENCE_TAG_NAME=BC BARCODE_QUALITY_TAG_NAME=QT VERBOSITY=INFO QUIET=false VALIDATION_STRINGENCY=STRICT MAX_RECORDS_IN_RAM=500000 CREATE_INDEX=false CREATE_MD5_FILE=false @PG ID:bamadapterfind PN:bamadapterfind PP:Illumina2bam VN:0.0.129 CL:bamadapterfind level=0 @PG ID:BamIndexDecoder PN:BamIndexDecoder PP:bamadapterfind DS:A command-line tool to decode multiplexed bam file VN:V1.13 CL:uk.ac.sanger.npg.picard.BamIndexDecoder INPUT=/dev/stdin OUTPUT=/nfs/sf32/ILorHSany_sf32/analysis/140417_HS31_12675_A_H8M0LADXX/Data/Intensities/BAM_basecalls_20140418-183310/12675_1.bam BARCODE_FILE=/nfs/sf32/ILorHSany_sf32/analysis/140417_HS31_12675_A_H8M0LADXX/lane_1.taglist METRICS_FILE=/nfs/sf32/ILorHSany_sf32/analysis/140417_HS31_12675_A_H8M0LADXX/Data/Intensities/BAM_basecalls_20140418-183310/12675_1.bam.tag_decode.metrics VALIDATION_STRINGENCY=SILENT CREATE_MD5_FILE=true BARCODE_TAG_NAME=BC BARCODE_QUALITY_TAG_NAME=QT MAX_MISMATCHES=1 MIN_MISMATCH_DELTA=1 MAX_NO_CALLS=2 CONVERT_LOW_QUALITY_TO_NO_CALL=false MAX_LOW_QUALITY_TO_CONVERT=15 VERBOSITY=INFO QUIET=false COMPRESSION_LEVEL=5 MAX_RECORDS_IN_RAM=500000 CREATE_INDEX=false @PG ID:spf PN:spatial_filter PP:BamIndexDecoder DS:A program to apply a spatial filter VN:v10.14 CL:/software/solexa/pkg/pb_calibration/v10.14/bin/spatial_filter -c -F pb_align_12675_1.bam.filter -t /nfs/sf32/ILorHSany_sf32/analysis/140417_HS31_12675_A_H8M0LADXX/Data/Intensities/BAM_basecalls_20140418-183310/no_cal/archive/qc/tileviz/12675_1 --region_size 200 --region_mismatch_threshold 0.0160 --region_insertion_threshold 0.0160 --region_deletion_threshold 0.0160 pb_align_12675_1.bam ; /software/solexa/pkg/pb_calibration/v10.14/bin/spatial_filter -a -u -F pb_align_12675_1.bam.filter - @PG ID:bwa PN:bwa PP:spf VN:0.5.10-tpx @PG ID:BamMerger PN:BamMerger PP:bwa DS:A command-line tool to merge BAM/SAM alignment info in the first input file with the data in an unmapped BAM file, producing a third BAM file that has alignment data and all the additional data from the unmapped BAM VN:V1.13 CL:uk.ac.sanger.npg.picard.BamMerger ALIGNED_BAM=pb_align_12675_1.bam INPUT=/dev/stdin OUTPUT=12675_1.bam KEEP_EXTRA_UNMAPPED_READS=true REPLACE_ALIGNED_BASE_QUALITY=true VALIDATION_STRINGENCY=SILENT CREATE_MD5_FILE=true ALIGNMENT_PROGRAM_ID=bwa KEEP_ALL_PG=false VERBOSITY=INFO QUIET=false COMPRESSION_LEVEL=5 MAX_RECORDS_IN_RAM=500000 CREATE_INDEX=false @PG ID:SplitBamByReadGroup PN:SplitBamByReadGroup PP:BamMerger DS:Split a BAM file into multiple BAM files based on ReadGroup. Headers are a copy of the original file, removing @RGs where IDs match with the other ReadGroup IDs VN:V1.13 CL:uk.ac.sanger.npg.picard.SplitBamByReadGroup INPUT=/nfs/sf32/ILorHSany_sf32/analysis/140417_HS31_12675_A_H8M0LADXX/Data/Intensities/BAM_basecalls_20140418-183310/no_cal/12675_1.bam OUTPUT_PREFIX=/nfs/sf32/ILorHSany_sf32/analysis/140417_HS31_12675_A_H8M0LADXX/Data/Intensities/BAM_basecalls_20140418-183310/no_cal/lane1/12675_1 OUTPUT_COMMON_RG_HEAD_TO_TRIM=1 VALIDATION_STRINGENCY=SILENT CREATE_MD5_FILE=true VERBOSITY=INFO QUIET=false COMPRESSION_LEVEL=5 MAX_RECORDS_IN_RAM=500000 CREATE_INDEX=false @PG ID:bwa_aln PN:bwa PP:SplitBamByReadGroup VN:0.5.10-tpx CL:/software/solexa/pkg/bwa/bwa-0.5.10-mt_fixes/bwa aln -q 15 -t 10 /lustre/scratch110/srpipe/references/Homo_sapiens/NCBI36/all/bwa/Homo_sapiens.NCBI36.48.dna.all.fa -b1 /nfs/sf32/ILorHSany_sf32/analysis/140417_HS31_12675_A_H8M0LADXX/Data/Intensities/BAM_basecalls_20140418-183310/no_cal/lane1/12675_1#30.bam > /tmp/pBSeVbERzB/1.sai @PG ID:bwa_aln_1 PN:bwa PP:bwa_aln VN:0.5.10-tpx CL:/software/solexa/pkg/bwa/bwa-0.5.10-mt_fixes/bwa aln -q 15 -t 10 /lustre/scratch110/srpipe/references/Homo_sapiens/NCBI36/all/bwa/Homo_sapiens.NCBI36.48.dna.all.fa -b2 /nfs/sf32/ILorHSany_sf32/analysis/140417_HS31_12675_A_H8M0LADXX/Data/Intensities/BAM_basecalls_20140418-183310/no_cal/lane1/12675_1#30.bam > /tmp/pBSeVbERzB/2.sai @PG ID:bwa_sam PN:bwa PP:bwa_aln_1 VN:0.5.10-tpx CL:/software/solexa/pkg/bwa/bwa-0.5.10-mt_fixes/bwa sampe -t 6 /lustre/scratch110/srpipe/references/Homo_sapiens/NCBI36/all/bwa/Homo_sapiens.NCBI36.48.dna.all.fa /tmp/pBSeVbERzB/1.sai /tmp/pBSeVbERzB/2.sai /nfs/sf32/ILorHSany_sf32/analysis/140417_HS31_12675_A_H8M0LADXX/Data/Intensities/BAM_basecalls_20140418-183310/no_cal/lane1/12675_1#30.bam /nfs/sf32/ILorHSany_sf32/analysis/140417_HS31_12675_A_H8M0LADXX/Data/Intensities/BAM_basecalls_20140418-183310/no_cal/lane1/12675_1#30.bam @PG ID:Picard_SamFormatConverter PN:SamFormatConverter PP:bwa_sam VN:1.72(1230) CL:/software/jdk1.7.0_25/bin/java -Xmx1000m -jar /software/solexa/pkg/picard/picard-tools-1.72/SamFormatConverter.jar VALIDATION_STRINGENCY=SILENT INPUT=/dev/stdin OUTPUT=/dev/stdout COMPRESSION_LEVEL=0 @PG ID:samtools_fixmate PN:samtools PP:Picard_SamFormatConverter VN:0.1.18 (r982:295) CL:/software/solexa/pkg/samtools/samtools-0.1.18/samtools fixmate - - @PG ID:BamMerger_1 PN:BamMerger PP:samtools_fixmate VN:V1.13 DS:A command-line tool to merge BAM/SAM alignment info in the first input file with the data in an unmapped BAM file, producing a third BAM file that has alignment data and all the additional data from the unmapped BAM CL:uk.ac.sanger.npg.picard.BamMerger ALIGNED_BAM=/dev/stdin ALIGNMENT_PROGRAM_ID=NULL KEEP_ALL_PG=true INPUT=/nfs/sf32/ILorHSany_sf32/analysis/140417_HS31_12675_A_H8M0LADXX/Data/Intensities/BAM_basecalls_20140418-183310/no_cal/lane1/12675_1#30.bam OUTPUT=/tmp/pBSeVbERzB/output_fifo.bam KEEP_EXTRA_UNMAPPED_READS=true VALIDATION_STRINGENCY=SILENT CREATE_MD5_FILE=true REPLACE_ALIGNED_BASE_QUALITY=false VERBOSITY=INFO QUIET=false COMPRESSION_LEVEL=5 MAX_RECORDS_IN_RAM=500000 CREATE_INDEX=false @PG ID:AlignmentFilter PN:AlignmentFilter PP:BamMerger_1 DS:Give a list of SAM/BAM files with the same set of records and in the same order but aligned with different references, split reads into different files according to alignments. You have option to put unaligned reads into one of output files or a separate file VN:V1.13 CL:uk.ac.sanger.npg.picard.AlignmentFilter INPUT_ALIGNMENT=[/nfs/sf32/ILorHSany_sf32/analysis/140417_HS31_12675_A_H8M0LADXX/Data/Intensities/BAM_basecalls_20140418-183310/no_cal/lane1/12675_1#30.bam, /tmp/pBSeVbERzB/output_fifo.bam] OUTPUT_ALIGNMENT=[/nfs/sf32/ILorHSany_sf32/analysis/140417_HS31_12675_A_H8M0LADXX/Data/Intensities/BAM_basecalls_20140418-183310/no_cal/archive/lane1/12675_1#30_phix.bam, /nfs/sf32/ILorHSany_sf32/analysis/140417_HS31_12675_A_H8M0LADXX/Data/Intensities/BAM_basecalls_20140418-183310/no_cal/archive/lane1/12675_1#30.bam] METRICS_FILE=/nfs/sf32/ILorHSany_sf32/analysis/140417_HS31_12675_A_H8M0LADXX/Data/Intensities/BAM_basecalls_20140418-183310/no_cal/archive/lane1/12675_1#30.bam_alignment_filter_metrics.json VALIDATION_STRINGENCY=SILENT CREATE_MD5_FILE=true VERBOSITY=INFO QUIET=false COMPRESSION_LEVEL=5 MAX_RECORDS_IN_RAM=500000 CREATE_INDEX=false @PG ID:bamsort PN:bamsort PP:AlignmentFilter VN:0.0.129 CL:/software/hpag/biobambam/0.0.129/bin/bamsort tmpfile=/nfs/sf32/ILorHSany_sf32/analysis/140417_HS31_12675_A_H8M0LADXX/Data/Intensities/BAM_basecalls_20140418-183310/no_cal/archive/lane1/L69794B1S8/ @PG ID:bammarkduplicates PN:bammarkduplicates PP:bamsort VN:0.0.129 CL:/software/hpag/biobambam/0.0.129/bin/bammarkduplicates I=/nfs/sf32/ILorHSany_sf32/analysis/140417_HS31_12675_A_H8M0LADXX/Data/Intensities/BAM_basecalls_20140418-183310/no_cal/archive/lane1/L69794B1S8/sorted.bam O=/dev/stdout tmpfile=/nfs/sf32/ILorHSany_sf32/analysis/140417_HS31_12675_A_H8M0LADXX/Data/Intensities/BAM_basecalls_20140418-183310/no_cal/archive/lane1/L69794B1S8/ M=/tmp/sHEI8FjtCA level=0 @PG ID:BamTagStripper PN:BamTagStripper PP:bammarkduplicates DS:Strip a list of tags in bam/sam record. By default, any tags containing lowercase letters will be stripped and other tags will be kept. A list of tags can be given to keep or strip VN:V1.13 CL:uk.ac.sanger.npg.picard.BamTagStripper INPUT=/dev/stdin OUTPUT=/dev/stdout TAG_TO_KEEP=[a3, aa, af, ah, as, br, qr, tq, tr] TAG_TO_STRIP=[OQ] TMP_DIR=[/nfs/sf32/ILorHSany_sf32/analysis/140417_HS31_12675_A_H8M0LADXX/Data/Intensities/BAM_basecalls_20140418-183310/no_cal/archive/lane1/L69794B1S8] VERBOSITY=INFO VALIDATION_STRINGENCY=SILENT CREATE_INDEX=false CREATE_MD5_FILE=false QUIET=false COMPRESSION_LEVEL=5 MAX_RECORDS_IN_RAM=500000 @PG ID:scramble PN:scramble PP:BamTagStripper VN:1.13.5 CL:/software/badger/bin/scramble -I bam -O cram -r /lustre/scratch110/srpipe/references/Homo_sapiens/NCBI36/all/fasta/Homo_sapiens.NCBI36.48.dna.all.fa @SQ SN:1 LN:247249719 UR:/lustre/scratch110/srpipe/references/Homo_sapiens/NCBI36/all/fasta/Homo_sapiens.NCBI36.48.dna.all.fa AS:NCBI36 M5:9ebc6df9496613f373e73396d5b3b6b6 SP:Homo sapiens @SQ SN:2 LN:242951149 UR:/lustre/scratch110/srpipe/references/Homo_sapiens/NCBI36/all/fasta/Homo_sapiens.NCBI36.48.dna.all.fa AS:NCBI36 M5:b12c7373e3882120332983be99aeb18d SP:Homo sapiens @SQ SN:3 LN:199501827 UR:/lustre/scratch110/srpipe/references/Homo_sapiens/NCBI36/all/fasta/Homo_sapiens.NCBI36.48.dna.all.fa AS:NCBI36 M5:0e48ed7f305877f66e6fd4addbae2b9a SP:Homo sapiens @SQ SN:4 LN:191273063 UR:/lustre/scratch110/srpipe/references/Homo_sapiens/NCBI36/all/fasta/Homo_sapiens.NCBI36.48.dna.all.fa AS:NCBI36 M5:cf37020337904229dca8401907b626c2 SP:Homo sapiens @SQ SN:5 LN:180857866 UR:/lustre/scratch110/srpipe/references/Homo_sapiens/NCBI36/all/fasta/Homo_sapiens.NCBI36.48.dna.all.fa AS:NCBI36 M5:031c851664e31b2c17337fd6f9004858 SP:Homo sapiens @SQ SN:6 LN:170899992 UR:/lustre/scratch110/srpipe/references/Homo_sapiens/NCBI36/all/fasta/Homo_sapiens.NCBI36.48.dna.all.fa AS:NCBI36 M5:bfe8005c536131276d448ead33f1b583 SP:Homo sapiens @SQ SN:7 LN:158821424 UR:/lustre/scratch110/srpipe/references/Homo_sapiens/NCBI36/all/fasta/Homo_sapiens.NCBI36.48.dna.all.fa AS:NCBI36 M5:74239c5ceee3b28f0038123d958114cb SP:Homo sapiens @SQ SN:8 LN:146274826 UR:/lustre/scratch110/srpipe/references/Homo_sapiens/NCBI36/all/fasta/Homo_sapiens.NCBI36.48.dna.all.fa AS:NCBI36 M5:1eb00fe1ce26ce6701d2cd75c35b5ccb SP:Homo sapiens @SQ SN:9 LN:140273252 UR:/lustre/scratch110/srpipe/references/Homo_sapiens/NCBI36/all/fasta/Homo_sapiens.NCBI36.48.dna.all.fa AS:NCBI36 M5:ea244473e525dde0393d353ef94f974b SP:Homo sapiens @SQ SN:10 LN:135374737 UR:/lustre/scratch110/srpipe/references/Homo_sapiens/NCBI36/all/fasta/Homo_sapiens.NCBI36.48.dna.all.fa AS:NCBI36 M5:4ca41bf2d7d33578d2cd7ee9411e1533 SP:Homo sapiens @SQ SN:11 LN:134452384 UR:/lustre/scratch110/srpipe/references/Homo_sapiens/NCBI36/all/fasta/Homo_sapiens.NCBI36.48.dna.all.fa AS:NCBI36 M5:425ba5eb6c95b60bafbf2874493a56c3 SP:Homo sapiens @SQ SN:12 LN:132349534 UR:/lustre/scratch110/srpipe/references/Homo_sapiens/NCBI36/all/fasta/Homo_sapiens.NCBI36.48.dna.all.fa AS:NCBI36 M5:d17d70060c56b4578fa570117bf19716 SP:Homo sapiens @SQ SN:13 LN:114142980 UR:/lustre/scratch110/srpipe/references/Homo_sapiens/NCBI36/all/fasta/Homo_sapiens.NCBI36.48.dna.all.fa AS:NCBI36 M5:c4f3084a20380a373bbbdb9ae30da587 SP:Homo sapiens @SQ SN:14 LN:106368585 UR:/lustre/scratch110/srpipe/references/Homo_sapiens/NCBI36/all/fasta/Homo_sapiens.NCBI36.48.dna.all.fa AS:NCBI36 M5:c1ff5d44683831e9c7c1db23f93fbb45 SP:Homo sapiens @SQ SN:15 LN:100338915 UR:/lustre/scratch110/srpipe/references/Homo_sapiens/NCBI36/all/fasta/Homo_sapiens.NCBI36.48.dna.all.fa AS:NCBI36 M5:5cd9622c459fe0a276b27f6ac06116d8 SP:Homo sapiens @SQ SN:16 LN:88827254 UR:/lustre/scratch110/srpipe/references/Homo_sapiens/NCBI36/all/fasta/Homo_sapiens.NCBI36.48.dna.all.fa AS:NCBI36 M5:3e81884229e8dc6b7f258169ec8da246 SP:Homo sapiens @SQ SN:17 LN:78774742 UR:/lustre/scratch110/srpipe/references/Homo_sapiens/NCBI36/all/fasta/Homo_sapiens.NCBI36.48.dna.all.fa AS:NCBI36 M5:2a5c95ed99c5298bb107f313c7044588 SP:Homo sapiens @SQ SN:18 LN:76117153 UR:/lustre/scratch110/srpipe/references/Homo_sapiens/NCBI36/all/fasta/Homo_sapiens.NCBI36.48.dna.all.fa AS:NCBI36 M5:3d11df432bcdc1407835d5ef2ce62634 SP:Homo sapiens @SQ SN:19 LN:63811651 UR:/lustre/scratch110/srpipe/references/Homo_sapiens/NCBI36/all/fasta/Homo_sapiens.NCBI36.48.dna.all.fa AS:NCBI36 M5:2f1a59077cfad51df907ac25723bff28 SP:Homo sapiens @SQ SN:20 LN:62435964 UR:/lustre/scratch110/srpipe/references/Homo_sapiens/NCBI36/all/fasta/Homo_sapiens.NCBI36.48.dna.all.fa AS:NCBI36 M5:f126cdf8a6e0c7f379d618ff66beb2da SP:Homo sapiens @SQ SN:21 LN:46944323 UR:/lustre/scratch110/srpipe/references/Homo_sapiens/NCBI36/all/fasta/Homo_sapiens.NCBI36.48.dna.all.fa AS:NCBI36 M5:f1b74b7f9f4cdbaeb6832ee86cb426c6 SP:Homo sapiens @SQ SN:22 LN:49691432 UR:/lustre/scratch110/srpipe/references/Homo_sapiens/NCBI36/all/fasta/Homo_sapiens.NCBI36.48.dna.all.fa AS:NCBI36 M5:2041e6a0c914b48dd537922cca63acb8 SP:Homo sapiens @SQ SN:X LN:154913754 UR:/lustre/scratch110/srpipe/references/Homo_sapiens/NCBI36/all/fasta/Homo_sapiens.NCBI36.48.dna.all.fa AS:NCBI36 M5:d7e626c80ad172a4d7c95aadb94d9040 SP:Homo sapiens @SQ SN:Y LN:57772954 UR:/lustre/scratch110/srpipe/references/Homo_sapiens/NCBI36/all/fasta/Homo_sapiens.NCBI36.48.dna.all.fa AS:NCBI36 M5:0b9f6a275ae64a04d4de48324adfbad0 SP:Homo sapiens @SQ SN:MT LN:16571 UR:/lustre/scratch110/srpipe/references/Homo_sapiens/NCBI36/all/fasta/Homo_sapiens.NCBI36.48.dna.all.fa AS:NCBI36 M5:d2ed829b8a1628d16cbeee88e88e39eb SP:Homo sapiens