Seqanswers Leaderboard Ad

**danielsbrewer** · 09-28-2011, 07:46 AM

http://seqanswers.com/forums/showthread.php?t=4994

Discussion of next-gen sequencing related bioinformatics: resources, algorithms, open source efforts, etc

**rskr** · 09-28-2011, 02:25 PM

I define the term read-group as a set of reads that are the result of the exact same wet lab protocol, from isolation of samples to attached barcodes and indexing. Informally I use the term read-group when referring to a GaIIx lane or HiSeq index.

Do they use that same convention?

**weeseda** · 04-02-2014, 09:39 AM

I've checked out the samtools man and other documents, but I'm still confussed on the read group line.

I am looking at 3 populations each of 8 individuals and want to do some SNP calling and downstream analysis. All 24 individuls were barcoded and run in one illumina lane.

To keep track of individuals and populations how would I set up the readgroup line? RG= population name and SM= individual (as below)? or is it even possible to keep track of both?

Thanks in advance for any insight!

rg.txt file for use with following command: samtools merge -rh rg.txt merged.bam *.bam

@RG ID:Pop1 SM:27861 PL:Illumina
@RG ID:Pop1 SM:27862 PL:Illumina
@RG ID:Pop2 SM:27863 PL:Illumina
@RG ID:Pop2 SM:27864 PL:Illumina
@RG ID:Pop3 SM:27865 PL:Illumina
@RG ID:Pop3 SM:27866 PL:Illumina
.
.
.

**kmcarr** · 04-03-2014, 05:46 AM

Originally posted by weeseda View Post

I've checked out the samtools man and other documents, but I'm still confussed on the read group line.

I am looking at 3 populations each of 8 individuals and want to do some SNP calling and downstream analysis. All 24 individuls were barcoded and run in one illumina lane.

To keep track of individuals and populations how would I set up the readgroup line? RG= population name and SM= individual (as below)? or is it even possible to keep track of both?

Thanks in advance for any insight!

rg.txt file for use with following command: samtools merge -rh rg.txt merged.bam *.bam

Code:

@RG	ID:Pop1	SM:27861	PL:Illumina
@RG	ID:Pop1	SM:27862	PL:Illumina
@RG	ID:Pop2	SM:27863	PL:Illumina
@RG	ID:Pop2	SM:27864	PL:Illumina
@RG	ID:Pop3	SM:27865	PL:Illumina
@RG	ID:Pop3	SM:27866	PL:Illumina

.
.
.

Each read group you define must have a unique ID. Your example does not, it uses each ID (Pop1, Pop2, Pop3) twice. Try something like this:

Code:

@RG	ID:1	SM:Pop1-27861	PL:Illumina
@RG	ID:2	SM:Pop1-27862	PL:Illumina
@RG	ID:3	SM:Pop2-27863	PL:Illumina
@RG	ID:4	SM:Pop2-27864	PL:Illumina
@RG	ID:5	SM:Pop3-27865	PL:Illumina
@RG	ID:6	SM:Pop3-27866	PL:Illumina

**fhtyert** · 06-01-2022, 08:04 PM

Great work!

basket random

Topics	Statistics	Last Post
Genetic Variants and Diabetes Risk in Childhood Cancer Survivors by seqadmin Started by seqadmin, Yesterday, 08:47 AM	0 responses 12 views 0 likes	Last Post by seqadmin Yesterday, 08:47 AM
Cancer Metastasis: A Deep Dive into Cellular Plasticity by seqadmin Started by seqadmin, 04-11-2024, 12:08 PM	0 responses 60 views 0 likes	Last Post by seqadmin 04-11-2024, 12:08 PM
Proteogenomic Profiles Offer New Clues in Prostate Cancer by seqadmin Started by seqadmin, 04-10-2024, 10:19 PM	0 responses 60 views 0 likes	Last Post by seqadmin 04-10-2024, 10:19 PM
Novel Diagnostic Assay Enhances Ovarian Cancer Detection by seqadmin Started by seqadmin, 04-10-2024, 09:21 AM	0 responses 54 views 0 likes	Last Post by seqadmin 04-10-2024, 09:21 AM

Seqanswers Leaderboard Ad

Announcement

what is a read group?

Comment

Comment

Comment

Comment

Comment

Latest Articles

ad_right_rmr

News