Seqanswers Leaderboard Ad

**Torst** · 07-13-2008, 08:51 PM

It appears there is one or more problems with the sequencing quality. I believe with Solexa that "A"s will be called when the flow cell is over illuminated or too oily and so on (User ScottC on this forum can probably explain this better!). That is why you have a large proportion of poly-A reads. You probably expect these in some proportion, but too many of them suggest a quality control issue.

The repetitive dimers look very suspicious too. I don't have any explanations for those, though.

**bioinfosm** · 07-14-2008, 07:46 AM

We saw similar behavior from a recent ChipSEQ. Less than 10% aligned to the actual reference, while 1% or so aligned to human dna, 1% or so formed contigs that align to some bacteria, etc., 1% aligned to adapter/primer sequence, but no clue of the leftovers!

Using 25bp reads for ChipSEQ data sounded more reasonable in our case, but still a huge amount of reads are not accounted for yet..

elisa*_* · 08-07-2008, 12:49 PM

poly A artefact near the edge of a tile

I came across this info in the release note of Maq 0.6.4.: "It is important to note that Illumina/Solexa sequencing may produce many false polyA at the edges of a tile. These polyA artefacts may greatly increase the running time of maq. Users are advised to remove these artefacts with their own scripts before alignment. For the moment maq does not provide a general functionality for filtering polyA."

Does anyone know about the source of this artefact?

**jlli** · 08-07-2008, 05:46 PM

Does SOLiD sequencing generate these artificial repetitive reads near the edge of a slide? We observed similar behavior from our SOLiD ChIPSeq project. We have no clue where went wrong.

elisa*_* · 08-07-2008, 05:58 PM

Hi jlli, I don't know the reason for Solid, but I did find an explanation for Solexa given by SillyPoint in the following thread. Maybe you have similar problems.

Sample tile images across a lane - SEQanswers

http://seqanswers.com/forums/showthread.php?t=358

Bridged amplification & clustering followed by sequencing by synthesis. (Genome Analyzer / HiSeq / MiSeq)

**acnoll** · 08-08-2008, 05:46 AM

Assuming you had a control lane on the FC how did it look - if you saw the same polyA problem this would argue for the oil theory. Was there one or more libraries constructed for the original sample? If the same result appeared in more than one library this would eliminate amplification bias.

Topics	Statistics	Last Post
Expanding the Horizons of Cellular Research with the Single Cell Atlas by seqadmin Started by seqadmin, Yesterday, 11:49 AM	0 responses 15 views 0 likes	Last Post by seqadmin Yesterday, 11:49 AM
Genetic Variants and Diabetes Risk in Childhood Cancer Survivors by seqadmin Started by seqadmin, 04-24-2024, 08:47 AM	0 responses 16 views 0 likes	Last Post by seqadmin 04-24-2024, 08:47 AM
Cancer Metastasis: A Deep Dive into Cellular Plasticity by seqadmin Started by seqadmin, 04-11-2024, 12:08 PM	0 responses 61 views 0 likes	Last Post by seqadmin 04-11-2024, 12:08 PM
Proteogenomic Profiles Offer New Clues in Prostate Cancer by seqadmin Started by seqadmin, 04-10-2024, 10:19 PM	0 responses 60 views 0 likes	Last Post by seqadmin 04-10-2024, 10:19 PM

Seqanswers Leaderboard Ad

Announcement

Repetitive reads

Comment

Comment

Comment

Comment

Comment

Comment

Latest Articles

ad_right_rmr

News