hi all,
I have a dataset with 4 columns, inculde chromosome, position, read count and direction (F/R), extracted from original Illumina solexa sequencing dataset, but the dataset have highly redanduncy. The dataset is about ~130Mb. However, the dataset reduced to ~30M dramatically when I removed the redanduncy datas.
Whether the original dataset is wrong or nothing?
I have a dataset with 4 columns, inculde chromosome, position, read count and direction (F/R), extracted from original Illumina solexa sequencing dataset, but the dataset have highly redanduncy. The dataset is about ~130Mb. However, the dataset reduced to ~30M dramatically when I removed the redanduncy datas.
Whether the original dataset is wrong or nothing?
Comment