# HG changeset patch # User Steve Losh # Date 1579841984 18000 # Node ID ff3116e5eac28a9e645c01613b982b51ca88e437 # Parent 008a451ebd8e48bc9057338da8d9e43c53f06eeb Update diff -r 008a451ebd8e -r ff3116e5eac2 README.markdown --- a/README.markdown Thu Jan 23 23:59:18 2020 -0500 +++ b/README.markdown Thu Jan 23 23:59:44 2020 -0500 @@ -174,12 +174,12 @@ 1. Average density would be `(/ 3r4 3r9) = 1/100000` or 1 gene every ~100kb. 2. Parts: - A. Two humans would have roughly 250k differences. - B. A human and a chimpanzee would have roughly 3m differences. + A. Two humans would have roughly 250k differences. + B. A human and a chimpanzee would have roughly 3m differences. 3. Parts: - A. Average density would be 2 SNPs per 1000 base pairs. - B. Is this a trick question? I think 1.1% of the differences would be in protein-coding regions. + A. Average density would be 2 SNPs per 1000 base pairs. + B. Is this a trick question? I think 1.1% of the differences would be in protein-coding regions. 4. Parts: - A. I am confused. The glossary defines a "haplotype" as a group of closely-linked genes that are typically inherited together. So… 1 haplotype? But the *text* talks about a haplotype as being a combination of SNPs in a recombination-poor region. So if "haplotype" means the combination of SNPs, not a set of genes, then I think this would be `4^10 = 1048576`. - B. On a diploid chromosome, you'd have two separate sets of SNPs to combine (and order doesn't matter), so I think it's `4^10 * 4^10 / 2`. - C. `(1 SNP / 5 kb) * (100 kb) = 20 SNPs = 2^20 sequences = 1048576 sequences` + A. I am confused. The glossary defines a "haplotype" as a group of closely-linked genes that are typically inherited together. So… 1 haplotype? But the *text* talks about a haplotype as being a combination of SNPs in a recombination-poor region. So if "haplotype" means the combination of SNPs, not a set of genes, then I think this would be `4^10 = 1048576`. + B. On a diploid chromosome, you'd have two separate sets of SNPs to combine (and order doesn't matter), so I think it's `4^10 * 4^10 / 2`. + C. `(1 SNP / 5 kb) * (100 kb) = 20 SNPs = 2^20 sequences = 1048576 sequences`