Day 1 (7/27)
John Novembre – Recombination rates in admixed individuals revealed by ancestry-based inference
The basic idea is to utilize unrelated individual for estimating recombination rate. Here “unrelated” is relative to family data. It’s just like GWAS vs. Linkage mapping. Essentially what the method does is to take admixed individuals between two ethnic groups and inferring the recombination events in their genome by an HMM model utilizing two reference genome from both parental population. Here it is easy to collect large amounts of individuals, unlike in family based studies. But the inference is indirect and more model-based.
– My session –
Chao Lin — He talked about an experimental work designed to test the relationship between population fitness and compensatory evolution, although the details are blurred in my mind now. needs to read further.
Hideki Innan — he talked about one of his earlier work on the rate of compensatory evolution, modeled after Kimura’s 1985 work, but under a different weak selection parameter regime. His conclusion is that the rate of compensatory evolution is very limited.
Deepa Agashe — she works with Allan Drummand and Christ Marx. The most striking in her work is that the strain of bacteria carrying a genetically modified enzyme protein, for which she changed the original 50% frequent codon and 5% rare codon composition to 100% frequent codon, had nearly as poor fitness as the knockout strain for that enzyme. A number of other constructs also go in various unexpected directions. She has tried to look at mRNA secondary structure for her modified enzyme and said she didn’t find any clue there. I don’t have a good idea of what has gone wrong. But I would try to do all kinds of experiments related to the engineered gene product at both mRNA and protein level, to discover the potential mechanism.
My presentation went reasonably well. Met Alan Moses, who acknowledged the contribution of this work and we chatted about the importance of pursuing the question about selection on weak binding sites.
– end of my session –
(the rest are not ordered)
Alan Moses — His recent work modeled after his previous studies on TFBS turnover, but now turned to post-translational modifications, namely phosphorylation. He first established that phosphorylation sites have specific motifs and something I didn’t know before — these sites are preferentially located in “disordered regions” in a protein, where protein-protein interaction happens. The challenge here is alignment, since protein sequences in these disordered regions evolve very fast. But knowing that phosphorylation sites occur here reduce the target for searching for them, therefore leading to a lower false positive rates. His result so far suggest a deep conservation of the phosphorylation code. He also found evidence for turnover of these sites, but no knowledge about the potential selective values yet. He also suggest that the post-translational modification system, like enhancers, are also combinatorial, as there are usually multiple kinds of sites in those disordered regions, in addition to phosphorylation sites, there are motifs responsible for protein localization and degradation, which often depend on the phosphorylation state.
Andy Clark — He gave a talk on the same topic as his last talk in Chicago — effect of deleterious mutations in an exploding human population. However, I feel this time I understand more of the talk and can start to see the value of it. Essentially there are two issues underlying what he talked about: (1) the super-exponential growth in human population; (2) a sample size that is larger than the estimated Ne of the population. These two can lead to violation of the assumptions used in standard coalescence calculations. So the effect is mainly that many commonly used formula and methods no longer apply to the current day human dataset, where mis-use of them might lead to wrong and confusing results. Another effect, which is the theme of his last talk, is that in our growing population, lineages may experience little loss so that although selection should be more efficient in a larger population, the growth, i.e. non-equilibrium state, implies accumulation of large amount of deleterious mutations in the tip of the coalescence tree. The effect on GWAS could be attributed to common disease rare variants hypothesis. Since current GWAS still rely on incomplete genotyping data, they could miss all these rare variants (singletons).
Charles Robin — from Australia. He presented a super cool study on the evolution of resistance to DDT in drosophila melanogaster. He and other people have previously found alleles that confer DDT resistance in the species. Using stocks collected before or after DDT use, he clearly demonstrated the allele frequency change, with the resistant allele rising from barely detectable to almost fixed within a span of about 50 years, remarkable! The underlying genetics is more complicated than previously thought. A “beagle” element is further improved in terms of the resistance effect from a randomly inserted p-element! And the later allele is on the rise against the original resistant allele. They are currently using the DGRP lines to do GWAS, in order to discover modifier locis for DDT 3hr knockdown and 24hr mortality traits.
Zhu Yuan — from Dimitri Petrov’s lab. As part of her thesis work, she wants to assess the quality of using next generation sequencing method on pooled fly samples in order to estimate the allele frequency. This is directly related to the current trend of combining sequencing with long term artificial selection in order to reveal the rich dynamics of evolution. However, she showed that three biases could be introduced : (1) amount of DNA difference between individuals — this error is not severe, usually the amount of DNA contributed from different individuals vary within 20%. for this she suggests pooling at least 100 individuals in order to get a reliable estimate of the allele frequency. yet it seems that if this is a serious concern, the proposed pooling more individuals could only work for non-rare frequency alleles. (2) sequencing errors — this is the predominant source of errors. for this she suggests sequencing to at least 50X genome-wide coverage and filtering out low coverage regions.
Chris Illingworth — He works with Ville Mustonen. They have developed an analysis method for treating artificial selection sequencing data. See abstract below
Quantifying selection acting on a trait from allele frequency time-series
Chris Illingworth1, Leopold Parts1, Stephan Schi!els2
, Gianni Liti3, Ville Mustonen1
1Wellcome Trust Sanger Institute, UK,
2Universität zu Köln, Germany,
3University of Nottingham, U
We present a population genetic method to analyse time-series data of allele frequencies, illustrated here using data from an artificial selection experiment in a yeast population. Our method measures the consistency of a range of proposed evolutionary scenarios with allele frequency changes observed over time. Population genetic theory is utilized to formulate equations of motion for the allele frequencies under each scenario, following which likelihoods for having observed the sequencing data under each scenario are derived. Comparison of these likelihoods gives an insight into the prevailing dynamics for the system under study. Using our method we discover that about 10% of polymorphic sites evolve non-neutrally. We further identify 37 genomic regions containing one or more driver alleles, quantify their selective advantage, get estimates of local recombination rates within the regions, and show that the dynamics of the drivers display a strong signature of fitness effects going beyond additive models of selection. The combined experimental and analytical approach we present offers a paradigm for understanding evolution in a range of systems under many different evolutionary pressures
Plenary talks
Ken Wolfe’s talk is very impressive. His group characterized genome evolution in yeast, and has worked in detail the evolution of the mating locus. Gene gain and loss after WGD is their topic.
Daniel Hartle — Y chromosome variations could influence quantitative traits, perhaps not through genes on Y but epigenetically.