|
|
View Full Version : Gene families and evolution
QuarkHead 04-12-05, 03:42 AM The following thoughts are probably not original, although I have never seen them put down in this way.
It is a fact that, in general, more complex organisms have more DNA that simpler ones. It is also a fact that more complex forms have more genes than simpler ones. Let's assume that these two facts are connected, as seems reasonable.
Let us also assume that there is some sort of connection between complexity and the numbers of different genes. Again, that seems fair.
Finally let us accept that complex forms have evolved from simpler ones. Not controversial, I hope.
Now let us ask what is the origin of the additional DNA that results in additional genes and increased complexity. Let's restrict ouselves to nuclear DNA. The additional DNA can only be the result of some sort of amplification process during the course of evolution.
Let us assume that this amplification applied equally across the whole genome. The result is, of course, that in the early stages of this process, there will exist multiple copies of identical genes. Let's call these gene families.
Now the progenitor organism managed along quite nicely with one (two if you like) copy of each gene, so its evolutionary descendent has many copies surplus to requirements.
It seems reasonable then to suggest that these "spare" copies are free to undergo any sort of change we care to imagine, without presenting themselves as a negative selection target. But these evolutionary "experiments" can be expected, from time to time, to confer selective advantage.
Moreover, it is known that gene families are capable of undergoing much more dramatic and sudden sequence alteration than are singletons. For example, single genes mostly suffer point mutaion, deletion or insertion. Gene families, on the other hand readily exchange blocks of sequence, and this is a relatively error-prone process.
So the suggestion is that a mechanism of this sort could quite easily result in sudden and dramatic evolutionary change with no evidence of intermediate forms
spuriousmonkey 04-12-05, 05:03 AM http://www.ncbi.nlm.nih.gov/entrez/query.fcgi?cmd=Search&db=books&doptcmdl=GenBookHL&term=gene+families+evolution+AND+dbio%5Bbook%5D+AN D+132476%5Buid%5D&rid=dbio.section.5481
QuarkHead 04-14-05, 03:40 PM http://www.ncbi.nlm.nih.gov/entrez/query.fcgi?cmd=Search&db=books&doptcmdl=GenBookHL&term=gene+families+evolution+AND+dbio%5Bbook%5D+AN D+132476%5Buid%5D&rid=dbio.section.5481Yes, well - I can search the net too. I was looking for a discussion.
Imperfectionist 04-14-05, 03:55 PM Humans have fewer genes than rice, so that part of your assumption is not so certain. DNA may also be incorporated from other life forms, such as when unnucleated cells merged with bacterium to form nucleated cells. One fact to remember is that the complexity of the genome does not correlate with the complexity of the organism, since the genome does not define every individual structure. Rather, it calls up certain predetermined routines, and defines how often or for how long they should grow, so the difference between a compact animal and a centipede might just be the number of times the central segment is repeated. It certainly seems possible that there are repeated segments of the genome to "experiment" with, and that there is a correction mechanism that could fix it in later generations if it doesn't work. But, the changes must fit in with the previous body, environment, and lifestyle in such a way that it really is an advantage, and that seems highly unlikely unless it is accomplished in small steps.
Ophiolite 04-14-05, 05:24 PM It is a fact that, in general, more complex organisms have more DNA that simpler ones.This seems a generalisation too far, as Imperfectionist has noted. As such it seems to invalidate the basis of your later speculations.
Imperfectionist 04-14-05, 05:51 PM animals also can incorporate the genes of plants and visa-versa. gene swapping is common in nature maybe that can account for sudden increase's in the length of dna sequences.
Hercules Rockefeller 04-14-05, 06:17 PM animals also can incorporate the genes of plants and visa-versa.
They can? I don't think that's correct. Can you give some specific examples?
gene swapping is common in nature
It is? I don't think that's correct either, except for bacteria. 'Horizontal transfer' of genes is a mechanism that has been out of favor for a while now.
maybe that can account for sudden increase's in the length of dna sequences.
I don't think so. Increases in genome size is attributed to partial/whole genome duplications and mobile elements (such as transposons and retroviruses) that copy portions of the genome in the process of jumping around. Duplications of chromosomal segments, whole chromosomes or whole genomes creates extra copies of genes which can then undergo neo- or sub-functionalization to produce related families of genes that were referred to in the original post.<P>
That more complex organisms have more DNA is to some extents true but is, as noted above, very much an overgeneralization. In fact, this is called the "C-value paradox", where, at least for eukaryotic genomes, the size of the genome has little relationship with the evolutionary complexity of the organism. For the most part, this is due to repetitive elements that tend to replicate within a host's genome (transposons, etc.), or the tendency of an organism toward polyploidy (more than one genome - mostly applies to plants).
Gene swapping between species is not so common in eukaryotes. The vast majority of increases in the amount of DNA in eukaryotes (plants and animals) comes from mistakes during genome replication (one example would be Down's Syndrome). But "duplication and divergence" of gene families is a major player in evolutionary advancement - the biggest player in plant evolution. In fact, plants are forming thousands of new species every second through the magic of polyploidy or genome duplication.
QuarkHead 04-15-05, 07:44 AM Humans have fewer genes than rice, so that part of your assumption is not so certain. Yes, admittedly it was a rash generalization. I was thinking about long range vertical comparison (in the sense that rice is not an ancestral type to Man). One fact to remember is that the complexity of the genome does not correlate with the complexity of the organism, That isn't what I said. It is true that much of eukaryotic DNA is of low sequence complexity - that is why I couched my assumptions in the way that I did. It certainly seems possible that there are repeated segments of the genome to "experiment" with, and that there is a correction mechanism that could fix it in later generations if it doesn't work.But that's my whole point! If "iit doesn't work" as you put it (and mostly it won't) there is no harm done and therefore no need to "fix" it. But, the changes must fit in with the previous body, environment, and lifestyle in such a way that it really is an advantage, and that seems highly unlikely unless it is accomplished in small steps.Again, my whole point was that sequence divergence within superfluous copies of genes would only be targets for positive selection, and so large steps may well represent the accumulation of small sequence change, but these small changes would be phenotypically silent, unless dominant (which is rather unlikely)
QuarkHead 04-15-05, 07:46 AM This seems a generalisation too far, as Imperfectionist has noted. As such it seems to invalidate the basis of your later speculations.Well no - my point is not critically dependent on the absolute amount of DNA, only on the number of genes.
QuarkHead 04-15-05, 07:53 AM ...or the tendency of an organism toward polyploidy which of course is a form of amplification!
The vast majority of increases in the amount of DNA in eukaryotes (plants and animals) comes from mistakes during genome replication (one example would be Down's Syndrome). I don't think this is true, certainly if by "replication" you mean DNA replication like the rest of us. Downs is a constellation of chromosomal aberrations, some of which certainly are the result of disjunctional failure. But "duplication and divergence" of gene families is a major player in evolutionary advancement - the biggest player in plant evolution. In fact, plants are forming thousands of new species every second through the magic of polyploidy or genome duplication.Well, there you go!
spuriousmonkey 04-15-05, 10:02 AM Some complex animals actually have streamlined their genome, i.e. it is smaller than expected for their complexity.
Hercules Rockefeller 04-15-05, 11:33 AM Some complex animals actually have streamlined their genome, i.e. it is smaller than expected for their complexity.
Yes indeed. <I>Takifugu rubripes</I> (puffer fish) is an example. This is the precise reason for it being one of the first vertebrates whose genome has been sequenced completely (31,059 genes coding for 33,609 proteins, Aparicio et al. 2002). Its compact genome and lack of highly repetitive DNA makes the puffer fish genome a valuable comparison point when looking at syntenic relationships between vertebrate genomes.<P>
Actually, the cause of Downs syndrome in most patients is trisomy of chromosome 21 (three copies instead of two). I should have said aneuploidy instead of polyploidy but I was trying to simplify. And nondisjunction IS a replicational mistake.
As far as genome simplification, you might want to look at nested genes within viruses. Essentially, taking a genome as a "book" that you might read, for many viruses the "book" would actually make sense if you read it forward or backward, which I find quite cool. The fact that eukaryotes have not tended toward this simplification is that the energy requirements for multicellular organisms are so high that the additional energy required to synthesize a few thousand, million, or even billion base pairs are negligible when compared to the total energy requirements.
QuarkHead 04-15-05, 04:49 PM Actually, the cause of Downs syndrome in most patients is trisomy of chromosome 21 (three copies instead of two). I should have said aneuploidy instead of polyploidy but I was trying to simplify. And nondisjunction IS a replicational mistake.Non-disjunction has nothing to do with replication, it is an error of seperation of chromosomes during meiosis. Replication, as most people use the word, refers to DNA replication. Nothing to do with chromosome disjunction.
As far as genome simplification, you might want to look at nested genes within viruses.Actually I know all about these. But I was heading in the opposite direction, i.e. towards larger genomes encoding more complex life forms
invert_nexus 04-15-05, 05:06 PM Yes indeed. Takifugu rubripes (puffer fish) is an example. This is the precise reason for it being one of the first vertebrates whose genome has been sequenced completely (31,059 genes coding for 33,609 proteins, Aparicio et al. 2002). Its compact genome and lack of highly repetitive DNA makes the puffer fish genome a valuable comparison point when looking at syntenic relationships between vertebrate genomes.
Interesting. So the amount of alternative splicing in the pufferfish is almost nil then?
Humans, for example, have between 20 to 25 thousand genes at the latest estimates which code for 90,000 or so proteins.
It seems to me that complexity might very well be inversely proportional to the number of genes. More and more research is leading towards non-gene functions in cellular development. Epigenetic factors. And also the tons of 'junk' DNA that have turned out to not always be so junky after all.
It is in the number of different ways that any one gene code to a variety of proteins that seems to me to speak of complexity in the host. I have no figures on prokaryotes and the average number of alternative splicings for their genes, but I do know that introns are practically negligible in most single-cells. It is only when the animals start to gain complexity and live in multicelled beings that the 'junk' DNA starts to build. Doesn't this scream significance?
Junk is not junk at all. It is one of the factors of complexity. One of the means of complexity. And genes are not the whole story except in simple organisms.
animals also can incorporate the genes of plants and visa-versa. gene swapping is common in nature maybe that can account for sudden increase's in the length of dna sequences.It is? I don't think that's correct either, except for bacteria. 'Horizontal transfer' of genes is a mechanism that has been out of favor for a while now.It is. One example is that viruses will inject their own DNA into yours, after which you carry the code necessary to release new copies of the virus. It may stay latent for years or even never show up in you, but it might show up in your descendants.
Hercules Rockefeller 04-16-05, 10:41 AM It is.
No, it isn’t.
The original statement was “…animals also can incorporate the genes of plants and visa-versa…” and I was responding to this statement. Plants and animals do not freely incorporate each other’s genes.
One example is that viruses will inject their own DNA into yours, after which you carry the code necessary to release new copies of the virus.
Do you think that all viral infections lead to integration of the viral DNA into the genome of the infected cells? Retroviruses are the only type of virus that does this, and they represent only a tiny fraction of all the human viruses that exist. (Besides, retroviruses have RNA genomes, not DNA.) So yes, I suppose that retroviruses are one example of how an exogenous gene can insert into the human genome, but it’s a very limited example and doesn’t pertain to the original statement from Imperfectionist.
It may stay latent for years or even never show up in you, but it might show up in your descendants.
Descendants? The only way for that to happen is for a retrovirus to infect germline cells rather than just somatic cells. I am not aware of any such retroviruses. Do you have any examples? Yes, I know that HIV-positive mothers can transmit it to their babies, but that’s due to plain old infection and not because HIV has integrated into their germline (ie. the genome of their oogonia). HIV infects T cells (ie. somatic cells). So any retrovirus that integrates into somatic cell genomes is of no relevance to evolution, which is the whole thrust of this thread.<P>
Hercules Rockefeller 04-16-05, 10:59 AM So the amount of alternative splicing in the pufferfish is almost nil then?
I don’t know about that. Simply knowing a genome sequence does not indicate the extent of alternative splicing. This has to be determined empirically by assaying the transcriptome and proteome of an organism. I would suggest that the protein number indiacted does not take alternative splicing into account.
There is an accepted correlation between organism complexity and the extent of alternative splicing. Yeast barely does any, whereas the human genome exhibits at least 75%. I remember one seminar speaker I heard (whose research was on alternative splicing) you claimed that as many as 90-95% of all human genes are alternatively spliced!
Humans, for example, have between 20 to 25 thousand genes at the latest estimates which code for 90,000 or so proteins.
Whoa, hold on there! There was <I><B>one</B></I> report that suggested that there may be as little as 20,000 genes in the human genome, and this was based solely on computational analysis of the genome sequence and not any experimental evidence. The fact is, you can come up with any figure you want simply by tweaking the gene-finding algorithms that you use. My understanding is that the current majority opinion on the number of human genes is currently around 60,000 genes, but probably more in reality.
If it’s one thing that discussions like these indicate, it’s that the issue of “organism complexity” can be a very subjective thing.<P>
spuriousmonkey 04-16-05, 11:20 AM 60.000 was the consent before the human genome project.
The number went down quickly after that. 25.000 to 30.000 sounds reasonable.
QuarkHead 04-16-05, 12:03 PM 25.000 to 30.000 sounds reasonable.Based on what reasoning? Here's an example of how it might go.....
Human genome = 4 x 10<sup>9</sup>bp
Fraction coding = 0.1
Average processed transcript = 4 x 10<sup>3</sup>bp
Answer = (4 x 10<sup>9</sup>)(4 x 10<sup>-3</sup>)(0.1) = 10<sup>5</sup>
....which is the figure we used to use as an estimate for the number of human genes.
Hercules Rockefeller 04-16-05, 01:51 PM 60.000 was the consent before the human genome project.
Consensus opinion before the HGP was more like 80,000 to 100,000 genes. Original estimates coming from the HGP were 30,000 - 35,000 genes, and the count (as far as I am aware) has been climbing ever since as the gene scan algorithims become more sophisticated. As I said, there was one report that the count is more like 20,000 based on a modified gene scan algorithim but I don't think anyone is taking that seriously.
25.000 to 30.000 sounds reasonable.
Not to me it doesn't. That's only a few more thousand genes than <I>C.elegans</I>.<P>
spuriousmonkey 04-16-05, 05:24 PM it doesn't really take that many more genes to make a c. elegans than a human.
Especially since basically the same gene families are used to make every different part in an animal. In humans we might have more than 20 FGFs for instance. But many can be knocked out without any problem for most organs. It is not so much that humans need more FGFs because they are that much more complex. It is a historic result. One that facilitated evolution, by duplication of genes and modification of function, sometimes, in some some organs. Sometimes with major effects in most or all organs.
Hercules Rockefeller 04-16-05, 07:16 PM it doesn't really take that many more genes to make a c. elegans than a human.
That statement is debatable.
But many can be knocked out without any problem for most organs.
There's a lot of subjectivity and generalizations in there. "...many <I>(but not all)</I> can be knocked out without any problem for most <I>(but not all)</I> organs.
But I see where you’re coming from… :) <P>
CharonZ 04-16-05, 07:39 PM The latest number I read (end of 2004) were around 25k genes, too.
About gene families (or rather paralogues, the term gene families is actually reserved for another classification. I know it is not really important for this discussion, but at least for me it is easier...):
It is indeed known that duplicated genes (or paralogues) present a "playing ground" for modifications and as such, provide a pool for higher variability. I am not quite convinced that block modifications occur more often, though (AlthoughI have no data regarding this). But yes, basically I agree with the notion that duplication is an important factor of evolution.
Now, I am not sure if this has any impact on the basic reasoning above:
It is a fact that, in general, more complex organisms have more DNA that simpler ones. It is also a fact that more complex forms have more genes than simpler ones. Let's assume that these two facts are connected, as seems reasonable.
It appears to me that size of DNA, or the complexity of the organism is not reallly relevant for the occurence of gene duplication. In fact, in bacteria the amount of gene duplication is rather high (despite the fact that in contrast to multicellular organisms there is a stronger selection for maintaining a small chromosome). So the mechanism of gene duplication and its impact on evolutionary change is common for basically all organisms.
QuarkHead 04-17-05, 04:50 AM About gene families (or rather paralogues, the term gene families is actually reserved for another classification. I know it is not really important for this discussion, but at least for me it is easier...): Paralogies would certainly fit in with my scheme, but these, if we are using the term in the same way, involve massive chromosomal regions. I admit that to call the descendents of an amplification process families now might seem strange, as there isn't much evidence of widespread sequence homology between genes with unrelated functions.
But the point I was trying to make was that, those genes that we have in excess of those posessed by our anscestors, must have been in families at some point in evolution. If their kinship is no longer detectable, that must be due to rather rapid divergence.
I am not quite convinced that block modifications occur more often, though (AlthoughI have no data regarding this). There is abundant evidence, in Man at least, that genuine gene families participate in sequence exchange, both reciprocal and non-reciprocal. Not only does this often involve many tens of kilobases, but appears to be relatively error-prone. .
It appears to me that size of DNA, or the complexity of the organism is not reallly relevant for the occurence of gene duplication. .Yes, everybody wants to beat me with that particular stick! I have conceded that it was a rash generalization. As you say, it is not critical to my central point.
invert_nexus 05-27-05, 10:20 PM I don't have time to devote to an in-depth post here, but I would like to make a quick post and hopefully Hercules and QuarkHead have email alerts on their threads.
I'm kinda surprised by the response to the given number of genes. All I can say is that this is the number of genes given by the Human Genome Project (http://www.ornl.gov/sci/techresources/Human_Genome/faq/genenumber.shtml) site as the latest officially recognized number. The number is still an estimate and is likely to increase some, but apparently not much.
It was while I was doing research for a response to this thread that I came up with the idea that I posted in my new thread in this forum. The Genome as Evolution.
Unfortunately, this was a while ago and the information that I had dug up is now somewhat vague once more as I've been delving into other things in the interim.
I will dig once more and make a more detailed post on this subject.
I'll leave with this.
The question is how can such a surprisingly small number of genes code a human? It's a shock to think it. But, it's even more beautiful in that we make so much more use of the genes we have. The various genes have a variety of configurations to code for different proteins. Epigenetic factors also must be considered in the equation. Methylation and Histone tagging to name just a couple.
The depths of coding to make a complicated organism is so much more than we ever thought it was in the past. Not just genes anymore. Genes are only the tip of the iceberg.
We have a long way to go.
(Crossing fingers on email notification.)
QuarkHead 05-28-05, 02:14 PM I don't have time to devote to an in-depth post here, Neither do I, hopefully laterI'm kinda surprised by the response to the given number of genes. All I can say is that this is the number of genes given by the Human Genome Project (http://www.ornl.gov/sci/techresources/Human_Genome/faq/genenumber.shtml) site as the latest officially recognized number. The number is still an estimate and is likely to increase some, but apparently not much.Yes. The surprise comes from the fact that, the original, higher, estimates were theoretical, and theoroticians seldom like it when nast things like real data prove them wrong!!
However, it is worth remembering that the HGP estimate is also, in its own way a model, worked with an algorithm I think I described before (although they may have changed it since I last looked in detail)
It was while I was doing research for a response to this thread that I came up with the idea that I posted in my new thread in this forum. The Genome as Evolution. Well, let's take a peek.
Oh - and good to talk to you again
pranks085 06-02-05, 04:48 AM I think you may have a point with these gene repeats. Isn't that how families such as the globin family are thought to have arisen?
According to the Federal Genome Institue report as of August 31st, 2005, humans have about 25,000 genes, but many animals have a lot more than that. The report stated that they were quite surprised that the number was lower than the original 20-40,000 estimate. Chimps are 95% to almost 99% related to humans and they have currently identified the functional characteristic of the billions of building blocks of their DNA, which will greatly benefit humans in identifying defects and disease. Even though Chimp gene relation is so similar to ours, they state that the physical (phenotype) difference between chimps and humans is ten times as great as the difference between two individual people. Food for thought.
|