“...Another claim often heard is that there is a ’life sequence' of 400 proteins, and that the amino acid sequences of these proteins cannot be changed, for organisms to be alive.
“This, however, is nonsense. The 400 protein claim seems to come from the protein coding genome of Mycobacterium genetalium, which has the smallest genome currently known of any modern organism. However, inspection of the genome suggests that this could be reduced further to a minimal gene set of 256 proteins. Note again that this is a modern organism. The first protobiont/progenote would have been smaller still, and preceded by even simpler chemical systems.
“As to the claim that the sequences of proteins cannot be changed, again this is nonsense. There are in most proteins regions where almost any amino acid can be substituted, and other regions where conservative substitutions (where charged amino acids can be swapped with other charged amino acids, neutral for other neutral amino acids and hydrophobic amino acids for other hydrophobic amino acids) can be made. Some functionally equivalent molecules can have between 30-50% of their amino acids different. In fact it is possible to substitute structurally non-identical bacterial proteins for yeast proteins, and worm proteins for human proteins, and the organisms live quite happily...
“Yes,
one kilogram of the amino acid arginine has 2.85 x 10
24 molecules in it (that’s well over a billion billion); a tonne of arginine has 2.85 x 10
27 molecules. If you took a semi-trailer load of each amino acid and dumped it into a medium size lake, you would have enough molecules to generate our particular replicator in a few tens of years, given that you can make 55 amino acid long proteins in 1 to 2 weeks...
“...Another misunderstanding is that most people feel that the number of enzymes/ribozymes, let alone the ribozymal RNA polymerases or any form of self-replicator, represent a very unlikely configuration and that the chance of a single enzyme/ribozyme forming, let alone a number of them, from random addition of amino acids/nucleotides is very small.
“However, an analysis by Ekland suggests that in the sequence space of 220 nucleotide long RNA sequences, a staggering 2.5 x 10
112sequences are efficent [sic] ligases. Not bad for a compound previously thought to be only structural. Going back to our primitive ocean of 1 x 10
24 litres and assuming a nucleotide concentration of 1 x 10
-7 M, then there are roughly 1 x 10
49 potential nucleotide chains, so that a fair number of efficent [sic] RNA ligases (about 1 x 10
34) could be produced in a year, let alone a million years. The potential number of RNA polymerases is high also; about 1 in every 10
20 sequences is an RNA polymerase. Similar considerations apply for ribosomal acyl transferases (about1 in every 10
15 sequences), and ribozymal nucleotide synthesis.
“Similarly, of the 1 x 10
130 possible 100 unit proteins, 3.8 x 10
61 represent cytochrome C alone! There’s lots of functional enyzmes [sic] in the peptide/nucleotide search space, so it would seem likely that a functioning ensemble of enzymes could be brewed up in an early Earth’s prebiotic soup.
“So, even with more realistic (if somewhat mind beggaring) figures, random assemblage of amino acids into ’life-supporting’ systems (whether you go for protein enzyme based hypercycles, RNA world systems, or RNA ribozyme-protein enzyme coevolution) would seem to be entirely feasible, even with pessimistic figures for the original monomer concentrations and synthesis times.”
— Ian F. Musgrave, PhD, “
Lies, Damned Lies, Statistics, and Probability of Abiogenesis Calculations,”
Talk Origins FAQ, 1998