SciELO - Scientific Electronic Library Online

 
vol.45 número2Evaluación de la preferencia de Liriomyza spp. en genotipos diferentes de garbanzo y efecto del extracto de Melia azedarach L.: resultados preliminaresUtilización de marcadores morfológicos y moleculares AFLP en la identificacíon de germo plasma nativo y cultivado de Elymus scabrifolius (Poaceae) índice de autoresíndice de materiabúsqueda de artículos
Home Pagelista alfabética de revistas  

Servicios Personalizados

Revista

Articulo

Indicadores

  • No hay articulos citadosCitado por SciELO

Links relacionados

  • No hay articulos similaresSimilares en SciELO

Compartir


Revista de la Facultad de Ciencias Agrarias. Universidad Nacional de Cuyo

versión On-line ISSN 1853-8665

Rev. Fac. Cienc. Agrar., Univ. Nac. Cuyo vol.45 no.2 Mendoza dic. 2013

 

ARTÍCULOS ORIGINALES

Yield prediction and inbreeding of maize synthetics generated with lines and single crosses.
Classic probability

Predicción de rendimiento y endogamia de sintéticos de maíz generados con líneas y cruzas simples.
Probabilidad clásica

 

Jaime Sahagún-Castellanos, Juan E. Rodríguez-Pérez, José L. Escalante-González

Dpto. de Fitotecnia. Universidad Autónoma Chapingo. km 38,5 Carretera México-Texcoco. C. P. 56230 Chapingo, Estado de México. México. sahagun@correo.chapingo.mx, jsahagunc@yahoo.com.mx

Originales: Recepción: 01/10/2012 - Aceptación: 23/07/2013

 

Abstract

To reduce costs and labor associated with predicting the genotypic mean (GM) of a synthetic variety (SV) of maize (Zea mays L.), breeders can develop SVs from L lines and s single crosses (SynL,SC) instead of L+2s lines (SynL). The objective of this work was to derive and study formulae for the inbreeding coefficient (IC) and GM of SynL,SC, SynL, and the SV derived from (L+2s)/2 single crosses (SynSC). All SVs were derived from the same L+2s unrelated lines whose IC is FL, and each parent of a SV was represented by m plants. An a priori probability equation for the IC was used. Important results were: 1) the largest and smallest GMs correspond to SynL and SynL,SC, respectively; 2) the GM predictors with the largest and intermediate precision are those for SynL and SynL,SC, respectively; 3) only when FL=1, or m is large, SynL and SynSC are the same population, but only with SynSC prediction costs and labor undergo the maximum decrease, although its prediction precision is the lowest. To determine the SV to be developed, breeders should also consider the availability of lines, single crosses, manpower and land area; besides budget, target farmers, target environments, etc.

Keywords: low-input agriculture; Plant breeding; Genotypic stability; Random mating population; Identical by descent genes

Resumen

Para reducir costos y trabajo para predecir la media genotípica (GM) de una variedad sintética (SV) de maíz (Zea mays L.), se puede desarrollar SVs con L líneas y s cruzas simples (SynL,SC) en lugar de L+2s líneas (SynL). El objetivo de este trabajo fue derivar y estudiar fórmulas para el coeficiente de endogamia (IC) y la GM del SynL,SC, SynL y de la SV derivada con (L+2s)/2 cruzas simples (SynSC). Las SVs fueron generadas con las mismas L+2s líneas no emparentadas, con IC igual a FL. Cada progenitor se representó por m plantas. Se usó el concepto de probabilidad a priori para derivar fórmulas para IC. Resultados importantes fueron: 1) las GMs mayor y menor corresponden a SynL y SynL,SC, respectivamente, 2) la mayor mayor e intermedia precisiones para estimar GM, respectivamente, se obtienen con SynL y SynSC, y 3) solo cuando FL=1 o m es grande, SynL y SynSC son la misma población, pero SynSC, requiere menos trabajo y costos para la predicción aunque esta es menor. Para determinar qué SV desarrollar, se debe considerar también la disponibilidad de líneas, cruzas simples, mano de obra y área experimental; además de presupuesto, y ambientes y usuarios potenciales, etc.

Palabras clave: Agricultura marginal; Mejoramiento genético vegetal; Estabilidad genotípica; Población en apareamiento aleatorio; Genes idénticos por descendencia

Introduction

A synthetic variety (SV) of a crop species as maize (Zea mays L.) is the population resulting from the random mating of m plants from each of p selected parents. It has the advantages of exploiting the natural system of this cross-fertilizing species and having large genetic variability and an important proportion of heterozygosity that accounts for grain yield and a broad stability across environments, including low-input agriculture. Further, this mating system must produce stability in its genotypic array across the subsequent generations, which means that: 1) the farmers do not have to buy seed of the SV each cycle, except for the first one, and 2) the SV does not undergo inbreeding depression. Besides that these two advantages of a SV are not shown by hybrids, these varieties have been vulnerable to destructive diseases such as Helminthosporium maydis (11). Hybrids, however, show large heterotic effects.
Normally, the development of a synthetic variety (SV) of a crop species such as maize (Zea mays L.) includes the field evaluation of p pure lines and their p(p - 1)/2 direct crosses. The data produced is then used according to a prediction equation to estimate the genotypic mean of each of the 2p - (p + 1) SVs that can be derived.
Furthermore, a prediction equation (1) and evidence from reality imply an inverse relationship between genotypic mean (GM) and inbreeding coefficient (IC). This equation, however, also requires field evaluation of the p parents and their p(p - 1)/2 crosses. Moreover, when p is large, the available resources might not be enough, and an alternative approach should be applied instead.
A partial solution to the problem of limited resources to derive SVs that is more flexible than the use of parents that are exclusively single crosses (5), double crosses (6) or a mixture of single and double crosses (4), is the use of L lines and s single crosses as potential parents of a SV. It is based on L + 2s unrelated initial lines whose IC is FL (the s single crosses are derived from 2s lines).
Besides more flexibility, this synthetic variety (SynL,SC) relative to the conventional SV derived from L+2s lines, implies a decrease in s parents thereby reducing the number of possible SVs. This SynL,SC was already studied (7), but it was restricted to pure initial lines (FL=1) and did not include the estimation of the genotypic mean.
The hypothesis is that when the initial lines are not pure (FL< 1), the GM, IC, and the precision of the estimation of the genotypic mean of the SynL,SC may undergo a change due to the decrease in the number of parents.
Moreover, the derivation of the IC of a SV has been based on the known coancestries of the individuals produced by randomly mating the parents, usually a set of pure and unrelated lines (3), or on the genotypic array of the SV where each genotype is substituted for the probability of identity by descent of the two genes forming the substituted genotype (5).
The procedure proposed here to derive the IC considers that the two genes of each genotype of a SV can be visualized as the result of a random sample with replacement of size two from the set whose elements are the genes of the gametic array of the parents. This consideration in turn makes it possible to visualize the derivation of the IC of the SV as a random experiment where the calculation of the probability of the occurrence of an event can be based on an a priori model. Thus, it would be interesting to see how useful this approach could be to derive the IC of SynL,SC.
Under this context for SynL,SC, a work was planned to study the SVs derived from L lines and s single crosses, including the cases: 1) L > 0 and s = 0 (SynL), and 2) L = 0 and s>0 (SynSC). The objectives were to determine and analyze general (FL ≤ 1) formulae for the IC and GM of the SynL,SC, SynL, and SynSC.

Methods and theoretical framework

In this study, the concepts of gametic array (GA) and genotypic array (GEA) of a diploid population under random mating were used in the context of the Hardy- Weinberg law. If in a population the frequency of the gene Ai is pi (i=1, 2, ..., a), these concepts are defined as:

Since a SV is the progeny generated by randomly mating its parents, the inbreeding coefficient of the variety can be visualized as the probability that two random genes taken with replacement from the set whose elements are the genes of the gametic array of the parents are identical by descent (IBD).
In particular, if the parents of a SV are L lines and s single crosses and each parent is represented by m individuals, according to the one-locus model of a diploid species, the genes of the gametic array of the SV can be visualized as a set whose elements are the 2mL + 2ms genes contributed by the mL and ms individuals that represent the L lines and s single crosses, respectively.
Thus, the calculation of the probability that two random genes taken with replacement are IBD can be explicitly made based on the classical definition of probability.
Since the initial L+2s lines were assumed to be unrelated (without parentage) and their inbreeding coefficient was FL (0≤ FL≤1), the contributions to the inbreeding coefficient of the SynL,SC can occur only when two sampled genes trace back to the same individual (as a selfpollination) and to two individuals from the same line or single cross (as an intraparental cross).
The prediction equation, based on a decomposition of the genotypic array of the SV into subpopulations that, once evaluated in the field, produce data that enable the breeder to develop unbiased estimators of the genotypic mean of the SV, was considered as well.
With respect to the prediction of the genotypic mean of the SynL,SC based on experimental data only, let Api1Api2 be the genotype of the p-th individual that represents parent i.
Consider a random sample of size two with replacement taken from the 2m (L + s) genes of the gametic array, the probability that the obtained genes are those that integrate the ordered genotype ApikAqjl is 1/[2m(L+s)]2 (p, q = 1, 2, 3, ..., m; i, j = 1, 2, 3, ..., L + s; k, l = 1, 2). In addition, if Ypik,qjl is the genotypic value of ApikAqjl, the expected genotypic mean of the SV derived from L
lines and s single crosses (MSynL,SC) must be expressable as:

It should be noted that if in equation 1 Ypik,qjl is substituted for ApikAqjl, the equation becomes the genotypic array of the SynL,SC, whose gametic array is:

Results

Inbreeding coefficients
Each genotype of the genotypic array of the SynL,SC can be visualized as formed by two random genes taken with replacement from the set of the 2m(L+s) genes of the gametic array of the parents of the SV. Two sources of the inbreeding coefficient of the SynL,SC (FSynL,SC) are the Lm selfpollinations of the L lines and their Lm(m-1) intraparental crosses; in each case, the frequency of the genotypes generated by two IBD genes is expected to average (1+FL)/2. Thus, the contribution of the L linesb to the FSynL,SC is:

Similarly, the two remaining contributions to the FSynL,SC are from the sm selfpollinations of the parental single-cross hybrids and their sm(m-1) intraparental crosses. Since the frequencies of genotypes formed with two IBD genes are expected to average 1/2 and (1+FL)/2, respectively, the two contributions of the s single crosses to FSynL,SC were:

In summary, FSynL,SC, the sum of all three previous contributions, reduces to:

When all parents of the SV are only lines (L > 0 and s = 0), the inbreeding coefficient (FSynL) reduces to:

For FL = 1, equation 3 becomes FSynL = 1/L.
When L = 0 and s > 0, according to equation 2, the inbreeding coefficient of the resulting SV (FSynSC) is:

For FL=1, Equation 4 becomes FSynSC = 1/(2s), as showed in other study (7).
Table (page 80) shows the inbreeding coefficients of several SynL,SC classified into four sets. All SVs in a set were derived from a particular group of lines, each participating in a SV only once, either as a parent of the SV, or as a parent of a parental single cross.
The numbers of lines in a set were 4, 6, 8, and 10, determined according to results found for the optimum number of parents of synthetic maize varieties (2).
The differences among the SVs in a set were the number of lines used to derive the single crosses that, in addition to the remaining lines of the set, were parents of a SV. For each SV, 15 inbreeding coefficients were calculated (all possible combinations between 5 values of m and 3 of FL).

Table. Inbreeding coefficients (X1000) of the synthetic varieties (SVs) of four sets, each developed from a particular set of unrelated parental lines whose inbreeding coefficient is FL. Each SV is derived from L lines and s single crosses, and each parent is represented by m individuals. In a set each line participates only once in each SV, either as a parent itself or as a parent of a single cross that is a parent of the SV.
Tabla. Coeficientes de endogamia (x1000) de las variedades sintéticas (SVs) de cuatro conjuntos, cada uno desarrollado a partir de un grupo particular de líneas cuyo coeficiente de endogamia es FL. Cada SV tiene como progenitores L líneas y s cruzas simples, cada uno representado por m individuos. En cada conjunto cada línea participa una vez en cada SV, como progenitora o para formar una cruza simple progenitora de la SV.

Genotypic mean
Let MRMP and MCP stand for the mean of the L + s progenies generated by randomly mating the m individuals that represent each parent in isolation (RMP populations), and the progenies produced by all direct crosses among the L + s parents, respectively. From a decomposition into two sets of progenies related with MRMP and MCP, equation [1] can be rewritten as:

Equation 5 (page 80) is a particular case of a prediction formula (10). Clearly, the same approach can be used to derive the GM of SynL (MSynL) and SynSC (MSyn SC). For example, analogically:

Where M'CP is the mean of the (L+2s)(L+2s-1)/2 direct crosses of the lines and M'MRP stands for the mean of the L+2s populations produced by the random mating of the m plants of each of the parental lines.
Regarding the precision of the estimation of MSynL,SC, the variance (Var) of the experimental mean of the SynL,SC ( ) for a single replication, if s2 is the experimental error variance, is Wricke & Weber (9):

Discussion

According to table (page 80) and equations 2, 3 and 4 (page 79), the inbreeding values FSynL, FSynSC, and FSynL,SC are always directly related with FL. In addition, when L > 0 and s > 0, then: 1) the relationship between number of parents (s + L) and inbreeding coefficient of the SynL,SC is inverse. This is so because an increase in the number of unrelated parents implies an increase in interparental matings whose progeny do not contribute to inbreeding, and 2) the FSynL,SC values are larger than those of the SynL and SynSC that belong to the same set. Furthermore, equations 3 and 4 (page 79) imply that when m is large, or when the initial lines are fully inbred (FL = 1), FSynL = FSynSC. This result was also found with the approach to derive formulae for IC based on coancestries (7).
An explanation of this is as follows: when FL = 1, the formation of the single crosses does not imply losses of non-identical by descent genes, the gene frequencies in the set of single crosses and in the set of the lines are the same, and thus the inbreeding coefficients and the genotypic means of SynL and SynSC are the same as well. In fact, FL = 1 the genotypic arrays of SynL and SynSC are the same. Otherwise, if the initial lines were not pure, it would be expected that FSynSC>FSynL because the single cross parents would lose more NIBD genes (8).
It should be noted that whenever in the set of parents of a SV all or some have heterozygous individuals the gene frequencies that these parents contribute to their SV behave as random variables, and the magnitude of their variance may presumably be an indicator of the genetic stability of the SV. The truthfulness and utility of this assumption, however, deserves study.
For 0 ≤ FL < 1, the two smallest inbreeding coefficients, although not necessarily equal, are FSynL and FSynSC (equations 2-3-4, page 79; table, page 80). This is at least partly because the gene frequencies in the initial lines are expected to be balanced in SynL and SynSC, whereas in SynL,SC the frequency of a gene contributed directly from a line will probably double that of a gene of a line contributed via a single cross.
With balanced gene frequencies the random mating of the parents in each of SynL and SynSC may produce more heterozygous genotypes relative to those of the SynL,SC, and therefore FSynL,SC will more likely be the largest. Furthermore, it has been found that if the parents of a SV were only lines the contribution of non-identical by descent genes to the SynL would be the largest (8), and consequently FSynL would be the smallest, results that are consistent with those obtained in this article. Thus, independently of the FL value, in terms of inbreeding and hence of genotypic mean, the two best SVs are SynL and SynSC.
The magnitude of the decrease in the GM, of SynL,SC however, also depends on how intense the inbreeding depression is in the genetical materials used. This disadvantage of SynL,SC, however, is ameliorated because this SV is more flexible, and reduces labor and costs. As already mentioned, relative to SynL, the use of SynL,SC implies a decrease in the number of parents and this causes a decrease in the number of entries required for prediction. The resulting decrease is as follows: if the number of initial potential parents of a SV were p, with a decrease to p-1, the decrease in the number of entries to be prepared and evaluated in the field would be p [i.e., 1 RMP+(p-1) crosses].
Equation 6 (page 81) provides useful information to study the precision of the estimators of the genotypic means of the synthetics. For example, if L + 2s = 12 and FL < 1, to estimate the mean of a SynL, 12 random mating populations (RMPs) must be generated and evaluated in the field (if FL = 1, the formation and evaluation of RMPs would be substituted for the evaluation of the lines), in addition to the formation and experimental evaluation of the 66 direct crosses between the parents.
Alternatively, with 12 initial lines, 4 lines and 4 single crosses could be the parents of a SynL,SC. In this case the entries to be developed and then evaluated would be 36 (8 RMPs and 28 crosses between parents). But, although costs and labor are decreased, the precision of the estimation decreases as well. In the example, the variances of the estimations of the genotypic means of the SVs derived, one from 12 lines, and the other from 4 lines and 4 single crosses, are 0.0133σ2 and 0.0293σ2, respectively.
Theoretically, these results imply that in order for the two cases to have the same precision, the field evaluation of the entries for the SynL,SC must use (0.0293)/ (0.0133) = 2.2 replicates per replicate used for SynL, although in terms of required experimental units these two cases would need about the same number: 79.2 (36x2.2) and 78 (12 RMPs and 66 direct crosses between parents, respectively).
Thus, although the genotypic mean decreases as the inbreeding coefficientb becomes larger, the use of a mixture of lines and single crosses as parents of ab synthetic variety enables the estimation of the genotypic means which, with limitedb resources, could not otherwise be possible (with SynL, for example).
Generalizing, GM and precision of estimation of GM (PE) undergo a reduction when a SynL,SC is used instead of SynL. But whereas the labor and costs due to the use of SynL,SC can be quantitatively measured, the magnitude of the effects on GM and PE due to the use of SynL,SC are difficult to be assessed before making a decision relative to the choice of SV to be developed.
The effect of the increase in IC on GM depends on the size of the increase and on how sensitive the genetic material used is to inbreeding, whereas the magnitude of the PE depends on the number of entries under evaluation in the field, the number of replications, the experimental technique and the experimental materials. In addition to the previous considerations, the breeders should take into account the particularities of their breeding program (budget, number of single crosses and lines available, land area, labor capacity, target environments, target farmers, etc.).

Conclusions

From the equations derived for inbreeding coefficients (IC), genotypic means (GM) and precision of estimation of GM (PE), it is known that if a change from p to p-1 parents of a SV is made (when two lines are substituted for their single cross), then: 1) the number of entries that require field evaluation to predict performance of all possible SVs derived from the p-1 parents decreases by p units, 2) there is a loss of precision estimation of GM (PE), and 3) the IC increases and GM decreases.
While labor and cost reductions can be calculated, the effects on GM and PE depend, apart from a change of p entries and the number of replications (PE), on the genetic background of the parents (GM) and the experimental technique and materials used (PE).
In addition, maize breeders should consider the particularities of the breeding program to decide the type of SV to be developed (budget, flexibility of SynL,SC, labor capacity, experimental technique, target environments, target farmers, etc.).

References

1. Busbice, T. H. 1970. Predicting yields of synthetic varieties. Crop Science. 10: 260-269.         [ Links ]

2. Kutka, F. J.; Smith, M. E. 2007. How many parents give the highest yield in predicted synthetic and composite populations of maize? Crop Science. 47: 1907-1913.         [ Links ]

3. Márquez-Sánchez, F. 2008. Endogamia y predicción de sintéticos de maíz de cruzas dobles. Revista Fitotecnia Mexicana. 31: 1-4.         [ Links ]

4. Sahagún-Castellanos, J. 2011. Inbreeding and yield of synthetic varieties derived from single and double cross hybrids. Maydica. 56(3): 265-271.         [ Links ]

5. Sahagún-Castellanos, J.; Villanueva-Verduzco, C. 1997. Teoría de las variedades sintéticas formadas con cruzas simples. Revista Fitotecnia Mexicana. 20: 69-80.         [ Links ]

6. Sahagún-Castellanos, J.; Rodríguez, J. E.; Peña, A. 2005. Predicting yield of synthetic varieties derived from double crosses. Maydica. 50: 129-136.         [ Links ]

7. Sahagún-Castellanos, J.; Rodríguez-Pérez, J. E. 2011. Inbreeding of synthetic varieties derived from lines and single crosses. Revista Chapingo Serie Horticultura. 17(3): 107-115.         [ Links ]

8. Sahagún-Castellanos, J.; Villanueva-Verduzco, C. 2012. ¿Variedades sintéticas derivadas de cruzas simples o de cruzas dobles? Revista Chapingo Serie Horticultura. 18(3): 279-289

9. Wricke, G.; Weber, W. E. 1986. Quantitative Genetics and Selection in Plant Breeding. Walter de Gruyter. Berlin. New York. USA. 406 p.

10. Wright, A. J. 1981. The quantitative genetics of diploid synthetic varieties. Proc. Fourth meeting of the section biometrics in plant breeding. Poitiers, France, Sept. 2-4. p. 137-157.

11. Zuliani, P.; Lavalle, A.; Bramardi, S. J.; Defacio, R. 2012. Caracterización de poblaciones nativas de maíz mediante Análisis de Procrustes Generalizado y Análisis Factorial Múltiple. Rev. FCA UNCUYO. 44(1):49-64.

Creative Commons License Todo el contenido de esta revista, excepto dónde está identificado, está bajo una Licencia Creative Commons