Schühli, G. S. et al., 371
Vol. 4, N.4: pp. 371-377, November, 2013 ISSN: 2179-4804
Journal of Biotechnology and Biodiversity
Genetic selection of Calophyllum brasiliense for seed orchards
Guilherme Schnell Schühli1,*, Thiago Wendling Gonçalves de Oliveira2, Maria do Socorro Padilha de Oliveira3, João Antonio Pereira Fowler 1
ABSTRACT
Calophyllum brasiliense populations are under severe depletion and criteria to improve production and quality of propagative material are therefore necessary. Genetic measures have the potential to reduce consanguinity and maximize allelic representation within target populations. Here, we explored genetic values for this species in a small relic of natural forest in Rio de Janeiro State (Brazil). The objective was to evaluate the potential of some genetic measures for seed orchard establishment. As genomic information of native trees is still scarce, we opted to use a dominant marker: RAPD. DNA from 17 phenotypically superior trees was obtained through the CTAB method and submitted for amplification by PCR. Electrophoresis and electronic documentation was then conducted. We calculated the percentage of polymorphic bands (PPB), gene diversity (Ht), Shannon’s information index (i), genetic distance (UPGMA) and parsimony analysis. Six primers were evaluated generating 34 loci. We found high genetic diversity PPB=70.6% with Ht=0.28 and i=0.41. Genetic relationships were reported in dendrograms (maximum parsimony and distance). Simulated sampling within and among clusters suggests that inter cluster sampling is more effective to capture the genetic diversity.
Key-words: RAPD, propagative material, MAS, guanandi.
Seleção genética de Calophyllum brasiliense para a formação de pomares de sementes
RESUMO
As populações de Calophyllum brasiliense encontram-se sob severa depleção e, portanto, são necessários critérios para melhorar a produção e a qualidade do material propagativo da espécie. Índices genéticos têm o potencial para orientar a redução de consangüinidade e de maximizar a representação alélica dentro das populações de interesse. Neste artigo exploramos valores genéticos para esta espécie em um pequeno relicto de floresta natural no Estado do Rio de Janeiro (Brasil). O objetivo foi o de avaliar o potencial de algumas medidas genéticas para o estabelecimento de pomares de sementes. Desde que informações genômicas de árvores nativas ainda são escassas optamos pelo uso de um marcador dominante: RAPD. O DNA de 17 árvores fenotipicamente superiores foi obtido através do método CTAB e encaminhado para amplificação por PCR. Eletroforese e documentação eletrônica foram então conduzidas. Calculamos a porcentagem de bandas polimórficas (PPB), diversidade genética (Ht), índice de informação de Shannon (i), distância genética (UPGMA) e análise de parcimônia. Seis iniciadores foram avaliados gerando 34 loci. Encontramos alta diversidade genética PPB=70,6% com Ht=0,28 e i=0,41. As relações genéticas foram apresentadas em dendrogramas (Máxima parcimônia e distância). Amostragem simulada dentro e entre agrupamentos sugerem que a amostragem dentro de grupamentos é mais eficiente para melhor capturar a diversidade genética.
Palavras-chave: RAPD, material de propagação, MAS, guanandi
*Author for correspondence.
1,*Embrapa Forestry - Estrada da Ribeira, km 111, Po-Box 319 - Colombo, PR - Brazil - 83411-000 Phone: 55 (41) 3675-5789 - Fax: 55 (41) 3675-5601. guilherme.schuhli@embrapa.br * , joao-antonio.fowler@embrapa.br
2Federal University of Paraná grad. student. thiagowendling@yahoo.com.br
3Embrapa Eastern Amazon - Trav. Dr. Enéas Pinheiro s/nº Po-Box, 48 Belém, PA - Brazil CEP 66095-100 spadilha@cpatu.embrapa.br
J. Biotec. Biodivers. v.4, N.4: pp.371-377, Nov . 2013
https://doi.org/10.20873/jbb.uft.cemaf.v4n4.schuhli
Schühli, G. S. et al., 372
INTRODUCTION
Calophyllum brasiliense Cambess. (Clusiaceae)
is found in many of Brazil’s most representative biomes, including the Amazon, Atlantic forest and Cerrado (Marques and Joly 2000). Its
geographical distribution spans from Central America to southern coast of Brazil (Santa
Catarina State). C. brasiliense is adapted to high soil humidity and is typically found in waterlogged forests (Reitz et al. 1978; Oliveira -
Filho and Ratter 1995). Natural populations of C. brasiliense (popularly known as guanandi) are under severe depletion due to illegal
harvesting and deforestation driven by the rapid expansion of the agricultural frontier ( Marques and Joly 2000).
Recent reformulations of the Brazilian laws controlling agricultural activities and natural conservation areas have led to renewed interest in the restoration of natural forests. Indeed,
rehabilitation of forests is a requirement of the
new Brazilian Forestry Code. Effective restoration heavily depends on the wide availability of propagative material for a range
of native species. However, seedlings and even
seeds of native tree species are commercially scarce in Brazil (Caldas 2006). Moreover, the
seedlings and seeds that are available are not necessarily of the highest quality (e. g.
phytossanitary and genetic quality) .
Improving the production of propagative material is especially important for the effective rehabilitation of native forests. Genetic measures
are an especially effective strategy to increase
production (Sebbenn 2006). For example, genetic information can be used to direct
sampling efforts to reduce inbreeding and the loss of diversity through genetic drift, thereby
improving the quality of the future seed orchard
composition. Thus, with a basic knowledge of genetic parameters, trees are selected for propagation not only based on their productivity
and health characteristics, but also considering consanguinity and broad allelic representation
within target populations (Sebbenn 2002) .
The objective of this study was to assess the potential genetic measures to improve the production of Calophyllum brasiliense. A small
fragment of natural forest in Rio de Janeiro State
(Brazil) was selected as source of matrix trees to provide seeds for propagation. After phenoty pic
selection of ‘superior’ trees, we calculated genetic diversity parameters to facilitate the
development of optimal sampling strategies for
the creation of seed orchards. This is the first population study of this species based on DNA
markers and provides important baseline information for future research to assure genetic
quality in seed and seedling production of guanandi.
MATERIAL AND METHODS
Genetic assessment was through a simple
molecular marker. As genomic information of native Brazilian tree species is still scarce and there is the need for a fast processing time, we
used dominant markers. Moreover, to ensure the methods adopted could, if necessary, be scaled -
up, it was also important to use a transferable marker with low costs and with a simple procedure. Random amplification of
polymorphic DNA – RAPD fulfills these characteristics, providing estimation of minimal
population parameters with high transferab ility within native species. RAPD is also fast ,
technically simple and allows for a high
frequency of identification of polymorphisms (Newbury and Ford-Lloyd 1993). However, a possible disadvantage of RAPD is low
reproducibility when researchers lack tight
experimental control of the PCR conditions (Muchugi et al. 2009).
Material for determining genetic variability was collected from 17 phenotypically ‘superior’ trees (=apparently healthy and with a high diameter at breast height) located in a small conservation area in Casemiro de Abreu municipality in the northeast of the Brazilian State of Rio de Janeiro
(Figure 1) .
J. Biotec. Biodivers. v.4, N.4: pp.371-377, Nov . 2013
Schühli, G. S. et al., 373

Figure 1. Location of Rio de Janeiro State and Casimiro de Abreu municipality (1) and the experimental area (2).
Young and healthy leaves from each individual
were collected and were stored in plastic bags with silica gel for transportation to the laboratory. Leaf samples were stored in a freezer
at -20°C .
Genomic DNA was obtained from cold stored
leaves using a CTAB (cetyltrimethylammonium bromide) based protocol (Cheung et al. 1993).
After extraction, the genomic extract was eluted in bi-distilled water and stored at -80°C. DNA samples were submitted to the polymerase chain
reaction (PCR)-RAPD and exsiccate and genomic DNA were deposited in Embrapa ’s Florestas Herbarium and Molecular Biology
Collection as vouchers .
Amplifications were replicated to ensure
reproducibility. All samples were submitted to 3 PCRs (at least) for each primer set. Only bands
reproducible in two independent amplification
reactions were included in the data analyses. The amplification reactions were carried out in a final volume of 25 uL. PCR reactions consisted
of 1x PCR buffer 2mM MgCl2, 0.2mM dNTPs (each), µM 0.5 desired oligonucleotide (primer),
1U Taq polymerase, and water to complete 25 µL. Template DNA was adjusted to 2 ng/ul in the PCR reaction tube. We tested 10
oligonucleotides (Operon Technologies Inc.) . The reactions were amplified in an Applied
Biosystems thermal cycler model Veriti 96 Well . Amplifications were performed on a thermal cycler with 4 minutes of initial denaturation at
92°C; 40-cycles consisting of 1 minute at 92°C,
1m and 30s at 40°C, and 2m at 72°C; and final extension of 5m at 72°C for the complete
extension of amplified products. Default electrophoresis procedures (1,5% agarose;
ethidium bromide staining; 80 V) and electronic
documentation was conducted to register PCR results.
RAPD data are dominant, and each band therefore represented the phenotype at a single
bi-allelic locus. To minimize biased interpretation of the electrophoresis gels, we
implemented automatic gel reading with a Gel
Analyzer v. 2010a (Lazar 2010). After background subtraction (manual baseline tool - Gel Analyzer), the multiple gels from same
primer set (replicates) were visually compared to confirm band patterns. A binary matrix was then
composed based on this evaluation.
We examined sampling variance to determine
how many markers would be required for a given level of precision in the estimate of
genetic distances (Tivang et al.1994; Manly
1997). The procedure was done within the software GENES v. 2009.7.0 (Cruz 2006) ,
considering 1 to 32 markers and similarity under
Nei and Li (1979). The ideal number of polymorphic bands was based on the suggested
< 0.05 stress value (Kruskal 1964). T he percentage of polymorphic bands, Nei’s gene diversity, Shannon’s information index, and
genetic distance were calculated using
POPGENE 1.32 (Yeh et al., 1997) and were based on the Hardy–Weinberg equilibrium .
Pairwise genetic distances were calculated using the Nei coefficient (Nei, 1972). Phenograms were prepared based on UPGMA (unweighted pair group method using arithmetic averages)
(algorithm FIND activated with maximum number of tied trees set to 25). The genetic
distance matrix was also subjected to a principal coordinate analysis (PCA). New independent
axial coordinates were calculated from the
genetic distances, which represent most of the variability of the original data. The taxa were
then plotted as points in three dimensional continuous space defined by the first three
coordinates. These calculations were done using the NTSYS-PC program (Rohlf 1993 ).
Neighbour-joining (NJ) search was computed
J. Biotec. Biodivers. v.4, N.4: pp.371-377, Nov . 2013
Schühli, G. S. et al., 374
with PAUP*4b10 (Swofford 2003). As distance methods sometimes perform poorly to identify evolutionary patterns, we also included a
heuristic search for an optimal tree. Maximum parsimony analyses (MP) of the binary data
matrix were performed with PAUP* v. 4b10
using the heuristic search (MULPARS, ACCTRAN, random sequence addition, 10,000
replicates, and TBR branch swapping). Negative branch lengths were allowed, but set to zero for tree-score calculation. Steepest descent options
were not in effect. Starting tree(s) were obtained via neighbour joining. Data was treated as unrooted. Bootstrap analysis (Felsenstein 1985)
was also performed to assess statistical support for branches in the most parsimonious trees and
neighbour-joining trees. Branch support was evaluated for 10,000 replicates with neighbour
joining. Bremer support (Bremer 1994) was calculated for parsimony trees .
Based on the results of the parsimony and distance analysis, we selected individual trees
based on their genetic relationships. Genetic
diversity indexes within groups of close ly related individuals were compared with indexes
obtained from groups of distantly related individuals. Simulated diversity indexes obtained from both sampling strategies were considered based in Nei’s diversity index.

RESULTS AND DISCUSSION
Six primers resulted in successful amplification.
Selected oligonucleotide sequences (provided below) may be useful for further investigation within the genus (Table 1). Unsuccessful PCR
resulted from the primers OPN02, OPN03, OPP19, and OPM02.
Table 1. Oligonucleotide used in PCR, respective sequences and loci number.
Loci Oligonucleotide Sequence (5’- 3’)
number
1.7 observed and 1.5 effective alleles per loci. Nei’s gene diversity was 0.28 and Shannon’s Information index was 0.41 indicating that the
assessed C. brasiliense population displayed a high diversity. OPT13 amplified the largest
number of loci (11) while OPE20 amplified the least (3).
Estimations of precision with 34 loci revealed
high correlation values, reaching 0.8 with 14 loci, 0.9 with 20 and 1 with 33. The stress value with 34 loci was less than 0.05 (Figure ).
Figure 2. Average bootstrapped correlation values and
evaluated loci number. Square deviation and stress values for the 33 loci were 0.03 and 0.02, respectively.
Assuming the Hardy-Weinberg equilibrium, the
expected diversity at species level was 0.2030. Shannon’s index of diversity was 0.4144. A
simple heuristic search under by parsimony
yielded two unrooted topologies (consistency index 0.35; homoplasy index 0.64; and retention index 0.54). The conflict was restricted to the
arrangement of the samples F, G and H.
We opted for one of the 2 topologic resolutions instead of the strict consensus to keep branch length values (Figure A). The groups are topologically congruent with the definition
OPA01
OPC10 OPE20 OPJ19 OPP13
CAGGCCCTTC
TGTCTGGGTG AACGGTGACC GGACACCACT GGAGTGCCTC
8
7
3
5
11
found in classical UPGMA analysis (at 20% of genetic distances). Group E, K, and M and F, G, and H (first structured under parsimony
analysis) were grouped in UPGMA with A, B, L, M, P and Q. Bootstrap support is shown in Figure (A and B) and Bremer support values are
Phenotypic data was organized in a binary matrix with 34 loci for 17 individuals. We found 24 (70.6%) polymorphic loci with an average of
shown in Figure (A).
J. Biotec. Biodivers. v.4, N.4: pp.371-377, Nov . 2013
Schühli, G. S. et al., 375

Figure 3. Genetic relationships as represented by unrooted dendrograms. A Parsimony topology (IC=0.35; HI=0.64; RI=0.54), branch length represent character changes; B Distance topology (UPGMA), branch length represents distances. Bootstrap values (10,000 replicates), when > 50% are presented in boxes, Bremer support (for parsimony only), when more than 0 were near respective branch.
The low consistency and retention index indicate genetic relationships specifically for
considerable misidentification of loci as homologous. The lower bootstrap values (< 50%) for some branches, under both searches
strategies (UPGMA and Parsimony), demonstrate incongruence derived from non - homology loci. The unsolved position for these branches (including alternatives topologies for F,G and H) are directly related to the low consistency and retention index. Nonetheless , branches with higher bootstrap values ( and
confirmed in both analyses) are robust and not affected by non-homologous loci data .
Even taking into consideration our limited geographical sampling area, both genetic
diversity measures were high (Nei’s and
Shannon’s diversity index). This suggests that dominant markers are still able to recover recent
undomesticated tree species. These species are generally reported as having high genetic diversity, which can be observed in the easy
amplification of primers and in the polymorphic loci. It should be noted that the unsuccess ful primers were immediately discarded to allow fast processing. We believe that a careful optimization could raise the viability of some of these discarded markers.
A reasonable precision for this scale of evaluation was obtained with the 34 obtained
loci. The correlation and stress values support the observed topologies. Although it is possible to examine genetic differences based on RAPD
data, support values suggest that the method still
has limitations. Some resolutions were supported by bootstrap analysis both in
J. Biotec. Biodivers. v.4, N.4: pp.371-377, Nov . 2013
Schühli, G. S. et al., 376
parsimony and distance analysis (C, D and O; F and H; A and B). Bremer support values also suggest that, while there is sufficient information
to resolve the genetic relationships, the topology
requires stronger support. For general aspects of genetic distances, this level of precision is
acceptable as an expedite criteria. Therefore, it is viable to gather basic genetic information for
Calophyllum brasiliense using dominant
markers, particularly with RAPD technique and even at small geographic scales. More precise
surveys would require larger datasets or markers with higher discriminatory power .
The parsimony analysis indicated a north- south relationship. Group A joins G (north) with F
(intermediate) and H (south). The same pattern
is apparent for the other groups. The natural dispersion of C. brasiliense is mainly through bats (Marques and Joly 2000). However, the
observed pattern may also be a consequence of a major road (BR101) that runs is near the
sampling site (ca. 200m) in a nort h/south pattern. Such a large road may have influenced bat ecology, limiting or restricting the pattern of
seed dispersal .
A strategy of sampling to compose seed orchard
can be proposed based in the unrooted parsimony tree. This is far from being the only
option, but has the advantage of minimiz ing
costs and maximizing genetic diversity. We propose sampling most divergent genotypes
detected in tree topology. Obviously, the small scale sampling allows inclusion of all 17 samples to a seed orchard. This would be
satisfactory to capture the inter-clade diversity
and most of the population diversity. Nonetheless, a larger number of individuals would be needed to adequately represent
diversity within clades and maximize the probability of inclusion of rare alleles .
CONCLUSIONS
Using dominant markers, particularly with the RAPD technique, seems a viable strategy to
collect basic genetic information for Calophyllum brasiliense, even at small geographic scales. This method revealed high genetic diversity in the studied population of C. brasiliense, a typical feature of native uncultivated tree species.
Although lacking strong support, it is possible to explore genetic distances and recognize patterns of distribution of the studied genotypes with
RAPD. This information allows the development of a sampling strategy to ensure maximum genetic representation when selecting
representatives for a seed orchard .
REFERENCES
Bremer K. Branch support and tree stability. Cladistics. 1994; 10: 295- 304.
Caldas L. Pomares de sementes de esp écies nativas – As funções das redes de sementes. In: Higa AR. and Silva LD. Pomar de sementes de
espécies florestais nativas. 1st ed. FUPEF; 2006. 227- 242.
Cheung WY, Hubert N, Landry BS. A simple and rapid DNA microextration method for plant,
animal, and insect suitable for RAPD and other PCR analyses. Genome Res. 1993; 3: 69- 70.
Cruz CD. Programa Genes: Biometria. Viçosa (MG): Editora UFV; 2006 .
Kruskal JB. Multidimensional scaling by optimizing goodness of fit to a no metric hypothesis. Psychometrika. , 1964; 29(1): 1-27 .
Lazar I. [homepage on the internet]. Gel
Analyzer. Debrecen, Hungary: University of Debrecen; updated in 2010; accessed in 20 13
Jan 11. Available from: http://www.gelanalyzer.com/index.html
Manly BFJ. Randomization, bootstrap and Monte Carlo methods in Biology. London: Chapman & Hall, 1997.
Marques MCM, Joly CA. Estrutura e dinâmica
de uma população de Calophyllum brasiliense Camb. em floresta higrófila do sudeste do Brasil.
Rev Bras Bot. 2000; 23(1):107- 12.
Muchugi A, Kadu C, Kindt R, Kipruto H, Lemurt, S, Olale K, Nyadoi P, Dawson I,
Jamnadass, R. Molecular markers for tropical trees: A Practical guide to principles and procedures. ICRAF Technical Manual no. 9.
World Agroforestry Centre Nairobi; 2009.
Nei M, Li W-H. Mathematical model for studying genetic variation in terms of restriction
J. Biotec. Biodivers. v.4, N.4: pp.371-377, Nov . 2013
Schühli, G. S. et al., 377
endonucleases. Proc. Natl. Acad. Sci. USA . 1979; 76(10): 5269- 73.
Nei M. Genetic distance between populations. Am. Nat. 1972; 106:283- 92.
Newbury HJ, Ford-Lloyd BV. The use of RAPD for assessing variation in plants. Plant Growth
Regul. 1993; 12:43-51, 1993 .
Oliveira-Filho AT, Ratter JA. A study of the
origin of central brazilian forests by the analysis of plant species distribution patterns. Edinb J
Bot. 1995; 52: 141- 94.
Reitz R, Klein RM, Reis A. Projeto madeira de Santa Catarina. Sellowia. 1978; 28(30): 218- 24.
Rohlf FJ. NTSYS-pc v. 1.8. Numerical
taxonomy and multivariate analysis system. Setauket, NY: Applied Biostatistics Inc. 1993.
Sebbenn, AM. Número de árvores matrizes e conceitos genéticos na coleta de sementes para reflorestamentos com espécies nativas. Rev Ins t Flor. 2002; 14: 115- 132.
Sebbenn, AM. Sistemas de reprodução em espécies arbóreas tropicais e suas implicações para a seleção de árvores matrizes para reflorestamentos ambientais. In: Higa AR. and Silva LD. Pomar de sementes de espécies florestais nativas. 1st ed. FUPEF; 2006. 93- 138.
Swofford, D. L. PAUP*. Phylogenetic Analysis
Using Parsimony (*and Other Methods). Version 4b10. Sunderland, Massachusetts : Sinauer Associates. 2003.
Tivang JG, Nienhuis J, Smith OS. Estimation of sampling variance of molecular marker data using the bootstrap procedure. Theor Appl Genet. 1994; 89(25):264- 9.
Yeh F, Yang RC, Mao J, Ye Z, Boyle TJB. The user-friendly shareware for Population Genetic Analysis. Molecular Biology and Biotechnology
Centre, University of Alberta, Edmonton, 1999.
Recebido: 04/05/201 3 Received: 05/04/201 3
Aprovado: 19/09 /2013 Approved: 09/19 /2013
J. Biotec. Biodivers. v.4, N.4: pp.371-377, Nov . 2013