Most popular articles
Everything About Peaches. Clemson University Cooperative Extension Service Everything About Peaches Website: whether you are a professional or backyard peach...
Mission Statement. For the sake of mankind and the world as a whole a further increase of the sustainability...
Newsletter 9: July 2013 - Temperate Fruits in the Tropics and Subtropics. Download your copy of the Working Group Temperate...
USA Walnut varieties. The Walnut Germplasm Collection of the University of California, Davis (USA). A description of the Collection and a History...
China Walnut varieties.

Articles

A strategy for aggregating multi-source historical phenotypic and genotypic data sets containing homonyms for global genomic prediction in apple (Malus domestica)

Article number
1362_17
Pages
123 – 130
Language
English
Abstract
Genomic prediction can be used to combine historical phenotypic and genotypic data sets from multiple sources to match novel germplasm with new production environments.
Implementation of genomic prediction in apple (Malus domestica Borkh.) benefits from accurate matching of identities of genetic treatments (i.e., accessions) and SNP marker loci across large data sets from multiple experiments and trials.
However, data collection and formatting methods differ among data sources.
Thesauri can be used to integrate data sets so that they can be aggregated.
For apple, we developed scripts to produce thesauri that standardize accession names and SNP locus identifiers across the RosBREED, FruitBreedomics, and Australian Grove genomic data sets generated from three SNP genotyping platforms.
One challenge of aggregation is the presence of errors in the data which lead to homonyms (non-uniqueness in a name used to refer to a specific accession or its clone). To correctly label homonyms in these data sets, the thesauri were primed with historical data (“training” data sets) labeled with Malus UNiQue identifiers (MUNQ IDs) and SNP marker information.
The resultant scripts revealed these training data sets also contained homonyms caused by: 1) misassigned MUNQ IDs to accession name; 2) misassigned accession identifier to MUNQ ID/accession name; 3) misassigned SNP identifiers to 48 markers; and 4) potentially incorrect records in international databases leading to ostensible inferences about the accession name.
To resolve these homonyms, the scripts were extended to identify potential errors in published historical data sets, correct for resolvable data processing errors, and append accession ID or source to the accession name where the source error of the homonym could not be determined.
Correcting these homonyms has facilitated the aggregation of approximately 2184 unique accessions across 259,850 SNP loci from the three genotyping platforms.

Publication
Authors
D. Edge-Garza, K. Evans, E.M. Ross, S. Jung, D. Main, C. Hardner
Keywords
globally unique identifier, passport identifier, data curation
Full text
Online Articles (85)
L. Hamama | J. Bosselut | L. Voisine | J. Chameau | S. Foucrier | S. Pierre | J. Jeauffre | L. Ogé | T. Thouroude | F. Foucher | L. Hibrand-Saint Oyant
A. Ciacciulli | H.D. Pappalardo | M. Caruso | M. Pindo | S. Piazza | M. Malnoy | C. Licciardello
T. Lallemand | S. Aubourg | J.-M. Celton | C. Landès
Y. Kamiya | S. Shiraki | H. Mehraj | M.A. Akter | S. Takahashi | M. Seki | E.S. Dennis | K. Osabe | R. Fujimoto
M. Iorizzo | M.A. Lila | P. Perkins-Veazie | C. Luby | N. Vorsa | P. Edger | N. Bassil | P. Munoz | J. Zalapa | R.K. Gallardo | A. Atucha | D. Main | L. Giongo | C. Li | J. Polashock | C. Sims | E. Canales | L. DeVetter | M. Coe | D. Chagné | A. Colonna | R. Espley
Y. Bal Krishna | S.N. Vyavahare | S.I. Patil | P.V. Sane
S. Shiraki | Y. Kamiya | H. Mehraj | S. Takahashi | M. Seki | E.S. Dennis | R. Fujimoto
H. Muranty | M. Jung | M. Roth | X. Cazenave | A. Patocchi | F. Laurens | C.-E. Durel
M. Jung | S. Bühlmann-Schütz | M. Hodel | M. Kellerhals | N. Bolliger | M. Köhle | M. Kobelt | H. Muranty | B. Studer | G.A.L. Broggini | A. Patocchi
J. Bénéjam | E. Bineau | M. Brault | J. Zhao | Y. Carretero | E. Pelpoir | K. Pellegrino | F. Bitton | M. Causse
M. Vukosavljev | I. Stranjanac | B.W.P. van Dongen | R.E. Voorrips | M. Miric | B. Bozanic Tanjga | P. Arens | M.J.M. Smulders
N. Munyengwa | C. Peace | N.L. Dillon | D. Ortiz-Barrientos | N. Christie | A.A. Myburg | C. Hardner
T. Jaingulueam | P. Suwor | K. Saetiew | W.S. Tsai | S. Techawongstien | T. Tarinta | S. Kumar | N. Jeeartid | O. Chatchawankanphanich | S. Kramchote
C. Domenichini | P. Negri | M. Defrancesco | S. Alessandri | L. Bergonzoni | I. Verde | M. Malnoy | G.A.L. Broggini | A. Patocchi | A. Peil | O.F. Emeriewen | L. Dondini | S. Tartarini
R.K. Volz | N. Proffit | C. Marshall | B. Orcheski | D. Bowatte | D. Chagné | E. López-Girona | V.G.M. Bus
A. Petiteau | C. Denancé | H. Muranty | C.-E. Durel | B.E. García-Gómez | M.J. Aranzana | F. Lebreton | P. Guérif | M. Cournol | B. Petit | A. Guyader | F. Laurens
M. Di Guardo | M. Moretto | M. Moser | C. Catalano | M. Troggio | Z. Deng | A. Cestaro | M. Caruso | G. Distefano | R. Russo | S. di Silvestro | C. Arlotta | D.P. Paolo | G. Russo | S. La Malfa | L. Bianco | A. Gentile
K. Ziane | L. Ghaouti | N. Chtaina | A. Zahid | M. Arbaoui
P. Sangarun | P. Suwor | K. Saetiew | W.S. Tsai | S. Techawongstien | T. Tarinta | S. Kumar | N. Jeeartid | O. Chatchawankanphanich | N. Phironrit | S. Kramchote
S. Bühlmann-Schütz | M. Hodel | E. Dorfmann | M. Jung | G.A.L. Broggini | A. Patocchi | M. Kellerhals
C. Catalano | G. Licciardello | S. Seminara | G. Tropea Garzia | A. Biondi | S. La Malfa | A. Gentile | G. Distefano
P.C.S. Angelo | L.F.P. Pereira | G.H. Sera | E.T. Caixeta
G. Pasev | V. Radeva-Ivanova | V. Pashkoulova | A. Nankar | D. Kostova
M.A. Akter | N. Miyaji | M. Shimizu | T. Takasaki-Yasuda | E.S. Dennis | R. Fujimoto
M.-L. Ramaroson | J.-J. Helesbeux | L. Hamama | L. Ogé | D. Breard | S. Huet | A. Suel | P. Hugueney | R. Baltenweck | P. Claudel | V. Le Clerc | M. Briard
L. Garkava-Gustavsson | J. Skytte af Sätra | F. Odilbekov | I. Abreu | A.I. Johansson | E. van de Weg | T. Zhebentyayeva
J. Corbacho | C. Inês | J. Labrador | A. Cordeiro | M.C. Gomez-Jimenez
A. Paulino | R.C. Pires | I. Fernandes | J. Santos | T. Brás | D. Rosa | O.S. Paulo | M.F. Duarte | L. Marum
M. Ish-Shalom | A. Doron-Faigenboim | S. Tsaidi | H. Zemach | A. Sherman | Y. Cohen
M. Di Guardo | B. Farneti | I. Khomenko | L. Luca | G. Modica | A. Mosca | G. Distefano | L. Bianco | M. Troggio | F. Sottile | S. La Malfa | F. Biasioli | A. Gentile
R. Sánchez | L. Arroyo | C. Sanz | A.G. Pérez
G. Arnau | A.E. Ehounou | E. Maledon | E Nudol | H. Vignes | M.C. Gravillon | A.S.P. N’guetta | P. Mournet | A.M. Kouakou | H. Chaïr | F. Cormier
G. Almeida | A. Faustino | R.C. Pires | D. Soldado | L. Cachucho | M.M. Oliveira | E. Jerónimo | L. Marum
R.R. Rodríguez-Domínguez | R. Rosas-Quijano | M. Salvador-Figueroa | A. Vázquez-Ovando | D. Gálvez-López
X. Chen | S. Kumar | C. Deng | B. van Hooijdonk | E. Varkonyi-Gasic | C. Wiedow | J. Millner | S. Sofkova-Bobcheva | J. Lempe | A. Peil | H. Flachowsky | V.G.M. Bus
B. Orcheski | E. López-Girona | A. Tattersall | D. Chagné | F. Elliott | D. Hunter | A. Karlstrom | R.K. Volz | J. Johnston
C. Miranda | P. Irisarri | J. Arellano | F.J. Bielsa | A. Valencia | J. Urrestarazu | A. Pina | L.G. Santesteban | L. Castel | P. Errea
L. Bergonzoni | L. Dondini | S. Alessandri | C. Domenichini | V. Ancarani | G. Caracciolo | M. Pietrella | G. Baruzzi | S. Tartarini
S. Guerrero-Garibay | F. Olvera-Martínez | D. Aceves-Monreal | P.L. López de Alba | A. Cruz-Hernández
J.C. Puthiyaparambil | M. Pagie | S. Teressita | P.M. Jay | N. Bongani | F. Paul | M. Candy | M. Mark | M. Marion | M. Ian | L. Sanskruti | M. Nitin
A. Muhammad | S. Noor | I. Hussain | K. Ali | A. Shahzad | M. Numan | K. Adil | M. Aqeel | H. Hafeez | M. Zeshan | G.M. Ali
S.A. Mehlenbacher | B.J. Heilsnis | R.T. Mooneyham | J.W. Snelling
D. Ray | C. Auvinet | F. Lebreton | C. Pitiot | A. Petiteau | B. Petit | F. Laurens
J.H. Guo | S.L. Chen | K.D. Chiou | Z.Z. Xu | W.L. Lee | S. Nontajak
F.F. Ramahavalisoa | E. Rafitoharson | V. Rakotoarimanana | J.M. Bouvet | J.M. Leong Pock Tsy | J. Queste | P. Danthu
M.C. Vergneaud | R. Bauduin | Y. Gilles | B. Petit | F. Laurens
F. Córdoba López | M. Moreno Verdú | M. Rabadán Mínguez | C. Rodríguez Sánchez | M. Pérez-Jiménez | O. Pérez-Tornero
F. Córdoba López | M. Moreno Verdú | M. Rabadán Mínguez | C. Rodríguez Sánchez | M. Pérez-Jiménez | O. Pérez-Tornero