Genotype and phenotype data standardization, utilization and integration in the big data era for agricultural sciences

Yang

Feng

Zhang

et al. (

2020

)

Crop phenomics and high-throughput phenotyping: past decades, current challenges, and future perspectives

Mol. Plant

187

–

214

Borgman

C.L.

(

2015

)

Big Data, Little Data, No Data: Scholarship in the Networked World

. The MIT Press, Cambridge, MA.

Mosconi

Randall

et al. (

2019

)

Three gaps in opening science

Comput. Support Coop. Work (CSCW)

749

–

789

Federer

L.M.

(

2019

)

Who, what, when, where, and why? Quantifying and understanding biomedical data reuse

University of Maryland

Wallis

J.C.

Rolando

and

Borgman

C.L.

(

2013

)

If we share data, will anyone use them? Data sharing and reuse in the long tail of science and technology

PLoS One

, e67332.

Pasquetto

I.V.

Randles

B.M.

and

Borgman

C.L.

(

2017

)

On the reuse of scientific data

Data Sci. J.

–

Culina

Crowther

T.W.

Ramakers

J.J.C.

et al. (

2018

)

How to do meta-analysis of open datasets

Nat. Ecol. Evol.

1053

–

1056

and

Nahar

(

2016

)

Reuse of scientific data in academic publications: an investigation of Dryad digital repository

J. Inf. Manag.

478

–

494

10.

Pasquetto

I.V.

Borgman

C.L.

and

Wofford

M.F.

(

2019

)

Uses and reuses of scientific data: the data creators’ advantage

Harv. Data Sci. Rev.

11.

Rung

and

Brazma

(

2013

)

Reuse of public genome-wide gene expression data

Nat. Rev. Genet.

–

12.

Karasti

and

Blomberg

(

2018

)

Studying infrastructuring ethnographically

Comput. Support. Coop. Work (CSCW)

233

–

265

13.

Hanson

Sugden

and

Alberts

(

2011

)

Making data maximally available

Science

331

, 649.

14.

Leonelli

(

2013

)

Integrating data to acquire new knowledge: three modes of integration in plant science

Stud. Hist. Philos. Sci. Part C

503

–

514

15.

Kattge

Bonisch

Diaz

et al. (

2020

)

TRY plant trait database – enhanced coverage and open access

Glob. Chang. Biol.

119

–

188

16.

Harper

Campbell

Cannon

E.K.S.

et al. (

2018

)

AgBioData consortium recommendations for sustainable genomics and genetics databases for agriculture

Database

2018

, bay088.

17.

Adam-Blondon

A.F.

Alaux

Pommier

et al. (

2016

)

Towards an open grapevine information system

Hortic. Res.

, 16056.

18.

Dempsey

and

Heery

(

1998

)

Metadata: a current view of practice and issues

J. Doc.

145

–

172

19.

Mayernik

M.S.

and

Acker

(

2018

)

Tracing the traces: the critical role of metadata within networked communications

J. Assoc. Inf. Sci. Technol.

177

–

180

20.

Edwards

(

2016

) The impact of genomics technology on adapting plants to climate change. In:

Edwards

Batley

(eds)

Plant Genomics and Climate Change

Springer

New York, NY

, pp.

173

–

178

21.

Chitnis

Monos

et al. (

2021

)

Next-generation sequencing technologies: an overview

Hum. Immunol.

–

811

22.

Smith

L.M.

Fung

Hunkapiller

M.W.

et al. (

1985

)

The synthesis of oligonucleotides containing an aliphatic amino group at the 5ʹ terminus: synthesis of fluorescent DNA primers for use in DNA sequence analysis

Nucleic Acids Res.

2399

–

2412

23.

Sanger

Nicklen

and

Coulson

A.R.

(

1977

)

DNA sequencing with chain-terminating inhibitors

Proc. Natl. Acad. Sci. U.S.A.

5463

–

5467

24.

Crossley

B.M.

Bai

Glaser

et al. (

2020

)

Guidelines for Sanger sequencing and molecular assay monitoring

J. Vet. Diagn. Invest.

767

–

775

25.

Mardis

E.R.

(

2008

)

The impact of next-generation sequencing technology on genetics

Trends Genet.

133

–

141

26.

van Dijk

E.L.

Auger

Jaszczyszyn

et al. (

2014

)

Ten years of next-generation sequencing technology

Trends Genet.

418

–

426

27.

Buermans

H.P.

and

den Dunnen

J.T.

(

2014

)

Next generation sequencing technology: advances and applications

Biochim. Biophys. Acta

1842

1932

–

1941

28.

Slatko

B.E.

Gardner

A.F.

and

Ausubel

F.M.

(

2018

)

Overview of next-generation sequencing technologies

Curr. Protoc. Mol. Biol.

122

, e59.

29.

Ekblom

and

Wolf

J.B.

(

2014

)

A field guide to whole-genome sequencing, assembly and annotation

Evol. Appl.

1026

–

1042

30.

English

A.C.

Richards

Han

et al. (

2012

)

Mind the gap: upgrading genomes with pacific biosciences RS long-read sequencing technology

PLoS One

, e47768.

31.

Huddleston

Ranade

Malig

et al. (

2014

)

Reconstructing complex regions of genomes using long-read sequencing technology

Genome Res.

688

–

696

32.

Wang

Zhao

Bollas

et al. (

2021

)

Nanopore sequencing technology, bioinformatics and applications

Nat. Biotechnol.

1348

–

1365

33.

Marx

(

2023

)

Method of the year: long-read sequencing

Nat. Methods

–

34.

Chen

Sun

Wang

et al. (

2023

)

Portable nanopore-sequencing technology: trends in development and applications

Front Microbiol.

, 1043967.

35.

Wick

R.R.

Judd

L.M.

and

Holt

K.E.

(

2019

)

Performance of neural network basecalling tools for Oxford Nanopore sequencing

Genome Biol.

, 129.

36.

Grodzicker

Williams

Sharp

et al. (

1975

)

Physical mapping of temperature-sensitive mutations of adenoviruses

Cold Spring Harb. Symp. Quant. Biol.

439

–

446

37.

Yang

Kang

Yang

et al. (

2013

)

Review on the development of genotyping methods for assessing farm animal diversity

J. Anim. Sci. Biotechnol.

, 2.

38.

Carvalho

Bengtsson

Speed

T.P.

et al. (

2007

)

Exploration, normalization, and genotype calls of high-density oligonucleotide SNP array data

Biostatistics

485

–

499

39.

Chagne

Crowhurst

R.N.

Troggio

et al. (

2012

)

Genome-wide SNP detection, validation, and development of an 8K SNP array for apple

PLoS One

, e31745.

40.

Bayer

M.M.

Rapazote-Flores

Ganal

et al. (

2017

)

Development and evaluation of a barley 50k iSelect SNP Array

Front. Plant Sci.

, 1792.

41.

Verde

Jenkins

Dondini

et al. (

2017

)

The Peach v2.0 release: high-resolution linkage mapping and deep resequencing improve chromosome-scale assembly and contiguity

BMC Genomics

, 225.

42.

Ganal

M.W.

Polley

Graner

E.M.

et al. (

2012

)

Large SNP arrays for genotyping in crop plants

J. Biosci.

821

–

828

43.

McKain

M.R.

Johnson

M.G.

Uribe-Convers

et al. (

2018

)

Practical considerations for plant phylogenomics

Appl. Plant Sci.

, e1038.

44.

Kumar

Choudhary

Jat

B.S.

et al. (

2021

)

Skim sequencing: an advanced NGS technology for crop improvement

J. Genet.

100

–

45.

Schmickl

Liston

Zeisek

et al. (

2016

)

Phylogenetic marker development for target enrichment from transcriptome and genome skim data: the pipeline and its application in southern African Oxalis (Oxalidaceae)

Mol. Ecol. Resour.

1124

–

1135

46.

Head

S.R.

Komori

H.K.

LaMere

S.A.

et al. (

2014

)

Library construction for next-generation sequencing: overviews and challenges

Biotechniques

–

64, 66, 68, passim

47.

Deschamps

Llaca

and

May

G.D.

(

2012

)

Genotyping-by-Sequencing in Plants

Biology

460

–

483

48.

Elshire

R.J.

Glaubitz

J.C.

Sun

et al. (

2011

)

A robust, simple genotyping-by-sequencing (GBS) approach for high diversity species

PLoS One

, e19379.

49.

Andrews

K.R.

Good

J.M.

Miller

M.R.

et al. (

2016

)

Harnessing the power of RADseq for ecological and evolutionary genomics

Nat. Rev. Genet.

–

50.

Miller

M.R.

Dunham

J.P.

Amores

et al. (

2007

)

Rapid and cost-effective polymorphism identification and genotyping using restriction site associated DNA (RAD) markers

Genome Res.

240

–

248

51.

Danecek

Auton

Abecasis

et al. (

2011

)

The variant call format and VCFtools

Bioinformatics

2156

–

2158

52.

Lyon

M.S.

Andrews

S.J.

Elsworth

et al. (

2021

)

The variant call format provides efficient and robust storage of GWAS summary statistics

Genome Biol.

, 32.

53.

Leinonen

Sugawara

Shumway

et al. (

2011

)

The sequence read archive

Nucleic Acids Res.

D19

–

54.

Kodama

Shumway

Leinonen

et al. (

2012

)

The sequence read archive: explosive growth of sequencing data

Nucleic Acids Res.

D54

–

55.

Edgar

Domrachev

and

Lash

A.E.

(

2002

)

Gene expression omnibus: NCBI gene expression and hybridization array data repository

Nucleic Acids Res.

207

–

210

56.

Barrett

and

Edgar

(

2006

)

Gene expression omnibus: microarray data storage, submission, retrieval, and analysis

Methods Enzymol.

411

352

–

369

57.

Clough

and

Barrett

(

2016

) The gene expression omnibus database. In:

Statistical Genomics: Methods and Protocols

, pp.

–

110

58.

Tateno

Imanishi

Miyazaki

et al. (

2002

)

DNA Data Bank of Japan (DDBJ) for genome scale research in life science

Nucleic Acids Res.

–

59.

Miyazaki

Sugawara

Ikeo

et al. (

2004

)

DDBJ in the stream of various biological data

Nucleic Acids Res.

D31

–

60.

Ogasawara

Kodama

Mashima

et al. (

2020

)

DDBJ Database updates and computational infrastructure enhancement

Nucleic Acids Res.

D45

–

D50

61.

Cochrane

Karsch-Mizrachi

Nakamura

et al. (

2011

)

The international nucleotide sequence database collaboration

Nucleic Acids Res.

D15

–

62.

Cochrane

Karsch-Mizrachi

Takagi

et al. (

2016

)

The International Nucleotide Sequence Database Collaboration

Nucleic Acids Res.

D48

–

63.

(

2020

)

Promoting best practice in nucleotide sequence data sharing

Sci. Data

, 152.

64.

Nordberg

Cantor

Dusheyko

et al. (

2014

)

The genome portal of the department of energy joint genome institute: 2014 updates

Nucleic Acids Res.

D26

–

65.

Sreedasyam

Plott

Hossain

M.S.

et al. (

2023

)

JGI Plant Gene Atlas: an updateable transcriptome resource to improve functional gene descriptions across the plant kingdom

Nucleic Acids Res.

8383

–

8401

66.

Goodstein

D.M.

Shu

Howson

et al. (

2012

)

Phytozome: a comparative platform for green plant genomics

Nucleic Acids Res.

D1178

–

1186

67.

Members

C.-N.

and

Partners

(

2021

)

Database Resources of the National Genomics Data Center, China National Center for Bioinformation in 2021

Nucleic Acids Res.

D18

–

D28

68.

Cezard

Cunningham

Hunt

S.E.

et al. (

2022

)

The European Variation Archive: a FAIR resource of genomic variation for all species

Nucleic Acids Res.

D1216

–

D1220

69.

Song

Tian

et al. (

2018

)

Genome Variation Map: a data repository of genome variations in BIG Data Center

Nucleic Acids Res.

D944

–

D949

70.

Chang

Song

Zhang

et al. (

2022

)

Robust CRISPR/Cas9 mediated gene editing of JrWOX11 manipulated adventitious rooting and vegetative growth in a nut tree species of walnut

Sci. Hortic.

303

, 111199.

71.

International Hapmap

(

2003

)

The International HapMap Project

Nature

426

789

–

796

72.

Jung

Jesudurai

Staton

et al. (

2004

)

GDR (Genome Database for Rosaceae): integrated web resources for Rosaceae genomics and genetics research

BMC Bioinf.

, 130.

73.

Jung

Lee

Cheng

C.H.

et al. (

2019

)

15 years of GDR: new data and functionality in the Genome Database for Rosaceae

Nucleic Acids Res.

D1137

–

D1145

74.

Jung

Cheng

C.H.

et al. (

2014

)

CottonGen: a genomics, genetics and breeding database for cotton research

Nucleic Acids Res.

D1229

–

1236

75.

Jung

Cheng

C.H.

et al. (

2021

)

CottonGen: the community database for cotton genomics, genetics, and breeding research

Plants

, 2805.

76.

Grant

Nelson

R.T.

Cannon

S.B.

et al. (

2010

)

SoyBase, the USDA-ARS soybean genetics and genomics database

Nucleic Acids Res.

D843

–

846

77.

Brown

A.V.

Conners

S.I.

Huang

et al. (

2021

)

A new decade and new data at SoyBase, the USDA-ARS soybean genetics and genomics database

Nucleic Acids Res.

D1496

–

D1501

78.

Gonzales

M.D.

Archuleta

Farmer

et al. (

2005

)

The Legume Information System (LIS): an integrated information resource for comparative legume biology

Nucleic Acids Res.

D660

–

665

79.

Dash

Campbell

J.D.

Cannon

E.K.

et al. (

2016

)

Legume information system (LegumeInfo.org): a key component of a set of federated data resources for the legume family

Nucleic Acids Res.

D1181

–

1188

80.

Fernandez-Pozo

Menda

Edwards

J.D.

et al. (

2015

)

The Sol Genomics Network (SGN)—from genotype to phenotype to breeding

Nucleic Acids Res.

D1036

–

1041

81.

Foerster

Bombarely

Battey

J.N.D.

et al. (

2018

)

SolCyc: a database hub at the Sol Genomics Network (SGN) for the manual curation of metabolic networks in Solanum and Nicotiana specific databases

Database (Oxford)

2018

, bay035.

82.

Lawrence

C.J.

(

2007

)

MaizeGDB

Methods Mol. Biol.

406

331

–

345

83.

Portwood

J.L.

2nd,

Woodhouse

M.R.

Cannon

E.K.

et al. (

2019

)

MaizeGDB 2018: the maize multi-genome genetics and genomics database

Nucleic Acids Res.

D1146

–

D1154

84.

Wegrzyn

J.L.

Lee

J.M.

Tearse

B.R.

et al. (

2008

)

TreeGenes: a forest tree genome database

Int. J. Plant Genomics

2008

, 412875.

85.

Falk

Herndon

Grau

et al. (

2019

)

Growing and cultivating the forest genomics database, TreeGenes

Database

2019

, bay084.

86.

Garcia-Hernandez

Berardini

T.Z.

Chen

et al. (

2002

)

TAIR: a resource for integrated Arabidopsis data

Funct. Integr. Genomics

239

–

253

87.

Poole

R.L.

(

2007

)

The TAIR database

Methods Mol. Biol.

406

179

–

212

88.

Sanderson

L.A.

Caron

C.T.

Tan

et al. (

2019

)

KnowPulse: A web-resource focused on diversity data for pulse crop improvement

Front. Plant Sci.

, 965.

89.

Smith

R.N.

Aleksic

Butano

et al. (

2012

)

InterMine: a flexible data warehouse system for the integration and analysis of heterogeneous biological data

Bioinformatics

3163

–

3165

90.

Kalderimis

Lyne

Butano

et al. (

2014

)

InterMine: extensive web services for modern biology

Nucleic Acids Res.

W468

–

472

91.

Tello-Ruiz

M.K.

Jaiswal

and

Ware

(

2022

)

Gramene: a resource for comparative analysis of plants genomes and pathways

Methods Mol. Biol.

2443

101

–

131

92.

Ware

(

2007

)

Gramene

Methods Mol. Biol.

406

315

–

329

93.

Ware

D.H.

Jaiswal

et al. (

2002

)

Gramene, a tool for grass genomics

Plant Physiol.

130

1606

–

1613

94.

Gladman

Olson

Wei

et al. (

2022

)

SorghumBase: a web-based portal for sorghum genetic information and community advancement

Planta

255

, 35.

95.

Lyne

Sullivan

Butano

et al. (

2015

)

Cross-organism analysis using InterMine

Genesis

547

–

560

96.

Paajanen

Kettleborough

Lopez-Girona

et al. (

2019

)

A critical comparison of technologies for a plant genome sequencing project

Gigascience

, giy163.

97.

Sun

Shang

Zhu

Q.H.

et al. (

2022

)

Twenty years of plant genome sequencing: achievements and challenges

Trends Plant Sci.

391

–

401

98.

Pucker

Irisarri

de Vries

et al. (

2022

)

Plant genome sequence assembly in the era of long reads: Progress, challenges and future directions

Quant. Plant Biol.

, e5.

99.

Shi

Tian

Lai

et al. (

2023

)

Plant pan-genomics and its applications

Mol. Plant

168

–

186

100.

S.S.

Urban

A.E.

and

Mills

R.E.

(

2020

)

Structural variation in the sequencing era

Nat. Rev. Genet.

171

–

189

101.

Quan

et al. (

2022

)

Population-scale genotyping of structural variation in the era of long-read sequencing

Comput. Struct. Biotechnol. J.

2639

–

2647

102.

Sun

Wang

et al. (

2020

)

Dissection of complex traits of tomato in the post-genome era

Theor. Appl. Genet.

133

1763

–

1776

103.

Lye

Z.N.

and

Purugganan

M.D.

(

2019

)

Copy number variation in domestication

Trends Plant Sci.

352

–

365

104.

Hovhannisyan

Harutyunyan

Aroutiounian

et al. (

2019

)

DNA copy number variations as markers of mutagenic impact

Int. J. Mol. Sci.

, 4723.

105.

Dolatabadian

Patel

D.A.

Edwards

et al. (

2017

)

Copy number variation and disease resistance in plants

Theor. Appl. Genet.

130

2479

–

2490

106.

Yuan

Bayer

P.E.

Batley

et al. (

2021

)

Current status of structural variation studies in plants

Plant Biotechnol. J.

2153

–

2163

107.

Alonge

Wang

Benoit

et al. (

2020

)

Major impacts of widespread structural variation on gene expression and crop improvement in tomato

Cell

182

145

–

161 e123

108.

Chawla

H.S.

Lee

Gabur

et al. (

2021

)

Long-read sequencing reveals widespread intragenic structural variants in a recent allopolyploid crop plant

Plant Biotechnol. J.

240

–

250

109.

Xia

Zhang

et al. (

2019

)

Plant editosome database: a curated database of RNA editosome in plants

Nucleic Acids Res.

D170

–

D174

110.

Thao

N.P.

and

Tran

L.S.

(

2016

)

Enhancement of plant productivity in the post-genomics era

Curr. Genomics

295

–

296

111.

Pan

Wei

Guo

et al. (

2019

)

Trait ontology analysis based on association mapping studies bridges the gap between crop genomics and Phenomics

BMC Genomics

, 443.

112.

Danecek

Bonfield

J.K.

Liddle

et al. (

2021

)

Twelve years of SAMtools and BCFtools

Gigascience

, giab008.

113.

Brachi

Morris

G.P.

and

Borevitz

J.O.

(

2011

)

Genome-wide association studies in plants: the missing heritability is in the field

Genome Biol.

, 232.

114.

Gali

K.K.

Sackville

Tafesse

E.G.

et al. (

2019

)

Genome-wide association mapping for agronomic and seed quality traits of field pea (Pisum sativum L.)

Front. Plant Sci.

, 1538.

115.

Khan

S.U.

Saeed

Khan

M.H.U.

et al. (

2021

)

Advances and challenges for QTL analysis and GWAS in the plant-breeding of high-yielding: a focus on rapeseed

Biomolecules

, 1516.

116.

Tibbs Cortes

Zhang

and

(

2021

)

Status and prospects of genome-wide association studies in plants

Plant Genome.

, e20077.

117.

Liu

Hua

et al. (

2015

)

Natural variation in ARF18 gene simultaneously affects seed weight and silique length in polyploid rapeseed

Proc. Natl. Acad. Sci. U.S.A.

112

E5123

–

5132

118.

Christeller

J.T.

McGhie

T.K.

Johnston

J.W.

et al. (

2019

)

Quantitative trait loci influencing pentacyclic triterpene composition in apple fruit peel

Sci. Rep.

, 18501.

119.

Chagné

Ryan

Saeed

et al. (

2019

)

A high density linkage map and quantitative trait loci for tree growth for New Zealand mānuka (Leptospermum scoparium)

N. Z. J. Crop Hortic. Sci.

261

–

272

120.

Budhlakoti

Kushwaha

A.K.

Rai

et al. (

2022

)

Genomic selection: a tool for accelerating the efficiency of molecular breeding for development of climate-resilient crops

Front. Genet.

, 832153.

121.

Bhat

J.A.

Ali

Salgotra

R.K.

et al. (

2016

)

Genomic selection in the era of next generation sequencing for complex traits in plant breeding

Front. Genet.

, 221.

122.

Crossa

Perez-Rodriguez

Cuevas

et al. (

2017

)

Genomic selection in plant breeding: methods, models, and perspectives

Trends Plant Sci.

961

–

975

123.

Fasoula

D.A.

Ioannides

I.M.

and

Omirou

(

2019

)

Phenotyping and plant breeding: overcoming the barriers

Front. Plant Sci.

, 1713.

124.

Akiyama

Kurotani

Iida

et al. (

2014

)

RARGE II: an integrated phenotype database of Arabidopsis mutant traits using a controlled vocabulary

Plant Cell Physiol.

, e4.

125.

Miroslaw

(

2001

)

Officially Released Mutant Varieties – The FAO/IAEA Database

Plant Cell Tissue Organ. Cult.

175

–

177

126.

Zheng

Zhang

Martin

G.B.

et al. (

2019

)

Plant Genome Editing Database (PGED): a call for submission of information about genome-edited plant Mutants

Mol. Plant

127

–

129

127.

Shikata

Hoshikawa

Ariizumi

et al. (

2016

)

TOMATOMA update: phenotypic and metabolite information in the micro-tom mutant resource

Plant Cell Physiol.

, e11.

128.

McGill

B.J.

Enquist

B.J.

Weiher

et al. (

2006

)

Rebuilding community ecology from functional traits

Trends Ecol. Evol.

178

–

185

129.

Violle

Navas

Vile

et al. (

2007

)

Let the concept of trait be functional!

Oikos

116

882

–

892

130.

Schneider

F.D.

Fichtmueller

Gossner

M.M.

et al. (

2019

)

Towards an ecological trait‐data standard

Meth. Ecol. Evolut

2006

–

2019

131.

Allan

Manning

Alt

et al. (

2015

)

Land use intensification alters ecosystem multifunctionality via loss of biodiversity and changes to functional composition

Ecol. Lett.

834

–

843

132.

Diaz

Quetier

Caceres

D.M.

et al. (

2011

)

Linking functional diversity and social actor strategies in a framework for interdisciplinary analysis of nature’s benefits to society

Proc. Natl. Acad. Sci. U.S.A.

108

895

–

902

133.

Lavorel

and

Grigulis

(

2012

)

How fundamental plant functional trait relationships scale-up to trade-offs and synergies in ecosystem services

J. Ecol.

100

128

–

140

134.

Pujar

Youens-Clark

et al. (

2009

)

Gramene QTL database: development, content and applications

Database (Oxford)

2009

, bap005.

135.

Singh

Batra

Sharma

et al. (

2021

)

WheatQTLdb: a QTL database for wheat

Mol. Genet. Genomics

296

1051

–

1056

136.

Reich

P.B.

Wright

I.J.

and

Lusk

C.H.

(

2007

)

Predicting leaf physiology from simple plant and climate attributes: a global GLOPNET analysis

Ecol. Appl.

1982

–

1988

137.

Kissling

W.D.

Walls

Bowser

et al. (

2018

)

Towards global data products of Essential Biodiversity Variables on species traits

Nat. Ecol. Evol.

1531

–

1540

138.

Peat

H.J.

and

Fitter

A.H.

(

1994

)

A comparative study of the distribution and density of stomata in the British flora

Biol. J. Linn. Soc. Lond.

377

–

393

139.

Poschlod

Kleyer

Jackel

A.-K.

et al. (

2003

)

BIOPOP — A database of plant traits and internet application for nature conservation

Folia Geobot.

263

–

271

140.

Garcia-Recio

Santos-Gomez

Soto

et al. (

2021

)

GRIN database: a unified and manually curated repertoire of GRIN variants

Hum. Mutat.

–

141.

Kühn

Durka

and

Klotz

(

2004

)

BiolFlor: a new plant-trait database as a tool for plant invasion ecology

Divers. Distrib.

363

–

365

142.

Kleyer

Bekker

R.M.

Knevel

I.C.

et al. (

2008

)

The LEDA Traitbase: a database of life history traits of the Northwest European flora

J. Ecol.

1266

–

1274

143.

Tavsanoglu

and

Pausas

J.G.

(

2018

)

A functional trait database for Mediterranean Basin plants

Sci. Data

, 180135.

144.

Falster

Gallagher

Wenk

E.H.

et al. (

2021

)

AusTraits, a curated plant trait database for the Australian flora

Sci. Data

, 254.

145.

Houle

Govindaraju

D.R.

and

Omholt

(

2010

)

Phenomics: the next challenge

Nat. Rev. Genet.

855

–

866

146.

Hati

A.J.

and

Singh

R.R.

(

2021

)

Artificial intelligence in smart farms: plant phenotyping for species recognition and health condition identification using deep learning

274

–

289

147.

Saleem

M.H.

Potgieter

and

Mahmood Arif

(

2019

)

Plant disease detection and classification by deep learning

Plants

, 468.

148.

Zhang

Zhou

Xiao

et al. (

2022

)

End-to-end fusion of hyperspectral and chlorophyll fluorescence imaging to identify rice stresses

Plant Phenomics

2022

, 9851096.

149.

Sandhu

K.S.

Mihalyov

P.D.

Lewien

M.J.

et al. (

2021

)

Combining genomic and phenomic information for predicting grain protein content and grain yield in spring wheat

Front. Plant Sci.

, 613300.

150.

Araus

J.L.

Kefauver

S.C.

Zaman-Allah

et al. (

2018

)

Translating high-throughput phenotyping into genetic gain

Trends Plant Sci.

451

–

466

151.

Steinbach

Alaux

Amselem

et al. (

2013

)

GnpIS: an information system to integrate genetic and genomic data from plants and fungi

Database

2013

, bat058.

152.

Pommier

Michotey

Cornut

et al. (

2019

)

Applying FAIR Principles to Plant Phenotypic Data Management in GnpIS

Plant Phenomics

2019

, 1671403.

153.

Brookes

A.J.

and

Robinson

P.N.

(

2015

)

Human genotype-phenotype databases: aims, challenges and opportunities

Nat. Rev. Genet.

702

–

715

154.

Cobo-Simón

(

2022

)

Cartograplant: cyberinfrastructure to improve forest health and productivity in the context of a changing climate

. In Plant and Animal Genome XXIX Conference,

San Diego (CA)

155.

Sansone

S.A.

McQuilton

Rocca-Serra

et al. (

2019

)

FAIRsharing as a community approach to standards, repositories and policies

Nat. Biotechnol.

358

–

367

156.

Bulow

Schindler

Choi

et al. (

2004

)

PathoPlant: a database on plant-pathogen interactions

Silico. Biol.

529

–

536

157.

Bulow

Schindler

and

Hehl

(

2007

)

PathoPlant: a platform for microarray expression data to analyze co-regulated genes involved in plant defense responses

Nucleic Acids Res.

D841

–

845

158.

et al. (

2020

)

PncStress: a manually curated database of experimentally validated stress-responsive non-coding RNAs in plants

Database

2020

, baaa001.

159.

Global Burden Of Disease Cancer

Fitzmaurice

Abate

et al. (

2019

)

Global, regional, and national cancer incidence, mortality, years of life lost, years lived with disability, and disability-adjusted life-years for 29 cancer groups, 1990 to 2017: a systematic analysis for the global burden of disease study

JAMA Oncol.

1749

–

1768

160.

Dhondt

Wuyts

and

Inze

(

2013

)

Cell to whole-plant phenotyping: the best is yet to come

Trends Plant Sci.

428

–

439

161.

Diaz

B.P.

Knowles

Johns

C.T.

et al. (

2021

)

Seasonal mixed layer depth shapes phytoplankton physiology, viral production, and accumulation in the North Atlantic

Nat. Commun.

, 6634.

162.

Adak

Murray

S.C.

Calderon

C.I.

et al. (

2023

)

Genetic mapping and prediction for novel lesion mimic in maize demonstrates quantitative effects from genetic background, environment and epistasis

Theor. Appl. Genet.

136

, 155.

163.

Hill

D.P.

D’Eustachio

Berardini

T.Z.

et al. (

2016

)

Modeling biochemical pathways in the gene ontology

Database

2016

, baw126.

164.

Poux

and

Gaudet

(

2017

)

Best practices in manual annotation with the gene ontology

Methods Mol. Biol.

1446

–

165.

Chibucos

M.C.

and

Tyler

B.M.

(

2009

)

Common themes in nutrient acquisition by plant symbiotic microbes, described by the Gene Ontology

BMC Microbiol.

, S6.

166.

Fox

S.E.

Geniza

Hanumappa

et al. (

2014

)

De novo transcriptome assembly and analyses of gene expression during photomorphogenesis in diploid wheat Triticum monococcum

PLoS One

, e96855.

167.

Vining

K.J.

Romanel

Jones

R.C.

et al. (

2015

)

The floral transcriptome of Eucalyptus grandis

New Phytol.

206

1406

–

1422

168.

Fennell

A.Y.

Schlauch

K.A.

Gouthu

et al. (

2015

)

Short day transcriptomic programming during induction of dormancy in grapevine

Front. Plant Sci.

, 834.

169.

Gupta

Geniza

Naithani

et al. (

2021

)

Chia (Salvia hispanica) gene expression atlas elucidates dynamic spatio-temporal changes associated with plant growth and development

Front. Plant Sci.

, 667678.

170.

Godoy

Kuhn

Munoz

et al. (

2021

)

The role of auxin during early berry development in grapevine as revealed by transcript profiling from pollination to fruit set

Hortic. Res.

, 140.

171.

Perez-Riverol

Q.W.

Wang

et al. (

2016

)

PRIDE Inspector Toolsuite: moving toward a universal visualization tool for proteomics data standard formats and quality assessment of ProteomeXchange datasets

Mol. Cell. Proteomics

305

–

317

172.

Kosova

Vitamvas

Urban

M.O.

et al. (

2018

)

Plant abiotic stress proteomics: the major factors determining alterations in cellular proteome

Front. Plant Sci.

, 122.

173.

Jarnuczak

A.F.

and

Vizcaino

J.A.

(

2017

)

Using the PRIDE Database and ProteomeXchange for submitting and accessing public proteomics datasets

Curr. Protoc. Bioinfor.

13 31 11

–

13 31 12

174.

Okuda

Watanabe

Moriya

et al. (

2017

)

jPOSTrepo: an international standard data repository for proteomes

Nucleic Acids Res.

D1107

–

D1111

175.

Moriya

Kawano

Okuda

et al. (

2019

)

The jPOST environment: an integrated proteomics data repository and database

Nucleic Acids Res.

D1218

–

D1224

176.

Chen

Liu

et al. (

2022

)

iProX in 2021: connecting proteomics data sharing with big data

Nucleic Acids Res.

D1522

–

D1527

177.

Chen

et al. (

2019

)

iProX: an integrated proteome resource

Nucleic Acids Res.

D1211

–

D1217

178.

Sharma

Eckels

Taylor

G.K.

et al. (

2014

)

Panorama: a targeted proteomics knowledge base

J. Proteome. Res.

4205

–

4210

179.

Desiere

Deutsch

E.W.

King

N.L.

et al. (

2006

)

The Peptide Atlas project

Nucleic Acids Res.

D655

–

658

180.

Deutsch

E.W.

(

2010

)

The PeptideAtlas Project

Methods Mol. Biol.

604

285

–

296

181.

Tsugawa

Rai

Saito

et al. (

2021

)

Metabolomics and complementary techniques to investigate the plant phytochemical cosmos

Nat. Prod. Rep.

1729

–

1759

182.

Members

M.S.I.B.

Sansone

S.A.

Fan

et al. (

2007

)

The metabolomics standards initiative

Nat. Biotechnol.

846

–

848

183.

Sumner

L.W.

Amberg

Barrett

et al. (

2007

)

Proposed minimum reporting standards for chemical analysis Chemical Analysis Working Group (CAWG) Metabolomics Standards Initiative (MSI)

Metabolomics

211

–

221

184.

Vinaixa

Schymanski

E.L.

Neumann

et al. (

2016

)

Mass spectral databases for LC/MS- and GC/MS-based metabolomics: state of the field and future prospects

TrAC

–

185.

Salek

R.M.

Neumann

Schober

et al. (

2015

)

COordination of Standards in MetabOlomicS (COSMOS): facilitating integrated metabolomics data access

Metabolomics

1587

–

1597

186.

Steinbeck

Conesa

Haug

et al. (

2012

)

MetaboLights: towards a new COSMOS of metabolomics data management

Metabolomics

757

–

760

187.

Considine

E.C.

and

Salek

R.M.

(

2019

)

A tool to encourage minimum reporting guideline uptake for data analysis in metabolomics

Metabolites

: 43.

188.

Schorn

M.A.

Verhoeven

Ridder

et al. (

2021

)

A community resource for paired genomic and metabolomic data mining

Nat. Chem. Biol.

363

–

368

189.

Cooper

and

Jaiswal

(

2016

)

The Plant Ontology: a tool for plant genomics

Methods Mol. Biol.

1374

–

114

190.

Cooper

Walls

R.L.

Elser

et al. (

2013

)

The plant ontology as a tool for comparative plant anatomy and genomic analyses

Plant Cell Physiol

, e1.

191.

Avraham

Tung

C.W.

Ilic

et al. (

2008

)

The Plant Ontology Database: a community resource for plant structure and developmental stages controlled vocabulary and annotations

Nucleic Acids Res.

D449

–

454

192.

Warman

Sullivan

C.M.

Preece

et al. (

2021

)

A cost-effective maize ear phenotyping platform enables rapid categorization and quantification of kernels

Plant J.

106

566

–

579

193.

Oellrich

Walls

R.L.

Cannon

E.K.

et al. (

2015

)

An ontology approach to comparative phenomics in plants

Plant Methods

, 10.

194.

Cooper

Meier

Laporte

M.A.

et al. (

2018

)

The Planteome database: an integrated resource for reference ontologies, plant genomics and phenomics

Nucleic Acids Res.

D1168

–

D1180

195.

Tello-Ruiz

M.K.

Naithani

Gupta

et al. (

2021

)

Gramene 2021: harnessing the power of comparative genomics and pathways for plant research

Nucleic Acids Res.

D1452

–

D1463

196.

Naithani

and

Jaiswal

(

2017

)

Pathway analysis and omics data visualization using pathway genome databases: FragariaCyc, a case study

Methods Mol. Biol.

1533

241

–

256

197.

Naithani

Raja

Waddell

E.N.

et al. (

2014

)

VitisCyc: a metabolic pathway knowledgebase for grapevine (Vitis vinifera)

Front. Plant Sci.

, 644.

198.

Gupta

Naithani

Preece

et al. (

2022

)

Plant reactome and PubChem: the plant pathway and (Bio)Chemical Entity Knowledgebases

Methods Mol. Biol.

2443

511

–

525

199.

Naithani

Gupta

Preece

et al. (

2020

)

Plant Reactome: a knowledgebase and resource for comparative pathway analysis

Nucleic Acids Res.

D1093

–

D1103

200.

Jaiswal

and

Usadel

(

2016

)

Plant Pathway Databases

Methods Mol. Biol.

1374

–

201.

Kattge

Ogle

Bönisch

et al. (

2011

)

A generic structure for plant trait databases

Meth. Ecol. Evolut.

202

–

213

202.

van Kleunen

Pysek

Dawson

et al. (

2019

)

The Global Naturalized Alien Flora (GloNAF) database

Ecology

100

, e02542.

203.

Manolio

T.A.

Collins

F.S.

Cox

N.J.

et al. (

2009

)

Finding the missing heritability of complex diseases

Nature

461

747

–

753

204.

Visscher

P.M.

Brown

M.A.

McCarthy

M.I.

et al. (

2012

)

Five years of GWAS discovery

Am. J. Hum. Genet.

–

205.

Visscher

P.M.

Wray

N.R.

Zhang

et al. (

2017

)

10 years of GWAS discovery: biology, function, and translation

Am. J. Hum. Genet.

101

–

206.

Uffelmann

Huang

Q.Q.

Munung

N.S.

et al. (

2021

)

Genome-wide association studies

Nat. Rev. Methods Primers

–

207.

Falconer

D.S.

and

Mackay

T.F.C.

(

1996

)

Introduction to Quantitative Genetics

Longmans Green, Harlow

Essex, UK

Google Preview

208.

Kearsey

M.J.

(

1998

)

The principles of QTL analysis (a minimal mathematics approach)

J. Exp. Bot.

1619

–

1623

209.

Lynch

and

Walsh

B.V.

(

1998

)

Genetics and Analysis of Quantitative Traits

. Sinauer Associates,

Sunderland, MA

Google Preview

210.

Sallam

Eltaher

Alqudah

A.M.

et al. (

2022

)

Combined GWAS and QTL mapping revealed candidate genes and SNP network controlling recovery and tolerance traits associated with drought tolerance in seedling winter wheat

Genomics

114

, 110358.

211.

Hayes

B.J.

Gjuvsland

and

Omholt

(

2006

)

Power of QTL mapping experiments in commercial Atlantic salmon populations, exploiting linkage and linkage disequilibrium and effect of limited recombination in males

Heredity

–

212.

Joiret

Mahachie John

J.M.

Gusareva

E.S.

et al. (

2019

)

Confounding of linkage disequilibrium patterns in large scale DNA based gene-gene interaction studies

BioData Min.

, 11.

213.

Hartl

D.L.

Clark

A.G.

and

Clark

A.G.

(

1997

)

Principles of Population Genetics

. Sinauer Associates,

Sunderland, MA

Google Preview

214.

Lee

Y.H.

(

2015

)

Meta-analysis of genetic association studies

Ann. Lab. Med.

283

–

287

215.

Dehghan

(

2018

)

Genome-wide association studies

Methods Mol. Biol.

1793

–

216.

Buniello

MacArthur

J.A.L.

Cerezo

et al. (

2019

)

The NHGRI-EBI GWAS Catalog of published genome-wide association studies, targeted arrays and summary statistics 2019

Nucleic Acids Res.

D1005

–

D1012

217.

Togninalli

Seren

Freudenthal

J.A.

et al. (

2020

)

AraPheno and the AraGWAS Catalog 2020: a major database update including RNA-Seq and knockout mutation data for Arabidopsis thaliana

Nucleic Acids Res.

D1063

–

D1068

218.

Zeggini

and

Ioannis

J.P.A.

(

2009

)

Meta-analysis in genome-wide association studies

Pharmacogenomics

191

–

201

219.

Soriano

J.M.

Colasuonno

Marcotuli

et al. (

2021

)

Meta-QTL analysis and identification of candidate genes for quality, abiotic and biotic stress in durum wheat

Sci. Rep.

, 11877.

220.

Kraft

Zeggini

and

Ioannidis

J.P.

(

2009

)

Replication in genome-wide association studies

Stat Sci.

561

–

573

221.

Zhang

Yin

et al. (

2018

)

QTL-by-environment interaction in the response of maize root and shoot traits to different water regimes

Front. Plant Sci.

, 229.

222.

Lowry

D.B.

Lovell

J.T.

Zhang

et al. (

2019

)

QTL × environment interactions underlie adaptive divergence in switchgrass across a large latitudinal gradient

Proc. Natl. Acad. Sci.

116

12933

–

12941

223.

Pinu

F.R.

Beale

D.J.

Paten

A.M.

et al. (

2019

)

Systems biology and multi-omics integration: viewpoints from the metabolomics research community

Metabolites

, 76.

224.

Pacheco

A.R.

Pauvert

Kishore

et al. (

2022

)

Toward FAIR Representations of Microbial Interactions

mSystems

, e0065922.

225.

Sumner

L.W.

Styczynski

McLean

et al. (

2015

)

Introducing the USA plant, algae and microbial metabolomics research coordination network (PAMM-NET)

Metabolomics

–

226.

Kodra

Pousinis

Vorkas

P.A.

et al. (

2022

)

Is current practice adhering to guidelines proposed for metabolite identification in LC-MS untargeted metabolomics? A meta-analysis of the literature

J. Proteome Res.

590

–

598

227.

Schroeder

Meyer

S.W.

Heyman

H.M.

et al. (

2019

)

Generation of a collision cross section library for multi-dimensional plant metabolomics using UHPLC-Trapped Ion Mobility-MS/MS

Metabolites

, 13.

228.

Wilkinson

M.D.

Dumontier

Aalbersberg

I.J.

et al. (

2016

)

The FAIR Guiding Principles for scientific data management and stewardship

Sci. Data

, 160018.

229.

Jeliazkova

Apostolova

M.D.

Andreoli

et al. (

2021

)

Towards FAIR nanosafety data

Nat. Nanotechnol.

644

–

654

230.

Iturbide

Fernandez

Gutierrez

J.M.

et al. (

2022

)

Implementation of FAIR principles in the IPCC: the WGI AR6 Atlas repository

Sci. Data

, 629.

231.

Mons

Neylon

Velterop

et al. (

2017

)

Cloudy, increasingly FAIR; revisiting the FAIR data guiding principles for the European open science cloud

Inform. Serv. Use

–