Maize Feature Store: A centralized resource to manage and analyze curated maize multi-omics features for machine learning applications

Lloyd

J.P.

Seddon

A.E.

Moghe

G.D.

et al. (

2015

)

Characteristics of Plant Essential Genes Allow for within- and between-Species Prediction of Lethal Mutant Phenotypes

Plant Cell.

2133

–

2147

Singh

Ganapathysubramanian

Singh

A.K.

et al. (

2016

)

Machine Learning for High-Throughput Stress Phenotyping in Plants

Trends Plant Sci.

110

–

124

Benos

Tagarakis

A.C.

Dolias

et al. (

2021

)

Machine Learning in Agriculture: A Comprehensive Updated Review

Sensors. (Basel)

, 3758.

Gui

Yang

et al. (

2020

)

ZEAMAP, a Comprehensive Database Adapted to the Maize Multi-Omics Era

iScience

, 101241.

Woodhouse

M.R.

Cannon

E.K.

Portwood

J.L.

2nd et al. (

2021

)

A pan-genomic approach to genome databases using maize as a model system

BMC Plant Biol.

, 385.

Zhao

Canaran

Jurkuta

et al. (

2006

)

Panzea: a database and resource for molecular and functional diversity in the maize genome

Nucleic Acids Res.

D752

–

757

Goodstein

D.M.

Shu

Howson

et al. (

2012

)

Phytozome: a comparative platform for green plant genomics

Nucleic Acids Res.

D1178

–

1186

Benson

D.A.

Karsch-Mizrachi

Lipman

D.J.

et al. (

2007

)

GenBank

Nucleic Acids Res.

D21

–

10.

Tello-Ruiz

M.K.

Naithani

Gupta

et al. (

2021

)

Gramene 2021: harnessing the power of comparative genomics and pathways for plant research

Nucleic Acids Res.

D1452

–

D1463

11.

Waese-Perlman

Pasha

et al. (

2021

)

ePlant in 2021: New Species, Viewers, Data Sets, and Widgets

bioRxiv.

2021

–

2024

12.

Liu

Wang

Xiao

et al. (

2016

)

MODEM: multi-omics data envelopment and mining in maize

Database. (Oxford)

2016

, baw117.

13.

Fukushima

Kusano

Redestig

et al. (

2009

)

Integrated omics approaches in plant systems biology

Curr Opin. Chem. Biol.

532

–

538

14.

Zogli

Pingault

Grover

et al. (

2020

)

Ento(o)mics: the intersection of ‘omic’ approaches to decipher plant defense against sap-sucking insect pests

Curr. Opin. Plant Biol.

153

–

161

15.

Deshmukh

Sonah

Patil

et al. (

2014

)

Integrating omic approaches for abiotic stress tolerance in soybean

Front Plant Sci.

, 244.

16.

Rajasundaram

and

Selbig

(

2016

)

More effort - more results: recent advances in integrative ‘omics’ data analysis

Curr. Opin. Plant Biol.

–

17.

Gundla

N.K.

and

Chen

(

2016

)

Creating NoSQL Biological Databases with Ontologies for Query Relaxation

Procedia Comput Sci

460

–

469

Crossref

18.

Wang

Pandis

et al. (

2014

)

High dimensional biological data retrieval optimization with NoSQL technology

BMC Genom.

, S3.

19.

Medini

Donati

Tettelin

et al. (

2005

)

The microbial pan-genome

Curr. Opin. Genet. Dev.

589

–

594

20.

Morneau

(

2021

)

Pan-genomes: moving beyond the reference

Nat. Plants

914

–

920

21.

Hufford

M.B.

Seetharam

A.S.

Woodhouse

M.R.

et al. (

2021

)

De novo assembly, annotation, and comparative analysis of 26 diverse maize genomes

Science

373

655

–

662

22.

Zhu

and

Dong

(

2016

)

rDNAse: R package for generating various numerical representation schemes of DNA sequences

23.

Babak Khorsand

E.S.

Zahiri

Sharif

et al. (

2017

)

Stability Analysis in Differentially Expressed Genes

24.

Xiao

Cao

D.S.

Zhu

M.F.

et al. (

2015

)

protr/ProtrWeb: R package and web server for generating various numerical representation schemes of protein sequences

Bioinformatics.

1857

–

1859

25.

Horton

Park

K.J.

Obayashi

et al. (

2007

)

WoLF PSORT: protein localization predictor

Nucleic Acids Res.

W585

–

587

26.

Almagro Armenteros

J.J.

Sonderby

C.K.

Sonderby

S.K.

et al. (

2017

)

DeepLoc: prediction of protein subcellular localization using deep learning

Bioinformatics.

3387

–

3395

27.

Linding

Jensen

L.J.

Diella

et al. (

2003

)

Protein disorder prediction: implications for structural proteomics

Structure

1453

–

1459

28.

Krogh

Larsson

von Heijne

et al. (

2001

)

Predicting transmembrane protein topology with a hidden Markov model: application to complete genomes

J. Mol. Biol.

305

567

–

580

29.

Petersen

T.N.

Brunak

von Heijne

et al. (

2011

)

SignalP 4.0: discriminating signal peptides from transmembrane regions

Nat. Methods

785

–

786

30.

Woodhouse

M.R.

Sen

Schott

et al. (

2021

)

qTeller: A tool for comparative multi-genomic gene expression analysis

Bioinformatics.

236

–

242

31.

Forestan

Aiese Cigliano

Farinati

et al. (

2016

)

Stress-induced and epigenetic-mediated maize transcriptome regulation study by means of transcriptome reannotation and differential expression analysis

Sci Rep

, 30446.

32.

Warman

Panda

Vejlupkova

et al. (

2020

)

High expression in maize pollen correlates with genetic contributions to pollen fitness as well as with coordinated transcription from neighboring transposable elements

PLoS Genet.

, e1008462.

33.

Walley

J.W.

Sartor

R.C.

Shen

et al. (

2016

)

Integration of omic networks in a developmental atlas of maize

Science

353

814

–

818

34.

Stelpflug

S.C.

Sekhon

R.S.

Vaillancourt

et al. (

2016

)

An Expanded Maize Gene Expression Atlas based on RNA Sequencing and its Use to Explore Root Development

Plant Genom.

plantgenome2015

–

Crossref

35.

Opitz

Paschold

Marcon

et al. (

2014

)

Transcriptomic complexity in young maize primary roots in response to low water potentials

BMC Genom.

, 741.

36.

Makarevitch

Waters

A.J.

West

P.T.

et al. (

2015

)

Transposable elements contribute to activation of maize genes in response to abiotic stress

PLoS Genet.

, e1004915.

37.

Kakumanu

Ambavaram

M.M.

Klumas

et al. (

2012

)

Effects of drought on gene expression in maize reproductive and leaf meristem tissue revealed by RNA-Seq

Plant Physiol.

160

846

–

867

38.

Johnston

Wang

Sun

et al. (

2014

)

Transcriptomic analyses indicate that maize ligule development recapitulates gene expression patterns that occur during lateral organ initiation

Plant Cell.

4718

–

4732

39.

Ricci

W.A.

et al. (

2019

)

Widespread long-range cis-regulatory elements in the maize genome

Nat. Plants

1237

–

1249

40.

Ernst

and

Kellis

(

2017

)

Chromatin-state discovery and genome annotation with ChromHMM

Nat Protoc

2478

–

2492

41.

Quinlan

A.R.

and

Hall

I.M.

(

2010

)

BEDTools: a flexible suite of utilities for comparing genomic features

Bioinformatics.

841

–

842

42.

Dong

Xiao

Govindarajulu

et al. (

2019

)

The regulatory landscape of a core maize domestication module controlling bud dormancy and growth repression

Nat. Commun.

, 3810.

43.

Bolduc

Yilmaz

Mejia-Guerra

M.K.

et al. (

2012

)

Unraveling the KNOTTED1 regulatory network in maize meristems

Genes Dev.

1685

–

1690

44.

Oka

Zicola

Weber

et al. (

2017

)

Genome-wide mapping of transcriptional enhancer candidates using DNA and chromatin features in maize

Genome Biol.

, 137.

45.

Vollbrecht

Duvick

Schares

J.P.

et al. (

2010

)

Genome-wide distribution of transposed Dissociation elements in maize

Plant Cell.

1667

–

1685

46.

McCarty

D.R.

Latshaw

et al. (

2013

)

Mu-seq: sequence-based mapping and identification of transposon induced mutations

PLoS One

, e77172.

47.

Mejia-Guerra

M.K.

Galeano

N.F.

et al. (

2015

)

Core Promoter Plasticity Between Maize Tissues and Genotypes Contrasts with Predominance of Sharp Transcription Initiation Sites

Plant Cell.

3309

–

3320

48.

Hoopes

G.M.

Hamilton

J.P.

Wood

J.C.

et al. (

2019

)

An updated gene atlas for maize reveals organ-specific and stress-induced genes

Plant J.

1154

–

1167

49.

Cingolani

Platts

Wang le

et al. (

2012

)

A program for annotating and predicting the effects of single nucleotide polymorphisms, SnpEff: SNPs in the genome of Drosophila melanogaster strain w1118; iso-2; iso-3

Fly. (Austin)

–

50.

Mistry

Chuguransky

Williams

et al. (

2021

)

Pfam: The protein families database in 2021

Nucleic Acids Res.

D412

–

D419

51.

Lyons

and

Freeling

(

2008

)

How to usefully compare homologous plant genes and chromosomes as DNA sequences

Plant J.

661

–

673

52.

Arendsee

Singh

et al. (

2019

)

phylostratr: a framework for phylostratigraphy

Bioinformatics.

3617

–

3627

53.

Schnable

J.C.

and

Freeling

(

2011

)

Genes identified by visible mutant phenotypes show increased bias toward one of two subgenomes of maize

PLoS One

, e17855.

54.

Cao

Fang

et al. (

2017

)

Transcriptomic profiling of the maize (Zea mays L.) leaf response to abiotic stresses at the seedling stage

Front Plant Sci.

, 290.

55.

Zheng

Wang

Ding

et al. (

2018

)

Cellular stress alters 3ʹUTR landscape through alternative polyadenylation and isoform-specific degradation

Nat. Commun.

, 2268.

56.

van Dijk

A.D.J.

Kootstra

Kruijer

et al. (

2021

)

Machine learning in plant science and plant breeding

iScience

,101890.

57.

Yocca

A.E.

and

Edger

P.P.

(

2021

)

Machine learning approaches to identify core and dispensable genes in pangenomes

Plant Genom.

, e20135.

58.

Tao

Zhao

Mace

et al. (

2019

)

Exploring and exploiting pan-genomics for crop improvement

Mol Plant

156

–

169

59.

Gordon

S.P.

Contreras-Moreira

Woods

D.P.

et al. (

2017

)

Extensive gene content variation in the Brachypodium distachyon pan-genome correlates with population structure

Nat. Commun.

, 2184.

60.

Wang

Mauleon

et al. (

2018

)

Genomic variation in 3,010 diverse accessions of Asian cultivated rice

Nature

557

–

61.

Y.H.

Zhou

et al. (

2014

)

De novo assembly of soybean wild relatives for pan-genome analysis of diversity and agronomic traits

Nat. Biotechnol.

1045

–

1052

62.

Ohno

(

1970

)

Evolution by Gene Duplication

63.

Golicz

A.A.

et al. (

2019

)

Insight into the evolution and functional characteristics of the pan-genome assembly from sesame landraces and modern cultivars

Plant Biotechnol. J.

881

–

892

64.

Liu

et al. (

2020

)

Pan-Genome of Wild and Cultivated Soybeans

Cell.

182

162

–

176 e113

65.

Bayer

P.E.

Golicz

A.A.

Scheben

et al. (

2020

)

Plant pan-genomes are the new reference

Nat. Plants

914

–

920

66.

Golicz

A.A.

Bayer

P.E.

Barker

G.C.

et al. (

2016

)

The pangenome of an agronomically important crop plant Brassica oleracea

Nat. Commun.

, 13390.

67.

Zhao

Feng

et al. (

2018

)

Pan-genome analysis highlights the extent of genomic variation in cultivated and wild rice

Nat. Genet.

278

–

284

68.

Kimotho

R.N.

Baillo

E.H.

and

Zhang

(

2019

)

Transcription factors involved in abiotic stress responses in Maize (Zea mays L.) and their roles in enhanced productivity in the post genomics era

PeerJ

, e7211.

69.

Christine Tranchant-Dubreuil

M.R.

and

Sabot

(

2019

) Plant pangenome: impacts on phenotypes and evolution. In:

Annual Plant Reviews Online

Wiley Online Library

, pp.

–

Google Preview

70.

Yuan

et al. (

2012

)

Predicting the lethal phenotype of the knockout mouse by integrating comprehensive genomic data

Bioinformatics.

1246

–

1252

71.

Campos

T.L.

Korhonen

P.K.

Hofmann

et al. (

2020

)

Combined use of feature engineering and machine-learning to predict essential genes in Drosophila melanogaster

NAR Genom. Bioinform.

, lqaa051.