Abstract

FlyVar is a publicly and freely available platform that addresses the increasing need of next generation sequencing data analysis in the Drosophila research community. It is composed of three parts. First, a database that contains 5.94 million DNA polymorphisms found in Drosophila melanogaster derived from whole genome shotgun sequencing of 612 genomes of D . melanogaster . In addition, a list of 1094 dispensable genes has been identified. Second, a graphical user interface (GUI) has been implemented to allow easy and flexible queries of the database. Third, a set of interactive online tools enables filtering and annotation of genomic sequences obtained from individual D . melanogaster strains to identify candidate mutations. FlyVar permits the analysis of next generation sequencing data without the need of extensive computational training or resources.

Database URL : www.iipl.fudan.edu.cn/FlyVar .

INTRODUCTION

With the rapid progress of next generation sequencing (NGS) technology, analysis and especially interpretation of large amounts of sequencing data is complex. One of the most common applications of NGS technology is to identify causative mutations. Unfortunately, the vast majority of genomic variants in any individual are polymorphisms that do not affect gene function in an obvious fashion ( 1 ). One of the most effective ways to distinguish a damaging mutation from a polymorphism is by comparing affected and control genomes. In general, variations that show no enrichment in affected individuals, and/or are shared by a large number of controls, are likely to be benign polymorphisms and can often be excluded from further investigation ( 2 , 3 ). The efficiency of this filtering step largely depends on the size of the control cohort. In humans, several large scale projects have been carried out to categorize variants of the human genome, such as the 1000 genomes project ( 4 ). Results obtained from these projects are essential resources for human genetics research. Similar NGS based methods have also been shown to be practical for mutation identification in model organisms, including D. melanogaster ( 5 , 6 ). However, the key challenge is that the density of polymorphisms is high among fly strains. Recent studies of natural populations indicate the presence of more than 1 polymorphism per 500 base pairs in the fly genome, more than two times higher than that in the human genome ( 6–8 ). As a result, identification of a causative mutation among such a large number of polymorphisms is challenging. The best approach to solve this issue is to sequence parental strains from which a mutant is derived and use it as the control to identify variants that are unique to the mutant strain ( 9 ). However, this approach cannot be applied to the vast majority of the existing mutants that have been accumulated over the past 100 years as their parental strains do not exist anymore. Alternatively, similar to the human genetics field, a database containing a large number of variants observed from control individuals could be used to filter out polymorphisms. Existing databases usually contain a relatively small number of variants, ranging from several hundreds to tens of thousands ( 10–13 ). Recently dbSNP reports 5.2 million sequence variations for D. melanogaster , in which these variants could be queried. While, none of these databases offer variant filtering tools that remove out benign variants from raw sequence data to help narrow down mutation candidates.

To resolve this issue, we have constructed a database, FlyVar ( www.iipl.fudan.edu.cn/FlyVar ), by collecting data from both our own and public whole genome sequencing (WGS) projects. Currently, FlyVar contains 4.86 million single nucleotide polymorphisms (SNPs) and 1.08 million indels. A set of web based interactive tools has been developed to enable easy filtering of each variant to evaluate and prioritize variants. Through the GUI, users will be able to efficiently analyse individual fruit fly genome data without the requirement of extensive bioinformatics skills or computational resources.

DATA SOURCE AND PROCESSING

Variants were collected from both our internal data and public data. Most of the internal data comes from the sequencing of a collection of X chromosome lethal mutants ( 9 ). In brief, EMS mutagenesis was performed and individual fly strains carrying lethal mutations on the X chromosome were established. Since the parental fly strains used for mutagenesis were isogenized for the X chromosome only, WGS of individual mutant strains allowed us to identify autosomal variants existing in the population of parental and balancer stocks. As a result, 0.92 million variants were identified. To further increase the size of the database, we have also deposited variants obtained from public sequencing data. Most of the public data are derived from the Drosophila Genome Research Project (DGRP) ( 14 ). For the DGRP, flies were collected at a food market in Raleigh, NC, USA and were used to establish many inbred strains, which were extensively phenotyped. WGS was performed for 205 inbred strains, from which 6.14 million variants were identified ( 15 ). As all these inbred strains are viable and free of apparent developmental defects, homozygous variants in these strains are likely to have minimal effects on development. Hence, a total of 4.7 million homozygous variants were identified and added into the FlyVar database. In addition, we have also included variants identified in four studies ( 7 , 14 , 16–18 ). All of these projects consisted of WGS of natural populations of D . melanogaster . From 49 WGS data sets, we were able to include an additional 0.32 million variants into the database. In total, 612 WGS data of D. melanogaster strains was collected into FlyVar, representing the most comprehensive data set.

SUMMARY OF VARIANTS

A summary of variants by chromosome and their annotation is shown in Table 1 for SNPs and Table 2 for small insertion/deletions. To avoid potential mutations generated by EMS on X chromosome, only variants from the autosomes are included in our database for the X chromosome lethal screen project.

Table 1.

Summary of SNPs in fly

SNPs/MNPsChr2LChr2RChr3LChr3RChrXChr4
No. sites overall1 109 349916 178110 51121 079 279638 52613552
Variant density(per kb)44.253943.437445.089238.759128.696210.4878
No. in Splicing10131130962113040320
No. in UTR326 35428 09227 87530 86217 759469
No. in UTR520 77421 90622 02923 41311 741286
No. in upstream a74 08370 4177320574 49639 628822
No. in downstream b59 8595512860 26159 65234 243744
No. in intergenic351 864238 941349 703302 888206 4433484
No. in intron437 754370 587439 610450 121258 3696271
No. in exon137 657129 977131 467136 71769 9401456
 Synonymous89 03283 26283 27484 09446 512604
 Nonsynonymous48 05846 15947 68852 01723 224842
 Stop loss65755783260
 Stop gain50248144852317810
SNPs/MNPsChr2LChr2RChr3LChr3RChrXChr4
No. sites overall1 109 349916 178110 51121 079 279638 52613552
Variant density(per kb)44.253943.437445.089238.759128.696210.4878
No. in Splicing10131130962113040320
No. in UTR326 35428 09227 87530 86217 759469
No. in UTR520 77421 90622 02923 41311 741286
No. in upstream a74 08370 4177320574 49639 628822
No. in downstream b59 8595512860 26159 65234 243744
No. in intergenic351 864238 941349 703302 888206 4433484
No. in intron437 754370 587439 610450 121258 3696271
No. in exon137 657129 977131 467136 71769 9401456
 Synonymous89 03283 26283 27484 09446 512604
 Nonsynonymous48 05846 15947 68852 01723 224842
 Stop loss65755783260
 Stop gain50248144852317810

a Upstream: variant overlaps 1-kb region upstream of transcription start site.

b Downstream: variant overlaps 1-kb region downstream of transcription end site.

Table 1.

Summary of SNPs in fly

SNPs/MNPsChr2LChr2RChr3LChr3RChrXChr4
No. sites overall1 109 349916 178110 51121 079 279638 52613552
Variant density(per kb)44.253943.437445.089238.759128.696210.4878
No. in Splicing10131130962113040320
No. in UTR326 35428 09227 87530 86217 759469
No. in UTR520 77421 90622 02923 41311 741286
No. in upstream a74 08370 4177320574 49639 628822
No. in downstream b59 8595512860 26159 65234 243744
No. in intergenic351 864238 941349 703302 888206 4433484
No. in intron437 754370 587439 610450 121258 3696271
No. in exon137 657129 977131 467136 71769 9401456
 Synonymous89 03283 26283 27484 09446 512604
 Nonsynonymous48 05846 15947 68852 01723 224842
 Stop loss65755783260
 Stop gain50248144852317810
SNPs/MNPsChr2LChr2RChr3LChr3RChrXChr4
No. sites overall1 109 349916 178110 51121 079 279638 52613552
Variant density(per kb)44.253943.437445.089238.759128.696210.4878
No. in Splicing10131130962113040320
No. in UTR326 35428 09227 87530 86217 759469
No. in UTR520 77421 90622 02923 41311 741286
No. in upstream a74 08370 4177320574 49639 628822
No. in downstream b59 8595512860 26159 65234 243744
No. in intergenic351 864238 941349 703302 888206 4433484
No. in intron437 754370 587439 610450 121258 3696271
No. in exon137 657129 977131 467136 71769 9401456
 Synonymous89 03283 26283 27484 09446 512604
 Nonsynonymous48 05846 15947 68852 01723 224842
 Stop loss65755783260
 Stop gain50248144852317810

a Upstream: variant overlaps 1-kb region upstream of transcription start site.

b Downstream: variant overlaps 1-kb region downstream of transcription end site.

Table 2.

Summary of indels in fly

INDELsChr2LChr2RChr3LChr3RChrXChr4
No. sites overall236 300195 139243 431225 411177 6973927
Variant density (per kb)10.27859.25188.74218.09507.98593.0391
No. in splicing155153140172826
No. in UTR367117318737180637135134
No. in UTR54293443247124613359679
No. in upstream a16 87716 36816 84116 41710 128241
No. in downstream b14 75913 53014 95314 39810 528274
No. in intergetic81 54557 30585 08968 85461 7991137
No. in intron108 09892432110 465108 68481 3611991
No. in exon3862360138604210306865
 Synonymous135312191103135266635
 Nonsynonymous2485235127212832241425
 Stop loss2361030
 Stop gain57857578228
INDELsChr2LChr2RChr3LChr3RChrXChr4
No. sites overall236 300195 139243 431225 411177 6973927
Variant density (per kb)10.27859.25188.74218.09507.98593.0391
No. in splicing155153140172826
No. in UTR367117318737180637135134
No. in UTR54293443247124613359679
No. in upstream a16 87716 36816 84116 41710 128241
No. in downstream b14 75913 53014 95314 39810 528274
No. in intergetic81 54557 30585 08968 85461 7991137
No. in intron108 09892432110 465108 68481 3611991
No. in exon3862360138604210306865
 Synonymous135312191103135266635
 Nonsynonymous2485235127212832241425
 Stop loss2361030
 Stop gain57857578228

a Upstream: Variant overlaps 1-kb region upstream of transcription start site.

b Downstream: Variant overlaps 1-kb region downstream of transcription end site.

Table 2.

Summary of indels in fly

INDELsChr2LChr2RChr3LChr3RChrXChr4
No. sites overall236 300195 139243 431225 411177 6973927
Variant density (per kb)10.27859.25188.74218.09507.98593.0391
No. in splicing155153140172826
No. in UTR367117318737180637135134
No. in UTR54293443247124613359679
No. in upstream a16 87716 36816 84116 41710 128241
No. in downstream b14 75913 53014 95314 39810 528274
No. in intergetic81 54557 30585 08968 85461 7991137
No. in intron108 09892432110 465108 68481 3611991
No. in exon3862360138604210306865
 Synonymous135312191103135266635
 Nonsynonymous2485235127212832241425
 Stop loss2361030
 Stop gain57857578228
INDELsChr2LChr2RChr3LChr3RChrXChr4
No. sites overall236 300195 139243 431225 411177 6973927
Variant density (per kb)10.27859.25188.74218.09507.98593.0391
No. in splicing155153140172826
No. in UTR367117318737180637135134
No. in UTR54293443247124613359679
No. in upstream a16 87716 36816 84116 41710 128241
No. in downstream b14 75913 53014 95314 39810 528274
No. in intergetic81 54557 30585 08968 85461 7991137
No. in intron108 09892432110 465108 68481 3611991
No. in exon3862360138604210306865
 Synonymous135312191103135266635
 Nonsynonymous2485235127212832241425
 Stop loss2361030
 Stop gain57857578228

a Upstream: Variant overlaps 1-kb region upstream of transcription start site.

b Downstream: Variant overlaps 1-kb region downstream of transcription end site.

Both Tables 1 and 2 show that, the variant density, which is defined as the total number of variants per kb, is similar for the four arms of chromosome 2 and 3 at about 1 polymorphism per 20 base pairs. Variant density for the X chromosome is markedly lower. This is true even when we take the fact that variants on the X chromosome from the X chromosome lethal project are excluded. This is consistent with the idea that the X chromosome is under higher purify selection as males only have a single copy ( 19 , 20 ).

As expected, the vast majority of the selected variants are likely to be benign polymorphisms for the following reasons. First, variants are enriched in noncoding region as about 70% of them are located in intergenic or intronic regions. Second, for variants that map to exons, 65% are synonymous, not affecting the amino acid sequence of the final protein product. Third, these variants tend to be less conserved bases. As shown in Figure 1 , the average level of evolutionary conservation for the polymorphism bases is lower than that across the genome. It is worth noting that though many polymorphism sites are highly conserved, evolutionary conservation score alone is not sufficient to predict the functional importance of a given base.

Comparison of evolutionary conservation scores between a whole chromosome and its polymorphism sites. In each plot, the left boxplot depicts conservation scores of a whole chromosome, such as ‘2L’ for chromosome 2L; the right boxplot displays conservation scores of which polymorphisms exists, for example ‘2L_SNP’ representing polymorphism sites of chromosome 2L.
Figure 1.

Comparison of evolutionary conservation scores between a whole chromosome and its polymorphism sites. In each plot, the left boxplot depicts conservation scores of a whole chromosome, such as ‘2L’ for chromosome 2L; the right boxplot displays conservation scores of which polymorphisms exists, for example ‘2L_SNP’ representing polymorphism sites of chromosome 2L.

Interestingly, we identified a total of 150 000 variants that corresponds to non-sense mutations, frame shifts and splice site mutations. These represent potential loss of function (LOF) mutations. These variants may affect genes that do not play an essential role in viability or cause an obvious phenotype ( 21 ). Second, a detrimental variant may be compensated for by a different variant, present elsewhere in the genome. Third, it is possible that a variant only affects a subset of transcript, isoforms or functional domains of a gene. This is particularly likely when a variant is located within alternatively spliced exons. Finally, nonsense mutations towards the end of a protein coding region may lead to a minor truncation of the protein with subtle functional consequences ( 22 ). To exclude the last two possibilities, we applied stringent criteria by which only LOF mutations that fall within constitutive exons and not within the last coding exons are considered. As a result, a total of 1094 genes that have clear LOF mutations when in a homozygous state in at least one strain is identified and these genes are considered dispensable for obvious developmental defects. When applied, this dispensable gene list allows exclusion of variants from these genes during the filtering process, further improving the selectivity in classifying deleterious mutations. We noticed that some dispensable genes are known lethal genes. Therefore, we provided an option for users whether to filter out variants within dispensable gene regions.

PLATFORM FUNCTIONS

To facilitate the usage of the database by individual researcher, we have implemented several web based functionalities to the database, allowing querying, annotating and filtering.

The criteria based query module allows extraction of variance information from the database. The interface of the query module is shown in Figure 2 . To make it easy for all users, we keep the interface of FlyVar very simple. Data can be queried based on fly strain ID, genomic coordinate, a given genomic interval, and gene ID. When users have chosen way of querying, corresponding input format and an example could be shown in the right input position of webpage. For example, as shown in Figure 2 , the querying way was chosen as ‘by variant’, corresponding input format and an example ‘The format of input:chr pos ref var. example:chr2L 22025 T G’ is presented immediately in the box of‘input or choose a file’part. The query output includes chromosome, genomic coordinate, reference sequence, and variant sequence. In addition, the ID of the fly strain(s) in which the variant was identified is included. We also designed a query by sample function, in case the specific strain ID in a dataset is meaningful. Furthermore, to facilitate user access to the DGRP collection, all variants from individual DGRP strains can be obtained by querying the corresponding strain ID. Finally, variant frequency in the DGRP collection is also provided with links to the corresponding strains at the FlyBase website ( 23 ).

Steps of query model of FlyVar database.
Figure 2.

Steps of query model of FlyVar database.

The annotation module provides a user-friendly interface to annotate variants input by the users. Currently, Drosophila melanogaster Release 5.1 reference ( 23 , 24 ) and an annotation tool ANNOVAR ( 25 ) are used to perform functional annotation. Webpage of annotation module is shown in Figure 3 . Input with standard vcf format or user defined tab-separated plain text are both accepted. For the custom plain text file, only essential information to define a variant, including chromosome, coordinate, reference base, and variant base are required. This module will generate two output files: the first file contains annotation for all the variants while the second file contains annotation for only variants that affect protein coding.

Steps of annotation model of FlyVar database.
Figure 3.

Steps of annotation model of FlyVar database.

We also developed a module that permits filtering of raw sequence data to eliminate potentially benign variants. The module removes homozygous polymorphisms stored in the database resulting in a narrower list of candidate mutations. To avoid potential deleterious mutations, only homozygous variants were considered benign and filtered out. Same style of filtering interface as query module and annotation module is shown in Figure 4 . Similar to the annotation module, both vcf and user defined tab delimited files can be used as input. The output file has the same format as the input file and only contains variants that are not observed in the database.

Steps of filtering model of FlyVar database.
Figure 4.

Steps of filtering model of FlyVar database.

Since some dispensable genes are known lethal genes, an option, whether variants in dispensable genes are filtered out or not, is provided. Based on the annotation file of collected LOF mutations which have been stored in our database and could be downloaded from ‘dispensable gene’ webpage, users could make their own decision about how to utilize dispensable genes. If dispensable gene regions were included into filtering process, the rest of variants after filtering would be separated into two parts, variants not in dispensable gene regions and variants within dispensable gene regions. Input variants within dispensable genes could also be annotated to ease decision making.

Considering that the number of variants that need to be processed can be large, we provide support for batch file input processing as well as a standard interactive interface. This interactive input feature allows processing of up to 200 000 variants at once. For a larger number of variants, a batch file containing the variants can be uploaded and results will be sent back to users through email.

Besides querying or annotating or filtering online, users also could download the whole database into their local disk.

Data downloading module organizes data according to its project source to ease data selection.

By now, FlyVar is a publicly and freely available for all potential users including profit and non-profit organizations. After logging in, users could give comments or suggestion.

DISCUSSION

Here, we present a set of integrative web-based tools that allow individual research labs to process NGS data directly. This is the first of its kind that specifically addresses an increasing need for processing NGS data within the Drosophila research community. The FlyVar database contains data derived from the latest public WGS data. As a result, the number of variants collected in our database is several orders of magnitude larger than existing databases, greatly enhancing its utility. Upon filtering, the number of remaining variants will be dramatically reduced, resulting in a smaller, more accurate list of candidate mutations. The chrX mutation sequencing project delivered proof-of-principle of the usefulness of this database ( 9 ). Finally, to allow easy usage and access of the database, a set of interactive web based tools have been implemented that enable straightforward data processing without the requirement for programming or special computational resources.

There are several aspects that we are currently improving. First, copy number variations will be included in the near future. Second, we will further improve our annotation of variants. For example, annotation from the modENCODE ( 26 ) data can be included to annotate the potential effects a variant might have on a gene regulatory element. Third, as it is important to keep the database growing, we are developing tools that allow the community to easily contribute their private data. Finally, we plan to adopt cloud computing to the database and tools to make big data sets more accessible for the researchers.

ACKNOWLEDGEMENTS

The authors thank Tinghe Ding, Dr Hua Zhong and Dr Stephen Richards for helpful discussion. We also thank Jason Scott Salvo for editing the manuscript.

Funding

NIH RC4-grant (1RC4GM096355-01 to H.J.B. and R.C.); NSFC grant (61472086) to F.W.

Conflict of interest . None declared.

REFERENCES

1

Harris
H.
(
1971
)
Polymorphism and protein evolution. The neutral mutation-random drift hypothesis
.
J. Med. Genet.
,
8
,
444
452
.

2

Yang
Y.
Muzny
D.M.
Reid
J.G.
et al.  . (
2013
)
Clinical whole-exome sequencing for the diagnosis of mendelian disorders
.
N. Engl. J. Med.
,
369
,
1502
1511
.

3

Adams
D.R.
Sincan
M.
Fuentes Fajardo
K.
et al.  . (
2012
)
Analysis of DNA sequence variants detected by high-throughput sequencing
.
Hum. Mutat.
,
33
,
599
608
.

4

1000 Genomes Project Consortium.
(
2012
)
An integrated map of genetic variation from 1,092 human genomes
.
Nature
,
491
,
56
65
.

5

Blumenstiel
J.P.
Noll
A.C.
Griffiths
J.A.
et al.  . (
2009
)
Identification of EMS-induced mutations in Drosophila melanogaster by whole-genome sequencing
.
Genetics
,
182
,
25
32
.

6

Mackay
T.F.C.
Richards
S.
Stone
E.A.
et al.  . (
2012
)
The Drosophila melanogaster Genetic Reference Panel
.
Nature
,
482
,
173
178
.

7

Langley
C.H.
Stevens
K.
Cardeno
C.
et al.  . (
2012
)
Genomic variation in natural populations of Drosophila melanogaster
.
Genetics
,
192
,
533
598
.

8

Berger
J.
Suzuki
T.
Senti
K.A.
et al.  . (
2001
)
Genetic mapping with SNP markers in Drosophila
.
Nat. Genet.
,
29
,
475
481
.

9

Nele
A.
Haelterman
L.J.
Li
Y.
et al.  . (
2014
)
Large-scale identification of chemically induced mutations in Drosophila melanogaster
.
Genome Res.
,
24
,
1707
1718
.

10

Chen
D.
Ahlford
A.
Schnorrer
F.
et al.  . (
2008
)
High-resolution, high-throughput SNP mapping in Drosophila melanogaster
.
Nat. Methods
,
5
,
323
329
.

11

Chen
D.
Berger
J.
Fellner
M.
et al.  . (
2009
)
FLYSNPdb: a high-density SNP database of Drosophila melanogaster
.
Nucleic Acids Res
,
37
(
Database issue
),
D567
D570
.

12

Hoskins
R.A.
Phan
A.C.
Naeemuddin
M.
et al.  . (
2001
)
Single nucleotide polymorphism markers for genetic mapping in Drosophila melanogaster
.
Genome Res.
,
11
,
1100
1113
.

13

Casillas
S.
Petit
N.
Barbadilla
A.
(
2005
)
DPDB: a database for the storage, representation and analysis of polymorphism in the Drosophila genus
.
Bioinformatics
,
21
(
Suppl.2
),
ii26
ii30
.

14

Huang
W.
Massouras
A.
Inoue
Y.
et al.  . (
2014
)
Natural variation in genome architecture among 205 Drosophila melanogaster Genetic Reference Panel lines
.
Genome Res.
,
24
,
1193
1208
.

15

Huang
W.
Massouras
A.
Inoue
Y.
et al.  . (
2014
)
Genome Res
., .

16

Fabian
D.K.
Kapun
M.
Nolte
V.
et al.  . (
2012
)
Genome-wide patterns of latitudinal differentiation among populations of Drosophila melanogaster from North America
.
Mol. Ecol.
,
21
,
4748
4769
.

17

Remolina
S.C.
Chang
P.L.
Leips
J.
et al.  . (
2012
)
Genomic basis of aging and life-history evolution in Drosophila melanogaster
.
Evolution
,
66
,
3390
3403
.

18

Kolaczkowski
B.
Kern
A.D.
Holloway
A.K.
et al.  . (
2011
)
Genomic differentiation between temperate and tropical Australian populations of Drosophila melanogaster
.
Genetics
,
187
,
245
260
.

19

Begun
D.J.
Aquadro
C.F.
(
1992
)
Levels of naturally occurring DNA polymorphism correlate with recombination rates in D. melanogaster
.
Nature
,
356
,
519
520
.

20

Singh
N.D.
Petrov
D.A.
(
2007
)
Evolution of gene function on the X chromosome versus the autosomes
.
Genome Dyn.
,
3
,
101
118
.

21

Young
M.W.
Judd
B.H.
(
1978
)
Nonessential sequences, genes, and the polytene chromosome bands of Drosophila melanogaster
.
Genetics
,
88
,
723
742
.

22

MacArthur
D.G.
Balasubramanian
S.
Frankish
A.
et al.  . (
2012
)
A systematic survey of loss-of-function variants in human protein-coding genes
.
Science
,
335
,
823
828
.

23

dos Santos
G.
Schroeder
A.J.
Goodman
J.L.
et al.  . (
2015
)
FlyBase: introduction of the Drosophila melanogaster Release 6 reference genome assembly and large-scale migration of genome annotations
.
Nucleic Acids Res.
,
doi: 10.1093/nar/gku1099
.

24

Smith
C.D.
Shu
S.Q.
Mungall
C.J.
et al.  . (
2007
)
The Release 5.1 annotation of Drosophila melanogaster heterochromatin
.
Science
,
316
,
1586
1591
.

25

Wang
K.
Li
M.
Hakonarson
H.
(
2010
)
ANNOVAR: functional annotation of genetic variants from high-throughput sequencing data
.
Nucleic Acids Res.
,
38
,
e164
.

26

Gerstein
M.B.
Lu
Z.J.
Van Nostrand
E.L.
et al.  . (
2010
)
Integrative analysis of the Caenorhabditis elegans genome by the modENCODE project
.
Science
,
330
,
1775
1787
.

Author notes

Citation details: Wang,F., Jiang,L., Chen,Y., et al. FlyVar: a database for genetic variation in Drosophila melanogaster . Database (2015) Vol. 2015: article ID bav079; doi:10.1093/database/bav079

This is an Open Access article distributed under the terms of the Creative Commons Attribution License ( http://creativecommons.org/licenses/by/4.0/ ), which permits unrestricted reuse, distribution, and reproduction in any medium, provided the original work is properly cited.