Abstract

A Leishmania Microsatellite Database (LeishMicrosatDB) is reported for genome wise mining of microsatellites in six Leishmania species, using in silico techniques. This was created to provide parasitologists a platform to understand the genome characterization, mapping, phylogeny and evolutionary analysis. The present version of the database contains 1 738 669 simple sequence repeats of which 181 s756 repeats are present in compound form. The repeats can be sought in a chromosome using input parameters such as repeat type (mono- hexa), coding status, repeat unit length and repeat sequence motif. The genic repeats have been further hyperlinked with their corresponding locus id, and the database is appended with primer3 plus for primer designing of selected repeats with left and right flanking sequences up to 250 bp. Information on clustering and polymorphic repeats can also be retrieved. This database may also be adopted as a tool to study the relative occurrence and distribution of microsatellites across the parasitic genome. The database can enable a biologist to select markers at desired intervals over the chromosomes, and can be accessed as an open source repository at http://biomedinformri.com/leishmicrosat .

Database URL:http://biomedinformri.com/leishmicrosat

Introduction

Leishmania is a genus of protozoan parasites that infect macrophages causing a broad spectrum of diseases, ranging from self-limiting cutaneous leishmaniasis to severe mucocutaneous leishmaniasis with fatal spontaneous evolution. Leishmaniasis comprises a group of diseases having extensive morbidity and mortality in most developing countries. Infection with pathogenic Leishmania results an annual incidence of 2 million cases in 88 countries ( www.who.int/tdr/disease/leish ). Molecular markers are highly necessary to identify different strain through human populations, and identify animal reservoirs of the strains circulating in humans. One of the most powerful and discriminative DNA-based methods for strain typing and population dynamics is the analysis of highly variable co-dominant microsatellite markers. Microsatellites or simple sequence repeats (SSRs) are short, hypervariable, tandemly repeated sequence motifs (1–6 bp), which has evolved and expanded by DNA replication slippage. These may be perfect repeats (consisting of one type of repeat) or might contain single or few base-pair interruptions ( 1 ). The genomic regions where microsatellite density (loci/Mbp) is markedly higher than the average density in the genome are called repeat clusters, and those repeating unit which have two or more runs of different repeat motif [e.g. (GTG)8(AT)16] are called compound repeat ( 2 ). The rate of mutation of microsatellite rich region is five to six times higher than that of neutral regions of DNA. Tandem repeats (direct or inverted) involved in rearrangements of DNA, alteration of gene copy number (deletion or amplification), formation of extra chromosomal amplicons (circular or linear), and the presence of supernumerary chromosomes have been described in Leishmania ( 3 , 4 ). It has already been reported that Leishmania are relatively rich in microsatellites ( 5 ). Duhagon et al. described non-uniformity of repeat patterns in the intergenic regions, and asymmetrical strand distribution of dinucleotide repeats favoring TT and GT repeats in the coding strands which may control genome structure and gene expression ( 6 ). Current analyses of length polymorphism of repeats containing regions shed some light on the population structures and genetic studies of many different species. Recently, multilocus microsatellite typing (MLMT) has been used successfully in Leishmania throughout the world to track down different strains and to investigate its population dynamics ( 7–11 ). Several studies have discussed the variability of various species of Leishmania ( 12–17 ). Microsatellite markers designed for Leishmania species (13 markers for L. major , 16 markers for L. tropica and 20 markers for L. donovani ) have shown high level of polymorphism ( 18–20 ). Several microsatellite markers identified in L. infantum and others have shown to discriminate between some Leishmania populations ( 21–23 ). All such studies mainly describe the repeat polymorphism within the same or different species. Despite the medical importance of this parasite, its population genetics is poorly understood. In this respect, the use of molecular markers can provide very useful information for the targeted organisms ( 24 ). Moreover, a number of additional applications for the genotype data are possible if the mapped microsatellites with known positions in the genome are used. For example, it is possible to undertake association studies to identify correlations between the frequency of marker alleles and different parasite phenotypes. It is also be possible to search for evidence of recombination within a chromosome ( 25 ). But very little is known about the length polymorphism of repeat containing regions. Availability of complete and annotated genome sequences of different Leishmania species has provided an excellent opportunity to analyze microsatellites in great detail for their genomic locations, distributions and frequencies. in silico mining of microsatellites repeats may provide a useful basis for carrying out further investigation of its structural and functional characteristics. For eukaryotic genome, few such databases for microsatellite searching has been reported in recent years ( 26–31 ).

In this article, we describe the development of a microsatellite database (LeishMicrosatDB) using LAMPP (Linux-Apache-MySQL-PHP-Perl) technology, and GenBank of NCBI as a data source to extract the microsatellite data. LeishMicrosatDB is a unique database of microsatellite repeats for diverse Leishmania species. The database currently contains 213 chromosomes of six species, and provides information of microsatellite type (simple perfect or compound perfect), repeat unit length (mono- to hexa-nucleotide), repeat number, repeat motif, microsatellite length and chromosomal location in the genome. Furthermore, the information about clustering of different microsatellites and polymorphic repeats (different repeat units of particular loci of different species/strains) can also be retrieved.

Materials and Methods

Data source

The chromosome wise genome sequences of six Leishmania species and their respective annotation files (.ptt or .gff), available in public domain ( ftp.ncbi.nlm.nih.gov/genomes/Protozoa/ , http://tritrypdb.org/common/downloads/release-4.1/ ), were downloaded. The details of each Leishmania species are described in Table 1 . All possible non-overlapping simple repeats were searched by repeat mining tool called MISA ( 32 ). We applied the following criteria (mono—5 repeat unit; di—4 repeat unit; tri to tetra—3 repeat unit and penta to hexa—2 repeat unit) to define each SSR as true repeat. Rationale for choosing the small cutoff value was that, the microsatellites are often disrupted by single base substitution. These simple repeats were mapped on to the genomic annotations from the .ptt file using a customized Perl script, ANNOTATE . The repeats present within the start and end position of a gene were assigned as coding SSR, and those found in the intergenic regions were considered as non-coding SSR. Left flanking and right flanking sequences (≤250 bp) of each repeat coordinates were extracted by using a perl program called XTRACT . For extracting polymorphic repeats, we applied the method described by Pankaj Kumar et al. with certain modifications ( 33 ), Orthologous parts among the chromosomes were searched using BLASTn using following set of parameters: E -value ≤ 0.001; X drop-off value for final gapped alignment = 200; and repeat masking filter = off. Genic and intergenic sequences were screened out by using in-house developed perl script. The repeats were considered as putative Polymorphic Simple Sequence Repeats (PSSR) if a pair of orthologous sequence contains essentially same repeat of different length. To reduce false positives PSSR, left flanking and right flanking sequences of each putative PSSR were compared, and the final PSSR were screened out when identity in corresponding flanking sequences is >60%.

Table 1.

The details about sequenced Leishmania strains, the version of sequenced genomes, annotation status for each genome, number of chromosomes

Serial numberParasite nameStrainRefSeq assembly IDNumber of chromosome
1L. donovaniMHOM/NP/2003/BPK282A1GCF_000227135.136
2L. infantumMCAN/ES/98/LLM-724(JPCM5)GCF_000002875.236
3L. braziliensisMHOM/BR/75/M2904GCF_000002845.135
4L. majorMHOM/IL/1980/FriedlinGCF_000002725.136
5L. tarentolaeParrot-TarII2011-06-2236
6L. mexicanaMHOM/GT/2001/U11032013-01-1634
Serial numberParasite nameStrainRefSeq assembly IDNumber of chromosome
1L. donovaniMHOM/NP/2003/BPK282A1GCF_000227135.136
2L. infantumMCAN/ES/98/LLM-724(JPCM5)GCF_000002875.236
3L. braziliensisMHOM/BR/75/M2904GCF_000002845.135
4L. majorMHOM/IL/1980/FriedlinGCF_000002725.136
5L. tarentolaeParrot-TarII2011-06-2236
6L. mexicanaMHOM/GT/2001/U11032013-01-1634
Table 1.

The details about sequenced Leishmania strains, the version of sequenced genomes, annotation status for each genome, number of chromosomes

Serial numberParasite nameStrainRefSeq assembly IDNumber of chromosome
1L. donovaniMHOM/NP/2003/BPK282A1GCF_000227135.136
2L. infantumMCAN/ES/98/LLM-724(JPCM5)GCF_000002875.236
3L. braziliensisMHOM/BR/75/M2904GCF_000002845.135
4L. majorMHOM/IL/1980/FriedlinGCF_000002725.136
5L. tarentolaeParrot-TarII2011-06-2236
6L. mexicanaMHOM/GT/2001/U11032013-01-1634
Serial numberParasite nameStrainRefSeq assembly IDNumber of chromosome
1L. donovaniMHOM/NP/2003/BPK282A1GCF_000227135.136
2L. infantumMCAN/ES/98/LLM-724(JPCM5)GCF_000002875.236
3L. braziliensisMHOM/BR/75/M2904GCF_000002845.135
4L. majorMHOM/IL/1980/FriedlinGCF_000002725.136
5L. tarentolaeParrot-TarII2011-06-2236
6L. mexicanaMHOM/GT/2001/U11032013-01-1634

Results and Discussion

Construction and content of LeishMicrosatDB

In order to manage the data, MySQL, a relational database management system, was used for building the database. A front-end web interface was developed using web technologies like HTML, CSS, JavaScript, DBI (Database Interface), GD (Graphics Design), CGI (Common Gateway Interface) and PERL that communicate with the relational table for data retrieval. The overall architecture of the database is a ‘three-tier architecture’ with a client/presentation tier, middle /application tier and database tier which is outlined in Figure 1 . In database tier, tables were designed, and relationships among tables were created using unique, primary and foreign keys. The SSRs identified using MISA from different Leishmania species were stored into separate tables. Each species specific table contains field like chromosome, SSR_type, SSR_motif, Rep_no, Length, Start, End, Left_flank_seq, Right_flank_seq, Gene_id and PSSR_ID ( Table 2 ). The PSSR-ID is available for those repeats that are polymorphic. The unique PSSR_ID present in ‘PSSR’ table works as a bridge between individual SSR tables. The Gene table stores genomic coordinate of each gene from each species and its orthologous gene id. This explains the overall schema of the database for efficient data storage and retrieval ( Figure 2 ).

Figure 1.

Three tier architecture of LeishMicrosatDB.

Figure 2.

Architecture and data flow representation in LeishMicrosatDB.

Table 2.

Structure of the table used in the construction of the LeishMicrosatDB

Field informationFiled nameData typeKeyExample
Serial numberSnInt(20)PRI203
Chromosome numberChromosomeVarchar(2)11
Repeat typeTypeVarchar(1)1,2,3,4,5,6
SSR motifSsrVarchar(15)ACG, GA, AGGCTGA
Repeat numberRepNoInt(11)12,10
Total length of the repeated sequenceLengthInt(11)30,22
Start coordinate of the SSRStartInt(10)10 223, 331 201
End coordinate of the SSREndInt(10)208 871,345 129
Left flanking sequenceUpstreamVarchar(250)AGGCTAG … AGGTAGC
Right flanking sequenceDownstreamVarchar(250)AGCtTAG … AGTAGCAA
Gene information if found with in a geneCodingStatusVarchar(15)LTR1234.2, nonCoding,
Polymorphic SSR table Serial Number (if polymorphic)PSSR_IDInt(20)102, 203
Field informationFiled nameData typeKeyExample
Serial numberSnInt(20)PRI203
Chromosome numberChromosomeVarchar(2)11
Repeat typeTypeVarchar(1)1,2,3,4,5,6
SSR motifSsrVarchar(15)ACG, GA, AGGCTGA
Repeat numberRepNoInt(11)12,10
Total length of the repeated sequenceLengthInt(11)30,22
Start coordinate of the SSRStartInt(10)10 223, 331 201
End coordinate of the SSREndInt(10)208 871,345 129
Left flanking sequenceUpstreamVarchar(250)AGGCTAG … AGGTAGC
Right flanking sequenceDownstreamVarchar(250)AGCtTAG … AGTAGCAA
Gene information if found with in a geneCodingStatusVarchar(15)LTR1234.2, nonCoding,
Polymorphic SSR table Serial Number (if polymorphic)PSSR_IDInt(20)102, 203
Table 2.

Structure of the table used in the construction of the LeishMicrosatDB

Field informationFiled nameData typeKeyExample
Serial numberSnInt(20)PRI203
Chromosome numberChromosomeVarchar(2)11
Repeat typeTypeVarchar(1)1,2,3,4,5,6
SSR motifSsrVarchar(15)ACG, GA, AGGCTGA
Repeat numberRepNoInt(11)12,10
Total length of the repeated sequenceLengthInt(11)30,22
Start coordinate of the SSRStartInt(10)10 223, 331 201
End coordinate of the SSREndInt(10)208 871,345 129
Left flanking sequenceUpstreamVarchar(250)AGGCTAG … AGGTAGC
Right flanking sequenceDownstreamVarchar(250)AGCtTAG … AGTAGCAA
Gene information if found with in a geneCodingStatusVarchar(15)LTR1234.2, nonCoding,
Polymorphic SSR table Serial Number (if polymorphic)PSSR_IDInt(20)102, 203
Field informationFiled nameData typeKeyExample
Serial numberSnInt(20)PRI203
Chromosome numberChromosomeVarchar(2)11
Repeat typeTypeVarchar(1)1,2,3,4,5,6
SSR motifSsrVarchar(15)ACG, GA, AGGCTGA
Repeat numberRepNoInt(11)12,10
Total length of the repeated sequenceLengthInt(11)30,22
Start coordinate of the SSRStartInt(10)10 223, 331 201
End coordinate of the SSREndInt(10)208 871,345 129
Left flanking sequenceUpstreamVarchar(250)AGGCTAG … AGGTAGC
Right flanking sequenceDownstreamVarchar(250)AGCtTAG … AGTAGCAA
Gene information if found with in a geneCodingStatusVarchar(15)LTR1234.2, nonCoding,
Polymorphic SSR table Serial Number (if polymorphic)PSSR_IDInt(20)102, 203

Web visualization of LeishMicrosatDB

LeishMicrosatDB is likely to be accessed by biologist in broad objectives, primarily to develop molecular markers, and also to understand the role of microsatellites in regulating gene expression and genome evolution. The LeishMicrosatDB allows mining of different microsatellites along with their physical location in the chromosomes in six fully sequenced Leishmania species. At present, the LeishMicrosatDB has over 1.73 million repeats covering six Leishmania genomes. More related genomes will be considered when their whole genome sequences and .ptt file be made available in the public domain.

The web interface of LeishMicrosatDB provides a brief description and links to the page that enables user to select the genome and repeat class of interest. The database can be accessed by perfect repeats, compound repeats, repeat cluster and polymorphic repeats. The perfect repeats can be searched in a chromosome using following need based input parameters likerepeat type (mono- hexa), coding status, repeat unit length and repeat sequence motif. A specific region on the chromosome can be searched by providing input parameters (start and end position). Once species and chromosome options are selected, rest of the fields is set ‘ALL’ by default. The output is primarily a list of microsatellite annotated for all option of the query sheet and the output is generated as a hierarchical pre-sorted list. Each repeat carries its genomic location and corresponding indices. The result page gives complete information of SSR motif, 250 bp left and right flanking sequences that allows user to design locus specific primers. This is facilitated by automatic uploading of repeat and flanking sequences of the selected microsatellite into Primer3 query form ( Figure 3 ). At the bottom of the result page, repeat density map shows the distribution of repeats throughout the chromosome. Apart from the simple sequence repeats or perfect repeats, the database can be accessed for compound microsatellites (two or more microsatellites being found in close proximity) and microsatellite cluster (compound microsatellites interrupted by few nucleotides). Compound repeats can be sought by user’s customized repeat combination. For example, if a user wants to screen compound microsatellites from chromosome 36 of L. donovani which has repeat and combination of di- and tri-nucleotide repeat number greater than three unit, search can be made using the parameter specified in Figure 4 . Similarly, by specifying the interruption value, the repeat cluster can be accessed. The polymorphic tab contains a drop-down menu comprising the name of all six species. After selecting the target species, rest species were automatically updated in ‘species to consider’ field. A separate option is provided to screen out polymorphic repeats in genic and intergenic regions. The result page contains the number of polymorphic repeats found in the selected species, and gives the detailed information of the particular repeat motif, repeat unit, chromosome number, coding status and genomic location. The output shows information on the corresponding polymorphic repeats ( Figure 5 ). In this page, hyperlinks are also provided to each of the listed polymorphic repeats to design the primers using Primer3. All the detail search methods for perfect repeat, compound repeat, repeat cluster and polymorphic repeats are described in the database tutorial.

Figure 3.

Results displaying repeat information along with left and right flanking sequences and primer3plus primer generation tool.

Figure 4.

Result displaying compound repeats of any dinucleotide and trinucleotide repeat combination in 36 th chromosome of L. donovani .

Figure 5.

Overview of the retrieving of polymorphic repeats using screen-shots of various pages. ( A ) Main page containing species name which can be selected; ( B ) Overall information of the polymorphic repeats; ( C ) Detail information of the polymorphic repeats.

Leishmania genomes are varying greatly in microsatellite repeat compositions, diversity and distribution. In order to determine the frequency and composition of different type of repeat motifs available in database, a dedicated section ‘statistics’ has been incorporated in the database which comprises of (i) over all statistics, (ii) a polymorphic SSR statistics and (iii) a comparative statistics, and each statistics can be accessed by a separate ‘tab’. The overall statistics displays chromosome wise over-all repeat statistics of each genome, whereas polymorphic SSR statistics tab displays only the distribution of polymorphic repeats. The comparative statistics tab directs to a repeat summary page giving a detailed illustration of the repeat distribution. The repeat occurrence graph and table are generated dynamically based on the repeat information using GD module ( Figure 6 ). Several microsatellite databases ( Table 3 ) of various organisms have appeared in recent years that provide important data for the comparative analysis of microsatellite distribution in eukaryotic genomes; however, none of these databases provide length variation of SSR across genomes. The LeishMicrosatDB gives useful information such as comparative statistics and length variation across genomes. The identification of polymorphic repeats and its comparative study can exhibit different potential application.

Figure 6.

Tabular and graphical representation of microsatellite repeats comparison.

Table 3.

Comparison of various eukaryotic microsatellite databases, available in public domain

DatabaseDetails on
Coverage
Simple repeatsCompound repeatsClustering informationFlanking sequencesPolymorphic informationGenomic repeatsPrimer designComparative statistics
MMDBJ ( 17 ) YNNNYYNNMouse
InsatDB ( 18 ) YYNYNYYN5 Insect genome
MRD ( 19 ) YNNYNNNN8 eukaryotic genome
SSRD ( 20 ) YNNYNNNNHuman
EuMicrosat db ( 21 ) YYYYNYYN31 eukaryotic genome
FishMicrosat ( 22 ) YYYNNYYN36 fish genome
LeishMicrosatDBYYYYYYYY 6 L. genome
DatabaseDetails on
Coverage
Simple repeatsCompound repeatsClustering informationFlanking sequencesPolymorphic informationGenomic repeatsPrimer designComparative statistics
MMDBJ ( 17 ) YNNNYYNNMouse
InsatDB ( 18 ) YYNYNYYN5 Insect genome
MRD ( 19 ) YNNYNNNN8 eukaryotic genome
SSRD ( 20 ) YNNYNNNNHuman
EuMicrosat db ( 21 ) YYYYNYYN31 eukaryotic genome
FishMicrosat ( 22 ) YYYNNYYN36 fish genome
LeishMicrosatDBYYYYYYYY 6 L. genome
Table 3.

Comparison of various eukaryotic microsatellite databases, available in public domain

DatabaseDetails on
Coverage
Simple repeatsCompound repeatsClustering informationFlanking sequencesPolymorphic informationGenomic repeatsPrimer designComparative statistics
MMDBJ ( 17 ) YNNNYYNNMouse
InsatDB ( 18 ) YYNYNYYN5 Insect genome
MRD ( 19 ) YNNYNNNN8 eukaryotic genome
SSRD ( 20 ) YNNYNNNNHuman
EuMicrosat db ( 21 ) YYYYNYYN31 eukaryotic genome
FishMicrosat ( 22 ) YYYNNYYN36 fish genome
LeishMicrosatDBYYYYYYYY 6 L. genome
DatabaseDetails on
Coverage
Simple repeatsCompound repeatsClustering informationFlanking sequencesPolymorphic informationGenomic repeatsPrimer designComparative statistics
MMDBJ ( 17 ) YNNNYYNNMouse
InsatDB ( 18 ) YYNYNYYN5 Insect genome
MRD ( 19 ) YNNYNNNN8 eukaryotic genome
SSRD ( 20 ) YNNYNNNNHuman
EuMicrosat db ( 21 ) YYYYNYYN31 eukaryotic genome
FishMicrosat ( 22 ) YYYNNYYN36 fish genome
LeishMicrosatDBYYYYYYYY 6 L. genome

Conclusion

LeishMicrosatDB has been worked out as a complete curated web-oriented relational database of perfect, compound, cluster and polymorphic repeats in six-sequenced Leishmania genome. The database can provide parasitologists a platform to understand the diseases by considering the immense utility of the repeats. Various input parameters can be used for comprehensive search of simple, compound, polymorphic and cluster of repeats. This database may also be adopted as a useful tool to study relative occurrence and distribution of microsatellite across the parasitic genome. The repeats in the coding region of the gene may hopefully prove to be more useful for gene tagging and to study its functional role in evolutionary analysis, and all of these information may serve as an important input in designing experiments in new direction, elucidating novel role and function of different kinds of repeats. We anticipate that, the main application of this database will be the development of mapped markers for specific application such as association studies and the search for recombination with in chromosomes.

Availability

LeishMicrosatDB can be accessed freely at http://biomedinformri.com/leishmicrosat

Acknowledgements

The authors wish to express their gratitude towards F arheen W azri, Dr. Sindhuprava Rana and Md. Yusuf Ansari for their support in the development of database and revising the manuscript. The authors thank Dr Harpreet Singh, Scientist D, ICMR, New Delhi for helping us in setting up our biomedical informatics Department in RMRIMS, Patna, India.

Funding

The work is funded by Indian Council of Medical Research (ICMR), India. Funding for open access charges: Indian Council of Medical Research (ICMR)

Conflict of interest . None declared.

References

1

Katti
M.V.
Ranjekar
P.K.
Gupta
V.S.
(
2001
)
Differential distribution of simple sequence repeats in eukaryotic genome sequences
.
Mol. Biol. Evol.
,
18
,
1161
1167
.

2

Sharma
P.C.
Grover
A.
Kahl
G.
(
2007
)
Mining microsatellites in eukaryotic Genomes
.
Trend Biotechnol.
,
25
,
490
498
.

3

Mukherjee
A.
Langston
L.D.
Ouellette
M.
(
2011
)
Intra chromosomal tandem duplication and repeat expansion during attempts to inactivate the subtelomeric essential gene GSH1 in Leishmania
.
Nucleic Acids Res.
,
39
,
7499
7511
.

4

Ubeda
J.M.
Légaré
D.
Raymond
F.
et al.  . (
2008
)
Modulation of gene expression in drug resistant Leishmania is associated with gene amplification, gene deletion and chromosome aneuploidy
.
Genome Biol.
,
9
,
R115
.

5

Zhang
W.
Yo
Y.
Shen
Y.
et al.  . (
2004
)
Preliminary study on applicability of microsatellite DNA primers from parasite protozoa Trypanosoma cruzi in free-living protozoa
.
J. Ocean Univ. China
,
3
,
80
84
.

6

Duhagon
M.A.
Smircich
P.
Forteza
D.
et al.  . (
2011
)
Comparative genomic analysis of dinucleotide repeats in Tritryps
.
Gene
,
487
,
29
37
.

7

Ochsenreither
S.
Kuhls
K.
Schaar
M.
et al.  . (
2006
)
Multilocus microsatellite typing as a new tool for discrimination of Leishmania infantum MON-1 strains
.
J. Clin. Microbiol.
,
44
,
495
503
.

8

Kebede
N.
Oghumu
S.
Worku
A.
et al.  . (
2013
)
Multilocus microsatellite signature and identification of specific molecular markers for Leishmania aethiopica
.
Parasit Vectors
,
6
,
160

9

Gouzelou
E.
Haralambous
C.
Amro
A.
et al.  . (
2012
)
Multilocus microsatellite typing (MLMT) of strains from Turkey and Cyprus reveals a novel monophyletic L. donovani sensu lato group
.
PLoS Negl. Trop. Dis.
,
6
,
e1507

10

Kuhls
K.
Alam
M.Z.
Cupolillo
E.
et al.  . (
2011
)
Comparative microsatellite typing of new world Leishmania infantum reveals low heterogeneity among populations and its recent old world origin
.
PLoS Negl. Trop. Dis.
,
5
,
e1155
.

11

Kuhls
K.
Cupolillo
E.
Silva
S.O.
et al.  . (
2013
)
Population structure and evidence for both clonality and recombination among Brazilian strains of the subgenus Leishmania ( Viannia )
.
PLoS Negl. Trop. Dis.
,
7
,
e2490
.

12

Bulle
B.
Millon
L.
Bart
J.M.
et al.  . (
2002
)
Practical approach for typing strains of Leishmania infantum by microsatellite analysis
.
J. Clin. Microbiol.
,
40
,
3391
3397
.

13

Seridi
N.
Amro
A.
Kuhls
K.
et al.  . (
2008
)
Genetic polymorphism of Algerian Leishmania infantum strains revealed by multilocus microsatellite analysis
.
Microbes Infect.
,
10
,
1309
1315
.

14

Mauricio
I.L.
Howard
M.K.
Stothard
Jr
Miles
M.A.
(
1999
)
Genomic diversity in the Leishmania donovani complex
.
Parasitology
,
119
,
237
246
.

15

Hide
M.
Banuls
A.L.
Tibayrenc
M.
(
2001
)
Genetic heterogeneity and phylogenetic status of Leishmania (Leishmania) infantum zymodeme MOn-1: epidemiological implications
.
Parasitology
,
123
,
425
432
.

16

Ishikawa
E.A.
Silveira
F.T.
Magalhães
A.L.
et al.  . (
2002
).
Genetic variation in popu- genetic variation in populations of Leishmania species in Brazil
.
Trans. R Soc. Trop. Med. Hyg.
,
96
,
111
121
.

17

Cupolillo
E.
Brahim
Lr
Toaldo
C.B.
et al.  . (
2003
)
Genetic polymorphism and molecular epidemiology of Leishmania ( viannia) braziliensis from different hosts and geographic areas in Brazil
.
J. Clin. Microbiol.
,
41
,
3126
3132
.

18

Jamjoom
M.B.
Ashford
R.W.B.
Bates
P.A.
et al.  . (
2002
)
Polymorphic microsatellite repeats are not conserved between Leishmania donovani and Leishmania major
.
Mol. Ecol. Notes
,
2
,
104
106
.

19

Schwenkenbecher
J.M.
Fröhlich
C.
Gehre
F.
et al.  . (
2004
)
Evolution and conservation of microsatellite markers for Leishmania tropica
.
Infect. Genet. Evol.
,
4
,
99
105
.

20

Jamjoom
M.B.
Ashford
R.W.
Bates
P.A.
et al.  . (
2002
)
Towards a standard battery of microsatellite markers for the analysis of the Leishmania donovani complex
.
Ann. Trop. Med. Parasitol.
,
96
,
265
70
.

21

Rossi
V.
Wincker
P.
Ravel
C.
et al.  . (
1994
)
Structural organization of microsatellite families in the Leishmania genome and polymorphisms at two (CA) n loci
.
Mol. Biochem. Parasitol.
,
65
,
271
282
.

22

Rodriguez
N.
De Lima
H.
Rodriguez
A.
et al.  . (
1997
)
Genomic DNA repeat from Leishmania(Viannia) braziliensis (Venezuelan strain) containing simplerepeats and microsatellites
.
Parasitology
,
115
,
349
358
.

23

Russell
R.
Iribar
M.P.
Lambson
B.
et al.  . (
1999
)
Intra and interspecific microsatellite variation in the Leishmania subgenus Viannia
.
Mol. Biochem. Parasitol.
,
103
,
71
77
.

24

Rougeron
V.
Waleckx
E.
Hide
M.
et al.  . (
2008
)
A set of 12 microsatellite loci for genetic studies of Leishmaniabraziliensis
.
Mol. Ecol. Resources
,
8
,
351
353
.

25

Fakhar
M.
Motazedian
M.H.
Daly
D.
et al.  . (
2008
)
An integrated pipeline for the development of novel panels of mapped microsatellite markers for Leishmania donovani complex, Leishmania braziliensis and Leishmania major
.
Parasitology
,
135
,
567
574
.

26

Mouse Microsatellite Data Base of Japan (MMDBJ) http://www.shigen.nig.ac.jp/mouse/mmdbj

27

Archak
S.
Meduri
E.
Kumar
P.S.
Nagaraju
J.
(
2007
)
InSatDb: a microsatellite database of fully sequenced insect genomes
.
Nucleic Acids Res.
,
35
,
D36
D39
.

28

Subramanian
S.
Madgula
V.M.
George
R.
et al.  . (
2002
)
MRD: a microsatellite repeats database for prokaryotic and eukaryotic genomes
.
Genome Biol.
,
3
,
PREPRINT0011
.

29

Subramanian
S.
Madgula
V.M.
George
R.
et al.  . (
2003
)
SSRD: simple sequence repeats database of the human genome
.
Comp. Funct. Genomics
,
4
,
342
345
.

30

Aishwarya
V.
Grover
A.
Sharma
P.C.
(
2007
)
EuMicroSatdb: a database for microsatellites in the sequenced genomes of eukaryotes
.
BMC Genomics
,
10
,
225
.

31

Nagpure
N.S.
Rashid
I.
Pati
R.
et al.  . (
2013
)
FishMicrosat: a microsatellite database of commercially important fishes and shellfishes of the Indian subcontinent
.
BMC Genomics
,
14
,
630
.

33

Kumar
P.
Chaitanya
P.S.
Nagarajaram
H.A.
(
2011
)
PSSRdb: a relational database of polymorphic simple sequence repeats extracted from prokaryotic genomes
.
Nucleic Acids Res.
,
39
,
D601
D605
.

Author notes

These authors contributed equally to this work.

Citation details: Dikhit,M.R., Moharana,K.C., Sahoo,B.R., et al . LeishMicrosatDB: open source database of repeat sequences detected in six fully sequenced Leishmania genomes. Database (2014) Vol. 2014: article ID bau078; doi:10.1093/database/bau078

This is an Open Access article distributed under the terms of the Creative Commons Attribution License (http://creativecommons.org/licenses/by/4.0/), which permits unrestricted reuse, distribution, and reproduction in any medium, provided the original work is properly cited.