Review of databases for experimentally validated human microRNA–mRNA interactions Open Access

Databases with experimental validation of human MTIs

Number	Database	Number of miRs	Number of target genes	Number of MTIs	Number of citations (WoS)	Experimental validation methods	Features	Database site
1.	MiRTarBase (10, 16–20)	2599	15 064	380 639	3217	CLIP-Seq, Luciferase assay, microarray, NGS, pSILAC, western blot	Offers different category options for browsing, e.g. by miR, by disease and by KEGG Pathway Downloadable data (from version 8.0) though database site is version 9.0 The Number of MTIs in version 9.0 as reflected on database site is 2 200 449.	https://mirtarbase.cuhk.edu.cn/∼miRTarBase/miRTarBase_2022/php/index.php
2.	starBase/ENCORI (11, 21, 38)	155	687	1286	2605	CLIP-Seq	Downloadable data Only uses high-throughput datasets	https://rna.sysu.edu.cn/encori/index.php
3.	DIANA-TarBase (3, 22, 23–25)	1084	20 820	422 662	1894	AGO-IP, biotin miRNA tagging, CLASH, CLEAR-CLIP, CLIP-Seq, ELISA, IMPACT-seq, microarrays, PAR-CLIP, proteomics, qPCR, Reporter genes, RIP-seq, RNA-seq, RPF-seq, TRAP, western blot, other	Offers various options for filtering results. Downloadable data Integrates other DIANA tools for miR analysis	https://dianalab.e-ce.uth.gr/html/diana/web/index.php?r=tarbasev8
4.	miRWalk (26–28)	2656	19 128	31 235 408	1737	CLIP-Seq, Luciferase assay, microarray, NGS, pSILAC, western blot	Updated twice a year Downloadable data for computationally predicted targets Experimentally validated entries from miRTarBase (# of MTIs reflected on miRWalk site)	http://mirwalk.umm.uni-heidelberg.de/
5.	miRecords (29)	303	1112	1748	986	ELISA, immunocytochemistry, northern blot, qRT-PCR, reporter assay, western blot, other	Downloadable data Not up to date; last updated in 2013	http://c1.accurascience.com/miRecords/download.php
6.	miRGator (30–32)	N/A	N/A	N/A	266	qRT-PCR, reporter assay, western blot	Experimentally validated data from miRecords, miRTarBase and TarBase Not available for download Not up to date	http://mirgator.kobic.re.kr/index.html
7.	miRSystem (33)	N/A	N/A	N/A	186	Reporter assay, western blot, qRT-PCR TarBase methods: CLIP-Seq	Integration of different databases Not available for download	http://mirsystem.cgm.ntu.edu.tw/
8.	miRGate (34, 35)	N/A	N/A	N/A	44	Not specified	Experimentally validated data from four databases: TarBase, miRTarBase, miRecords and OncomirDB Shows validation methodology, database name and PubMed ID Includes all isoforms of each gene Can download search results though no available download for all human miRs Query only by organism, gene and/or miRNA	http://mirgate.bioinfo.cnio.es/
9.	miRSel (36)	N/A	N/A	N/A	33 (based on PubMed results)	Not specified	Uses text-mining of biomedical literature (PubMed abstracts) to extract miR–target interactions Query via miR identifiers, gene and/or protein names, PubMed IDs, gene ontology (GO) terms Integrates information from TarBase, miRecords and miR2Disease Outdated links to TarBase	https://services.bio.ifi.lmu.de:1047/mirsel/
10.	targetHub (37)	N/A	N/A	N/A	10	Not specified	Experimentally validated information from miRTarBase	https://app1.bioinformatics.mdanderson.org/tarhub/_design/basic/index.html

Number	Database	Number of miRs	Number of target genes	Number of MTIs	Number of citations (WoS)	Experimental validation methods	Features	Database site
1.	MiRTarBase (10, 16–20)	2599	15 064	380 639	3217	CLIP-Seq, Luciferase assay, microarray, NGS, pSILAC, western blot	Offers different category options for browsing, e.g. by miR, by disease and by KEGG Pathway Downloadable data (from version 8.0) though database site is version 9.0 The Number of MTIs in version 9.0 as reflected on database site is 2 200 449.	https://mirtarbase.cuhk.edu.cn/∼miRTarBase/miRTarBase_2022/php/index.php
2.	starBase/ENCORI (11, 21, 38)	155	687	1286	2605	CLIP-Seq	Downloadable data Only uses high-throughput datasets	https://rna.sysu.edu.cn/encori/index.php
3.	DIANA-TarBase (3, 22, 23–25)	1084	20 820	422 662	1894	AGO-IP, biotin miRNA tagging, CLASH, CLEAR-CLIP, CLIP-Seq, ELISA, IMPACT-seq, microarrays, PAR-CLIP, proteomics, qPCR, Reporter genes, RIP-seq, RNA-seq, RPF-seq, TRAP, western blot, other	Offers various options for filtering results. Downloadable data Integrates other DIANA tools for miR analysis	https://dianalab.e-ce.uth.gr/html/diana/web/index.php?r=tarbasev8
4.	miRWalk (26–28)	2656	19 128	31 235 408	1737	CLIP-Seq, Luciferase assay, microarray, NGS, pSILAC, western blot	Updated twice a year Downloadable data for computationally predicted targets Experimentally validated entries from miRTarBase (# of MTIs reflected on miRWalk site)	http://mirwalk.umm.uni-heidelberg.de/
5.	miRecords (29)	303	1112	1748	986	ELISA, immunocytochemistry, northern blot, qRT-PCR, reporter assay, western blot, other	Downloadable data Not up to date; last updated in 2013	http://c1.accurascience.com/miRecords/download.php
6.	miRGator (30–32)	N/A	N/A	N/A	266	qRT-PCR, reporter assay, western blot	Experimentally validated data from miRecords, miRTarBase and TarBase Not available for download Not up to date	http://mirgator.kobic.re.kr/index.html
7.	miRSystem (33)	N/A	N/A	N/A	186	Reporter assay, western blot, qRT-PCR TarBase methods: CLIP-Seq	Integration of different databases Not available for download	http://mirsystem.cgm.ntu.edu.tw/
8.	miRGate (34, 35)	N/A	N/A	N/A	44	Not specified	Experimentally validated data from four databases: TarBase, miRTarBase, miRecords and OncomirDB Shows validation methodology, database name and PubMed ID Includes all isoforms of each gene Can download search results though no available download for all human miRs Query only by organism, gene and/or miRNA	http://mirgate.bioinfo.cnio.es/
9.	miRSel (36)	N/A	N/A	N/A	33 (based on PubMed results)	Not specified	Uses text-mining of biomedical literature (PubMed abstracts) to extract miR–target interactions Query via miR identifiers, gene and/or protein names, PubMed IDs, gene ontology (GO) terms Integrates information from TarBase, miRecords and miR2Disease Outdated links to TarBase	https://services.bio.ifi.lmu.de:1047/mirsel/
10.	targetHub (37)	N/A	N/A	N/A	10	Not specified	Experimentally validated information from miRTarBase	https://app1.bioinformatics.mdanderson.org/tarhub/_design/basic/index.html

The databases are listed in the order of the number of citations, with the most cited tools at the top.

AGO-IP, Argonaute immunoprecipitation; CLASH, cross-linking ligation and sequencing of hybrids; CLEAR-CLIP, covalent ligation of endogenous Argonaute-bound RNAs-cross-linking and immunoprecipitation; CLIP-seq, cross-linking and immunoprecipitation followed by sequencing; ELISA, enzyme-linked immunosorbent assay; IMPACT-seq, identification of miRNA recognition elements by pull-down and alignment of captive transcripts-sequencing; MTIs, miRNA–mRNA target interactions; NGS, next-generation sequencing; PAR-CLIP, photoactivatable ribonucleoside-enhanced cross-linking and immunoprecipitation; pSILAC, pulsed stable isotope labeling by amino acids in cell culture; qPCR, quantitative polymerase chain reaction; ; RIP-seq, RNA/ribonucleic acid immunoprecipitation sequencing; RNA-seq, ribonucleic acid sequencing; RPF-seq, ribosome profiling sequencing; TRAP, trapping by RNA in vitro affinity purification.

Table 1.

Databases with experimental validation of human MTIs

Number	Database	Number of miRs	Number of target genes	Number of MTIs	Number of citations (WoS)	Experimental validation methods	Features	Database site
1.	MiRTarBase (10, 16–20)	2599	15 064	380 639	3217	CLIP-Seq, Luciferase assay, microarray, NGS, pSILAC, western blot	Offers different category options for browsing, e.g. by miR, by disease and by KEGG Pathway Downloadable data (from version 8.0) though database site is version 9.0 The Number of MTIs in version 9.0 as reflected on database site is 2 200 449.	https://mirtarbase.cuhk.edu.cn/∼miRTarBase/miRTarBase_2022/php/index.php
2.	starBase/ENCORI (11, 21, 38)	155	687	1286	2605	CLIP-Seq	Downloadable data Only uses high-throughput datasets	https://rna.sysu.edu.cn/encori/index.php
3.	DIANA-TarBase (3, 22, 23–25)	1084	20 820	422 662	1894	AGO-IP, biotin miRNA tagging, CLASH, CLEAR-CLIP, CLIP-Seq, ELISA, IMPACT-seq, microarrays, PAR-CLIP, proteomics, qPCR, Reporter genes, RIP-seq, RNA-seq, RPF-seq, TRAP, western blot, other	Offers various options for filtering results. Downloadable data Integrates other DIANA tools for miR analysis	https://dianalab.e-ce.uth.gr/html/diana/web/index.php?r=tarbasev8
4.	miRWalk (26–28)	2656	19 128	31 235 408	1737	CLIP-Seq, Luciferase assay, microarray, NGS, pSILAC, western blot	Updated twice a year Downloadable data for computationally predicted targets Experimentally validated entries from miRTarBase (# of MTIs reflected on miRWalk site)	http://mirwalk.umm.uni-heidelberg.de/
5.	miRecords (29)	303	1112	1748	986	ELISA, immunocytochemistry, northern blot, qRT-PCR, reporter assay, western blot, other	Downloadable data Not up to date; last updated in 2013	http://c1.accurascience.com/miRecords/download.php
6.	miRGator (30–32)	N/A	N/A	N/A	266	qRT-PCR, reporter assay, western blot	Experimentally validated data from miRecords, miRTarBase and TarBase Not available for download Not up to date	http://mirgator.kobic.re.kr/index.html
7.	miRSystem (33)	N/A	N/A	N/A	186	Reporter assay, western blot, qRT-PCR TarBase methods: CLIP-Seq	Integration of different databases Not available for download	http://mirsystem.cgm.ntu.edu.tw/
8.	miRGate (34, 35)	N/A	N/A	N/A	44	Not specified	Experimentally validated data from four databases: TarBase, miRTarBase, miRecords and OncomirDB Shows validation methodology, database name and PubMed ID Includes all isoforms of each gene Can download search results though no available download for all human miRs Query only by organism, gene and/or miRNA	http://mirgate.bioinfo.cnio.es/
9.	miRSel (36)	N/A	N/A	N/A	33 (based on PubMed results)	Not specified	Uses text-mining of biomedical literature (PubMed abstracts) to extract miR–target interactions Query via miR identifiers, gene and/or protein names, PubMed IDs, gene ontology (GO) terms Integrates information from TarBase, miRecords and miR2Disease Outdated links to TarBase	https://services.bio.ifi.lmu.de:1047/mirsel/
10.	targetHub (37)	N/A	N/A	N/A	10	Not specified	Experimentally validated information from miRTarBase	https://app1.bioinformatics.mdanderson.org/tarhub/_design/basic/index.html

Number	Database	Number of miRs	Number of target genes	Number of MTIs	Number of citations (WoS)	Experimental validation methods	Features	Database site
1.	MiRTarBase (10, 16–20)	2599	15 064	380 639	3217	CLIP-Seq, Luciferase assay, microarray, NGS, pSILAC, western blot	Offers different category options for browsing, e.g. by miR, by disease and by KEGG Pathway Downloadable data (from version 8.0) though database site is version 9.0 The Number of MTIs in version 9.0 as reflected on database site is 2 200 449.	https://mirtarbase.cuhk.edu.cn/∼miRTarBase/miRTarBase_2022/php/index.php
2.	starBase/ENCORI (11, 21, 38)	155	687	1286	2605	CLIP-Seq	Downloadable data Only uses high-throughput datasets	https://rna.sysu.edu.cn/encori/index.php
3.	DIANA-TarBase (3, 22, 23–25)	1084	20 820	422 662	1894	AGO-IP, biotin miRNA tagging, CLASH, CLEAR-CLIP, CLIP-Seq, ELISA, IMPACT-seq, microarrays, PAR-CLIP, proteomics, qPCR, Reporter genes, RIP-seq, RNA-seq, RPF-seq, TRAP, western blot, other	Offers various options for filtering results. Downloadable data Integrates other DIANA tools for miR analysis	https://dianalab.e-ce.uth.gr/html/diana/web/index.php?r=tarbasev8
4.	miRWalk (26–28)	2656	19 128	31 235 408	1737	CLIP-Seq, Luciferase assay, microarray, NGS, pSILAC, western blot	Updated twice a year Downloadable data for computationally predicted targets Experimentally validated entries from miRTarBase (# of MTIs reflected on miRWalk site)	http://mirwalk.umm.uni-heidelberg.de/
5.	miRecords (29)	303	1112	1748	986	ELISA, immunocytochemistry, northern blot, qRT-PCR, reporter assay, western blot, other	Downloadable data Not up to date; last updated in 2013	http://c1.accurascience.com/miRecords/download.php
6.	miRGator (30–32)	N/A	N/A	N/A	266	qRT-PCR, reporter assay, western blot	Experimentally validated data from miRecords, miRTarBase and TarBase Not available for download Not up to date	http://mirgator.kobic.re.kr/index.html
7.	miRSystem (33)	N/A	N/A	N/A	186	Reporter assay, western blot, qRT-PCR TarBase methods: CLIP-Seq	Integration of different databases Not available for download	http://mirsystem.cgm.ntu.edu.tw/
8.	miRGate (34, 35)	N/A	N/A	N/A	44	Not specified	Experimentally validated data from four databases: TarBase, miRTarBase, miRecords and OncomirDB Shows validation methodology, database name and PubMed ID Includes all isoforms of each gene Can download search results though no available download for all human miRs Query only by organism, gene and/or miRNA	http://mirgate.bioinfo.cnio.es/
9.	miRSel (36)	N/A	N/A	N/A	33 (based on PubMed results)	Not specified	Uses text-mining of biomedical literature (PubMed abstracts) to extract miR–target interactions Query via miR identifiers, gene and/or protein names, PubMed IDs, gene ontology (GO) terms Integrates information from TarBase, miRecords and miR2Disease Outdated links to TarBase	https://services.bio.ifi.lmu.de:1047/mirsel/
10.	targetHub (37)	N/A	N/A	N/A	10	Not specified	Experimentally validated information from miRTarBase	https://app1.bioinformatics.mdanderson.org/tarhub/_design/basic/index.html

The databases are listed in the order of the number of citations, with the most cited tools at the top.

Table 2.

Useful features of databases for validated human MTIs

Database	Downloadable data of all validated MTIs?	Constant updates/updated within last 5 years (2017–21)?	Includes MTIs validated via low-throughput methods?	Includes MTIs validated via high-throughput methods?	Allows queries through multiple methods (e.g. method, disease and KEGG Pathway)	MTI network visualization tool?
MiRTarBase (10, 16–20)	✔	✔	✔	✔	✔	✔
starBase/ENCORI (11, 21, 38)	✔	✔		✔
DIANA-TarBase (3, 22, 23–25)	✔	✔	✔	✔	✔
miRWalk (26–28)		✔	✔	✔	✔
miRecords (29)	✔		✔	✔
miRGator (30–32)			✔	✔
miRSystem (33)			✔	✔
miRGate (34, 35)			✔	✔
miRSel (36)			✔	✔
targetHub (37)			✔	✔

Database	Downloadable data of all validated MTIs?	Constant updates/updated within last 5 years (2017–21)?	Includes MTIs validated via low-throughput methods?	Includes MTIs validated via high-throughput methods?	Allows queries through multiple methods (e.g. method, disease and KEGG Pathway)	MTI network visualization tool?
MiRTarBase (10, 16–20)	✔	✔	✔	✔	✔	✔
starBase/ENCORI (11, 21, 38)	✔	✔		✔
DIANA-TarBase (3, 22, 23–25)	✔	✔	✔	✔	✔
miRWalk (26–28)		✔	✔	✔	✔
miRecords (29)	✔		✔	✔
miRGator (30–32)			✔	✔
miRSystem (33)			✔	✔
miRGate (34, 35)			✔	✔
miRSel (36)			✔	✔
targetHub (37)			✔	✔

Table 2.

Open in new tab Download slide

Useful features of databases for validated human MTIs

Database	Downloadable data of all validated MTIs?	Constant updates/updated within last 5 years (2017–21)?	Includes MTIs validated via low-throughput methods?	Includes MTIs validated via high-throughput methods?	Allows queries through multiple methods (e.g. method, disease and KEGG Pathway)	MTI network visualization tool?
MiRTarBase (10, 16–20)	✔	✔	✔	✔	✔	✔
starBase/ENCORI (11, 21, 38)	✔	✔		✔
DIANA-TarBase (3, 22, 23–25)	✔	✔	✔	✔	✔
miRWalk (26–28)		✔	✔	✔	✔
miRecords (29)	✔		✔	✔
miRGator (30–32)			✔	✔
miRSystem (33)			✔	✔
miRGate (34, 35)			✔	✔
miRSel (36)			✔	✔
targetHub (37)			✔	✔

Database	Downloadable data of all validated MTIs?	Constant updates/updated within last 5 years (2017–21)?	Includes MTIs validated via low-throughput methods?	Includes MTIs validated via high-throughput methods?	Allows queries through multiple methods (e.g. method, disease and KEGG Pathway)	MTI network visualization tool?
MiRTarBase (10, 16–20)	✔	✔	✔	✔	✔	✔
starBase/ENCORI (11, 21, 38)	✔	✔		✔
DIANA-TarBase (3, 22, 23–25)	✔	✔	✔	✔	✔
miRWalk (26–28)		✔	✔	✔	✔
miRecords (29)	✔		✔	✔
miRGator (30–32)			✔	✔
miRSystem (33)			✔	✔
miRGate (34, 35)			✔	✔
miRSel (36)			✔	✔
targetHub (37)			✔	✔

Top databases feature frequent updates

The databases that indicate their most recent updates include miRTarBase, whose current version (version 9.0) was released in September 2021 (10), starBase/ENCORI with an update in November 2021 (38), TarBase, with its latest version released in 2017 (22), and miRWalk, which is updated twice a year (28). miRecords explicitly mentions that its last update was in 2013. Other databases such as miRGator, miRSystem, miRGate, miRSel and targetHub do not show evidence of constant updates, meaning some of these tools that derive MTI data from the top frequently updated databases have limited information on MTIs compared to the latter.

Databases catalog MTIs validated by low- and/or high-throughput techniques

Most databases report a combination of low- and high-throughput techniques applied to the validation of MTIs. A notable exception is starBase, which primarily uses high-throughput methods.

Supporting information is frequently provided

Among the highly cited and recently updated databases, starBase/ENCORI has the most descriptive help section with both graphical instructions and a glossary of terms (38). DIANA-TarBase also has a helpful graphical instruction page, but without additional text details (22). miRTarBase has limited user support/instructions but does include a graphical and text glossary of validation methods (10). miRWalk offers a ‘Frequently Asked Questions’ page to guide users on how to use the database including information on how to perform target searches, obtain information on only validated MTIs and interpret the ‘score’ of an MTI based on the algorithm used (28).

Overlap between database findings is variable

We performed an assessment of the overlap of miRs between all databases and found that only 16 miRs were identified in all of the top five databases (Figure 2). Several challenges were identified, including incorrect inclusion of non-human miRs and specificity of annotation of the −3p and −5p miR species. miRWalk and miRTarBase share the greatest overlap in web search-produced miRs (2707 in total), which is attributed to the fact that miRWalk derives validated information from miRTarBase. miRecords had the greatest number of unique miRs within the top five datasets; however, limitations include that the database is not frequently updated, and the site is not always accessible. No miRs were found to be unique to TarBase or ENCORI.

Figure 2.

Panel A is a Venn diagram to show the total number of overlapping MTIs across the top five databases. Panel B is a table to show the number of total MTIs in each combination of two databases (Union), the number of MTIs overlapping between each combination of two databases (Intersection) and how many were unique to each database in combinations of two.

Discussion

The purpose of this review was to evaluate the web-based tools available for the organization of experimentally validated mRNA targets of human miRNAs. With the increasingly recognized biological relevance of miRNAs to disease development, experimental validation of MTIs is necessary for the assessment of miRNA function. The utility of tools that curate validated evidence extends beyond the data cataloged because they can also be used to inform machine-learning methods for target prediction, especially those that use the validated data as a training set (23). Since the available databases are developed using different techniques and a range of purposes, it is necessary to comprehensively evaluate each tool’s attributes in order to determine which is most useful for a given application.

Ease of access in navigating the database is a key feature that is shared among some of the top databases. Though each database might define its user-friendly layout in different ways, maintaining flexibility in how users can perform queries is the most useful feature. The databases listed in Table 1 all allow users to search based on miRNA name and/or target gene name. Strength of MiRTarBase is that it features the option for searching by Kyoto Encyclopedia of Genes and Genomes (KEGG) Pathway, validation method, disease, PubMed ID and an advanced search option that allows one to input a list of miRs or target genes (20). Similarly, TarBase gives users multiple options to filter results by validation methods, regulation type (i.e. up, down or unknown), cell type and tissue type (22). A useful feature of the most cited databases gives the user the option to download the validated data, making it easy to manipulate further in programs such as R. The download feature was applied to this review, which allowed us to identify with certainty the total numbers of the miRNAs and genes included. Regarding citations, frequent use by the scientific community suggests that those tools are user-friendly and intuitive (39, 40).

We evaluated the number of miRs across the top five databases and found very little overlap. The lack of consistency between databases poses a major challenge for cross-validation between studies that utilized different databases to find MTI information. In some cases, the validated data from one database (e.g. MiRTarBase) is integrated into another database (e.g. miRWalk). While this might seem to provide validation, it is actually the replication of the same information across two tools. Overall, the lack of overlap between the top databases demonstrates a need for a more transparent and rigorous approach to cataloging validated human MTIs.

Among the tools included in this review, five were developed with a text-mining technique or manual curation to collect experimentally validated targets. These are miRTarBase, TarBase, miRecords, miRWalk and miRSel. However, miRWalk has since changed the way users can access validated targets by incorporating a filter that links to miRTarBase (28). Similarly, databases such as miRGator, miRSystem, miRGate and targetHub integrate data from one or a combination of miRTarBase, TarBase or miRecords. The latter were some of the earlier tools developed to catalog validated targets, TarBase in 2006 (22), miRTarBase in 2010 (19) and miRecords in 2008 (29) and therefore were foundational for subsequent tool development. While TarBase and miRTarBase continue to be updated regularly with the latest versions released within the last 5 years (2017–21), miRecords is no longer up to date with its last update in 2013. Other databases that undergo ongoing updates, also within the last 5 years, include starBase and miRWalk. This upkeep of databases is needed in order to catalog new MTIs as they are validated, making this an important consideration for researchers using these tools.

A key criterion in this review was that each tool has a stated purpose of cataloging MTIs with experimentally validated evidence. The lower throughput methods for experimental validation provide a stronger level of evidence for a functional MTI and are less likely to be falsely positive, whereas the high-throughput methods are a weaker level of evidence but generate a larger number of potential MTIs with a great likelihood for false positives but also excluding false negatives. Further inspection of the tools revealed that while all are designed to assess for experimental evidence of an MTI, others offer additional functionality. From our top five tools, miRTarBase and TarBase are explicit and intentional in their role of curating experimentally validated MTIs. However, starBase, the former version of ENCORI, was the second most cited tool according to WoS records, and besides describing MTIs, it also includes miRNA interactions with long non-coding RNAs, pseudogenes, circular RNAs and protein-RNA interactions (21). miRWalk and miRecords include both predicted and validated MTIs and in the case of miRWalk, it includes miRNA target site prediction using diverse algorithms (28). While an older version of miRWalk (27) hosted a ‘validated target module’ that allowed users to search by target gene, miRNA, BioCarta or KEGG Pathway, disease, cell line or proteins involved in miRNA processing, the option is no longer available and has been replaced by a miRTarBase filter for the validated targets as aforementioned (28). Within this review, we looked at the total number of times the database and/or primary database papers were cited, but this metric does not reveal how the tool was used. Thus, understanding the information that each tool harbors might be a better way for scientists to assess which database to use.

While some of the information that can be gleaned across databases is redundant (e.g. predicted and/or validated MTIs), some databases offer unique features. In order to attain the most comprehensive information about MTIs, some databases integrate other tools beyond target identification. Table 3 lists tools linked to each database, excluding those for target prediction. For example, MiRTarBase, with the highest number of integrated tools, includes 10 databases that provide additional information on miR regulation, disease associations and gene and miR expression profiles (20). An additional feature unique to miRTarBase is an option to visualize the regulatory network among miRs, regulators and gene targets. As part of DIANA tools, TarBase is interconnected with other tools such as DIANA-miRPath, for information on miR regulation in physiological molecular pathways, DIANA-miRGen for miR regulatory information and DIANA-plasmiR for information on circulating miR biomarkers (22). These integrated tools can make it easy for researchers to obtain a lot of information about an MTI from a single database and provide linkage to other tools for further miR analysis. Information obtained from one database with more limited applications can be entered into a more comprehensive database (e.g. MiRTarBase) to access linked tools and related information. In this way, the relative strengths and complementary functions of multiple databases can be leveraged in order to increase the overall utility of these tools for complex research questions.

Table 3.

Tools integrated by the 10 databases for validated human MTIs

Target validation tool	Databases integrated by the validation tool
miRTarBase (10, 20)	CMEP, GEO, HMDD, miRBase, miRSponge, NCBI Entrez Gene, NCBI RefSeq, SomamiR, TCGA, TransMir
starBase (21)	Ensembl genome browser, GEO, miRBase, UCSC genome browser
DIANA-TarBase (25)	Ensembl genome browser, miRBase, miRGen, miRPath, plasmiR
miRWalk (28)	Disease Ontology, KEGG Pathway, miRBase, Reactome Pathways
miRecords (29)	NCBI Entrez Gene, NCBI RefSeq
miRGator (30)	GEO, Gene Ontology, KEGG Pathway, miRBase, SRA, TCGA
miRSystem (33)	BioCarta, Gene Ontology, KEGG Pathway, Interaction Database, Reactome
miRGate (34)	MiRBase, Ensembl
miRSel (36)	HUGO Gene Nomenclature Committee, miRGen, miRBase, NCBI Entrez Gene, Swiss-Prot Protein Database
targetHub (37)	MiRBase, NCBI Entrez Gene, UCSC Table Browser

Target validation tool	Databases integrated by the validation tool
miRTarBase (10, 20)	CMEP, GEO, HMDD, miRBase, miRSponge, NCBI Entrez Gene, NCBI RefSeq, SomamiR, TCGA, TransMir
starBase (21)	Ensembl genome browser, GEO, miRBase, UCSC genome browser
DIANA-TarBase (25)	Ensembl genome browser, miRBase, miRGen, miRPath, plasmiR
miRWalk (28)	Disease Ontology, KEGG Pathway, miRBase, Reactome Pathways
miRecords (29)	NCBI Entrez Gene, NCBI RefSeq
miRGator (30)	GEO, Gene Ontology, KEGG Pathway, miRBase, SRA, TCGA
miRSystem (33)	BioCarta, Gene Ontology, KEGG Pathway, Interaction Database, Reactome
miRGate (34)	MiRBase, Ensembl
miRSel (36)	HUGO Gene Nomenclature Committee, miRGen, miRBase, NCBI Entrez Gene, Swiss-Prot Protein Database
targetHub (37)	MiRBase, NCBI Entrez Gene, UCSC Table Browser

Information about these tools was found via the most current database papers and database sites. Not included in this table are target prediction databases that are included in some of the target validation tools.

CMEP, Circulating MicroRNA Expression Profiling; GEO, Gene Expression Omnibus; HMDD, Human MicroRNA Disease Database; KEGG, Kyoto Encyclopedia of Genes and Genomes; NCBI, National Center for Biotechnology Information; SRA, Sequence Read Archive; TCGA, The Cancer Genome Atlas; UCSC, University of California, Santa Cruz.

Table 3.

Tools integrated by the 10 databases for validated human MTIs

Target validation tool	Databases integrated by the validation tool
miRTarBase (10, 20)	CMEP, GEO, HMDD, miRBase, miRSponge, NCBI Entrez Gene, NCBI RefSeq, SomamiR, TCGA, TransMir
starBase (21)	Ensembl genome browser, GEO, miRBase, UCSC genome browser
DIANA-TarBase (25)	Ensembl genome browser, miRBase, miRGen, miRPath, plasmiR
miRWalk (28)	Disease Ontology, KEGG Pathway, miRBase, Reactome Pathways
miRecords (29)	NCBI Entrez Gene, NCBI RefSeq
miRGator (30)	GEO, Gene Ontology, KEGG Pathway, miRBase, SRA, TCGA
miRSystem (33)	BioCarta, Gene Ontology, KEGG Pathway, Interaction Database, Reactome
miRGate (34)	MiRBase, Ensembl
miRSel (36)	HUGO Gene Nomenclature Committee, miRGen, miRBase, NCBI Entrez Gene, Swiss-Prot Protein Database
targetHub (37)	MiRBase, NCBI Entrez Gene, UCSC Table Browser

Target validation tool	Databases integrated by the validation tool
miRTarBase (10, 20)	CMEP, GEO, HMDD, miRBase, miRSponge, NCBI Entrez Gene, NCBI RefSeq, SomamiR, TCGA, TransMir
starBase (21)	Ensembl genome browser, GEO, miRBase, UCSC genome browser
DIANA-TarBase (25)	Ensembl genome browser, miRBase, miRGen, miRPath, plasmiR
miRWalk (28)	Disease Ontology, KEGG Pathway, miRBase, Reactome Pathways
miRecords (29)	NCBI Entrez Gene, NCBI RefSeq
miRGator (30)	GEO, Gene Ontology, KEGG Pathway, miRBase, SRA, TCGA
miRSystem (33)	BioCarta, Gene Ontology, KEGG Pathway, Interaction Database, Reactome
miRGate (34)	MiRBase, Ensembl
miRSel (36)	HUGO Gene Nomenclature Committee, miRGen, miRBase, NCBI Entrez Gene, Swiss-Prot Protein Database
targetHub (37)	MiRBase, NCBI Entrez Gene, UCSC Table Browser

There are challenges in this field, including infrastructure to sustain and support maintenance, updates and accessibility of tools. Funding to support the development of these tools may wax and wane or be limited in duration, which may impact perpetuity. A researcher may move to a new institution with a different web hosting service that can impact accessibility. Similarly, a researcher may retire with no succession plan for continuing to support a tool. In order to support the sustainability and continuous improvement of these tools, more centralized support may be needed (e.g. National Institutes of Health or other government-funded organizations’ initiatives or programs).

Conclusion

The increasingly appreciated relevance of miRNAs in human diseases has led to a need for understanding their gene targets, and as a result, many databases have emerged to keep track of these interactions as they are discovered. Without a standardized approach to cataloging MTIs, databases differ in scope, organization and attributes; therefore, the aim of this review was to describe and compare them with the intention that it may provide researchers with a better understanding of the tool that best fits their needs. Databases with the option to download the data, which also happen to be the top cited validation tools, offer users flexibility in further manipulation of the data in programs such as R. Additionally, ongoing database updates are needed to ensure that the most comprehensive information on MTIs is discoverable. Features present in top cited databases like miRTarBase and TarBase that enhance the value of the database to users include providing various browsing options beyond searching by miR or gene target as well as integrated tools that further the analysis of MTIs. Tools can change over time in scope and purpose; thus, it is important to not only reference the primary papers and updates but also visit the database sites for a better understanding of what they can provide. As high-throughput sequencing data continues to accrue, the number of MTIs will increase apace, but low-throughput methods ensure strong evidence of direct miRNA–mRNA interactions. Scientists who seek to use these validation tools must then decide whether to use a tool like starBase, with only high-throughput methods, or miRTarBase and TarBase that are more frequently updated repositories with both high- and low-throughput methods. By providing a general overview of miR target validation tools with applications for Homo sapiens, this review hopes to aid researchers, especially those new to miR bioinformatic analysis, in choosing which database to use and inform the future development of tools with considerations offered within this paper.

Expanding the evaluation to more than web-based tools (e.g. R packages such as multiMiR) could reveal additional useful features for researchers interested in uncovering additional information on MTIs of interest. Future reviews could also create a comprehensive assessment of tool utilization. For tools that have both experimentally validated and predicted data, this could include making a distinction as to which datum was used in a given published study. There are a few databases (e.g. TarBase and miRTarBase) that have been consistently updated and offer new tools and features with each new release. In some cases, less commonly cited databases now link to these highly used and regularly updated tools. Looking forward, a small number of databases may emerge as the optimal tools across attributes that become the gold standard for MTI prediction and annotation.

Data availability

miRTarBase is available at http://mirtarbase.cuhk.edu.cn/ (10)

starBase/The Encyclopedia of RNA Interactomes (ENCORI) is available at https://starbase.sysu.edu.cn/ or https://rna.sysu.edu.cn/encori/index.php (38)

DIANA-TarBase is available at https://dianalab.e-ce.uth.gr/html/diana/web/index.php?r=tarbasev8%2Findex (22)

miRWalk is available at http://mirwalk.umm.uni-heidelberg.de/ (28)

miRecords is available at http://c1.accurascience.com/miRecords/download.php (29)

miRGator is available at http://mirgator.kobic.re.kr/index.html (31)

miRSystem is available at http://mirsystem.cgm.ntu.edu.tw/ (33)

miRGate is available at http://mirgate.bioinfo.cnio.es/ (34)

miRSel is available at https://services.bio.ifi.lmu.de:1047/mirsel/ (36)

targetHubis available at https://app1.bioinformatics.mdanderson.org/tarhub/_design/basic/index.html (37)

Author contributions

D.K. performed data collection, analysis and primary drafting of the manuscript. K.A. contributed to the methods, data analysis, interpretation of results and approving the final draft. B.E.A. contributed to the methods, interpretation of results and approving the final draft. K.A.L. contributed to drafting the introduction and approving the final draft. J.C.F. contributed to the overall study design and approving the final draft. E.F. contributed to the overall development of the study design, interpretation of results, drafting the manuscript and approving the final draft.

Funding

This study was supported by the National Institute for Diabetes, Digestive and Kidney Disease (grant number R01DK124228). Biospecimens used in this study were provided under approval X01DK115999. The Diabetes Prevention Program (DPP) was conducted by the DPP Research Group and supported by the National Institute of Diabetes and Digestive and Kidney Diseases (NIDDK), the General Clinical Research Center Program, the National Institute of Child Health and Human Development (NICHD), the National Institute on Aging (NIA), the Office of Research on Women’s Health, the Office of Research on Minority Health, the Centers for Disease Control and Prevention (CDC) and the American Diabetes Association. The data and biospecimens from the DPP were supplied by the NIDDK Central Repository. This manuscript was not prepared under the auspices of the DPP and does not represent analyses or conclusions of the DPP Research Group, the NIDDK Central Repository or the National Institutes of Health.

Conflict of interest

None declared.

Acknowledgements

We acknowledge Xingyue Gong for further support in the use of R programming language.

References

Pirrò

Matic

Colizzi

et al. (

2021

)

The microRNA analysis portal is a next-generation tool for exploring and analyzing miRNA-focused data in the literature

Sci. Rep.

, 9007.

Tafrihi

and

Hasheminasab

(

2019

)

MiRNAs: biology, biogenesis, their web-based tools, and databases

Microrna

–

Papadopoulos

G.L.

Reczko

Simossis

V.A.

et al. (

2009

)

The database of experimentally supported targets: a functional update of TarBase

Nucleic Acids Res.

D155

–

d158

Mortazavi

S.S.

Bahmanpour

Daneshmandpour

et al. (

2021

)

An updated overview and classification of bioinformatics tools for microRNA analysis, which one to choose?

Comput. Biol. Med.

134

, 104544.

Paul

Chakraborty

Sarkar

et al. (

2018

)

Interplay between miRNAs and human diseases

J. Cell. Physiol.

233

2007

–

2018

Chen

Heikkinen

Wang

et al. (

2019

)

Trends in the development of miRNA bioinformatics tools

Brief. Bioinformatics

1836

–

1852

Crossref

Hamzeiy

Allmer

Yousef

(

2014

) Computational methods for microRNA target prediction. In:

Yousef

and

Allmer

(eds.)

miRNomics: MicroRNA Biology and Computational Analysis

Totowa, NJ

Humana Press

, pp.

207

–

221

Riolo

Cantara

Marzocchi

et al. (

2020

)

miRNA targets: from prediction tools to experimental validation

Methods Protoc.

, 1.

Kumar

Wong

A.K.

Tizard

M.L.

et al. (

2012

)

miRNA Targets: a database for miRNA target predictions in coding and non-coding regions of mRNAs

Genomics

100

352

–

356

10.

Huang

H.-Y.

Lin

Y.-C.-D.

Cui

et al. (

2022

)

miRTarBase update 2022: an informative resource for experimentally validated miRNA–target interactions

Nucleic Acids Res.

D222

–

D230

11.

Yang

J.-H.

Shao

et al. (

2011

)

starBase: a database for exploring microRNA–mRNA interaction maps from Argonaute CLIP-Seq and Degradome-Seq data

Nucleic Acids Res.

D202

–

D209

12.

Monga

and

Kumar

(

2019

)

Computational Biology of Non-Coding RNA

Springer, New York

, pp.

215

–

250

13.

Ji Diana Lee

Kim

Muth

D.C.

et al. (

2015

)

Validated microRNA target databases: an evaluation

Drug Dev. Res.

389

–

396

14.

Lukasik

Wójcikowski

and

Zielenkiewicz

(

2016

)

Tools4miRs - one place to gather all the tools for miRNA analysis

Bioinformatics

2722

–

2724

15.

Page

M.J.

McKenzie

J.E.

Bossuyt

P.M.

et al. (

2021

)

The PRISMA 2020 statement: an updated guideline for reporting systematic reviews

Bmj

372

, n71.

16.

Chou

C.-H.

Chang

N.-W.

Shrestha

et al. (

2016

)

miRTarBase 2016: updates to the experimentally validated miRNA-target interactions database

Nucleic Acids Res.

D239

–

D247

17.

Chou

C.-H.

Shrestha

Yang

C.-D.

et al. (

2018

)

miRTarBase update 2018: a resource for experimentally validated microRNA-target interactions

Nucleic Acids Res.

D296

–

D302

18.

Hsu

S.-D.

Lin

F.-M.

W.-Y.

et al. (

2011

)

miRTarBase: a database curates experimentally validated microRNA–target interactions

Nucleic Acids Res.

D163

–

D169

19.

Hsu

S.-D.

Tseng

Y.-T.

Shrestha

et al. (

2014

)

miRTarBase update 2014: an information resource for experimentally validated miRNA-target interactions

Nucleic Acids Res.

D78

–

D85

20.

Huang

H.-Y.

Lin

Y.-C.-D.

et al. (

2019

)

miRTarBase 2020: updates to the experimentally validated microRNA–target interaction database

Nucleic Acids Res.

D148

–

D154

21.

J.-H.

Liu

Zhou

et al. (

2014

)

starBase v2.0: decoding miRNA-ceRNA, miRNA-ncRNA and protein–RNA interaction networks from large-scale CLIP-Seq data

Nucleic Acids Res.

D92

–

D97

22.

Karagkouni

Paraskevopoulou

M.D.

Chatzopoulos

et al. (

2018

)

DIANA-TarBase v8: a decade-long collection of experimentally supported miRNA-gene interactions

Nucleic Acids Res.

D239

–

D245

23.

Sethupathy

Corda

and

Hatzigeorgiou

A.G.

(

2006

)

TarBase: A comprehensive database of experimentally supported animal microRNA targets

RNA

192

–

197

24.

Vergoulis

Vlachos

I.S.

Alexiou

et al. (

2012

)

TarBase 6.0: capturing the exponential growth of miRNA targets with experimental support

Nucleic Acids Res.

D222

–

229

25.

Vlachos

I.S.

Paraskevopoulou

M.D.

Karagkouni

et al. (

2015

)

DIANA-TarBase v7.0: indexing more than half a million experimentally supported miRNA:mRNA interactions

Nucleic Acids Res.

D153

–

159

26.

Dweep

Sticht

Pandey

et al. (

2011

)

miRWalk—database: prediction of possible miRNA binding sites by “walking” the genes of three genomes

J. Biomed. Inform.

839

–

847

27.

Dweep

Gretz

and

Sticht

(

2014

)

miRWalk database for miRNA-target interactions

Methods Mol. Biol.

1182

289

–

305

28.

Sticht

De La Torre

Parveen

et al. (

2018

)

miRWalk: an online resource for prediction of microRNA binding sites

PLoS One

, e0206239.

29.

Xiao

Zuo

Cai

et al. (

2009

)

miRecords: an integrated resource for microRNA-target interactions

Nucleic Acids Res.

D105

–

D110

30.

Cho

Jun

Lee

et al. (

2011

)

miRGator v2.0: an integrated system for functional investigation of microRNAs

Nucleic Acids Res.

D158

–

D162

31.

Cho

Jang

Jun

et al. (

2013

)

miRGator v3.0: a microRNA portal for deep sequencing, expression profiling and mRNA targeting

Nucleic Acids Res.

D252

–

D257

32.

Nam

Kim

Shin

et al. (

2008

)

miRGator: an integrated system for functional annotation of microRNAs

Nucleic Acids Res.

D159

–

D164

33.

T.P.

Lee

C.Y.

Tsai

M.H.

et al. (

2012

)

miRSystem: an integrated system for characterizing enriched functions and pathways of microRNA targets

PLoS One

, e42390.

34.

Andrés-León

Gómez-López

and

Pisano

D.G.

(

2017

)

Prediction of miRNA-mRNA Interactions Using miRGate

Methods Mol. Biol.

1580

225

–

237

35.

Andrés-León

González Peña

Gómez-López

et al. (

2015

)

miRGate: a curated database of human, mouse and rat miRNA-mRNA targets

Database (Oxford)

2015

, bav035.

36.

Naeem

Küffner

Csaba

et al. (

2010

)

miRSel: automated extraction of associations between microRNAs and genes from the biomedical literature

BMC Bioinform.

, 135.

37.

Manyam

Ivan

Calin

G.A.

et al. (

2013

)

targetHub: a programmable interface for miRNA-gene interactions

Bioinformatics

2657

–

2658

38.

Zhou

K.R.

Liu

Cai

et al.

ENCORI: The Encyclopedia of RNA Interactomes

. https://starbase.sysu.edu.cn/index.php (15 November 2021, date last accessed).

39.

Flowers

Asam

Allen

et al. (

2022

)

Coexpressed microRNAs, target genes and pathways related to metabolism, inflammation and endocrine function in individuals at risk for type 2 diabetes

Mol. Med. Rep.

, 156.

40.

Flowers

Aouizerat

B.E.

Kanaya

A.M.

et al. (

2022

)

MicroRNAs associated with incident diabetes in the diabetes prevention program

J. Clin. Endocrinol. Metab.

, dgac714.