Abstract

Medicinal plants are anticipated to be one of the most valuable resources for the remedial usage in the treatment of various ailments. The data on key medicinal plants and their therapeutic efficacy against various ailments are quite scattered and not available on a single platform. Moreover, currently there is no means/mechanism of finding the best medicinal plant(s) from numerous plants known to cure any disease. DISPEL (Diseases Plants Eliminate) is a compendium of medicinal plants available across the world that are used to cure infectious as well as non-infectious diseases in humans. The association of a medicinal plant with a disease it cures is hereby referred to as ‘medicinal plant–disease cured’ linkage. The DISPEL database hosts ∼60 000 ‘medicinal plant–disease cured’ linkages encompassing ∼5500 medicinal plants and ∼1000 diseases. This platform provides interactive and detailed visualization of medicinal plants, diseases and their relations using comprehensible network graph representation. The user has the freedom to search the database by specifying the name of disease(s) as well as the scientific/common name(s) of plant. Each ‘medicinal plant–disease cured’ relation is scored based on the availability of any medicine/product derived from that medicinal plant, information about active compound(s), knowledge regarding the part of plant that is effective and number of distinct articles/books/websites confirming the effectiveness of the medicinal plant. The user can find the best plant(s) that can be used to cure any desired disease(s). The DISPEL database is the first step towards generating the ‘most-effective’ combination of plants to cure a disease since it delineates as well as ranks all the therapeutic medicinal plants for that disease. The combination of best medicinal plants can then be used to conduct clinical trials and thus pave the way for their use in clinics for treatment of diseases.

Database URLhttps://compbio.iitr.ac.in/dispel

Introduction

Since prehistoric times, therapeutic plants have played a vital role in the treatment of diseases, thus, sustaining population health (1, 2). Plants synthesize various significant phytochemicals that operate as a defence mechanism against pathogens, insects, fungi, bacteria and other threats (3–8). Medicinal plants are therapeutically significant for curing various ailments such as hypertension, diabetes, malaria, flu, diarrhea, cancer, etc. (9–14). Towards this end, plant extracts are employed in raw, unprocessed or modified forms. Active components isolated from therapeutic plants have been utilized to make synthetic medicines, such as quinine and reserpine (15, 16).

The information regarding medicinal plants is scattered amongst several research publications and books. Hence, the usage of medicinal plants for therapeutic applications becomes empirically challenging (17, 18). A comprehensive resource of medicinal plants, their therapeutic use, part of the plant involved and active ingredients would be quite useful for drug discovery as well as clinical applications. The databases that have been developed till date are typically country-specific (for instance, TCMID for Chinese medicinal plant (19), MPDB for Bangladeshi medicinal plants (20)], phytochemical-specific [such as Phytochemica (21), IMPPAT (22), TCM-Mesh (23)]) or disease-specific (for instance, CVDHD (24), NPACT (25)). However, the most challenging task is how to choose the best medicinal plant(s) from the myriad of plants known to play a role in curing any specified disease.

Hence, we have built DISPEL (Diseases Plants Eliminate) database which is a comprehensive resource of medicinal plants available across the globe that are used for the treatment of infectious as well as non-infectious diseases in humans. The database hosts >60 000 ‘medicinal plantdisease cured’ linkages encompassing ∼5500 medicinal plants and ∼1000 diseases. The DISPEL database can be utilized in diverse cases and scenarios viz. a single-faceted study of a particular plant/disease as well as comparative analysis pertaining to heterogeneous assortment of multiple plants/diseases. The server uses a network graph model to provide dynamic and detailed visualization of medicinal plants, diseases and their relationships. The user has the option to search the database by entering the scientific/common name of the plant(s) or the name of the disease(s). Most significantly, the user can find the most effective plants that can be utilized to treat any desired diseases.

Methodology

Data collection, preprocessing and extraction

DISPEL database collates a variety of information on medicinal plants, including their scientific/common name, classification, habitat, country, the part of the plant that is used to treat the ailment, the compound/medicine (if available) and the therapeutic properties (Figure 1). Towards this end, we have compiled information from numerous publications/books/websites and assembled a comprehensive list of more than 5500 medicinal plants (Figure 2). Additional searches for the plant’s scientific/common names were conducted on PubMed (http://www.ncbi.nlm.nih.gov/pubmed) in order to find relevant data on the therapeutic herbs.

Schematic representation of overall strategy used in the construction of DISPEL database.
Figure 1.

Schematic representation of overall strategy used in the construction of DISPEL database.

Flow chart of data preprocessing and extraction. Preprocessing involves three main steps: (i) Tokenization, (ii) Stop word removal and (iii) Stemming. For data extraction, Python and Natural Language Toolkit were used.
Figure 2.

Flow chart of data preprocessing and extraction. Preprocessing involves three main steps: (i) Tokenization, (ii) Stop word removal and (iii) Stemming. For data extraction, Python and Natural Language Toolkit were used.

Database architecture and web interface

At the core of the DISPEL database lies the Neo4j software, which is a graph database management system (26) and it has been preferred over other database engines as the data to be stored/retrieved by the application is inherently graphical, i.e. plant nodes, disease nodes and their relationships (Figure 1). Hence, the data entities and relationships get perfectly modeled with Neo4j graph database. Neo4j provides constant time traversals for both depth and breadth due to the efficient representation of nodes and relationships. This ensures efficient scaling for a large number of nodes for the DISPEL database. Cypher Query Language has been employed as the query language for Neo4j database which aids in generating complex queries requesting graphs for multiple plants and diseases. Appropriate cypher query has been used to load the data into the Neo4j server without introducing any redundancies.

A plant entity consists of its name and a number of additional attributes such as its classification, habitat and country. The only component of a disease entity is its name, while a relationship entity consists of the score factor which is calculated based on the plant part (extract) that effectively treats the disease, compound/medicine (if available) and article/book/website references. The DISPEL server interface permits full-text searching with tolerance for typographical errors, for which full-text indices have been created for both plant and disease attributes. The full-text search is a fuzzy string search that uses the Levenshtein algorithm for closeness matching (27). A full-text query attempts to find five plants/diseases with closely matching names.

The backend server has been written in JavaScript using the Express.js library for REST API development (28). JSON strings have been used for bidirectional message transfer between backend and frontend (29). The Node.js application sends cypher queries to the Neo4j server using the Neo4j-driver library and obtains the results as JavaScript objects (30). The results are further processed and sent to the frontend where they are appropriately rendered. Two types of queries, viz. graph data queries and search queries can be performed. The graph query fetches graph data from Neo4j server and sends the serialized object to the frontend client. The graph query accepts plant identifiers, disease identifiers and mode (center of interest: plant/disease) to return graph data. The search query fetches the names of plants/diseases and sends a list of search results to the client. The open-source Model-View-ViewModel (MVVM) JavaScript framework called Vue.js was used to create frontend user interfaces (31). The server has the ability to handle multiple requests concurrently.

Results and discussion

We have built a user-friendly interface DISPEL to access the enormous, untapped data on medicinal plants encompassing ∼60 000 linkages of plants and human maladies that can be cured. The DISPEL database can be utilized in diverse cases viz. single-faceted study of a particular plant/disease and comparative analysis of multiple plants/diseases. This platform provides interactive and detailed visualization of medicinal plants, diseases and their relations using comprehensible network graph representation (Figure 3). Each ‘medicinal plantdisease cured’ relation is scored based on the availability of any medicine/product based on that medicinal plant, information about active compound(s), knowledge regarding the part of the plant that is effective and number of distinct articles/books/websites confirming the effectiveness of the medicinal plant. A medicinal plant with a score of 1 is regarded as the best plant for treating a certain ailment.

(A) Representative output of ‘Plant Search’ using three medicinal plants (center nodes): Terminalia arjuna (Arjuna, light blue), Withania somnifera (Ashwagandha, purple) and Phyllanthus emblica (Amla, orange). The diseases are shown as leaf/edge nodes. (B) Representative output of ‘Disease Search’ using two diseases (center nodes): hypertension (light blue) and diabetes (purple).
Figure 3.

(A) Representative output of ‘Plant Search’ using three medicinal plants (center nodes): Terminalia arjuna (Arjuna, light blue), Withania somnifera (Ashwagandha, purple) and Phyllanthus emblica (Amla, orange). The diseases are shown as leaf/edge nodes. (B) Representative output of ‘Disease Search’ using two diseases (center nodes): hypertension (light blue) and diabetes (purple).

The user has the freedom to search the database by specifying the name of disease(s) as well as the scientific/common name(s) of the plant (Figure 3). While typing, the search engine will automatically provide a list of plants or diseases that fits the query. The resultant graph (Figure 3) depicts relationships between plants and their therapeutic usage (diseases). The graph is interactive and the nodes can be moved around as per the convenience of the user. The search bar present in the graphical network (in left hand side section) can be used to find any desired plants/diseases within the displayed network. User can search one or multiple plants/diseases simultaneously and the nodes searched will become apparent (blink) in the graphical network. Furthermore, a country search feature has also been incorporated, allowing users to search for one or more plants from a specific country. Hovering on the edges displays the score of the relation between the medicinal plant and the disease cured; the radius of the node is proportional to the score. Clicking on the edges will display all the relevant information about that ‘medicinal plantdisease cured’ link (Figure 4). Furthermore, hovering over plant nodes shows detailed information about that specific plant. Most importantly, during ‘Disease Search’, the best medicinal plant(s) (with the highest score) for each disease will be displayed in a tabular manner (Figure 3B). The user also has the option to view all the plants known to cure a desired disease. The graph panel also includes handy controls to pan across as well as zoom in and out of the graph.

Clicking on the ‘Terminalia arjuna-hypertension’ link/edge displays the relevant information about the extract (part of plant), medicine, compound, score and references.
Figure 4.

Clicking on the ‘Terminalia arjuna-hypertension’ link/edge displays the relevant information about the extract (part of plant), medicine, compound, score and references.

To exemplify the utility of DISPEL server, we searched for medicinal plants curing hypertension and diabetes (Figure 3B). The best medicinal plants for curing each of the diseases get retrieved/displayed. The best plants for treating hypertension are Terminalia arjuna (Arjuna), Withania somnifera (Ashwagandha), Bacopa monnieri (Brahmi) and Rauvolfia serpentina (Indian snakeroot) while those for curing diabetes are Phyllanthus emblica (Amla), Momordica charantia (Karela), Cinnamomum camphora (Kapur), Tinospora cordifolia (Giloy) and Syzygium cumini (Jamun). Furthermore, plants effective in curing both the diseases simultaneously are also delineated. Zingiber officinale (Black Ginger), Punica granatum (Pomegranate) and Terminalia arjuna (Arjuna) are the best plants for concurrent treatment of diabetes and hypertension (Figure 3B). On the other hand, we can also use the ‘Plant Search’ feature in DISPEL to explore the diseases cured by distinct medicinal plants. For instance, Phyllanthus emblica, Terminalia arjuna and Withania somnifera can be combined together to cure diabetes as well as snake bite (Figure 3A). Thus, DISPEL database enumerates the ‘most-effective’ combination of plants to cure any desired disease(s). The combination of best medicinal plants can then be used to conduct clinical trials, and thus lead to the development of new, efficacious natural products for the treatment of diseases.

Conclusion

The user can find the best plant(s) that can be used to cure any desired disease(s). The DISPEL database will be helpful in determining the ‘most-effective’ combination of plants to cure a disease. Subsequently, clinical trials may be conducted to pave the way for the use of medicinal plant therapies in clinics for treatment of diseases. This resource will be beneficial to plant/medical scientists for discovering novel, natural treatment regimens.

Contributions statement

K.S. collected/curated the data and wrote the manuscript. H.M. developed the web-interface. P.S., P.P., A.K.B., A.J., G.E. and S.M. helped in collecting the data. H.C. gave valuable inputs/suggestions. D.S. conceptualized, designed and supervised the study.

Conflict of interest

There are no competing interests.

Acknowledgements

D.S. acknowledges the financial support from Ministry of Human Resource Development (Supercomputing Facility Grant - BT/2014-15/Plan/P-955; Faculty Initiation Grant - BIO/FIG/100700), Science and Engineering Research Board (ECR/2016/001566) and Department of Biotechnology (BT/PR40141/BTIS/137/16/2021). K.S. is thankful to Ministry of Human Resource Development for research fellowship.

References

1.

Gurib-Fakim
A.
(
2006
)
Medicinal plants: traditions of yesterday and drugs of tomorrow
.
Mol. Asp. Med.
,
27
,
1
93
.

2.

Petrovska
B.B.
(
2012
)
Historical review of medicinal plants’ usage
.
Pharmacogn. Rev.
,
6
,
1
5
.

3.

Baetz
U.
and
Martinoia
E.
(
2014
)
Root exudates: the hidden part of plant defense
.
Trends Plant Sci.
,
19
,
90
98
.

4.

Desmedt
W.
,
Mangelinckx
S.
,
Kyndt
T.
et al.  (
2020
)
A phytochemical perspective on plant defense against nematodes
.
Front. Plant Sci.
,
11
, 602079.

5.

Koehn
F.E.
and
Carter
G.T.
(
2005
)
The evolving role of natural products in drug discovery
.
Nat. Rev. Drug. Discov.
,
4
,
206
220
.

6.

Mishra
B.B.
and
Tiwari
V.K.
(
2011
)
Natural products: an evolving role in future drug discovery
.
Eur. J. Med. Chem.
,
46
,
4769
4807
.

7.

Perumal Samy
R.
and
Gopalakrishnakone
P.
(
2010
)
Therapeutic potential of plants as anti-microbials for drug discovery
.
Evid. Based Complement Altern. Med.
,
7
,
283
294
.

8.

Aly
A.H.
,
Debbab
A.
,
Kjer
J.
et al.  (
2010
)
Fungal endophytes from higher plants: a prolific source of phytochemicals and other bioactive natural products
.
Fungal. Divers.
,
41
,
1
16
.

9.

Carrizzo
A.
,
Moltedo
O.
,
Damato
A.
et al.  (
2020
)
New nutraceutical combination reduces blood pressure and improves exercise capacity in hypertensive patients via a nitric oxide–dependent mechanism
.
J. Am. Heart Assoc.
,
9
, e014923.

10.

Greenwell
M.
and
Rahman
P.
(
2015
)
Medicinal plants: their use in anticancer treatment
.
Int. J. Pharm. Sci. Res.
,
6
,
4103
4112
.

11.

Gupta
R.
,
Thakur
B.
,
Singh
P.
et al.  (
2010
)
Anti-tuberculosis activity of selected medicinal plants against multi-drug resistant Mycobacterium tuberculosis isolates
.
Indian J. Med. Res.
,
131
,
809
813
.

12.

Kooti
W.
,
Farokhipour
M.
,
Asadzadeh
Z.
et al.  (
2016
)
The role of medicinal plants in the treatment of diabetes: a systematic review
.
Electron. Physician
,
8
,
1832
1842
.

13.

Mathabe
M.
,
Nikolova
R.
,
Lall
N.
et al.  (
2006
)
Antibacterial activities of medicinal plants used for the treatment of diarrhoea in Limpopo Province, South Africa
.
J. Ethnopharmacol.
,
105
,
286
293
.

14.

Titanji
V.P.
,
Zofou
D.
and
Ngemenya
M.N.
(
2008
)
The antimalarial potential of medicinal plants used for the treatment of malaria in Cameroonian folk medicine
.
Afr. J. Tradit. Complement Altern. Med.
,
5
,
302
321
.

15.

Hoareau
L.
and
DaSilva
E.J.
(
1999
)
Medicinal plants: a re-emerging health aid
.
Electron. J. Biotechnol.
,
2
,
3
4
.

16.

Tungmunnithum
D.
,
Thongboonyou
A.
,
Pholboon
A.
et al.  (
2018
)
Flavonoids and other phenolic compounds from medicinal plants for pharmaceutical and medical aspects: an overview
.
Medicines (Basel)
,
5
, 93.

17.

Ekor
M.
(
2014
)
The growing use of herbal medicines: issues relating to adverse reactions and challenges in monitoring safety
.
Front. Pharmacol.
,
4
, 177.

18.

Lagunin
A.A.
,
Goel
R.K.
,
Gawande
D.Y.
et al.  (
2014
)
Chemo-and bioinformatics resources for in silico drug discovery from medicinal plants beyond their traditional use: a critical review
.
Nat. Prod. Rep.
,
31
,
1585
1611
.

19.

Huang
K.
,
Zhang
P.
,
Zhang
Z.
et al.  (
2021
)
Traditional Chinese Medicine (TCM) in the treatment of COVID-19 and other viral infections: efficacies and mechanisms
.
Pharmacol. Ther.
,
225
, 107843.

20.

Hussain
N.
,
Chanda
R.
,
Abir
R.A.
et al.  (
2021
)
MPDB 2.0: a large scale and integrated medicinal plant database of Bangladesh
.
BMC Res. Notes
,
14
,
1
6
.

21.

Pathania
S.
,
Ramakrishnan
S.M.
and
Bagler
G.
(
2015
)
Phytochemica: a platform to explore phytochemicals of medicinal plants
.
Database (Oxford)
,
2015
, bav075.

22.

Mohanraj
K.
,
Karthikeyan
B.S.
,
Vivek-Ananth
R.
et al.  (
2018
)
IMPPAT: a curated database of Indian medicinal plants, phytochemistry and therapeutics
.
Sci. Rep.
,
8
,
1
17
.

23.

Zhang
R.-Z.
,
Yu
S.-J.
,
Bai
H.
et al.  (
2017
)
TCM-Mesh: the database and analytical system for network pharmacology analysis for TCM preparations
.
Sci. Rep.
,
7
,
1
14
.

24.

Gu
J.
,
Gui
Y.
,
Chen
L.
et al.  (
2013
)
CVDHD: a cardiovascular disease herbal database for drug discovery and network pharmacology
.
J. Cheminform.
,
5
,
1
6
.

25.

Mangal
M.
,
Sagar
P.
,
Singh
H.
et al.  (
2013
)
NPACT: naturally occurring plant-based anti-cancer compound-activity-target database
.
Nucleic Acids Res.
,
41
,
D1124
D1129
.

26.

Gong
F.
,
Ma
Y.
,
Gong
W.
et al.  (
2018
)
Neo4j graph database realizes efficient storage performance of oilfield ontology
.
PLoS One
,
13
, e0207595.

27.

Zhang
S.
,
Hu
Y.
and
Bian
G.
(
2017
)
Research on string similarity algorithm based on Levenshtein Distance
. In: 2017 IEEE 2nd Advanced Information Technology, Electronic and Automation Control Conference (IAEAC), Chongqing, China, March 25-26, pp.
2247
2251
.

28.

Delcev
S.
and
Draskovic
D.
(
2018
)
Modern JavaScript frameworks: a survey study
. In: 2018 Zooming Innovation in Consumer Technologies Conference (ZINC), Novi Sad, Serbia, May 30-31, pp.
106
109
.

29.

Sun
C.
,
Zeng
X.
,
Sun
C.
et al.  (
2020
)
Research and application of data exchange based on JSON
. In: 2020 Asia-Pacific Conference on Image Processing, Electronics and Computers (IPEC), Dalian, China, April 14-16, pp.
349
355
.

30.

Huang
X.
(
2020
)
Research and application of Node. js Core Technology
. In: 2020 International Conference on Intelligent Computing and Human-Computer Interaction (ICHCI), Sanya, China, December 4-6, pp.
1
4
.

31.

Sheikh
W.
and
Sheikh
N.
(
2020
)
A Model-View-ViewModel (MVVM) application framework for hearing impairment diagnosis-class inheritance architecture
. In: 2020 Intermountain Engineering, Technology and Computing (IETC), Orem, UT, USA, October 2-3, pp.
1
6
.

Author notes

Present address: Somesh Mohapatra, Department of Materials Science and Engineering, Massachusetts Institute of Technology, Cambridge, MA, USA.

Equal contribution.

This is an Open Access article distributed under the terms of the Creative Commons Attribution License (https://creativecommons.org/licenses/by/4.0/), which permits unrestricted reuse, distribution, and reproduction in any medium, provided the original work is properly cited.