PetCaseFinder

Peer-reviewed veterinary case report

XRepDDA: An Interpretable Drug-Disease Association Prediction Framework Leveraging Pretrained Chemical Language Models.

Year:
2026
Authors:
Zhang C et al.
Affiliation:
School of Artificial Intelligence and Computer Science · China

Abstract

Drug repositioning aims to identify new indications for existing drugs, offering a cost-effective and time-efficient strategy for therapeutic development. Its core challenge lies in accurately predicting potential drug-disease associations (DDAs). However, existing computational approaches often suffer from inadequate drug representation, insufficient modeling of disease semantics, and imbalanced data distributions, which collectively limit predictive accuracy and generalization ability. To address these challenges, we propose an innovative framework, termed XRepDDA, that integrates multimodal feature representation with deep metric learning to improve DDA prediction accuracy and robustness. For drug representation, the SMI-TED pretrained chemical language model encodes SMILES sequences into chemically informative molecular embeddings. For disease representation, a hierarchical semantic graph based on the MeSH ontology is constructed together with a semantic-enhanced graph embedding strategy to capture hierarchical and semantic relationships among diseases. To mitigate class imbalance, we applied the AllKNN adaptive undersampling strategy. The prediction module is built upon an improved ModernNCA architecture, which learns a discriminative embedding space through deep metric learning. Experiments on multiple public benchmark data sets demonstrate that XRepDDA consistently outperforms diverse baseline models, including traditional machine learning, tree-based ensemble, and deep learning methods, achieving AUC and AUPR values of up to 0.9990 and 0.9991, respectively. Furthermore, molecular docking experiments on top-ranked candidate drugs for Alzheimer's disease and stomach neoplasms provide <i>in silico</i> validation of predictive reliability. To enhance interpretability, a multilevel explainability framework is established, combining SHAP-based global feature attribution with attention mechanisms and molecular perturbation analyses to identify key features and pharmacophores at the local level. These results support the chemical interpretability and the biological plausibility of the predictions.

Find similar cases for your pet

PetCaseFinder finds other peer-reviewed reports of pets with the same symptoms, plus a plain-English summary of what was tried across them.

Search related cases →

Original publication: https://europepmc.org/article/MED/41617662