Curvature-Guided Geometric Representation for Protein-Ligand Binding Affinity Prediction

arXiv cs.LG Papers

Summary

This paper proposes RicciBind, a geometric representation framework that integrates Ricci curvature and optimal transport for protein-ligand binding affinity prediction, demonstrating superior accuracy and interpretability across benchmarks.

arXiv:2606.14159v1 Announce Type: new Abstract: Protein-ligand binding affinity (PLA) prediction is critical in drug discovery. Despite the notable advancements in machine learning-based approaches, existing methods struggle to jointly characterize local geometric organization and globally coordinated cross-molecular interactions, limiting their ability to model complex binding mechanisms. Here, we propose RicciBind, a geometric representation framework that integrates curvature-guided hierarchical structure learning with optimal transport (OT)-based cross-domain alignment to model molecular interactions. Specifically, RicciBind leverages Ricci curvature to capture local interaction tightness within molecular structures, enhancing structural awareness and organizing atomic interactions into curvature-aware hierarchical representations. An OT-based cluster matching mechanism then aligns protein and ligand clusters across heterogeneous domains under geometric constraints, enabling globally consistent correspondences and revealing higher-order interaction patterns beyond local neighborhoods. By coupling curvature-guided structure encoding with OT-driven cross-domain alignment, RicciBind effectively models complex interaction semantics and substantially improves both the accuracy and interpretability of binding affinity prediction. Extensive experiments demonstrate that RicciBind achieved superior predictive performance and generalization across PLA benchmarks and virtual screening tasks. Ablation studies further confirmed the essential role of Ricci curvature in enhancing molecular interaction representations.
Original Article
View Cached Full Text

Cached at: 06/15/26, 09:11 AM

# Curvature-Guided Geometric Representation for Protein–Ligand Binding Affinity Prediction
Source: [https://arxiv.org/html/2606.14159](https://arxiv.org/html/2606.14159)
Chuan\-Xian RenYuhao LiZiqi HuangYue PanMingzhe Tang and Hong Yan\\IEEEmembershipFellow, IEEEThis work is supported in part by National Key R&D Program of China \(2024YFA1011900\), National Natural Science Foundation of China \(62376291\), Guangdong Basic and Applied Basic Research Foundation \(2023B1515020004\), Science and Technology Program of Guangzhou \(2024A04J6413\), the Fundamental Research Funds for the Central Universities, Sun Yat\-sen University \(24xkjc013\), and in part by the Hong Kong Innovation and Technology Commission \(ITC\) \(InnoHK Project CIMDA\) and the Institute of Digital Medicine of City University of Hong Kong \(Project 9229503\)\.Shuai Li, Chuan\-Xian Ren, Yuhao Li, Ziqi Huang, Yue Pan and Mingzhe Tang are with the School of Mathematics, Sun Yat\-sen University, Guangzhou 510275, China; Hong Yan is with Department of Electrical Engineering, City University of Hong Kong, 83 Tat Chee Avenue, Kowloon, Hong Kong\. Chuan\-Xian Ren is the corresponding author \(rchuanx@mail\.sysu\.edu\.cn\)\.

###### Abstract

Protein–ligand binding affinity \(PLA\) prediction is critical in drug discovery\. Despite the notable advancements in machine learning\-based approaches, existing methods struggle to jointly characterize local geometric organization and globally coordinated cross\-molecular interactions, limiting their ability to model complex binding mechanisms\. Here, we propose RicciBind, a geometric representation framework that integrates curvature\-guided hierarchical structure learning with optimal transport \(OT\)\-based cross\-domain alignment to model molecular interactions\. Specifically, RicciBind leverages Ricci curvature to capture local interaction tightness within molecular structures, enhancing structural awareness and organizing atomic interactions into curvature\-aware hierarchical representations\. An OT\-based cluster matching mechanism then aligns protein and ligand clusters across heterogeneous domains under geometric constraints, enabling globally consistent correspondences and revealing higher\-order interaction patterns beyond local neighborhoods\. By coupling curvature\-guided structure encoding with OT\-driven cross\-domain alignment, RicciBind effectively models complex interaction semantics and substantially improves both the accuracy and interpretability of binding affinity prediction\. Extensive experiments demonstrate that RicciBind achieved superior predictive performance and generalization across PLA benchmarks and virtual screening tasks\. Ablation studies further confirmed the essential role of Ricci curvature in enhancing molecular interaction representations\.

\{IEEEkeywords\}

Protein\-ligand binding affinity, Ricci curvature, Graph neural networks, Optimal transport\.

## 1Introduction

\\IEEEPARstart

Predicting protein–ligand binding affinity \(PLA\) is a key problem in drug discovery, as PLA quantitatively measures the strength of interaction between a ligand \(a small molecule\) and a protein \(the target\)\. Accurate PLA prediction is critical for identifying promising therapeutic candidates, yet traditional experimental approaches are costly and time\-consuming, often requiring years of effort and substantial resources for a single protein–ligand pair\[[1](https://arxiv.org/html/2606.14159#bib.bib1)\],\[[2](https://arxiv.org/html/2606.14159#bib.bib2)\],\[[3](https://arxiv.org/html/2606.14159#bib.bib3)\]\. Rapid advances in this field have significant potential to enhance the efficiency of drug discovery by facilitating the identification of promising therapeutic candidates\.

In recent years, deep learning techniques have attracted considerable attention due to their strong representational capacity and ability to automatically learn relevant features from raw data\. By effectively identifying and leveraging task\-relevant information, these models enable efficient and accurate predictions, making them a central focus in PLA prediction\. To date, deep learning approaches have been developed for predicting PLA\[[3](https://arxiv.org/html/2606.14159#bib.bib3)\],\[[4](https://arxiv.org/html/2606.14159#bib.bib4)\],\[[5](https://arxiv.org/html/2606.14159#bib.bib5)\],\[[6](https://arxiv.org/html/2606.14159#bib.bib6)\],\[[7](https://arxiv.org/html/2606.14159#bib.bib7)\]\. These can be broadly classified into two categories: structure\-free models\[[4](https://arxiv.org/html/2606.14159#bib.bib4)\],\[[5](https://arxiv.org/html/2606.14159#bib.bib5)\]and structure\-based models\[[3](https://arxiv.org/html/2606.14159#bib.bib3)\],\[[6](https://arxiv.org/html/2606.14159#bib.bib6)\],\[[7](https://arxiv.org/html/2606.14159#bib.bib7)\]\. The fundamental distinction between these approaches lies in whether atomic interactions and three\-dimensional \(3D\) structural information are explicitly incorporated\.

![Refer to caption](https://arxiv.org/html/2606.14159v1/x1.png)Figure 1:Illustration of deep learning paradigms for protein–ligand interaction modeling\.Protein–ligand complexes are represented using structure\-free 1D/2D representations or structure\-based 3D representations and processed by deep learning networks to learn interaction patterns, supporting downstream tasks such as binding affinity prediction and virtual screening\.Structure\-free methods\[[4](https://arxiv.org/html/2606.14159#bib.bib4)\],\[[5](https://arxiv.org/html/2606.14159#bib.bib5)\],\[[8](https://arxiv.org/html/2606.14159#bib.bib8)\],\[[9](https://arxiv.org/html/2606.14159#bib.bib9)\],\[[10](https://arxiv.org/html/2606.14159#bib.bib10)\]do not rely on 3D structural information of proteins and ligands or explicit interatomic interactions\. As illustrated in Fig\.[1](https://arxiv.org/html/2606.14159#S1.F1), ligands are typically encoded as simplified molecular\-input line\-entry system \(SMILES\) strings, 2D molecular graphs, or 2D grids, while proteins are represented by amino acid sequences\. In contrast, structure\-based methods\[[11](https://arxiv.org/html/2606.14159#bib.bib11)\],\[[12](https://arxiv.org/html/2606.14159#bib.bib12)\],\[[13](https://arxiv.org/html/2606.14159#bib.bib13)\],\[[14](https://arxiv.org/html/2606.14159#bib.bib14)\],\[[15](https://arxiv.org/html/2606.14159#bib.bib15)\],\[[16](https://arxiv.org/html/2606.14159#bib.bib16)\],\[[17](https://arxiv.org/html/2606.14159#bib.bib17)\]for PLA prediction explicitly incorporate the 3D structures of protein–ligand complexes along with their physicochemical interaction, with 3D graph neural networks \(3D\-GNNs\) emerging as the dominant paradigm\. In these approaches, protein–ligand complexes are modeled as spatial interaction graphs, naturally integrating molecular geometry and spatial adjacency relationships to capture informative structural and interaction patterns\. Through message\-passing schemes defined on relative coordinates or interatomic distances, 3D\-GNNs are inherently invariant to global rotations and translations\. Owing to these advantages, 3D\-GNN\-based methods have attracted growing attention for PLA prediction\.

Building upon structure\-based PLA prediction methods, recent advances have shown that hierarchical representations of 3D molecular graphs, together with multi\-level feature extraction, can effectively reduce computational complexity while improving predictive performance\[[18](https://arxiv.org/html/2606.14159#bib.bib18)\],\[[19](https://arxiv.org/html/2606.14159#bib.bib19)\],\[[20](https://arxiv.org/html/2606.14159#bib.bib20)\]\. Notably, CheapNet\[[20](https://arxiv.org/html/2606.14159#bib.bib20)\]captures higher\-order molecular representations through differentiable pooling\[[21](https://arxiv.org/html/2606.14159#bib.bib21)\], achieving strong performance in PLA prediction tasks\. Despite these advances, fully characterizing molecular interactions still requires an understanding of the intrinsic geometric properties of molecular graphs\. However, existing hierarchical methods do not explicitly incorporate such properties, resulting in an incomplete representation of molecular geometry and limiting their ability to capture complex cross\-scale dependencies, thereby constraining model expressiveness and interpretability\.

To address this limitation, it is crucial to adopt geometric descriptors that can explicitly characterize the intrinsic structure of molecular graphs beyond conventional feature aggregation\. Among such descriptors, Ricci curvature, a fundamental concept in differential geometry, provides a principled framework for analyzing and enhancing the graph geometry through its evolution under Ricci flow\. It captures both local connectivity patterns and global structural organization of graphs, making it particularly well\-suited for graph\-based modeling\. Its discrete formulations, including Ollivier\-Ricci curvature \(ORC\)\[[22](https://arxiv.org/html/2606.14159#bib.bib22)\]and Forman\-Ricci curvature \(FRC\)\[[23](https://arxiv.org/html/2606.14159#bib.bib23)\], effectively capture structural characteristics of graphs and have been successfully applied across diverse domains such as community detection\[[24](https://arxiv.org/html/2606.14159#bib.bib24)\],\[[25](https://arxiv.org/html/2606.14159#bib.bib25)\], internet topology analysis\[[26](https://arxiv.org/html/2606.14159#bib.bib26)\], and biomolecular feature representation\[[27](https://arxiv.org/html/2606.14159#bib.bib27)\],\[[28](https://arxiv.org/html/2606.14159#bib.bib28)\]\. Notably, ORC has advantages in explaining the over\-smoothing and over\-squashing phenomena in message\-passing graph neural networks\[[29](https://arxiv.org/html/2606.14159#bib.bib29)\]\. In recent times, there has been considerable development in the field of curvature\-enhanced graph neural network\[[30](https://arxiv.org/html/2606.14159#bib.bib30),[31](https://arxiv.org/html/2606.14159#bib.bib31)\]\. The integration of ORC into graph neural networks has been demonstrated to be a successful strategy in various synthetic and real\-world graphs, particularly in large\-scale and dense graphs\[[7](https://arxiv.org/html/2606.14159#bib.bib7)\]\.

Inspired by the ability of Ricci curvature to capture both local connectivity and global structural organization in graphs, we propose RicciBind, a hierarchical molecular representation framework guided by Ricci curvature that integrates geometric deep learning with optimal transport \(OT\) theory for protein–ligand interaction modeling and binding affinity prediction\. Unlike existing structure\-based methods, RicciBind incorporates geometric and topological information encoded by ORC to refine atom\-level representations and generate geometrically consistent cluster\-level representations\. Moreover, RicciBind models proteins and ligands as two heterogeneous structural domains and employs an OT\-based cluster matching mechanism to compute bidirectional cross\-domain transport plans between protein and ligand cluster distributions, enabling geometrically consistent alignment of key functional clusters while suppressing irrelevant ones, thereby effectively characterizing complex cross\-domain interaction patterns\.

The key contributions of RicciBind are summarized as follows\.

- •A curvature\-aware graph embedding module is designed, which takes the protein–ligand interaction graph as input and incorporates ORC as a topological descriptor to enhance atom\-level node representations\.
- •A curvature\-driven clustering module is proposed to adaptively assign atoms to clusters under the guidance of the Ricci curvature, yielding higher\-order representations that are more consistent with the geometric–topological structure of molecular graphs\.
- •To our knowledge, this is the first study to treat proteins and ligands as two distinct heterogeneous structural domains and to model protein–ligand binding complexes using optimal transport principletheory, offering a theoretical support for the alignment and selection of key functional interaction clusters\.
- •RicciBind is comprehensively evaluated across multiple public datasets, with results showing consistently superior performance in protein–ligand affinity prediction and virtual screening compared with existing approaches\.

The rest of this paper is organized as follows\. Section[2](https://arxiv.org/html/2606.14159#S2)reviews deep learning approaches for predicting PLA and discusses recent advances in the application of Ollivier–Ricci curvature within graph neural networks\. Section[3](https://arxiv.org/html/2606.14159#S3)presents the problem formulation and the computation of Ollivier–Ricci curvature, then provides a detailed description of the proposed RicciBind model\. Section[4](https://arxiv.org/html/2606.14159#S4)reports comprehensive experimental results and their analysis\. Section[5](https://arxiv.org/html/2606.14159#S5)presents a summary and discussion of the study\.

## 2Related Work

### 2\.1Deep Learning Methods in PLA Prediction

The deep learning methods for predicting protein\-ligand binding affinity can be categorized into two distinct classifications: structure\-based methods and structure\-free methods\.

Structure\-free methods simplify the representation of protein\-ligand complexes by neglecting their 3D structural and interaction information\. For instance, DeepDTA\[[4](https://arxiv.org/html/2606.14159#bib.bib4)\]employs two independent 1D\-CNNs to extract features directly from protein and ligand sequences\. In contrast, Graph\-CNN\[[8](https://arxiv.org/html/2606.14159#bib.bib8)\]and GraphDTA\[[5](https://arxiv.org/html/2606.14159#bib.bib5)\]leverage 2D molecular graphs to improve the feature representation of both proteins and ligands\. To enhance protein modeling, DGraphDTA\[[32](https://arxiv.org/html/2606.14159#bib.bib32)\]and S\-MAN\[[33](https://arxiv.org/html/2606.14159#bib.bib33)\]incorporate protein contact maps and spatial distance features\. Despite these advancements, such approaches still overlook the molecular intrinsic structures and genuine interaction mechanisms in 3D space, thereby limiting their capacity to attain robust generalization and interpretability\.

Structure\-based methods, by contrast, consider the 3D molecular structural and interaction information between protein and ligand with the aim of improving model generalization and interpretability\. These methods can be further categorised into two distinct groups: 3D\-CNN based models and 3D\-GNN based models\. In the 3D\-CNN based models, ACNN\[[11](https://arxiv.org/html/2606.14159#bib.bib11)\]develops a general 3D spatial convolution operation for learning atomic\-level chemical interactions\.KD​E​E​PK\_\{DEEP\}\[[12](https://arxiv.org/html/2606.14159#bib.bib12)\]applies a 3D\-CNN to voxelized protein and ligand representations enriched with eight pharmacophoric\-like features\. However, due to the sparsity of voxelized grid representations and the resulting low computational efficiency of 3D\-CNN, recent research has predominantly focused on 3D\-GNN based models\. At the atomic\-level, GIGN\[[6](https://arxiv.org/html/2606.14159#bib.bib6)\]and EHIGN\[[15](https://arxiv.org/html/2606.14159#bib.bib15)\]represent protein\-ligand complexes as heterogeneous graphs that capture both covalent and non\-covalent interactions\. Hierarchical methods, including LEFTNet\[[18](https://arxiv.org/html/2606.14159#bib.bib18)\], GET\[[19](https://arxiv.org/html/2606.14159#bib.bib19)\]and CheapNet\[[20](https://arxiv.org/html/2606.14159#bib.bib20)\], extend this framework by integrating block\-level and atomic\-level information to construct multi\-scale representations\. These approaches enhance the expressiveness of interaction modeling by capturing protein–ligand relationships across multiple structural hierarchies\.

In contrast to existing structure\-based methods, RicciBind adopts a hierarchical representation paradigm and explicitly incorporates geometric priors of molecular structures to construct an informative and physically consistent representation of protein–ligand complexes\. This design not only enhances the model’s ability to capture complex interaction patterns but also improves the accuracy of binding affinity prediction\.

### 2\.2Ollivier\-Ricci Curvature \(ORC\) in GNN

ORC is a discrete analogue of the classical Ricci curvature from differential geometry, which effectively captures the local geometric and topological properties of graphs\. Recent studies have revealed a connection between ORC and the issues of “over\-smoothing” and “over\-squashing” in message aggregation, demonstrating its effectiveness in alleviating information distortion in message\-passing GNNs\[[34](https://arxiv.org/html/2606.14159#bib.bib34),[29](https://arxiv.org/html/2606.14159#bib.bib29)\]\. The application of ORC has yielded positive outcomes in a variety of graph neural network models, as evidenced by its notable efficacy in graph learning tasks\.

In the context of the node classification task, the Curvature Graph Network \(CurvGN\)\[[30](https://arxiv.org/html/2606.14159#bib.bib30)\]integrates ORC into graph convolution network, employing a multi\-layer perceptron \(MLP\) to learn edge curvature for guiding message passing and adapting to local topological structures\. Building on this, the Curvature Graph Neural Network \(CGNN\)\[[31](https://arxiv.org/html/2606.14159#bib.bib31)\]explicitly converts edge curvature into ORC\-related weights through negative curvature processing and curvature normalization modules, further improving the adaptive locality ability of GNNs\. In addition, for biomolecular interaction prediction tasks, a few methods have leveraged ORC to capture molecular structures and enrich molecular representations\. For instance, Curvature\-enhanced Graph Convolutional Network \(CGCN\)\[[7](https://arxiv.org/html/2606.14159#bib.bib7)\]incorporates ORC information of molecular graphs into the node feature aggregation process, employing an MLP to learn edge weight functions from ORC\-related vectors\.

In this study, RicciBind incorporate ORC into both graph embedding and hierarchical representation stages of molecular modeling, enabling the capture of intrinsic geometric structure and higher\-order interaction patterns in protein–ligand complexes\. Incorporating ORC enhances the expressive power of protein–ligand complex representations and provides a principled geometric foundation for curvature\-guided atomic clustering\.

## 3Methods

This section begins by formalizing the protein–ligand binding affinity prediction problem\. It then introduces the definition of Ollivier–Ricci curvature and highlights its geometric significance within molecular graphs\. Finally, the RicciBind model is described in detail, with its overall architecture illustrated in Fig\.[2](https://arxiv.org/html/2606.14159#S3.F2)\.

![Refer to caption](https://arxiv.org/html/2606.14159v1/x2.png)Figure 2:Detailed workflow of RicciBind\.A\. The graph construction module encodes the protein–ligand complex as a molecular graph\.B\. The curvature\-aware graph embedding module integrates Ricci curvature as a topological descriptor to enhance global atom\-level representations\.C\. The curvature\-driven clustering module groups atoms into functional clusters under Ricci curvature guidance\.D\. The optimal transport\-based cluster matching module identifies and aligns functionally critical interaction clusters across domains\.### 3\.1Problem Definition

In this study, the primary objective is to predict the binding affinity of protein\-ligand complexes\. Each complex is represented as a 3D molecular graphG=\(V,E,R,K\)=\(Vl∪Vp,El∪Ep∪El​p,Rl∪Rp,Kl∪Kp∪Kl​p\)G=\(V,E,R,K\)=\(V\_\{l\}\\cup V\_\{p\},E\_\{l\}\\cup E\_\{p\}\\cup E\_\{lp\},R\_\{l\}\\cup R\_\{p\},K\_\{l\}\\cup K\_\{p\}\\cup K\_\{lp\}\), whereVlV\_\{l\}andVpV\_\{p\}denote the sets of nodes corresponding to ligand and protein atoms, respectively;ElE\_\{l\}andEpE\_\{p\}are sets of ligand and protein edges representing covalent bonds, respectively;El​pE\_\{lp\}corresponds to non\-covalent interaction edges \(i\.e\., non\-covalent bonds\) between the ligand and protein;RlR\_\{l\}andRpR\_\{p\}are the 3D coordinates of ligand and protein nodes;KlK\_\{l\}andKpK\_\{p\}denote the edge\-wise ORC of the ligand and protein; andKl​pK\_\{lp\}represents the curvature of edges within the complex\. Each nodevi∈Vv\_\{i\}\\in Vis associated with an initial feature vector𝒙i∈ℝd\\bm\{x\}\_\{i\}\\in\\mathbb\{R\}^\{d\}and a 3D coordinate𝒓i∈ℝ3\\bm\{r\}\_\{i\}\\in\\mathbb\{R\}^\{3\}\. A covalent edgeei​j∈El∪Epe\_\{ij\}\\in E\_\{l\}\\cup E\_\{p\}exists if a chemical bond is present between nodesviv\_\{i\}andvjv\_\{j\}, whereas a non\-covalent edgeei​j∈El​pe\_\{ij\}\\in E\_\{lp\}is defined if the Euclidean distance between a ligand nodevi∈Vlv\_\{i\}\\in V\_\{l\}and a protein nodevk∈Vpv\_\{k\}\\in V\_\{p\}satisfies‖𝒓i−𝒓k‖2<5​Å\\\|\\bm\{r\}\_\{i\}\-\\bm\{r\}\_\{k\}\\\|^\{2\}<5\\text\{Å\}\. The goal is to learn a functionf:G→ℝf:G\\to\\mathbb\{R\}that maps a protein–ligand complex graphGGto its predicted binding affinityy^=f​\(G\)\\hat\{y\}=f\(G\)\.

### 3\.2Ollivier\-Ricci Curvature

In the field of Riemannian geometry, Ricci curvature describes several key properties, including the rate of volume growth of geodesic balls, the divergence of geodesics, the transportation distance between balls, and the meeting probability of coupled random walks\[[35](https://arxiv.org/html/2606.14159#bib.bib35)\],\[[36](https://arxiv.org/html/2606.14159#bib.bib36)\],\[[37](https://arxiv.org/html/2606.14159#bib.bib37)\]\. In particular, it determines both the growth of ball volumes as a function of radius and the degree of overlap between two balls\. A larger overlapping volume corresponds to a lower transportation cost, thereby revealing a natural connection between Ricci curvature and optimal transport theory\. Building on this connection, Ollivier extended the notion of Ricci curvature to general metric spaces via the framework of optimal transport\[[38](https://arxiv.org/html/2606.14159#bib.bib38),[22](https://arxiv.org/html/2606.14159#bib.bib22)\]and further applied to graphs\[[39](https://arxiv.org/html/2606.14159#bib.bib39)\]\.

###### Definition 1\(Probability Measure\)

For an undirected graphG=\(V,E\)G=\(V,E\), denote the set of neighbors of nodex∈Vx\\in Vas𝒩​\(x\)=\{y∣\(x,y\)∈E\}\\mathcal\{N\}\(x\)=\\\{y\\mid\(x,y\)\\in E\\\}, then a probability measuremxαm\_\{x\}^\{\\alpha\}centered at nodexxis defined as:

mxα​\(y\)=\{αif​y=x,1−α\|𝒩​\(x\)\|if​yi∈𝒩​\(x\),0otherwise,m\_\{x\}^\{\\alpha\}\(y\)=\\begin\{cases\}\\alpha&\\text\{if \}y=x,\\\\\[6\.0pt\] \\dfrac\{1\-\\alpha\}\{\|\\mathcal\{N\}\(x\)\|\}&\\text\{if \}y\_\{i\}\\in\\mathcal\{N\}\(x\),\\\\\[10\.0pt\] 0&\\text\{otherwise\},\\end\{cases\}\(1\)whereα∈\[0,1\]\\alpha\\in\[0,1\]is a parameter that controls the weight assigned to the center nodexx\. Similar to\[[26](https://arxiv.org/html/2606.14159#bib.bib26),[40](https://arxiv.org/html/2606.14159#bib.bib40)\], we setα=0\.5\\alpha=0\.5in this study\.

###### Definition 2\(Wasserstein Distance\)

Given two probability measuresmxm\_\{x\}andmym\_\{y\}defined on the nodesx,y∈Vx,y\\in Vof graphGG, the Wasserstein distance between them is defined as:

W1​\(mx,my\)=infπ∈Π​\(mx,my\)∑u,v∈Vd​\(u,v\)​π​\(u,v\),W\_\{1\}\(m\_\{x\},m\_\{y\}\)=\\inf\_\{\\pi\\in\\Pi\(m\_\{x\},m\_\{y\}\)\}\\sum\_\{u,v\\in V\}d\(u,v\)\\pi\(u,v\),\(2\)whered​\(u,v\)d\(u,v\)is the shortest path distance between nodesuuandvvin graphGG, andπ​\(u,v\)\\pi\(u,v\)is the amount of probability mass transported from nodeuuto nodevv, which is mass preserving, i\.e\.,

∑vj∈Vπ​\(x,vj\)=mx,∑ui∈Vπ​\(ui,y\)=my\.\\sum\_\{v\_\{j\}\\in V\}\\pi\(x,v\_\{j\}\)=m\_\{x\},\\quad\\sum\_\{u\_\{i\}\\in V\}\\pi\(u\_\{i\},y\)=m\_\{y\}\.\(3\)

###### Definition 3\(Ollivier\-Ricci Curvature\)

LetG=\(V,E\)G=\(V,E\)be a graph\. For any edge\(x,y\)∈E\(x,y\)\\in E, the Ollivier\-Ricci curvatureORC​\(x,y\)\\mathrm\{ORC\}\(x,y\)is defined by comparing the Wasserstein distance between the probability measuresmxm\_\{x\}andmym\_\{y\}at the endpoints with their shortest path distanced​\(x,y\)d\(x,y\)\. Formally,

ORC​\(x,y\)=1−W1​\(mx,my\)d​\(x,y\)\.\\mathrm\{ORC\}\(x,y\)=1\-\\frac\{W\_\{1\}\(m\_\{x\},m\_\{y\}\)\}\{d\(x,y\)\}\.\(4\)

The ORC provides a principled approach for capturing intricate topological information in graphs\. Intuitively, the ORC of an edge reflects the relationship between the neighborhoods of its two incident nodes\. As shown in Fig\.[3](https://arxiv.org/html/2606.14159#S3.F3)A, a positive ORC indicates strong connectivity, signifying that the neighboring nodes of the two vertices tend to coalesce into relatively dense local structures, thereby implying close structural relationships\. In the event that the majority of edges within a local region exhibit positive curvature, the corresponding substructure is frequently classified as a community\[[25](https://arxiv.org/html/2606.14159#bib.bib25)\]\. Conversely, a negative ORC is indicative of inadequate connectivity, manifesting as the tendency of neighboring nodes of the two vertices to diverge\. This phenomenon results in the formation of relatively sparse local structures, signifying structural divergence\.

![Refer to caption](https://arxiv.org/html/2606.14159v1/x3.png)Figure 3:An illustration of edge curvature across local graph structures and the calculation of ORC\. A\.Edge curvature reflects the intrinsic geometry of local graph neighborhoods\. From left to right: a tree\-like structure exhibiting negative curvature, an infinitely extended grid with zero curvature, and a complete graph characterized by positive curvature\.B\. Schematic diagram for calculating the ORC of an edge\(u,v\)\(u,v\)\.ORC​\(u,v\)\\mathrm\{ORC\}\(u,v\)is defined through the Wasserstein distance between two probability measures supported on the neighborhoods of nodesuuandvv\. The green nodeuuis connected to its green neighbors and to the orange nodevv; symmetrically, the orange nodevvis connected to its orange neighbors and to the green nodeuu\.
### 3\.3The RicciBind Framework

In this section, the RicciBind framework is introduced in detail\. As illustrated in Fig\.[2](https://arxiv.org/html/2606.14159#S3.F2), RicciBind consists of three main components designed to enhance the structural modeling of protein\-ligand interactions through curvature\-guided geometric deep learning: \(1\) Curvature\-Aware Graph Embedding \(CAGE\) encodes the protein–ligand complex as a molecular graph and refines atom\-level representations by incorporating ORC as a topological descriptor \(Fig\.[2](https://arxiv.org/html/2606.14159#S3.F2)B\); \(2\) Curvature\-Driven Clustering \(CDC\) aggregates atom\-level embeddings into geometrically coherent cluster\-level representations based on the local structural reflected by ORC \(Fig\.[2](https://arxiv.org/html/2606.14159#S3.F2)C\); \(3\) Optimal Transport\-based Cluster Matching \(OTCM\) treats the protein and ligand clusters as cross\-domain distributions and employs entropy\-regularized optimal transport to identify and align functionally critical interaction clusters, thereby enabling accurate modeling of higher\-order binding patterns \(Fig\.[2](https://arxiv.org/html/2606.14159#S3.F2)D\)\.

#### 3\.3\.1Curvature\-Aware Graph Embedding\(CAGE\)

In this section, the CAGE module is designed to encode global protein\-ligand complexes into expressive atom\-level representations by explicitly incorporating geometric and topological information derived from ORC\. By integrating curvature as a structural descriptor within the message\-passing process, CAGE captures both the local geometric configuration and the topological connectivity of molecular interactions\. This curvature\-aware formulation enhances the network’s ability to recognize regions of high structural complexity, thereby improving the fidelity of protein–ligand representation learning\.

CAGE operates as an E\(n\)\-invariant graph neural network \(EIGN\) that focuses on extracting discriminative atom\-level embeddings from native binding conformations\. Overall, the CAGE module can be written as

𝑯L=CAGE​\(𝑯0,𝑹,𝑨,ORC,L\)\\displaystyle\\bm\{H\}^\{L\}=\\mathrm\{CAGE\}\(\\bm\{H\}^\{0\},\\bm\{R\},\\bm\{A\},\\mathrm\{ORC\},L\)\(5\)where𝑯0\\bm\{H\}^\{0\}and𝑹\\bm\{R\}denote the initial features \(see Supplementary Table S1 for details\) and 3D coordinates of the atom nodes respectively,𝑨\\bm\{A\}is the adjacency matrix of the protein\-ligand complex graph, ORC represents the Ollivier\-Ricci curvature of edge in complex graph, andLLspecifies the number of layers\.

Formally, for a specific layerll, the message\-passing procedure in CAGE is defined as

κi​j\\displaystyle\\kappa\_\{ij\}=\\displaystyle=ORC​\(vi,vj\),\\displaystyle\\mathrm\{ORC\}\(v\_\{i\},v\_\{j\}\),\(6\)𝒎i​j\\displaystyle\\bm\{m\}\_\{ij\}=\\displaystyle=ϕe​\(𝒉il,𝒉jl,‖𝒓i−𝒓j‖2,κi​j\),\\displaystyle\\phi\_\{e\}\(\\bm\{h\}\_\{i\}^\{l\},\\bm\{h\}\_\{j\}^\{l\},\\\|\\bm\{r\}\_\{i\}\-\\bm\{r\}\_\{j\}\\\|^\{2\},\\kappa\_\{ij\}\),\(7\)𝒉il\+1\\displaystyle\\bm\{h\}\_\{i\}^\{l\+1\}=\\displaystyle=ϕh​\(𝒉il,∑j∈𝒩​\(i\)𝒎i​j\),\\displaystyle\\phi\_\{h\}\(\\bm\{h\}\_\{i\}^\{l\},\\sum\_\{j\\in\\mathcal\{N\}\(i\)\}\\bm\{m\}\_\{ij\}\),\(8\)whereκi​j\\kappa\_\{ij\}is the ORC associated with edge\(i,j\)\(i,j\),𝒉il\\bm\{h\}\_\{i\}^\{l\}and𝒉jl\\bm\{h\}\_\{j\}^\{l\}denote the features of nodesviv\_\{i\}andvjv\_\{j\}before updating,‖𝒓i−𝒓j‖2\\\|\\bm\{r\}\_\{i\}\-\\bm\{r\}\_\{j\}\\\|^\{2\}represents their squared Euclidean distance computed from the 3D coordinates\. Theϕe\\phi\_\{e\}andϕh\\phi\_\{h\}are differentiable functions that update edge\-wise and node\-wise representations, respectively\. Node features are initialized as𝒉i0=𝑾​𝒙i\\bm\{h\}\_\{i\}^\{0\}=\\bm\{W\}\\bm\{x\}\_\{i\}, where𝑾\\bm\{W\}is a trainable linear projection\. AfterLLmessage\-passing layers, the final atom\-level embeddings are obtained as𝑯=\{𝒉iL∣vi∈V\}\\bm\{H\}=\\\{\\bm\{h\}\_\{i\}^\{L\}\\mid v\_\{i\}\\in V\\\}\.

#### 3\.3\.2Curvature\-Driven Clustering\(CDC\)

Protein–ligand binding often arises from cooperative interactions among higher\-order structural domains and functional groups, which cannot be adequately characterized from a purely atom\-level perspective\[[20](https://arxiv.org/html/2606.14159#bib.bib20)\]\. Moreover, over\-reliance on fine\-grained atomic representations not only imposes substantial computational overhead but also amplifies irrelevant noise, ultimately compromising model stability and generalization\.

To overcome these limitations, we introduce the Curvature\-Driven Clustering \(CDC\) module, which incorporates ORC into a differentiable pooling framework\[[21](https://arxiv.org/html/2606.14159#bib.bib21)\]to guide the learning of cluster assignment matrices\. Exploiting the geometric insight that positive curvature indicates tightly coupled local node neighborhoods whereas negative curvature reveals separation tendencies, CDC adaptively strengthens aggregation within strongly connected \(positively curved\) regions and suppresses ineffective coupling across weakly connected \(negatively curved\) ones\. Through this curvature\-driven clustering mechanism, CDC effectively groups topologically coherent atomic nodes into functionally meaningful substructures while preserving sharp boundaries in sparse regions, thereby yielding higher\-order representations that are both physically grounded and structurally interpretable\.

To incorporate curvature\-derived geometric priors into the clustering process, the CDC module first constructs a curvature\-informed adjacency matrix for both protein and ligand graphs\. Specifically, for each molecular graph, the curvature\-weighted edge strength is defined by transforming the ORC into a normalized connectivity measure

a~i​j=11\+e−κi​j,\\displaystyle\\tilde\{a\}\_\{ij\}=\\frac\{1\}\{1\+\\text\{e\}^\{\-\\kappa\_\{ij\}\}\},\(9\)wherea~i​j\\tilde\{a\}\_\{ij\}represents the\(i,j\)\(i,j\)\-th entry of curvature\-modulated adjacency matrix𝑨~\\tilde\{\\bm\{A\}\}\. The sigmoid mapping normalizes curvature values into a stable range, assigning higher weights to geometrically cohesive regions and attenuating connections across structurally divergent areas\.

To further stabilize the clustering process, the curvature\-modulated adjacency matrix is augmented with self\-loops and symmetrically normalized to obtain the curvature\-informed adjacency matrix

𝑨curv=𝑫^−1/2​𝑨^​𝑫^−1/2\\displaystyle\\bm\{A\}^\{\\rm curv\}=\\hat\{\\bm\{D\}\}^\{\-1/2\}\\hat\{\\bm\{A\}\}\\hat\{\\bm\{D\}\}^\{\-1/2\}\(10\)where

𝑨^=𝑨~\+𝑰,𝒅^i​i=∑ja^i​j\.\\displaystyle\\hat\{\\bm\{A\}\}=\\tilde\{\\bm\{A\}\}\+\\bm\{I\},\\quad\\hat\{\\bm\{d\}\}\_\{ii\}=\\sum\_\{j\}\\hat\{a\}\_\{ij\}\.
With the curvature\-informed adjacency matrices𝑨pcurv\\bm\{A\}^\{\\rm curv\}\_\{p\}and𝑨lcurv\\bm\{A\}^\{\\rm curv\}\_\{l\}established, the CDC module computes the curvature\-guided soft cluster assignment matrices for protein and ligand nodes\. Given the atom\-level node embeddings𝑯p∈ℝ\|Vp\|×d\\bm\{H\}\_\{p\}\\in\\mathbb\{R\}^\{\|V\_\{p\}\|\\times d\}and𝑯l∈ℝ\|Vl\|×d\\bm\{H\}\_\{l\}\\in\\mathbb\{R\}^\{\|V\_\{l\}\|\\times d\}, the curvature\-guided soft cluster assignment matrices𝑺pcurv∈ℝ\|Vp\|×cp\\bm\{S\}^\{\\rm curv\}\_\{p\}\\in\\mathbb\{R\}^\{\|V\_\{p\}\|\\times c\_\{p\}\}and𝑺lcurv∈ℝ\|Vl\|×cl\\bm\{S\}^\{\\rm curv\}\_\{l\}\\in\\mathbb\{R\}^\{\|V\_\{l\}\|\\times c\_\{l\}\}are computed as

𝑺pcurv=Softmax​\(GNN​\(𝑨pcurv,𝑯p,𝜽Sp\)\),\\displaystyle\\bm\{S\}^\{\\rm curv\}\_\{p\}=\{\\mathrm\{Softmax\}\}\(\{\\rm GNN\}\(\\bm\{A\}^\{\\rm curv\}\_\{p\},\\bm\{H\}\_\{p\},\\boldsymbol\{\\theta\}\_\{S\_\{p\}\}\)\),\(11\)𝑺lcurv=Softmax​\(GNN​\(𝑨lcurv,𝑯l,𝜽Sl\)\),\\displaystyle\\bm\{S\}^\{\\rm curv\}\_\{l\}=\{\\mathrm\{Softmax\}\}\(\{\\rm GNN\}\(\\bm\{A\}^\{\\rm curv\}\_\{l\},\\bm\{H\}\_\{l\},\\boldsymbol\{\\theta\}\_\{S\_\{l\}\}\)\),\(12\)where𝜽Sp\\boldsymbol\{\\theta\}\_\{S\_\{p\}\}and𝜽Sl\\boldsymbol\{\\theta\}\_\{S\_\{l\}\}are learnable parameters, andcpc\_\{p\}andclc\_\{l\}represent the predefined numbers of clusters for the protein and ligand, respectively\.

Based on the curvature\-guided cluster assignment matrices, the CDC module aggregates atom\-level information into curvature\-aware cluster representations\. The resulting cluster\-level embeddings for protein and ligand, denoted as𝒁pcurv∈ℝcp×d\\bm\{Z\}^\{\\rm curv\}\_\{p\}\\in\\mathbb\{R\}^\{c\_\{p\}\\times d\}and𝒁lcurv∈ℝcl×d\\bm\{Z\}^\{\\rm curv\}\_\{l\}\\in\\mathbb\{R\}^\{c\_\{l\}\\times d\}, together with their corresponding curvature\-aware cluster adjacency matrices𝑨~pcurv∈ℝcp×cp\\tilde\{\\bm\{A\}\}^\{\\rm curv\}\_\{p\}\\in\\mathbb\{R\}^\{c\_\{p\}\\times c\_\{p\}\}and𝑨~lcurv∈ℝcl×cl\\tilde\{\\bm\{A\}\}^\{\\rm curv\}\_\{l\}\\in\\mathbb\{R\}^\{c\_\{l\}\\times c\_\{l\}\}are obtained as

𝒁pcurv=\(𝑺pcurv\)⊤​𝑯p,𝑨~pcurv=\(𝑺pcurv\)⊤​𝑨~p​𝑺pcurv,\\displaystyle\\bm\{Z\}^\{\\rm curv\}\_\{p\}=\(\{\\bm\{S\}^\{\\rm curv\}\_\{p\}\}\)^\{\\top\}\\bm\{H\}\_\{p\},\\ \\tilde\{\\bm\{A\}\}^\{\\rm curv\}\_\{p\}=\(\{\\bm\{S\}^\{\\rm curv\}\_\{p\}\}\)^\{\\top\}\\tilde\{\\bm\{A\}\}\_\{p\}\\bm\{S\}^\{\\rm curv\}\_\{p\},\(13\)𝒁lcurv=\(𝑺lcurv\)⊤​𝑯l,𝑨~lcurv=\(𝑺lcurv\)⊤​𝑨~l​𝑺lcurv\.\\displaystyle\\bm\{Z\}^\{\\rm curv\}\_\{l\}=\(\{\\bm\{S\}^\{\\rm curv\}\_\{l\}\}\)^\{\\top\}\\bm\{H\}\_\{l\},\\ \\tilde\{\\bm\{A\}\}^\{\\rm curv\}\_\{l\}=\(\{\\bm\{S\}^\{\\rm curv\}\_\{l\}\}\)^\{\\top\}\\tilde\{\\bm\{A\}\}\_\{l\}\\bm\{S\}^\{\\rm curv\}\_\{l\}\.\(14\)Here,𝒁pcurv\\bm\{Z\}^\{\\rm curv\}\_\{p\}and𝒁lcurv\\bm\{Z\}^\{\\rm curv\}\_\{l\}summarize atomic embeddings into higher\-level structural units, while𝑨~pcurv\\tilde\{\\bm\{A\}\}^\{\\rm curv\}\_\{p\}and𝑨~lcurv\\tilde\{\\bm\{A\}\}^\{\\rm curv\}\_\{l\}encode curvature\-enhanced inter\-cluster connectivity patterns, ensuring that the geometric and topological organization of the original molecular graphs is preserved after pooling\.

To further refine these hierarchical representations and capture interactions across clusters, additional GNNs are applied to the cluster\-level graphs:

𝑪pcurv=GNN​\(𝑨~pcurv,𝒁pcurv,𝜽Zp\),\\displaystyle\\bm\{C\}\_\{p\}^\{\\rm curv\}=\{\\rm GNN\}\(\\tilde\{\\bm\{A\}\}^\{\\rm curv\}\_\{p\},\\bm\{Z\}^\{\\rm curv\}\_\{p\},\\boldsymbol\{\\theta\}\_\{Z\_\{p\}\}\),\(15\)𝑪lcurv=GNN​\(𝑨~lcurv,𝒁lcurv,𝜽Zl\),\\displaystyle\\bm\{C\}\_\{l\}^\{\\rm curv\}=\{\\rm GNN\}\(\\tilde\{\\bm\{A\}\}^\{\\rm curv\}\_\{l\},\\bm\{Z\}^\{\\rm curv\}\_\{l\},\\boldsymbol\{\\theta\}\_\{Z\_\{l\}\}\),\(16\)where𝜽Zp\\boldsymbol\{\\theta\}\_\{Z\_\{p\}\}and𝜽Zl\\boldsymbol\{\\theta\}\_\{Z\_\{l\}\}are learnable parameters\. The resulting cluster embeddings,𝑪pcurv\\bm\{C\}\_\{p\}^\{\\rm curv\}and𝑪lcurv\\bm\{C\}\_\{l\}^\{\\rm curv\}serve as refined, semantically coherent representations of the protein and ligand at the cluster level\.

#### 3\.3\.3Optimal Transport\-Based Cluster Matching\(OTCM\)

To capture the correspondence between functionally critical clusters that drive protein–ligand interactions, we introduce the OTCM module\. Optimal transport \(OT\), as a classical mathematical framework for comparing and aligning probability distributions, establishes robust and interpretable correspondences by identifying the most cost\-efficient transport plan between them\[[41](https://arxiv.org/html/2606.14159#bib.bib41)\],\[[42](https://arxiv.org/html/2606.14159#bib.bib42)\],\[[43](https://arxiv.org/html/2606.14159#bib.bib43)\]\.

In this study, the protein and ligand are modeled as two heterogeneous structural domains,𝒟p\\mathcal\{D\}\_\{p\}and𝒟l\\mathcal\{D\}\_\{l\}, where the curvature\-refined clusters produced by the CDC module define discrete probability distributions over the cluster sets of each domain\. Building on this representation, cross\-domain interactions between protein clusters and ligand clusters are formulated as an entropy\-regularized OT problem that learns an OT plan aligning structurally and functionally related clusters across domains\.

Given the refined cluster embeddings𝑪pcurv∈ℝcp×d\\bm\{C\}\_\{p\}^\{\\rm curv\}\\in\\mathbb\{R\}^\{c\_\{p\}\\times d\}and𝑪lcurv∈ℝcl×d\\bm\{C\}\_\{l\}^\{\\rm curv\}\\in\\mathbb\{R\}^\{c\_\{l\}\\times d\}produced by the CDC module, we first project them into a shared latent space through learnable mappings𝑾p\\bm\{W\}\_\{p\}and𝑾l\\bm\{W\}\_\{l\}\. This ensures that both domains are represented in a united metric space, allowing for meaningful computation of cross\-domain distances\.

To quantify the correspondence between protein clusters and ligand clusters, the protein\-to\-ligand transport cost matrix𝑪p→l∈ℝcp×cl\\bm\{C\}\_\{p\\rightarrow l\}\\in\\mathbb\{R\}^\{c\_\{p\}\\times c\_\{l\}\}is then defined by the squared Euclidean distance between the projected cluster embeddings

𝑪p→l​\(i,j\)=‖fpi−flj‖22,\\displaystyle\\bm\{C\}\_\{p\\rightarrow l\}\(i,j\)=\\\|f^\{i\}\_\{p\}\-f^\{j\}\_\{l\}\\\|\_\{2\}^\{2\},\(17\)wherefpif^\{i\}\_\{p\}andfljf^\{j\}\_\{l\}denote the projected features of theii\-th protein cluster and thejj\-th ligand cluster, respectively\.

Let𝜸p→l∈ℝ\+cp×cl\\bm\{\\gamma\}\_\{p\\rightarrow l\}\\in\\mathbb\{R\}\_\{\+\}^\{c\_\{p\}\\times c\_\{l\}\}represents the transport assignment of protein\-to\-ligand pairwise transport cost matrix\. The cross\-domain matching is then formulated as an entropy\-regularized Kantorovich OT problem

min𝜸p→l⁡⟨𝜸p→l,𝑪p→l⟩\+λ​H​\(γp→l\),s\.t\.​𝜸p→l​𝟏cl=μ,𝜸p→l⊤​𝟏cp=ν,\\displaystyle\\begin\{aligned\} &\\min\_\{\\bm\{\\gamma\}\_\{p\\rightarrow l\}\}\\langle\\bm\{\\gamma\}\_\{p\\rightarrow l\},\\bm\{C\}\_\{p\\rightarrow l\}\\rangle\+\\lambda H\(\\gamma\_\{p\\rightarrow l\}\),\\\\\[6\.0pt\] &\\text\{s\.t\.\}\\ \\bm\{\\gamma\}\_\{p\\rightarrow l\}\\bm\{1\}\_\{c\_\{l\}\}=\\mu,\\ \\bm\{\\gamma\}\_\{p\\rightarrow l\}^\{\\top\}\\bm\{1\}\_\{c\_\{p\}\}=\\nu,\\end\{aligned\}\(18\)where𝟏cl∈ℝcl\\bm\{1\}\_\{c\_\{l\}\}\\in\\mathbb\{R\}^\{c\_\{l\}\}and𝟏cp∈ℝcp\\bm\{1\}\_\{c\_\{p\}\}\\in\\mathbb\{R\}^\{c\_\{p\}\}are all\-one vector,μ\\muandν\\nudenote the empirical marginal distribution of the protein and ligand clusters\. The entropy term is given byH​\(𝜸p→l\)=∑i,j𝜸p→li​j​log​𝜸p→li​jH\(\\bm\{\\gamma\}\_\{p\\rightarrow l\}\)=\\sum\_\{i,j\}\\bm\{\\gamma\}^\{ij\}\_\{p\\rightarrow l\}\{\\rm log\}\\bm\{\\gamma\}^\{ij\}\_\{p\\rightarrow l\}, for penalty parameterλ\>0\\lambda\>0, the OT solution𝜸p→l∗\\bm\{\\gamma\}^\{\*\}\_\{p\\rightarrow l\}is unique and has the closed form

𝜸p→l∗=diag​\(μ\)​exp​\(−𝑪p→lλ\)​diag​\(ν\)\.\\displaystyle\\bm\{\\gamma\}^\{\*\}\_\{p\\rightarrow l\}=\{\\rm diag\}\(\\mu\)\\mathrm\{exp\}\\left\(\-\\frac\{\\bm\{C\}\_\{p\\rightarrow l\}\}\{\\lambda\}\\right\)\{\\rm diag\}\(\\nu\)\.\(19\)
To characterize the strength of interactions from protein clusters to ligand clusters, we perform row normalization on the optimal coupling matrix to derive a matching matrix

𝑴p→l=diag​\(𝜸∗​𝟏n\)−1​𝜸∗∈ℝcp×cl,\\displaystyle\\bm\{M\}\_\{p\\rightarrow l\}=\{\\rm diag\}\(\\bm\{\\gamma\}^\{\*\}\\bm\{1\}\_\{n\}\)^\{\-1\}\\bm\{\\gamma\}^\{\*\}\\in\\mathbb\{R\}^\{c\_\{p\}\\times c\_\{l\}\},\(20\)which defines a many\-to\-many correspondence between protein clusters and ligand clusters\. Intuitively, larger matching weights indicate that a protein cluster exhibits a stronger focus on a ligand cluster\.

Based on the matching relationship, each protein cluster performs weighted aggregation of ligand clusters information according to the learned matching weights\. The protein\-to\-ligand interaction representation is represented as

𝒁p→l=𝑴p→l​𝑳,\\displaystyle\\bm\{Z\}\_\{p\\rightarrow l\}=\\bm\{M\}\_\{p\\rightarrow l\}\\bm\{L\},\(21\)where𝑳=𝑾p→l​𝑪lcurv\\bm\{L\}=\\bm\{W\}\_\{p\\rightarrow l\}\\bm\{C\}\_\{l\}^\{\\rm curv\}denotes a learnable linear transformation of ligand cluster representations\. This transformation allows the model to adaptively emphasize ligand substructures that are strongly coupled with specific protein subregions\.

A symmetric ligand\-to\-protein representation𝒁l→p\\bm\{Z\}\_\{l\\rightarrow p\}is obtained analogously to ensure bidirectional modeling of protein\-ligand interactions\. Through this bidirectional OT\-based matching, the OTCM module establishes a geometry\-consistent alignment between protein and ligand cluster representations, capturing key cross\-domain interactions and identifying inter\-cluster correspondences critical for molecular binding while attenuating the influence of irrelevant or weakly interacting clusters\.

#### 3\.3\.4Protein\-Ligand Interaction Representation and Binding Affinity Prediction

Finally, to derive a unified representation of the protein–ligand interaction, the curvature\-informed structural embeddings produced by the CDC module are integrated with the cross\-domain transport features learned by the OTCM module\. This fusion jointly captures \(i\) the intrinsic geometric–topological organization within each molecular domain and \(ii\) the directional interaction patterns between protein and ligand clusters\. The protein\-ligand interaction representation is computed as

𝒁pro\-lig=Φl→p​\(∑i=1cl𝒁l→p​\[i\]\+∑i=1cl𝑪lcurv​\[i\]\)\+Φp→l​\(∑j=1cp𝒁p→l​\[j\]\+∑j=1cp𝑪pcurv​\[j\]\)\.\\displaystyle\\begin\{aligned\} \\bm\{Z\}\_\{\\text\{pro\-lig\}\}&=\\Phi\_\{l\\rightarrow p\}\\\!\\left\(\\sum\_\{i=1\}^\{c\_\{l\}\}\\bm\{Z\}\_\{l\\rightarrow p\}\[i\]\+\\sum\_\{i=1\}^\{c\_\{l\}\}\\bm\{C\}^\{\\rm curv\}\_\{l\}\[i\]\\right\)\\\\ &\+\\Phi\_\{p\\rightarrow l\}\\\!\\left\(\\sum\_\{j=1\}^\{c\_\{p\}\}\\bm\{Z\}\_\{p\\rightarrow l\}\[j\]\+\\sum\_\{j=1\}^\{c\_\{p\}\}\\bm\{C\}^\{\\rm curv\}\_\{p\}\[j\]\\right\)\.\\end\{aligned\}\(22\)whereΦl→p\\Phi\_\{l\\rightarrow p\}andΦp→l\\Phi\_\{p\\rightarrow l\}are multilayer perceptrons \(MLPs\) with learnable parameters that nonlinearly fuse intra\-domain curvature\-aware representations with cross\-domain transport\-informed features\.

The protein\-ligand interaction representation𝒁pro\-lig\\bm\{Z\}\_\{\\text\{pro\-lig\}\}is then passed to the prediction head

y^=Φpro\-lig​\(𝒁pro\-lig\),\\displaystyle\\hat\{y\}=\\Phi\_\{\\text\{pro\-lig\}\}\(\\bm\{Z\}\_\{\\text\{pro\-lig\}\}\),\(23\)whereΦpro\-lig\\Phi\_\{\\text\{pro\-lig\}\}is an MLP used to regress the binding affinity\.

The overall model is trained end\-to\-end by minimizing the mean\-squared error \(MSE\) between the predicted and experimentally measured binding affinities

LMSE=1N​∑i=1N\(y^i−yi\)2,\\displaystyle L\_\{\\rm MSE\}=\\frac\{1\}\{N\}\\sum\_\{i=1\}^\{N\}\(\\hat\{y\}\_\{i\}\-y\_\{i\}\)^\{2\},\(24\)whereNNis the number of protein\-ligand complex samples,y^i\\hat\{y\}\_\{i\}denotes the predicted binding affinity of theii\-th complex, andyiy\_\{i\}represents its corresponding experimental ground truth\.

By joint integration curvature\-guided structural signals with OT\-based cross\-domain relational correspondences, RicciBind learns representations that encode both fine\-grained geometric context and high\-level interaction semantics, resulting in improved fidelity and interpretability in protein–ligand binding affinity prediction\.

## 4Results and Discussion

In this study, we evaluate RicciBind on diverse PLA benchmarks tasks, including binding affinity prediction and large\-scale structure\-based virtual screening\.

All experiments were implemented using PyTorch on an NVIDIA GeForce RTX 4080 GPU with 16 GB of memory\. We employed the Adam optimizer with a learning rate of1​e−41\\textbf\{e\}\{\-4\}to update model parameters and applied a weight decay of1​e−61\\textbf\{e\}\{\-6\}to mitigate overfitting\. The batch size was set to 128, and early stopping was adopted based on the validation RMSE to ensure stable convergence\.

### 4\.1Data Preparation and Experimental Setup

For the binding affinity prediction task, we follow the experimental protocol of EHIGN\[[15](https://arxiv.org/html/2606.14159#bib.bib15)\], training and validating RicciBind on the PDBbind v2016 general set\. After preprocessing with RDKit, a total of 12,904 protein\-ligand complexes are retained and randomly divided into training and validation sets\. To assess the model’s generalization capability, we evaluate its performance on three independent external test sets: the PDBbind v2013 core set, PDBbind v2016 core set, and PDBbind v2019 holdout set, ensuring that there is no overlap between the training, validation, and testing samples\.

Furthermore, to examine the RicciBind’s out\-of\-distribution generalization beyond PDBbind, we evaluate it on the CSAR NRC\-HiQ dataset\[[44](https://arxiv.org/html/2606.14159#bib.bib44)\]\. After removing redundant entries overlapping with the PDBbind training set and structures that cannot be processed by RDKit, 14 complexes are retained for evaluation\.

For diverse protein evaluation task, we adopt the standardized evaluation benchmark constructed by Holoprot\[[45](https://arxiv.org/html/2606.14159#bib.bib45)\]\. Specifically, the PDBbind v2019 refined set\[[46](https://arxiv.org/html/2606.14159#bib.bib46)\]is partitioned into training, validation, and test subsets based on a 60% protein sequence similarity threshold, thereby strictly limiting sequence homology between the training and test targets at the data level\. This splitting strategy effectively mitigates the risk of information leakage arising from protein family similarity and ensures that the target proteins in the test set are markedly distinct from those in the training set in both sequence and structural characteristics\.

To evaluate the applicability of RicciBind in realistic drug discovery scenarios, we follow the cold start and warm start evaluation protocols introduced by EHIGN\[[15](https://arxiv.org/html/2606.14159#bib.bib15)\], examining the predictive performance of RicciBind on previously unseen protein targets as well as on ligands with substantial structural divergence\. Specifically, in the cold start setting, the original PDBbind training set is partitioned based on protein sequence similarity or ligand scaffold diversity, and new training, validation, and test sets are constructed in an 8:1:1 ratio\. In the protein cold start configuration, the sequences of proteins in the validation and test sets share less than 30% sequence identity with those in the training set, ensuring substantial structural divergence between training and testing targets\. In the ligand cold start setting, all ligands in the test set possess scaffolds that are not observed during training, enabling the evaluation of the model’s generalization to novel chemical structures\. By contrast, in the warm start setting, the dataset is randomly partitioned, allowing proteins and ligands to appear across multiple subsets, thereby reflecting the model’s performance under relatively familiar molecular environments\. Comparative analyses under these two settings provide a systematic assessment of RicciBind’s performance on both seen and unseen molecular entities, allowing a comprehensive evaluation of its predictive robustness and generalization capability in challenging and realistic protein–ligand binding scenarios\.

To further assess the practical applicability of RicciBind, we conduct large\-scale structure\-based virtual screening \(SBVS\) experiments\. Unlike the regression\-based binding affinity prediction task, SBVS task aims to evaluate the model’s capability to distinguish active ligands from decoys—molecules that share similar physicochemical properties with actives but fail to bind to the target protein\. We conduct experiments on two widely adopted benchmarks, DUD\-E\[[47](https://arxiv.org/html/2606.14159#bib.bib47)\]and LIT\-PCBA\[[48](https://arxiv.org/html/2606.14159#bib.bib48)\]\.

The DUD\-E dataset consists of 102 target proteins, 22,886 active compounds, and a large number of decoys generated from the ZINC\[[49](https://arxiv.org/html/2606.14159#bib.bib49)\]database, maintaining an approximate 1:50 active\-to\-decoy ratio\. In this study, we focus on 57 well\-characterized and functionally classifiable targets spanning four major protein families: GPCRs, kinases, nuclear proteins, and proteases\. To evaluate the generalization capability of RicciBind in virtual screening, we adopt a cross\-family zero\-shot setting: 26 targets from the nuclear protein and protease families are used for training, while 31 targets from the GPCR and kinase families are reserved for testing\. This experimental design assesses the model’s ability to perform ligand screening on entirely unseen protein families, providing a stringent test of its capacity to learn transferable protein\-ligand interaction patterns\. The full list of training and testing targets is provided in Supplementary Table S4\.

The LIT\-PCBA dataset, in contrast, derived from PubChem\-based high\-throughput biological assay data and constructed using an asymmetric validation embedding \(AVE\) strategy, which mitigates hidden data biases and minimize structural redundancy between training and testing sets\. This design yields a more realistic and challenging benchmark for evaluating structure\-based screening models\. In this study, to further examine target\-specific generalization, we evaluate RicciBind on three particularly challenging targets \(FEN1, KAT2A, and PKM2\) from LIT\-PCBA, each exhibiting an extreme active\-to\-decoy ratio exceeding 1:1000, thereby reflecting conditions closer to real\-world drug discovery pipelines\.

Table 1:Performance Statistics Summary for Protein–Ligand Binding Affinity Predictions on Three External Test Sets- •Note: The evaluation uses RMSE for prediction accuracy, with the average affinity values in the test sets being 6\.318 for PDBBind v2013, 6\.486 for PDBBind v2016 and 6\.333 for PDBBind v2019\. Pearson correlation coefficient evaluates the correlation of predictions with experimental affinities\. An upward arrow \(↑\\uparrow\) denotes higher scores are better and a downward arrow \(↓\\downarrow\) denotes the reverse\. The mean and standard deviation of the results from three independent runs are reported\. The best performance for each metric is highlighted inboldand the second\-best performance isunderlined\.

### 4\.2RicciBind in Cross\-Dataset PLA Prediction

In the PLA prediction task, the objective is to predict the binding strength between a given pair of protein and ligand, which is a regression problem\. We first evaluated RicciBind across three non\-overlapping datasets from PDBbind, comparing it against a diverse set of representative baselines, including structure\-free models \(DeepDTA\[[4](https://arxiv.org/html/2606.14159#bib.bib4)\], GraphDTA\[[5](https://arxiv.org/html/2606.14159#bib.bib5)\]and MGraphDTA\[[50](https://arxiv.org/html/2606.14159#bib.bib50)\]\) and structure\-based models \(PotentialNet\[[14](https://arxiv.org/html/2606.14159#bib.bib14)\], GNN\-DTI\[[51](https://arxiv.org/html/2606.14159#bib.bib51)\], IGN\[[13](https://arxiv.org/html/2606.14159#bib.bib13)\], SchNet\[[52](https://arxiv.org/html/2606.14159#bib.bib52)\], EGNN\[[53](https://arxiv.org/html/2606.14159#bib.bib53)\], MetalProGNet\[[54](https://arxiv.org/html/2606.14159#bib.bib54)\], GIGN\[[6](https://arxiv.org/html/2606.14159#bib.bib6)\], SS\-GNN\[[55](https://arxiv.org/html/2606.14159#bib.bib55)\], EHIGN\[[15](https://arxiv.org/html/2606.14159#bib.bib15)\]and CheapNet\[[20](https://arxiv.org/html/2606.14159#bib.bib20)\]\)\. Following the experimental protocol of EHIGN, all experiments are repeated three times with different random seeds to ensure statistical reliability\. Model performance is assessed using the RMSE and the Pearson’s correlation coefficient, with the mean and standard deviation reported across runs\. The comparative results on three independent external test sets are summarized in Table[1](https://arxiv.org/html/2606.14159#S4.T1), where baseline results are referenced from the official reports of EHIGN\[[15](https://arxiv.org/html/2606.14159#bib.bib15)\]and CheapNet\[[20](https://arxiv.org/html/2606.14159#bib.bib20)\]\.

As shown in Table[1](https://arxiv.org/html/2606.14159#S4.T1), RicciBind achieves competitive performance in cross\-dataset binding affinity prediction tasks\. Overall, RicciBind substantially outperforms structure\-free models and achieves performance comparable to or exceeding that of representative structure\-based approaches\. Specifically, on the PDBbind v2013 core set, RicciBind attains a Pearson correlation coefficient of 0\.872, representing a significant improvement over existing methods\. On the PDBbind v2016 core set, RicciBind similarly achieves the highest Pearson correlation of 0\.871, confirming its stability and robustness across different dataset\. Furthermore, on the more challenging and larger\-scale PDBbind v2019 holdout set, RicciBind exhibites superior overall performance, with an RMSE of 1\.335 and a Pearson correlation of 0\.667, indicating strong cross\-dataset generalization capabilities to previously unseen protein–ligand complexes\. Notably, evaluation on the PDBbind v2019 holdout set follows a temporal split scenario, where the model is trained on historical structural data and tested on newly released protein–ligand complexes\. This setup more realistically simulates the predictive requirements encountered in real\-world drug discovery\[[15](https://arxiv.org/html/2606.14159#bib.bib15)\]\.

In summary, across the three PDBbind datasets, RicciBind consistently and substantially outperforms structure\-free methods, highlighting the critical importance of explicitly incorporating three\-dimensional molecular structural information into binding affinity modeling for accurately characterizing protein–ligand interactions\. Moreover, compared with existing structure\-based models, RicciBind achieved superior or at least competitive performance across all benchmarks, demonstrating strong stability and robust cross\-dataset generalization\. These advantages can be attributed to the curvature\-aware geometric representations, which jointly capture local geometric constraints and global topological properties of molecular structures, and to the modeling of cluster\-level interactions that effectively encode higher\-order structural patterns driving protein–ligand binding, thereby enabling a more fine\-grained representation of protein–ligand interactions\.

![Refer to caption](https://arxiv.org/html/2606.14159v1/x4.png)Figure 4:Performance evaluation of RicciBind on non\-PDBbind external test sets and unseen proteins\.a, Comparison of Pearson correlation coefficients obtained by training RicciBind on the PDBbind dataset and evaluating on the external CSAR NRC\-HiQ test set\. b, Comparative analysis of predicted binding affinities for a representative protein–ligand complex \(PDB ID: 5a0b\)\. c\-e, Performance of RicciBind on unseen proteins, evaluated in terms of root mean squared error\(c\), Pearson correlation coefficient\(d\), and Spearman rank correlation coefficient\. All the compared results are taken from\[[20](https://arxiv.org/html/2606.14159#bib.bib20)\]\.
### 4\.3RicciBind on Non\-PDBbind External Test Set

To further assess the reliability and generalization capability of RicciBind beyond the PDBbind domain, we evaluate models trained on PDBbind dataset using the CSAR NRC\-HiQ dataset\[[44](https://arxiv.org/html/2606.14159#bib.bib44)\]\. Compared with PDBbind, CSAR NRC\-HiQ adopts more stringent curation and quality\-control criteria, and is therefore widely regarded as a rigorous benchmark for evaluating the practical applicability of protein–ligand binding affinity prediction models\. As summarized in Fig\.[4](https://arxiv.org/html/2606.14159#S4.F4)a, RicciBind achieves the highest Pearson correlation coefficient of 0\.92 among structure\-based baselines, demonstrating highly competitive performance under this challenging cross\-dataset evaluation setting\. These results demonstrate that RicciBind is capable of learning transferable protein–ligand interaction patterns, achieving stable predictive performance on previously unseen complex structures\.

### 4\.4RicciBind in Diverse Protein Dataset

To evaluate the generalization capability of RicciBind in characterising the structures of unknown target proteins under substantially increased target protein structural diversity, we adopted the standardized evaluation benchmark constructed by Holoprot \(see Subsection[4\.1](https://arxiv.org/html/2606.14159#S4.SS1)for details of the dataset\)\.

The comparison results are summarized in Fig\.[4](https://arxiv.org/html/2606.14159#S4.F4)c\-e, with baseline performances referenced from CheapNet\[[20](https://arxiv.org/html/2606.14159#bib.bib20)\], ProNet\[[56](https://arxiv.org/html/2606.14159#bib.bib56)\]and BindNet\[[57](https://arxiv.org/html/2606.14159#bib.bib57)\]\. RicciBind consistently outperforms all baselines across RMSE \(Fig\.[4](https://arxiv.org/html/2606.14159#S4.F4)c\), Pearson’s correlation coefficient \(Fig\.[4](https://arxiv.org/html/2606.14159#S4.F4)d\), and Spearman rank correlation coefficient \(Fig\.[4](https://arxiv.org/html/2606.14159#S4.F4)e\)\. These results demonstrate its clear advantages in both predictive accuracy and ranking consistency\. Specifically, RicciBind achieved an RMSE of 1\.197, a Pearson correlation of 0\.811, and a Spearman correlation of 0\.812, representing relative improvements of 3\.3%, 2\.1%, and 2\.9% over the strongest baseline, respectively\.

Notably, as a structure\-based hierarchical higher\-order geometric representation method, RicciBind significantly outperforms models that rely primarily on protein surface information \(e\.g\., MaSIF\[[58](https://arxiv.org/html/2606.14159#bib.bib58)\]\) as well as structure\-based molecular modeling approaches \(e\.g\., BindNet\[[57](https://arxiv.org/html/2606.14159#bib.bib57)\]\)\. These results suggest that, when target protein structural diversity is substantially increased, ORC provides an effective geometric prior for jointly encoding fine\-grained molecular structure and higher\-order interaction patterns\. This design contributes to the strong generalization ability and predictive stability of RicciBind across diverse and unseen target proteins \(see Supplementary Table S2 for details\)\.

Table 2:Performance Evaluation for Structure\-Base Models in the Warm Start \(Random Split\) and Cold Start \(Split Based on Sequence Identity and Scaffold\) Scenarios
### 4\.5RicciBind in Cold Start Scenario

In real\-world drug discovery, PLA prediction models are expected to exhibit strong generalization capabilities, enabling reliable predictions for proteins or ligands that were not observed during training and that exhibit substantial structural differences from known samples\. To evaluate RicciBind under such challenging generalization scenarios, we follow the experimental protocol proposed in EHIGN\[[15](https://arxiv.org/html/2606.14159#bib.bib15)\]and assessed its performance under both cold start and warm start settings \(see Subsection[4\.1](https://arxiv.org/html/2606.14159#S4.SS1)for details of the dataset\)\.

Table[2](https://arxiv.org/html/2606.14159#S4.T2)summarizes the performance of RicciBind under both warm start and cold start evaluation scenarios\. Overall, RicciBind achieves stable and consistent advantages across all three data partitioning strategies, outperforming the SOTA structure\-based method EHIGN\[[15](https://arxiv.org/html/2606.14159#bib.bib15)\]\. This highlights its reliability and robustness in challenging prediction settings, with particularly notable generalization performance on unseen protein–ligand complexes\. Specifically, in the warm start setting, RicciBind achieves the lowest RMSE and the highest Pearson correlation coefficient among all competing methods, yielding relative improvements of 2\.7% and 1\.3%, respectively\. This indicates that when the test distribution is closely aligned with the training data, RicciBind can more effectively exploit structural information to finely characterize protein–ligand binding patterns, resulting in improved predictive accuracy and stability\. By contrast, the cold start setting poses more stringent challenges to model generalization\. In both the protein cold start split based on a 30% sequence identity threshold and the ligand cold start split based on scaffold dissimilarity, RicciBind maintained its leading performance, achieving the lowest RMSE values \(1\.437 and 1\.248\) and the highest Pearson correlation coefficients \(0\.622 and 0\.722\), respectively\. This demonstrates that RicciBind remains capable of effectively learning and capturing key protein–ligand interaction patterns even under conditions of pronounced structural variation in either proteins or ligands\.

Notably, under cold start scenarios, all models exhibit varying degrees of performance degradation, highlighting the persistent challenge of improving generalization to entirely unseen structures\. Nevertheless, RicciBind consistently achieves the best average performance and the lowest standard deviation across all evaluation scenarios, demonstrating its robustness and predictive stability under diverse data distributions\. Collectively, these results suggest that RicciBind not only effectively learns geometric characteristics from previously observed molecules, but also retains the ability to extract key interaction patterns when substantial structural variations occur\. This reflects the potential advantages of its curvature\-guided geometric characterization and cluster\-level interaction modeling mechanism in mitigating distribution shifts caused by structural differences, thereby supporting more robust and reliable PLA prediction in challenging generalization settings\.

### 4\.6RicciBind in Structure\-Based Virtual Screening

To assess the practical applicability of RicciBind in real\-world drug discovery pipelines, we conduct comprehensive experiments on two structure\-based virtual screening \(SBVS\) benchmarks\. Virtual screening aims to prioritize the identification of potentially active molecules from large\-scale compound libraries by leveraging protein\-ligand 3D structural information\. Its core challenge lies in the model’s ability to identify true binders at an early stage\. Following the standard evaluation protocols, we employed the enrichment factor \(EF\) as the primary performance metric and reported EF values at the top 0\.1%, 0\.5%, 1%, and 5% of ranked compound lists, enabling a quantitative comparison of the early enrichment capability across different methods\.

We first evaluated the virtual screening performance of RicciBind on a high\-confidence subset of the DUD\-E benchmark\. The assessment was conducted under a stringent cross\-family zero\-shot setting, designed to test the RicciBind’s generalization capability towards unknown targets under conditions approximating real\-world drug discovery scenarios\. As shown in Fig\.[5](https://arxiv.org/html/2606.14159#S4.F5), RicciBind consistently outperforms all competing methods across four evaluation thresholds \(top 0\.1%, 0\.5%, 1%, and 5%\)\. Notably, under the most stringent early enrichment criteria, RicciBind achieves enrichment factors of 24\.346 and 17\.630 at the top 0\.1% and 0\.5% thresholds, respectively, indicating its strong ability to screen out genuinely active compounds from a vast array of candidate molecules\. These results demonstrate the robust generalization of RicciBind under cross\-family zero\-shot conditions and highlight its potential value for early hit identification in practical high\-throughput virtual screening tasks \(see Subsection[4\.1](https://arxiv.org/html/2606.14159#S4.SS1)and Supplementary Table S3 for details\)\.

Subsequently, We conduct target\-specific SBVS experiments on the LIT\-PCBA benchmark, focusing on three well\-recognized and particularly challenging targets, namely FEN1, KAT2A, and PKM2, to further validate the robustness and generalization capability of RicciBind in complex screening scenarios\. In this setting, RicciBind was compared against docking\-based scoring functions and learning\-based approaches\. Relevant benchmark results were uniformly sourced from EHIGN\[[15](https://arxiv.org/html/2606.14159#bib.bib15)\]to ensure consistency in evaluation\. As illustrated in Table[3](https://arxiv.org/html/2606.14159#S4.T3), traditional docking scoring functions \(e\.g\., Glide SP\[[60](https://arxiv.org/html/2606.14159#bib.bib60)\]\) exhibit inferior screening performance across all three targets compared with learning\-based methods\. This highlights the challenges posed by complex structure\-activity relationships in the LIT\-PCBA dataset to purely physical scoring models, while also revealing the inherent limitations of docking\-based approaches in early enrichment tasks\. In contrast, RicciBind achieves competitive and stable enrichment performance across all three targets\. Specifically, for the KAT2A target, RicciBind achieves a improvement of approximately 50% over the current SOTA structure\-based model EHIGN on the EF0\.1%\{0\.1\\%\}metric, indicating its enhanced ability to effectively identify active compounds across diverse and challenging screening scenarios\.

Overall, the evaluations on cross\-family zero\-shot virtual screening using DUD\-E and target\-specific virtual screening on LIT\-PCBA demonstrate that RicciBind maintains robust and superior early enrichment performance even under screening scenarios characterised by high chemical space diversity, structurally complex protein targets, and pronounced distributional shifts\. These results underscore the practical utility and translational potential of RicciBind for real\-world structure\-based drug discovery applications\.

![Refer to caption](https://arxiv.org/html/2606.14159v1/x5.png)Figure 5:Performance Comparison of Different Models on the DUD\-E Dataset\.Table 3:Target\-Specific SBVS on the LIT\-PCBA Dataset
### 4\.7Ablation Study

To validate the critical role of ORC in capturing the geometric information of protein\-ligand interactions, we conduct a series of ablation studies to evaluate the impact of removing ORC information on RicciBind performance within both the CAGE and CDC modules\. Specifically, we designed three RicciBind variants to analyze the contribution of Ricci curvature features across different modules, thereby elucidating the significance of curvature\-guided geometric representations in protein\-ligand interaction modeling\.

Table 4:Results of RicciBind with varying settings on the 2013 core set, 2016 core set, and the 2019 holdout set\.- •Variant 1: ORC information is removed from both the CAGE and CDC modules\. This variant evaluates the overall contribution of curvature to protein–ligand interaction modeling in RicciBind\.
- •Variant 2: ORC information is removed only from the CAGE module, while the complex graph node embeddings are encoded using the standard EIGN framework\. This configuration investigates the role of curvature in the explicitly modeling higher\-order topological and geometric features within the interaction graph\.
- •Variant 3: ORC information is removed only from the CDC module, and conventional DiffPool is used to cluster protein and ligand nodes\. This variant assesses the importance of curvature\-guided clustering for capturing local compactness and geometrically cohesive substructures in molecular graphs\.

We evaluated the above RicciBind variants on cross\-dataset PLA prediction task, with the results summarized in Table[4](https://arxiv.org/html/2606.14159#S4.T4)\. Overall, all three ablation variants exhibited varying degrees of performance degradation compared with the full RicciBind model during cross\-dataset testing, underscoring the importance of curvature information in characterizing protein–ligand interactions\. In particular, Variant 1, which completely removes curvature information from both the CAGE and CDC modules, suffers the most pronounced performance decline across all evaluation metrics\. This indicates that without curvature\-guided geometric representations, the model struggles to adequately capture the intricate structural patterns inherent in protein–ligand interactions, highlighting the central role of curvature information within RicciBind\. By contrast, Variant 2 and Variant 3 retain curvature information in the CDC and CAGE modules, respectively, and both achieve marked performance recovery relative to Variant 1\. This suggests that curvature\-guided geometric representations not only effectively enhances the model’s ability to capture higher\-order topological relationships but also significantly improves its geometric sensitivity, predictive stability, and cross\-dataset generalization performance\.

## 5Conclusion

The curvature\-guided geometric deep learning framework, called RicciBind, is proposed to address the challenge of effectively modeling the complex geometric and topological structures underlying molecular interactions\. It integrates Ollivier Ricci curvature–driven hierarchical geometric representations with an optimal transport–based structural matching mechanism to model protein–ligand interactions and enhance binding affinity prediction\. Extensive experiments across multiple benchmark datasets demonstrate that RicciBind consistently exhibits competitive predictive performance and strong generalization capabilities in binding affinity prediction and large\-scale structure\-based virtual screening tasks\. This indicates that integrating curvature\-aware geometric representations with optimal transport theory provides an effective strategy for modeling molecular interactions and predicting protein\-ligand binding affinities with high accuracy, thereby offering a promising new paradigm for structure\-based drug discovery\.

RicciBind focuses on capturing protein–ligand interaction representations, its underlying modeling paradigm is highly extensible and has the potential to generalize to broader biomolecular representations and related tasks\. In the future, we plan to refine RicciBind’s modeling strategies and explore its applicability to a wider range of tasks, such as pocket detection and molecular docking, thereby advancing the drug development process\.

## References

## References

- \[1\]N\. Fleming, “How artificial intelligence is changing drug discovery,”*Nature*, vol\. 557, no\. 7706, pp\. S55–S57, 2018\.
- \[2\]H\. Stärk, O\. Ganea, L\. Pattanaik, R\. Barzilay, and T\. Jaakkola, “Equibind: Geometric deep learning for drug binding structure prediction,” in*International Conference on Machine Learning*\. PMLR, 2022, pp\. 20 503–20 521\.
- \[3\]D\. Chen, J\. Liu, and G\.\-W\. Wei, “Multiscale topology\-enabled structure\-to\-sequence transformer for protein–ligand interaction predictions,”*Nature Machine Intelligence*, vol\. 6, no\. 7, pp\. 799–810, 2024\.
- \[4\]H\. Öztürk, A\. Özgür, and E\. Ozkirimli, “Deepdta: deep drug–target binding affinity prediction,”*Bioinformatics*, vol\. 34, no\. 17, pp\. i821–i829, 2018\.
- \[5\]T\. Nguyen, H\. Le, T\. P\. Quinn, T\. Nguyen, T\. D\. Le, and S\. Venkatesh, “Graphdta: predicting drug–target binding affinity with graph neural networks,”*Bioinformatics*, vol\. 37, no\. 8, pp\. 1140–1147, 2021\.
- \[6\]Z\. Yang, W\. Zhong, Q\. Lv, T\. Dong, and C\. Yu\-Chian Chen, “Geometric interaction graph neural network for predicting protein–ligand binding affinities from 3d structures \(gign\),”*The Journal of Physical Chemistry Letters*, vol\. 14, no\. 8, pp\. 2020–2033, 2023\.
- \[7\]C\. Shen, P\. Ding, J\. Wee, J\. Bi, J\. Luo, and K\. Xia, “Curvature\-enhanced graph convolutional network for biomolecular interaction prediction,”*Computational and Structural Biotechnology Journal*, vol\. 23, pp\. 1016–1025, 2024\.
- \[8\]W\. Torng and R\. B\. Altman, “Graph convolutional neural networks for predicting drug\-target interactions,”*Journal of Chemical Information and Modeling*, vol\. 59, no\. 10, pp\. 4131–4149, 2019\.
- \[9\]T\. Li, X\.\-M\. Zhao, and L\. Li, “Co\-vae: Drug\-target binding affinity prediction by co\-regularized variational autoencoders,”*IEEE Transactions on Pattern Analysis and Machine Intelligence*, vol\. 44, no\. 12, pp\. 8861–8873, 2021\.
- \[10\]P\. Bai, F\. Miljković, B\. John, and H\. Lu, “Interpretable bilinear attention network with domain adaptation improves drug–target prediction,”*Nature Machine Intelligence*, vol\. 5, no\. 2, pp\. 126–136, 2023\.
- \[11\]J\. Gomes, B\. Ramsundar, E\. N\. Feinberg, and V\. S\. Pande, “Atomic convolutional networks for predicting protein\-ligand binding affinity,”*arXiv preprint arXiv:1703\.10603*, 2017\.
- \[12\]J\. Jiménez, M\. Skalic, G\. Martinez\-Rosell, and G\. De Fabritiis, “K deep: protein–ligand absolute binding affinity prediction via 3d\-convolutional neural networks,”*Journal of Chemical Information and Modeling*, vol\. 58, no\. 2, pp\. 287–296, 2018\.
- \[13\]D\. Jiang, C\.\-Y\. Hsieh, Z\. Wu, Y\. Kang, J\. Wang, E\. Wang, B\. Liao, C\. Shen, L\. Xu, J\. Wu*et al\.*, “Interactiongraphnet: a novel and efficient deep graph representation learning framework for accurate protein–ligand interaction predictions,”*Journal of Medicinal Chemistry*, vol\. 64, no\. 24, pp\. 18 209–18 232, 2021\.
- \[14\]E\. N\. Feinberg, D\. Sur, Z\. Wu, B\. E\. Husic, H\. Mai, Y\. Li, S\. Sun, J\. Yang, B\. Ramsundar, and V\. S\. Pande, “Potentialnet for molecular property prediction,”*ACS Central Science*, vol\. 4, no\. 11, pp\. 1520–1530, 2018\.
- \[15\]Z\. Yang, W\. Zhong, Q\. Lv, T\. Dong, G\. Chen, and C\. Y\.\-C\. Chen, “Interaction\-based inductive bias in graph neural networks: enhancing protein\-ligand binding affinity predictions from 3d structures,”*IEEE Transactions on Pattern Analysis and Machine Intelligence*, vol\. 46, no\. 12, pp\. 8191–8208, 2024\.
- \[16\]X\. Zhang, H\. Gao, H\. Wang, Z\. Chen, Z\. Zhang, X\. Chen, Y\. Li, Y\. Qi, and R\. Wang, “Planet: a multi\-objective graph neural network model for protein–ligand binding affinity prediction,”*Journal of Chemical Information and Modeling*, vol\. 64, no\. 7, pp\. 2205–2220, 2023\.
- \[17\]S\. Chen, H\. Yi, Z\. You, X\. Shang, Y\.\-A\. Huang, L\. Wang, and Z\. Wang, “Local–global structure\-aware geometric equivariant graph representation learning for predicting protein–ligand binding affinity,”*IEEE Transactions on Neural Networks and Learning Systems*, 2025\.
- \[18\]Y\. Du, L\. Wang, D\. Feng, G\. Wang, S\. Ji, C\. P\. Gomes, Z\.\-M\. Ma*et al\.*, “A new perspective on building efficient and expressive 3d equivariant graph neural networks,”*Advances in Neural Information Processing Systems*, vol\. 36, pp\. 66 647–66 674, 2023\.
- \[19\]X\. Kong, W\. Huang, and Y\. Liu, “Generalist equivariant transformer towards 3d molecular interaction learning,” in*International Conference on Machine Learning*\. PMLR, 2024, pp\. 25 149–25 175\.
- \[20\]H\. Lim, S\. Kim, and S\. Lee, “Cheapnet: Cross\-attention on hierarchical representations for efficient protein\-ligand binding affinity prediction,” in*The Thirteenth International Conference on Learning Representations*, 2025\.
- \[21\]Z\. Ying, J\. You, C\. Morris, X\. Ren, W\. Hamilton, and J\. Leskovec, “Hierarchical graph representation learning with differentiable pooling,”*Advances in Neural Information Processing Systems*, vol\. 31, 2018\.
- \[22\]Y\. Ollivier, “Ricci curvature of markov chains on metric spaces,”*Journal of Functional Analysis*, vol\. 256, no\. 3, pp\. 810–864, 2009\.
- \[23\]Forman, “Bochner’s method for cell complexes and combinatorial ricci curvature,”*Discrete & Computational Geometry*, vol\. 29, no\. 3, pp\. 323–374, 2003\.
- \[24\]C\.\-C\. Ni, Y\.\-Y\. Lin, F\. Luo, and J\. Gao, “Community detection on networks with ricci flow,”*Scientific Reports*, vol\. 9, no\. 1, p\. 9984, 2019\.
- \[25\]J\. Sia, E\. Jonckheere, and P\. Bogdan, “Ollivier\-ricci curvature\-based method to community detection in complex networks,”*Scientific Reports*, vol\. 9, no\. 1, p\. 9800, 2019\.
- \[26\]C\.\-C\. Ni, Y\.\-Y\. Lin, J\. Gao, X\. D\. Gu, and E\. Saucan, “Ricci curvature of the internet topology,” in*IEEE Conference on Computer Communications \(INFOCOM\)*\. IEEE, 2015, pp\. 2758–2766\.
- \[27\]J\. Wee and K\. Xia, “Forman persistent ricci curvature \(fprc\)\-based machine learning models for protein–ligand binding affinity prediction,”*Briefings in Bioinformatics*, vol\. 22, no\. 6, 2021\.
- \[28\]——, “Ollivier persistent ricci curvature\-based machine learning for the protein–ligand binding affinity prediction,”*Journal of Chemical Information and Modeling*, vol\. 61, no\. 4, pp\. 1617–1626, 2021\.
- \[29\]K\. Nguyen, N\. M\. Hieu, V\. D\. Nguyen, N\. Ho, S\. Osher, and T\. M\. Nguyen, “Revisiting over\-smoothing and over\-squashing using ollivier\-ricci curvature,” in*International Conference on Machine Learning*\. PMLR, 2023, pp\. 25 956–25 979\.
- \[30\]Z\. Ye, K\. S\. Liu, T\. Ma, J\. Gao, and C\. Chen, “Curvature graph network,” in*International Conference on Learning Representations*, 2019\.
- \[31\]H\. Li, J\. Cao, J\. Zhu, Y\. Liu, Q\. Zhu, and G\. Wu, “Curvature graph neural network,”*Information Sciences*, vol\. 592, pp\. 50–66, 2022\.
- \[32\]M\. Jiang, Z\. Li, S\. Zhang, S\. Wang, X\. Wang, Q\. Yuan, and Z\. Wei, “Drug–target affinity prediction using graph neural network and contact maps,”*RSC Advances*, vol\. 10, no\. 35, pp\. 20 701–20 712, 2020\.
- \[33\]J\. Zhou, S\. Li, L\. Huang, H\. Xiong, F\. Wang, T\. Xu, H\. Xiong, and D\. Dou, “Distance\-aware molecule graph attention network for drug\-target binding affinity prediction,”*arXiv preprint arXiv:2012\.09624*, 2020\.
- \[34\]J\. Topping, F\. Di Giovanni, B\. P\. Chamberlain, X\. Dong, and M\. M\. Bronstein, “Understanding over\-squashing and bottlenecks on graphs via curvature,” in*International Conference on Learning Representations*, 2022\.
- \[35\]R\. S\. Hamilton, “Three\-manifolds with positive ricci curvature,”*Journal of Differential Geometry*, vol\. 17, no\. 2, pp\. 255–306, 1982\.
- \[36\]T\. H\. Colding, “Ricci curvature and volume convergence,”*Annals of Mathematics*, vol\. 145, no\. 3, pp\. 477–501, 1997\.
- \[37\]A\. Samal, R\. Sreejith, J\. Gu, S\. Liu, E\. Saucan, and J\. Jost, “Comparative analysis of two discretizations of ricci curvature for complex networks,”*Scientific Reports*, vol\. 8, no\. 1, p\. 8650, 2018\.
- \[38\]Y\. Ollivier, “Ricci curvature of metric spaces,”*Comptes Rendus Mathematique*, vol\. 345, no\. 11, pp\. 643–646, 2007\.
- \[39\]Y\. Lin, L\. Lu, and S\.\-T\. Yau, “Ricci curvature of graphs,”*Tohoku Mathematical Journal, Second Series*, vol\. 63, no\. 4, pp\. 605–627, 2011\.
- \[40\]C\.\-C\. Ni, Y\.\-Y\. Lin, J\. Gao, and X\. Gu, “Network alignment by discrete ollivier\-ricci flow,” in*International Symposium on Graph Drawing and Network Visualization*\. Springer, 2018, pp\. 447–462\.
- \[41\]G\. Monge, “Mémoire sur la théorie des déblais et des remblais,”*Mem\. Math\. Phys\. Acad\. Royale Sci\.*, pp\. 666–704, 1781\.
- \[42\]C\. Villani*et al\.*,*Optimal transport: old and new*\. Springer, 2008, vol\. 338\.
- \[43\]C\.\-X\. Ren, Y\. Zhai, Y\.\-W\. Luo, and H\. Yan, “Towards unsupervised domain adaptation via domain\-transformer,”*International Journal of Computer Vision*, vol\. 132, no\. 12, pp\. 6163–6183, 2024\.
- \[44\]J\. B\. Dunbar Jr, R\. D\. Smith, K\. L\. Damm\-Ganamet, A\. Ahmed, E\. X\. Esposito, J\. Delproposto, K\. Chinnaswamy, Y\.\-N\. Kang, G\. Kubish, J\. E\. Gestwicki*et al\.*, “Csar data set release 2012: ligands, affinities, complexes, and docking decoys,”*Journal of Chemical Information and Modeling*, vol\. 53, no\. 8, pp\. 1842–1852, 2013\.
- \[45\]V\. R\. Somnath, C\. Bunne, and A\. Krause, “Multi\-scale representation learning on proteins,”*Advances in Neural Information Processing Systems*, vol\. 34, pp\. 25 244–25 255, 2021\.
- \[46\]Z\. Liu, M\. Su, L\. Han, J\. Liu, Q\. Yang, Y\. Li, and R\. Wang, “Forging the basis for developing protein–ligand interaction scoring functions,”*Accounts of Chemical Research*, vol\. 50, no\. 2, pp\. 302–309, 2017\.
- \[47\]M\. M\. Mysinger, M\. Carchia, J\. J\. Irwin, and B\. K\. Shoichet, “Directory of useful decoys, enhanced \(dud\-e\): better ligands and decoys for better benchmarking,”*Journal of Medicinal Chemistry*, vol\. 55, no\. 14, pp\. 6582–6594, 2012\.
- \[48\]V\.\-K\. Tran\-Nguyen, C\. Jacquemard, and D\. Rognan, “Lit\-pcba: an unbiased data set for machine learning and virtual screening,”*Journal of Chemical Information and Modeling*, vol\. 60, no\. 9, pp\. 4263–4273, 2020\.
- \[49\]J\. J\. Irwin and B\. K\. Shoichet, “Zinc\- a free database of commercially available compounds for virtual screening,”*Journal of Chemical Information and Modeling*, vol\. 45, no\. 1, pp\. 177–182, 2005\.
- \[50\]Z\. Yang, W\. Zhong, L\. Zhao, and C\. Y\.\-C\. Chen, “Mgraphdta: deep multiscale graph neural network for explainable drug–target binding affinity prediction,”*Chemical Science*, vol\. 13, no\. 3, pp\. 816–833, 2022\.
- \[51\]J\. Lim, S\. Ryu, K\. Park, Y\. J\. Choe, J\. Ham, and W\. Y\. Kim, “Predicting drug–target interaction using a novel graph neural network with 3d structure\-embedded graph representation,”*Journal of Chemical Information and Modeling*, vol\. 59, no\. 9, pp\. 3981–3988, 2019\.
- \[52\]K\. T\. Schütt, H\. E\. Sauceda, P\.\-J\. Kindermans, A\. Tkatchenko, and K\.\-R\. Müller, “Schnet–a deep learning architecture for molecules and materials,”*The Journal of Chemical Physics*, vol\. 148, no\. 24, 2018\.
- \[53\]V\. G\. Satorras, E\. Hoogeboom, and M\. Welling, “E \(n\) equivariant graph neural networks,” in*International Conference on Machine Learning*\. PMLR, 2021, pp\. 9323–9332\.
- \[54\]D\. Jiang, Z\. Ye, C\.\-Y\. Hsieh, Z\. Yang, X\. Zhang, Y\. Kang, H\. Du, Z\. Wu, J\. Wang, Y\. Zeng*et al\.*, “Metalprognet: a structure\-based deep graph model for metalloprotein–ligand interaction predictions,”*Chemical Science*, vol\. 14, no\. 8, pp\. 2054–2069, 2023\.
- \[55\]S\. Zhang, Y\. Jin, T\. Liu, Q\. Wang, Z\. Zhang, S\. Zhao, and B\. Shan, “Ss\-gnn: a simple\-structured graph neural network for affinity prediction,”*ACS Omega*, vol\. 8, no\. 25, pp\. 22 496–22 507, 2023\.
- \[56\]L\. Wang, H\. Liu, Y\. Liu, J\. Kurtin, and S\. Ji, “Learning hierarchical protein representations via complete 3d graph networks,”*arXiv preprint arXiv:2207\.12600*, 2022\.
- \[57\]S\. Feng, M\. Li, Y\. Jia, W\. Ma, and Y\. Lan, “Protein\-ligand binding representation learning from fine\-grained interactions,”*arXiv preprint arXiv:2311\.16160*, 2023\.
- \[58\]P\. Gainza, F\. Sverrisson, F\. Monti, E\. Rodola, D\. Boscaini, M\. M\. Bronstein, and B\. E\. Correia, “Deciphering interaction fingerprints from protein molecular surfaces using geometric deep learning,”*Nature Methods*, vol\. 17, no\. 2, pp\. 184–192, 2020\.
- \[59\]P\. J\. Ballester and J\. B\. Mitchell, “A machine learning approach to predicting protein–ligand binding affinity with applications to molecular docking,”*Bioinformatics*, vol\. 26, no\. 9, pp\. 1169–1175, 2010\.
- \[60\]R\. A\. Friesner, J\. L\. Banks, R\. B\. Murphy, T\. A\. Halgren, J\. J\. Klicic, D\. T\. Mainz, M\. P\. Repasky, E\. H\. Knoll, M\. Shelley, J\. K\. Perry*et al\.*, “Glide: a new approach for rapid, accurate docking and scoring\. 1\. method and assessment of docking accuracy,”*Journal of Medicinal Chemistry*, vol\. 47, no\. 7, pp\. 1739–1749, 2004\.
- \[61\]G\. Marcou and D\. Rognan, “Optimizing fragment and scaffold docking by use of molecular interaction fingerprints,”*Journal of Chemical Information and Modeling*, vol\. 47, no\. 1, pp\. 195–207, 2007\.
- \[62\]J\. Desaphy, E\. Raimbaud, P\. Ducrot, and D\. Rognan, “Encoding protein–ligand interaction patterns in fingerprints and graphs,”*Journal of Chemical Information and Modeling*, vol\. 53, no\. 3, pp\. 623–637, 2013\.
- \[63\]A\. N\. Jain, “Surflex\-dock 2\.1: robust performance from ligand energetic modeling, ring flexibility, and knowledge\-based search,”*Journal of Computer\-Aided Molecular Design*, vol\. 21, no\. 5, pp\. 281–306, 2007\.
- \[64\]M\. Wójcikowski, P\. J\. Ballester, and P\. Siedlecki, “Performance of machine\-learning scoring functions in structure\-based virtual screening,”*Scientific Reports*, vol\. 7, no\. 1, p\. 46710, 2017\.

Similar Articles

Co-folding model guided by structural proteomics

arXiv cs.LG

Introduces AIMS-Fold, an inference-time guided-diffusion framework that integrates cross-linking mass spectrometry (XL-MS) and hydrogen-deuterium exchange (HDX-MS) data to improve protein co-folding predictions for induced proximity drug targets.