protein-language-models

#protein-language-models

Two-Stage Fine-Tuning for Protein Sequence Generation with Targeted Amino-Acid Composition

arXiv cs.LG ↗ · yesterday Cached

This paper proposes a two-stage fine-tuning pipeline combining domain-adaptive fine-tuning and reinforcement learning to generate protein sequences that match a desired amino-acid composition profile while maintaining sequence quality.

0 favorites 0 likes

#protein-language-models

Pepti-drift: Toxicity-Repulsive Drifting for Antigen-Conditioned Discrete Peptide Generation

arXiv cs.LG ↗ · yesterday Cached

Pepti-drift is a toxicity-aware latent refinement framework for generating antigen-specific peptides that avoids toxicity while maintaining binding affinity. It achieves large speedups over existing methods and produces diverse, valid, and low-toxicity peptide candidates.

0 favorites 0 likes

#protein-language-models

PROTOCOL: Late Interaction Retrieval for Protein Homolog Search

arXiv cs.LG ↗ · 2026-05-29 Cached

ProtoCol applies late-interaction retrieval to protein homology search, representing proteins as sets of residue embeddings and using MaxSim for scoring, outperforming pooled and alignment-based methods on remote homology benchmarks.

0 favorites 0 likes

#protein-language-models

Structural Interpretations of Protein Language Model Representations via Differentiable Graph Partitioning

arXiv cs.LG ↗ · 2026-05-13 Cached

This paper proposes SoftBlobGIN, a framework that enhances the interpretability of protein language model representations by projecting them onto contact graphs for structure-aware message passing. It demonstrates improved performance on enzyme classification and binding-site detection while providing auditable structural explanations.

0 favorites 0 likes

#protein-language-models

Conditional generation of antibody sequences with classifier-guided germline-absorbing discrete diffusion

arXiv cs.LG ↗ · 2026-05-11 Cached

This paper introduces a discrete diffusion model with a novel 'germline absorbing' modification to improve conditional antibody sequence generation. It addresses germline bias in protein language models and demonstrates superior performance in optimizing antibody binding affinity and developability compared to existing methods like EvoProtGrad.

0 favorites 0 likes

protein-language-models

Two-Stage Fine-Tuning for Protein Sequence Generation with Targeted Amino-Acid Composition

Pepti-drift: Toxicity-Repulsive Drifting for Antigen-Conditioned Discrete Peptide Generation

PROTOCOL: Late Interaction Retrieval for Protein Homolog Search

Structural Interpretations of Protein Language Model Representations via Differentiable Graph Partitioning

Conditional generation of antibody sequences with classifier-guided germline-absorbing discrete diffusion

Submit Feedback