OneRank: Unified Transformer-Native Ranking Architecture for Multi-Task Recommendation

Hugging Face Daily Papers 06/15/26, 12:00 AM Papers

Summary

OneRank proposes a Transformer-native multi-task ranking framework that integrates feature encoding and prediction to reduce inter-task interference and improve ranking performance in recommender systems.

Multi-task learning (MTL) is essential in recommender systems to enable complementary learning among diverse user feedback. While modern industrial practices have shifted from DNNs to Transformer-centric architectures to strengthen sequence modeling and scaling capacity, they still decouple feature encoding from multi-task prediction, treating the Transformer as a task-agnostic encoder. This design fundamentally limits the performance and scalability by (1) creating an information bottleneck under heterogeneous task objectives, (2) inducing gradient interference that leads to the seesaw phenomenon, and (3) forcing a dataflow transition in which attention-based, context-adaptive representation learning is converted to static feed-forward task prediction with incompatible information read-write dynamics. We propose OneRank, a Transformer-native multi-task ranking framework that eliminates encoder-predictor separation and introduces task-private channels for forward representation learning and backward optimization, enabling task-specialized learning while reducing inter-task interference. In the forward pass, OneRank learns task-specific representations bottom-up through task-conditioned information selection, candidate-aware contextualization, and controlled cross-task interaction. In the backward pass, cross-task gradient detachment isolates task-private parameter updates from shared knowledge extraction modules, preventing negative transfer. We further replace static task-specific MLP scorers with dynamic matching-based scoring for context-aware personalized ranking. By internalizing multi-task reasoning within the Transformer stack, OneRank establishes a unified and scalable architectural paradigm. Offline and online experiments on large-scale industrial datasets show that OneRank significantly outperforms state-of-the-art baselines while maintaining computational efficiency.

Original Article

View Cached Full Text

Cached at: 06/16/26, 11:33 AM

Paper page - OneRank: Unified Transformer-Native Ranking Architecture for Multi-Task Recommendation

Source: https://huggingface.co/papers/2606.16838 Authors:

Abstract

OneRank presents a Transformer-native multi-task learning framework that integrates feature encoding and prediction to reduce inter-task interference and improve ranking performance in recommender systems.

Multi-task learning(MTL) is essential inrecommender systemsto enable complementary learning among diverse user feedback. While modern industrial practices have shifted from DNNs toTransformer-centric architectures to strengthen sequence modeling and scaling capacity, they still decouple feature encoding from multi-task prediction, treating theTransformeras a task-agnostic encoder. This design fundamentally limits the performance and scalability by (1) creating an information bottleneck under heterogeneous task objectives, (2) inducing gradient interference that leads to the seesaw phenomenon, and (3) forcing a dataflow transition in which attention-based, context-adaptive representation learning is converted to static feed-forward task prediction with incompatible information read-write dynamics. We propose OneRank, aTransformer-native multi-task ranking framework that eliminates encoder-predictor separation and introducestask-private channelsfor forward representation learning and backward optimization, enabling task-specialized learning while reducing inter-task interference. In the forward pass, OneRank learns task-specific representations bottom-up throughtask-conditioned information selection,candidate-aware contextualization, and controlledcross-task interaction. In the backward pass, cross-taskgradient detachmentisolates task-private parameter updates from shared knowledge extraction modules, preventingnegative transfer. We further replace static task-specific MLP scorers withdynamic matching-based scoringfor context-aware personalized ranking. By internalizing multi-task reasoning within theTransformerstack, OneRank establishes a unified and scalable architectural paradigm. Offline and online experiments on large-scale industrial datasets show that OneRank significantly outperforms state-of-the-art baselines while maintaining computational efficiency.

View arXiv page View PDF Add to collection

Get this paper in your agent:

hf papers read 2606\.16838

Don’t have the latest CLI?curl \-LsSf https://hf\.co/cli/install\.sh \| bash

Models citing this paper0

No model linking this paper

Cite arxiv.org/abs/2606.16838 in a model README.md to link it from this page.

Datasets citing this paper0

No dataset linking this paper

Cite arxiv.org/abs/2606.16838 in a dataset README.md to link it from this page.

Spaces citing this paper0

No Space linking this paper

Cite arxiv.org/abs/2606.16838 in a Space README.md to link it from this page.

Collections including this paper0

No Collection including this paper

Add this paper to acollectionto link it from this page.

OneRank: Unified Transformer-Native Ranking Architecture for Multi-Task Recommendation

Paper page - OneRank: Unified Transformer-Native Ranking Architecture for Multi-Task Recommendation

Abstract

Models citing this paper0

Datasets citing this paper0

Spaces citing this paper0

Collections including this paper0

Similar Articles

Expand More, Shrink Less: Shaping Effective-Rank Dynamics for Dense Scaling in Recommendation

Response-free item difficulty modelling for multiple-choice items with fine-tuned transformers: Component-wise representation and multi-task learning

Breaking the Filter Bubble: A Semantic Pareto-DQN Framework for Multi-Objective Recommendation

Toward Native Multimodal Modeling: A Roadmap

Multimodal Embedding & Reranker Models with Sentence Transformers

Submit Feedback

Similar Articles

Expand More, Shrink Less: Shaping Effective-Rank Dynamics for Dense Scaling in Recommendation

Response-free item difficulty modelling for multiple-choice items with fine-tuned transformers: Component-wise representation and multi-task learning

Breaking the Filter Bubble: A Semantic Pareto-DQN Framework for Multi-Objective Recommendation

Toward Native Multimodal Modeling: A Roadmap

Multimodal Embedding & Reranker Models with Sentence Transformers