multi-task

Tag

Cards List
#multi-task

Ultralytics YOLO26: Unified Real-Time End-to-End Vision Models

Hugging Face Daily Papers · 2d ago Cached

Ultralytics YOLO26 introduces a unified real-time vision model family with NMS-free inference, improved training strategies, and multi-task capabilities for detection, segmentation, and pose estimation, achieving state-of-the-art accuracy-latency trade-offs.

0 favorites 0 likes
#multi-task

Model Merging by Output-Space Projection

arXiv cs.LG · 6d ago

This paper presents a new framework for model merging that casts the problem as a convex quadratic program over residual updates, minimizing a squared-output calibration objective. It subsumes existing heuristic methods and provides a closed-form diagnostic to predict merge quality, showing consistent gains on language and vision benchmarks.

0 favorites 0 likes
#multi-task

Qwen-VLA: Unifying Vision-Language-Action Modeling across Tasks, Environments, and Robot Embodiments

Hugging Face Daily Papers · 2026-05-28 Cached

Qwen-VLA is a unified vision-language-action model for embodied decision-making, integrating manipulation, navigation, and trajectory prediction across different robot platforms. It uses a DiT-based action decoder and embodiment-aware prompt conditioning, achieving strong performance and out-of-distribution generalization.

0 favorites 0 likes
#multi-task

AsyncTool: Evaluating the Asynchronous Function Calling Capability under Multi-Task Scenarios

Hugging Face Daily Papers · 2026-05-27 Cached

This paper introduces AsyncTool, a benchmark for evaluating LLM-based agents' asynchronous function calling abilities in multi-task scenarios with delayed tool responses. It proposes efficiency-oriented metrics and identifies key failure modes of current tool-using agents.

0 favorites 0 likes
#multi-task

First time fine-tuning, need a sanity check — 3B or 7B for multi-task reasoning? [D]

Reddit r/MachineLearning · 2026-04-23

A self-taught developer asks for advice on choosing between 3B and 7B models for a first multi-task fine-tuning project focused on deeper reasoning about underlying questions.

0 favorites 0 likes
← Back to home

Submit Feedback