zero-shot-learning

Tag

Cards List
#zero-shot-learning

Zero-Shot Learning in Industrial Scenarios: New Large-Scale Benchmark, Challenges and Baseline

arXiv cs.AI · 2026-06-09 Cached

This paper proposes a large-scale multi-modal dataset (MMIO) for zero-shot industrial defect detection and introduces the Refined Text-Visual Prompt (RTVP) method, achieving state-of-the-art results on the benchmark.

0 favorites 0 likes
#zero-shot-learning

GlucoFM-Bench: Benchmarking Time-Series Foundation Models for Blood Glucose Forecasting

arXiv cs.LG · 2026-06-08 Cached

GlucoFM-Bench evaluates time-series foundation models for blood glucose forecasting across 15 datasets, showing strong zero-shot/few-shot transfer by Chronos-2 and TimesFM but superior performance of a lightweight LSTM when full training data is available.

0 favorites 0 likes
#zero-shot-learning

Long Live Fine-Tuning: Task-Specific Transformers Outperform Zero-Shot LLMs for Misinformation Response Classification on Reddit

arXiv cs.CL · 2026-06-04 Cached

Researchers from University of Technology Sydney compare fine-tuned transformers (DistilBERT, RoBERTa) against zero-shot LLMs (Llama variants, Claude, Gemini) for classifying misinformation responses on Reddit, finding that fine-tuned RoBERTa achieves 0.62 macro-F1 versus 0.50 for the best zero-shot model. The study shows that task-specific fine-tuning outperforms larger generalist models, particularly for detecting belief propagation, and that safety-alignment artifacts in frontier models can degrade performance.

0 favorites 0 likes
#zero-shot-learning

RESCAST-100K: A Comprehensive Dataset for Cross-Domain Residential Load and Indoor Temperature Forecasting

arXiv cs.LG · 2026-06-03 Cached

Introduces RESCAST-100K, a large-scale benchmark dataset for cross-domain residential load and indoor temperature forecasting, featuring simulated and real data to evaluate transfer learning, domain adaptation, and zero-shot generalization.

0 favorites 0 likes
#zero-shot-learning

From Data to Insights: Exploring Program-of-Thoughts Prompting for Chart Summarization

arXiv cs.CL · 2026-05-29 Cached

This paper introduces a zero-shot strategy for chart summarization using Program-of-Thoughts prompting, where lightweight visual language models (VLMs) generate Python programs to compute statistics, improving factual accuracy over existing methods.

0 favorites 0 likes
#zero-shot-learning

AnyMo: Geometry-Aware Setup-Agnostic Modeling of Human Motion in the Wild

Hugging Face Daily Papers · 2026-05-21 Cached

AnyMo is a geometry-aware framework for setup-agnostic human motion modeling using physics-grounded IMU simulation and graph encoding, achieving significant improvements in zero-shot activity recognition, cross-modal retrieval, and motion captioning across multiple datasets.

0 favorites 0 likes
#zero-shot-learning

Predicting Psychological Well-Being from Spontaneous Speech using LLMs

arXiv cs.CL · 2026-05-13 Cached

This academic paper investigates using LLMs for zero-shot prediction of psychological well-being scores from spontaneous speech, evaluating 12 models and achieving high correlation with clinical metrics.

0 favorites 0 likes
#zero-shot-learning

A Foundation Model for Zero-Shot Logical Rule Induction

Hugging Face Daily Papers · 2026-05-06 Cached

This paper introduces the Neural Rule Inducer (NRI), a foundation model for zero-shot logical rule induction that uses domain-agnostic statistical properties to generalize across tasks without retraining.

0 favorites 0 likes
#zero-shot-learning

DiZiNER: Disagreement-guided Instruction Refinement via Pilot Annotation Simulation for Zero-shot Named Entity Recognition

arXiv cs.CL · 2026-04-20 Cached

DiZiNER is a framework that uses disagreement between multiple LLMs to refine task instructions for zero-shot named entity recognition, achieving state-of-the-art results on 14 out of 18 benchmarks and significantly reducing the performance gap between zero-shot and supervised systems.

0 favorites 0 likes
#zero-shot-learning

Zero-shot World Models Are Developmentally Efficient Learners [R]

Reddit r/MachineLearning · 2026-04-18

Researchers introduce Zero-shot World Models (ZWM), an approach that achieves visual competence comparable to state-of-the-art models while trained on minimal data (single child's visual experience) without task-specific training. This work demonstrates a path toward more data-efficient AI systems that match human developmental learning efficiency.

0 favorites 0 likes
← Back to home

Submit Feedback