distilbert

#distilbert

Cognitive-Linguistic Indicators of Depression in Online Communities: Analysed by DistilBERT and Holographic Reduced Representation

arXiv cs.CL ↗ · 2026-06-02 Cached

This paper presents a hybrid model combining DistilBERT embeddings with Holographic Reduced Representation vectors encoding cognitive-linguistic features (first-person pronouns, absolutist words, negative emotion ratios) to detect depression in Reddit posts, achieving a macro F1 of 0.94 and demonstrating that theory-driven features complement contextual embeddings for explainable mental health NLP.

0 favorites 0 likes

#distilbert

trained a prompt injection detector using ml-intern and DeepSeek v4 Flash, runs in the browser

Reddit r/LocalLLaMA ↗ · 2026-05-22

Trained a prompt injection classifier using ml-intern and DeepSeek V4 Flash, achieving 99% F1 with DistilBERT, optimized to ONNX int8 (~65MB) and deployable in the browser via Transformers.js v3.

0 favorites 0 likes

#distilbert

Switchcraft: AI Model Router for Agentic Tool Calling

arXiv cs.AI ↗ · 2026-05-11 Cached

This paper introduces Switchcraft, the first AI model router specifically optimized for agentic tool calling to reduce inference costs. By using a lightweight DistilBERT classifier, it achieves significant cost savings while maintaining high accuracy in tool-use tasks.

0 favorites 0 likes

#distilbert

Applied Explainability for Large Language Models: A Comparative Study

arXiv cs.CL ↗ · 2026-04-20 Cached

A comparative study evaluating three explainability techniques (Integrated Gradients, Attention Rollout, SHAP) on fine-tuned DistilBERT for sentiment classification, highlighting trade-offs between gradient-based, attention-based, and model-agnostic approaches for LLM interpretability.

0 favorites 0 likes

distilbert

Cognitive-Linguistic Indicators of Depression in Online Communities: Analysed by DistilBERT and Holographic Reduced Representation

trained a prompt injection detector using ml-intern and DeepSeek v4 Flash, runs in the browser

Switchcraft: AI Model Router for Agentic Tool Calling

Applied Explainability for Large Language Models: A Comparative Study

Submit Feedback