open-weight-models

Tag

Cards List
#open-weight-models

@rohanpaul_ai: FT: Trump will never support a US AI regulator, says outgoing adviser Sriram Krishnan. The policy is selective pressure…

X AI KOLs Timeline · 4h ago Cached

An FT article reports that outgoing Trump adviser Sriram Krishnan says Trump will not support a US AI regulator, advocating instead for selective pressure on cyber risk managed by companies and agencies. Krishnan also expresses concern that there is no leading American open-weight model compared to Chinese ones.

0 favorites 0 likes
#open-weight-models

Do you agree with Palantir CEO Alex Karp that the enterprise "tokenmaxxing" business model has "gone completely wrong" with minimal ROI? Will open-weight models inevitably win?

Reddit r/artificial · 8h ago

Palantir CEO Alex Karp criticized the API token pricing model of commercial AI labs like OpenAI and Anthropic, arguing it offers minimal ROI and that open-weight models are winning as enterprises seek control over their data and compute.

0 favorites 0 likes
#open-weight-models

What does "Safe AI" look like? [D]

Reddit r/MachineLearning · yesterday

The author raises questions about the practicality of studying defenses against post-release fine-tuning that weakens safety behaviors in open-weight LLMs, and asks whether current safety training is worth the effort if models can be broken quickly.

0 favorites 0 likes
#open-weight-models

Researchers Build Self-Replicating AI Worm That Operates Entirely on Local, Open-Weight Models

Reddit r/LocalLLaMA · yesterday Cached

University of Toronto researchers developed a proof-of-concept AI worm that uses a local open-weight LLM to autonomously reason about network vulnerabilities, generate tailored exploits, and replicate across hosts without human intervention, achieving 62% network infection in controlled tests.

0 favorites 0 likes
#open-weight-models

@0xSero: Are you worried about your right to access intelligence? - local ai - open weight models - access to US frontier All th…

X AI KOLs Following · yesterday Cached

A tweet warns that the right to access intelligence is at risk due to efforts to ban certain AI labs and open weight models, with Anthropic allegedly aiming to be the sole player.

0 favorites 0 likes
#open-weight-models

@rohanpaul_ai: Coinbase CEO Brian Armstrong said Coinbase is experimenting with defaulting to Chinese open-weight models such as GLM 5…

X AI KOLs Timeline · 4d ago Cached

Coinbase CEO Brian Armstrong announced the company is experimenting with using Chinese open-weight AI models like GLM 5.2 and Kimi 2.7 for its LLM gateway, routing prompts by difficulty, suggesting that frontier models may be overkill for execution tasks.

0 favorites 0 likes
#open-weight-models

@cline: We’ve been impressed with GLM-5.2 and so are introducing a $9.99/month subscription to give you 2-5x discounted access …

X AI KOLs Following · 4d ago Cached

Cline announces a $9.99/month subscription offering discounted access to GLM-5.2 and other open-weight models, with a $1.99 special promo for new users on Cline CLI and IDE.

0 favorites 0 likes
#open-weight-models

@FinanceYF5: Source:

X AI KOLs Following · 5d ago Cached

OpenRouter announces that four open-weight models are now powering real agentic pipelines, with a new blog post detailing why companies are choosing them as of June.

0 favorites 0 likes
#open-weight-models

@FinanceYF5: 阅读文章:https://openrouter.ai/blog/insights/the-open-weight-models-that-matter-june-2026/…

X AI KOLs Following · 5d ago Cached

The article highlights the growing importance of open-weight AI models as of June 2026, with DeepSeek V4 Flash emerging as a cost-effective, high-performance model that rivals frontier models like GPT-5.5 for agentic tasks.

0 favorites 0 likes
#open-weight-models

@FinanceYF5: Four open-weight models have entered a stage where they can support real agent workflows. OpenRouter published a new article on the Insights blog discussing why the company chose these models in June:

X AI KOLs Following · 5d ago Cached

OpenRouter posted on the Insights blog, pointing out that four open-weight models have reached a stage capable of supporting real agent workflows, and explained why the company chose these models in June.

0 favorites 0 likes
#open-weight-models

@rasbt: I put together a new article on setting up local coding agents with open-weight models. Everything runs 100% locally. I…

X AI KOLs Timeline · 6d ago Cached

Sebastian Raschka shares a new tutorial on setting up fully local coding agents using open-weight LLMs, including a walkthrough and assessment checklist for choosing models.

0 favorites 0 likes
#open-weight-models

Bio-preparedness in 2026

Reddit r/singularity · 2026-06-26

This article warns that current and upcoming AI models significantly lower the barrier to creating bioweapons, citing distillation attacks on open-weight models and the inability to prevent safety ablation. It calls for public funding of broad-spectrum countermeasures as a necessary response.

0 favorites 0 likes
#open-weight-models

Why current LLM costs are not sustainable

Hacker News Top · 2026-06-26 Cached

The article argues that current high LLM pricing is unsustainable due to diminishing performance gains, the rise of open-weight models, specialized AI chips reducing inference costs, and zero switching costs, predicting significant price drops as competition intensifies.

0 favorites 0 likes
#open-weight-models

The Unbearable Cheapness of Open Weight Models

Hacker News Top · 2026-06-25 Cached

The article examines the dramatic cost difference between open-weight models like DeepSeek V4 and closed models from Anthropic and OpenAI, arguing that the latter sustain high prices through artificial scarcity and branding rather than technical superiority.

0 favorites 0 likes
#open-weight-models

What a model reads beforehand changes how it answers later - and you can see it in the hidden states

Reddit r/artificial · 2026-06-23

This post reports an observation that reading a long, structured text before answering alters a model's later responses, with behavioral evidence from Claude and mechanistic analysis on open-weight Gemma models showing separable hidden states and sharper probability distributions in instruction-tuned variants.

0 favorites 0 likes
#open-weight-models

@MaximeRivest: 2 years ago sonnet 3.5 was released and it triggered the viral adoption of Cursor. Look at all these open weight models…

X AI KOLs Following · 2026-06-18 Cached

Two years after Sonnet 3.5's release sparked Cursor's viral adoption, open weight models now surpass it, running on consumer hardware. This is a pivotal moment for open source AI.

0 favorites 0 likes
#open-weight-models

@haider1: GLM 5.2 feels like the opus 4.5 moment for open-weight models what genuinely impressed me was during long, multi-step a…

X AI KOLs Following · 2026-06-17 Cached

GLM 5.2 marks a significant milestone for open-weight models, demonstrating strong context retention across long multi-step tasks and more reliable tool calling.

0 favorites 0 likes
#open-weight-models

GeoNatureAgent Benchmark: Benchmarking LLM Agents for Environmental Geospatial Analysis Across Frontier and Open-Weight Foundation Models

arXiv cs.AI · 2026-06-12 Cached

The paper introduces GeoNatureAgent Benchmark, the first benchmark for evaluating LLM agents on environmental geospatial analysis tasks via structured tool calls. It evaluates seven models on 93 tasks across 18 categories and finds Claude Sonnet 4 achieves highest accuracy at 60.8%, while open-weight models like DeepSeek V3.2 offer strong cost-performance tradeoffs.

0 favorites 0 likes
#open-weight-models

Mythos-class models will diffuse throughout the world by 2029 (7 minute read)

TLDR AI · 2026-06-12 Cached

Saagar Pateder analyzes the diminishing marginal returns of AI intelligence for consumer and enterprise tasks, and predicts that open-weight models will diffuse globally by 2029, based on historical trends in model performance and cost.

0 favorites 0 likes
#open-weight-models

ERRORQUAKE: Heavy-Tailed Error Severity Distributions in Open-Weight Large Language Models

arXiv cs.LG · 2026-06-05 Cached

The paper introduces Errorquake-10k, a benchmark for evaluating error severity in open-weight LLMs, showing that models with matched accuracy can have vastly different error severity distributions, and argues that severity should be reported alongside accuracy.

0 favorites 0 likes
Next →
← Back to home

Submit Feedback