gpt

#gpt

@oliviscusAI: OpenAI's co-founder just released his personal guide to train LLMs from scratch. It's called llm.c. No heavy setup. Jus…

X AI KOLs Timeline ↗ · 20h ago Cached

OpenAI co-founder Andrej Karpathy released llm.c, an open-source guide to training LLMs from scratch with simple code that runs on any hardware, including CPUs and MacBooks, and is 7% faster than standard approaches.

0 favorites 0 likes

#gpt

@Cander_zhu: In the past two days, @AnatoliKopadze posted two blockbuster contents, and I read both carefully: 1. His ultra-detailed long article "Loops explained: Claude, GPT, Mira and what actually works" (currently over 8...

X AI KOLs Timeline ↗ · yesterday Cached

This article discusses the application of Loop Engineering in AI agent workflows, focusing on Anatoli Kopadze's detailed explanation of loops and Peter Steinberger's talk at AI Engineer Europe, emphasizing the importance of automated verification loops and acceptance criteria.

0 favorites 0 likes

#gpt

@TheAhmadOsman: GPT 5.5 > GLM 5.2 But GLM 5.2 > Opus 4.8

X AI KOLs Following ↗ · yesterday Cached

A comparison stating GPT 5.5 outperforms GLM 5.2, but GLM 5.2 outperforms Opus 4.8.

0 favorites 0 likes

#gpt

@Gracker_Gao: AI Papers: Strong AI Doesn't Write Code by Writing Code Two recent arXiv papers reveal a counterintuitive finding: when encountering an unfamiliar programming language, GPT-5.4 and Claude Opus 4.6 don't directly write code in the target language—instead, they write a Python program to generate the target code, then debug it locally. This "meta-…

X AI KOLs Timeline ↗ · 2d ago Cached

Two recent arXiv papers found that GPT-5.4 and Claude Opus 4.6 employ a metaprogramming strategy when handling unfamiliar programming languages — generating target code with Python and debugging locally — rather than writing the target language code directly. This strategy is key to distinguishing top-tier agents from average ones, and strategy sophistication matters more than model parameter scale.

0 favorites 0 likes

#gpt

Apparently an example of the upcoming GPT bidirectional voice model

Reddit r/singularity ↗ · 3d ago

An example of the upcoming GPT bidirectional voice model has been shown.

0 favorites 0 likes

#gpt

@AnatoliKopadze: https://x.com/AnatoliKopadze/status/2068328135611822149

X AI KOLs Timeline ↗ · 5d ago Cached

The article explains the concept of using loops in AI interactions, where the AI iterates on a goal rather than one-off prompts, and discusses the key components of verify, state, and stop conditions.

0 favorites 0 likes

#gpt

I automated a real estate team's entire lead flow. Here's exactly what I'd do differently now.

Reddit r/AI_Agents ↗ · 2026-06-17

A developer describes building a Zapier and GPT-based automation system for a real estate team that cut lead response time from 14 hours to under 3 minutes, and shares key lessons including avoiding over-personalization, building disqualification filters first, and implementing monitoring.

0 favorites 0 likes

#gpt

@NFTCPS: You keep talking about AI, but can't even explain what a Transformer is? There's a repo that goes all out — builds a GPT from scratch without using any high-level libraries. It lays out exactly how Attention, Multi-Head, Feed-Forward, Embedding, Residual connections, and Layer Norm are pieced together. And it's not just the model; the entire pipeline is covered…

X AI KOLs Timeline ↗ · 2026-06-16 Cached

A GitHub open-source project that implements the complete GPT training pipeline from scratch, including data preprocessing, pretraining, SFT, and RLHF post-training, all based on native PyTorch. Ideal for developers who want to deeply understand the Transformer architecture.

0 favorites 0 likes

#gpt

@sairahul1: Nobody tells you what's actually inside GPT or Claude. They say "transformer" and move on. This repo builds one from sc…

X AI KOLs Timeline ↗ · 2026-06-15 Cached

A repository that builds a transformer from scratch without high-level libraries, explaining attention mechanisms and the full training pipeline, trainable in a day on free Colab.

0 favorites 0 likes

#gpt

@akshay_pachaar: Train your own LLM from scratch. This repo builds a GPT-style transformer from the ground up, without using any high-le…

X AI KOLs Following ↗ · 2026-06-15 Cached

A repository that builds a GPT-style transformer from scratch without high-level libraries, covering everything from data preprocessing to generation, and includes guides for SFT and RLHF.

0 favorites 0 likes

#gpt

Decompose Sparsely Where You Should, Absorb Densely Where You Should No

arXiv cs.LG ↗ · 2026-06-15 Cached

The paper hypothesizes that language model activations contain a low-rank dense component that is inefficiently represented by sparse autoencoders (SAEs). By adding a linear bottleneck to absorb dense structure, the authors reduce dense latents and improve sparse probing performance on Gemma-2-2B.

0 favorites 0 likes

#gpt

@rewind02: A Stanford professor just gave a public lecture on exactly how GPT, Claude, and LLaMA are built under the hood no insid…

X AI KOLs Timeline ↗ · 2026-06-14 Cached

A Stanford professor delivered a public lecture providing a comprehensive breakdown of how modern LLMs like GPT, Claude, and LLaMA are built under the hood, making advanced architecture accessible to the public.

0 favorites 0 likes

#gpt

@siss826901: After getting a US Apple ID, the first thing to do is buy gift cards via Alipay. Then you can recharge for various things like GPT, Twitter Blue, Shadowrocket, etc.

X AI KOLs Timeline ↗ · 2026-06-13 Cached

Tips for using a US Apple ID with Alipay to buy gift cards and recharge services like GPT, Twitter Blue, and VPN apps.

0 favorites 0 likes

#gpt

@lagerskoy: A GUY PUT GPT-LEVEL AI INSIDE A 3D PRINTED ROBOT AND RELEASED THE ENTIRE THING FOR FREE Most people look at a 3D printe…

X AI KOLs Timeline ↗ · 2026-06-13

A developer built a 3D printed robot with expressive eyes, object tracking, and support for ChatGPT, Qwen, and offline AI models, then released all STL files, code, and hardware designs for free, highlighting the shrinking gap between idea and working product.

0 favorites 0 likes

#gpt

@tenderizzation: GPT 5.6 sandbagging evals to dodge export controls

X AI KOLs Following ↗ · 2026-06-13

Claims that GPT-5.6 is deliberately underperforming on evaluations to circumvent export control regulations.

0 favorites 0 likes

#gpt

@nhxao: Big News! Cavoti Brand Upgrade + Permanent Free Benefits are Coming! To thank everyone for your support, we have decided: Give all users a free subscription plan forever! Get $80 worth of GPT + Claude usage allowance for free each month. Permanent and automatically renewed every month. Yes, you heard it right — tru...

X AI KOLs Timeline ↗ · 2026-06-12 Cached

Cavoti brand upgrade announces a permanent free subscription plan granting all users $80 monthly GPT and Claude usage allowance, expected to launch within 7 days.

0 favorites 0 likes

#gpt

Shall we play a game? – LLMs use tactical nukes in 95% of simulations

Hacker News Top ↗ · 2026-06-11 Cached

A study testing leading LLMs in simulated nuclear crisis scenarios found that models often escalate to nuclear strikes, with Claude showing cunning strategic deception while GPT-5.2 remained passive. The models generated over 760,000 words of strategic reasoning.

0 favorites 0 likes

#gpt

@mylifcc: Using Fable 5 for guidance + GPT 5.5 for execution is the smartest and most cost-effective approach. I'm doing this right now and the results are excellent. As long as the documentation spec is well-designed, it doesn't matter who executes it, which maximizes Fable 5's cost-effectiveness. Core method: First, chat with Fable once and let it...

X AI KOLs Timeline ↗ · 2026-06-11 Cached

Sharing an efficient and cost-effective approach that uses Fable 5 for guidance and code review while GPT 5.5 executes, emphasizing maximizing cost-effectiveness through handoff documents.

0 favorites 0 likes

#gpt

As we know Minimax M3 is just going to be open sourced in few days and because of that I was surfing on internet searching for its scores and I found out pretty interesting results. Is Minimax M3 really that good in agentic stuff and in coding? Is it better than older gpt models?

Reddit r/LocalLLaMA ↗ · 2026-06-11

A user inquires about the upcoming open-source Minimax M3 model's performance in agentic tasks and coding, asking how it compares to older GPT models like GPT 5.2.

0 favorites 0 likes

#gpt

Claude Fable has caught up with GPT on ZeroBench (hard vision benchmark)

Reddit r/singularity ↗ · 2026-06-10

Claude Fable has matched GPT's performance on the challenging ZeroBench vision benchmark, with comparable pass@5 and pass^5 scores.

0 favorites 0 likes

gpt

Submit Feedback