Tag
A developer shares a workflow using Claude Fable 5 as architect and GPT 5.5 Codex as builder, with a handoff memory system to manage AI-assisted development efficiently.
Developer @charliermarsh reports reducing ty's retained memory by another 15% using Codex, achieving a total ~25% memory reduction via /goal, and expresses enthusiasm for GPT models.
An open-source project named GordenSuperPPTSkills uses GPT to generate luxurious image-format PPTs, then 'decompiles' them into fully editable PPTX files, solving the pain point of AI-generated content being non-editable. The project is split into three independent skills, supporting either just image generation or just image-to-editable conversion, suitable for Codex environment.
This paper introduces Multi-Resolution Residual Routing (WAV v1), an extension of Block Attention Residuals that augments block representations with directional detail bases, improving deep decoder-only Transformer training.
A user reports that using a GPT model (possibly GPT-5.5) for a spreadsheet task cost $10 in heavily subsidized tokens, with actual compute cost estimated at $100, arguing that current AI pricing is unsustainable.
A minimal, hackable CUDA implementation of a GPT-like transformer language model that processes byte sequences, with sample outputs and build instructions.
A beginner-friendly, hands-on GitHub repository that breaks down GPT-like LLM architecture into simple parts, with 10 Jupyter notebooks covering tokenization, attention, transformer blocks, and a mini GPT implementation in PyTorch.
This paper describes the development of an LLM-based tool using OpenAI's GPT models to evaluate approximately 1,200 Statements of Purpose for Purdue's SURF program, processing them in 4.6 hours and accelerating the review process compared to traditional human grading.
Morgan Linton successfully boots a custom operating system built from scratch using OpenAI's Codex and GPT 5.5, a project started on May 4th.
TaskGPT is a voice agent for MacOS, enabling voice-based interactions with AI.
The DeepSWE benchmark costs are per task, not per total run. Running models like Mimo V2.5 Pro can cost ~$225 for a full run, while Mimo V2.5 non-pro costs ~$7.15. Users should be aware of this before running expensive models.
A GPT model discovered a bug in mlx-c 0.31.2, and the author is considering submitting a pull request without personally verifying the issue.
Discussion of recent benchmark results for Claude Opus 4.8 and GPT-5.5 on DeepSWE Bench, with speculation about future GPT-5.6 performance and efficiency trends.
GordenSun released an open-source project called GordenPPTSkill, which allows AI assistants (such as GPT, DeepSeek, Claude, etc.) to generate editable PPT files with high information density and complex layouts in one sentence. It supports automatic template updates and is suitable for use in state-owned enterprises and major internet companies.
An analysis reveals that 28.9% of GPT 5.5's failures on SWEBench Pro are due to broken or incorrect test cases, and similar issues affect other major AI benchmarks, raising concerns about the accuracy of current evaluation methods.
The author recounts building an AI agent using GPT-5.5 Codex and Meta Ads MCP that automates the entire media buying process, from research and copywriting to ad deployment and reporting, replacing the role of a traditional media buyer for less than $20/month.
A 12-chapter interactive textbook that teaches how to build a GPT-like language model from absolute scratch, with fully annotated code and beginner-friendly explanations.
Introduces BenchBench, a benchmark that tests AI models' ability to create effective benchmarks for other models, with GPT 5.2 being the only successful winner so far while frontier models like GPT 5.5 and Opus 4.6 struggled.
At the Physical AI Hackathon, a team built a 'Panda Master' using a modified ReachyMini robot that speaks via GPT, while an Agilex Robotics arm draws fortunes in real-time.
A tweet introduces a workflow where GPT-5.5 xhigh plans and delegates implementation to Composer 2.5 subagents via the pi-cursor-sdk, claiming it outperforms using either model alone. The linked GitHub repo is an open-source SDK that integrates Cursor models into the pi agent runtime.