distributed-systems

#distributed-systems

@soumithchintala: Cluster magicians and GPU whisperers, come join us! We’re looking for supercomputing engineers to build the infrastruct…

X AI KOLs Following ↗ · yesterday Cached

Thinking Machines Lab is hiring supercomputing engineers in NYC and SF to build infrastructure for real-time interactive models and large-scale training.

0 favorites 0 likes

#distributed-systems

Software Internals Book Club

Hacker News Top ↗ · yesterday Cached

This article describes a global email-based book club for senior developers focused on reading technical books about databases, distributed systems, and software performance, currently featuring 'Operating Systems: Three Easy Pieces'.

0 favorites 1 likes

#distributed-systems

@tom_doerr: System design interview notes based on bestselling guides https://github.com/liquidslr/system-design-notes…

X AI KOLs Timeline ↗ · 4d ago Cached

A GitHub repository containing comprehensive system design interview notes based on Alex Xu's bestselling books, covering topics like scaling, consistent hashing, and distributed systems.

0 favorites 0 likes

#distributed-systems

Large Language Models over Networks: Collaborative Intelligence under Resource Constraints

Hugging Face Daily Papers ↗ · 4d ago Cached

This paper explores collaborative intelligence paradigms where distributed Large Language Models work together across devices and clouds to handle resource constraints. It covers vertical device-cloud collaboration, horizontal multi-agent collaboration, routing policies, and open research challenges in scalable and trustworthy cooperative AI.

0 favorites 0 likes

#distributed-systems

@Modular: HTTP routing has been a solved problem for many years. Then came Large Language Models. Their backends aren't interchan…

X AI KOLs Following ↗ · 5d ago Cached

Modular published a blog post explaining why traditional HTTP routing doesn't work for LLM inference workloads. The article describes how their distributed inference framework handles stateful, heterogeneous GPU pods with KV caches, specialized prefill/decode backends, and conversation-level routing that traditional stateless routing algorithms cannot address.

0 favorites 0 likes

#distributed-systems

Can LLMs model real-world systems in TLA+?

Hacker News Top ↗ · 5d ago Cached

Researchers from the Specula team created SysMoBench, a benchmark evaluating whether LLMs can faithfully model real-world computing systems in TLA+ or merely recite textbook specifications. The benchmark tests 11 systems across four phases and reveals systematic gaps in current LLMs' ability to accurately model system implementations versus reference papers.

0 favorites 0 likes

#distributed-systems

Idempotency Is Easy Until the Second Request Is Different

Hacker News Top ↗ · 6d ago Cached

The article discusses the complexities of implementing idempotency in APIs, arguing that handling edge cases like concurrent requests and content mismatches is harder than simple replay caching.

0 favorites 0 likes

#distributed-systems

@GergelyOrosz: How have the fundamentals of building large, distributed software systems changed the last decade? A conversation with …

X AI KOLs Following ↗ · 2026-04-22 Cached

Martin Kleppmann discusses how the fundamentals of building large, distributed systems have evolved over the past decade in light of the updated second edition of his book "Designing Data-Intensive Applications."

0 favorites 0 likes

#distributed-systems

Building advanced AI workflows—what am I missing?

Reddit r/artificial ↗ · 2026-04-20

A developer seeking recommendations on advanced AI workflow orchestration tools and patterns, including LangChain, LangGraph, and AWS Step Functions, to build more robust and future-proof systems.

0 favorites 0 likes

#distributed-systems

Federated Learning

ML at Berkeley ↗ · 2021-03-16 Cached

The article explains the concept of Federated Learning as a privacy-preserving machine learning technique that trains models on local devices rather than central servers. It details the process of encrypted parameter updates and aggregation to mitigate data leakage risks while maintaining model performance.

0 favorites 0 likes

distributed-systems

Submit Feedback