The Transformer Pill

Reddit r/ArtificialInteligence 06/12/26, 04:20 PM News

transformers artificial-intelligence deep-learning causality bioinformatics linguistics scientific-impact

Summary

A reflection on the broad implications of transformer architectures beyond LLMs, including potential impacts on linguistics, genetics, and causal modeling, comparing their significance to the Haber-Bosch process.

I just watched a YouTube video that vulgarized the maths behind transformers. I feel like I have been living under a rock for the last 10 years. My knowledge of AIs basically stopped at CNNs (Convolutional Neural Networks). The theoretical and practical consequences of transformers are so vast and way beyond the current LLM hype when you understand what it implies: \* In linguistics: it completely shatters many of the dominant ideas in the field like the signifier signified divide and grammar seem to be a system emerging from statistical correlations rather than one we are born with. \* In genetics most genes responsible of monogenic diseases are already well known. What is left are polygenic diseases, like most autoimmune diseases or mental illnesses. Bioinformatics could combine the power of transformers with GWAS data to map the complex relationship between genes and illnesses. \* When transformers are paired with time-series, they cease to be correlation engines and become causality engines. Governments, big fortunes and companies like Palantir are mapping supply chains to predict crises, price hikes and potential wars. When you apply these predictive capabilities to human behavior you get very close to Minority Report. When I tried to find an equivalent in the history of science in term of impact, the only thing I could think of was the Haber-Bosch process which basically defined the whole 20th century (fertilizers, bombs, toxic gases…). What are your insights about the revolution transformers are about to bring that the general public seem to be completely unaware of?

Original Article

The Transformer Pill

Similar Articles

Transformer-Based Language Models Across Domain Verticals: Architectures, Applications and Critical Assessment

When transformers learn "impossible" languages, what do they learn?

How LLMs Actually Work (26 minute read)

@v0xium: I strongly recommend reading the Transformer chapter from Speech and Language Processing by Dan Jurafsky and James H. M…

We are hitting a wall trying to force transformers to do actual logic [D]

Submit Feedback

Similar Articles

Transformer-Based Language Models Across Domain Verticals: Architectures, Applications and Critical Assessment

When transformers learn "impossible" languages, what do they learn?

How LLMs Actually Work (26 minute read)

@v0xium: I strongly recommend reading the Transformer chapter from Speech and Language Processing by Dan Jurafsky and James H. M…

We are hitting a wall trying to force transformers to do actual logic [D]