research

Tag

Cards List
#research

@rohanpaul_ai: LLMs often cannot tell when an attack made them say something unsafe. Asking an LLM whether its own previous answer was…

X AI KOLs Timeline · 1h ago Cached

This paper investigates whether LLMs can reliably self-report when their outputs have been compromised by adversarial prefills, finding that models often cannot distinguish between compromised and intentional outputs, and their limited recognition stems from normal refusal behavior rather than true self-awareness.

0 favorites 0 likes
#research

@yoheinakajima: ActiveGraph: 1 month in: Paper #1: The Log is the Agent 3 LongMemEval Experiments Paper #2: Regimes, self-improvement l…

X AI KOLs Following · 2h ago Cached

ActiveGraph announces two new papers on agent memory (LongMemEval) and self-improvement regimes, along with reference agents, pack templates, and upcoming meetups in Seattle and San Francisco.

0 favorites 0 likes
#research

I Figured Out What Causes 'Super Weights'

Reddit r/ArtificialInteligence · 4h ago

Explains that super weights in large language models arise from the SoftMax-Attention interaction creating a 'Nothing Dump' token that serves as a stable reference point; removing these weights cripples performance.

0 favorites 0 likes
#research

Certainty Is All You Need

Reddit r/artificial · 6h ago

This paper introduces a new approach leveraging certainty in transformer models, building on the 'Attention Is All You Need' paradigm.

0 favorites 0 likes
#research

@SpaceX: Today’s mission includes a demo of a new vehicle that will enable affordable, routine access to the microgravity enviro…

X AI KOLs Following · 14h ago Cached

SpaceX's mission includes a demo of a new vehicle for affordable, routine access to microgravity for scientific research and in-space manufacturing, with a planned splashdown in the Pacific Ocean.

0 favorites 0 likes
#research

Thermodynamic Measure Of Intelligence

Reddit r/singularity · yesterday Cached

This paper proposes a thermodynamic measure of intelligence defined as 'rare-valid lift' and argues that recursive self-simulation is necessary and nearly sufficient for high thermodynamic intelligence, making intelligence measurable on a universal scale.

0 favorites 0 likes
#research

Prompt Injection as Role Confusion

Simon Willison's Blog · yesterday Cached

Research paper shows that LLMs suffer from 'role confusion', where they prioritize the style of text over its actual role tags, enabling prompt injection attacks. Destyling text reduces attack success from 61% to 10%, indicating a fundamental challenge for LLM security.

0 favorites 0 likes
#research

Prompt Injection as Role Confusion

Hacker News Top · yesterday Cached

This paper presents a theory that prompt injection attacks on LLMs stem from a fundamental flaw in how models perceive roles, treating roles as a type system for language. It explains existing attacks, predicts new ones, and proposes a research agenda for a science of roles.

0 favorites 0 likes
#research

Revised: Estimated share of newly written code exposed to AI generation and review

Reddit r/singularity · yesterday

This paper revises the estimated proportion of newly written code that is generated or reviewed by AI, analyzing its impact on software development.

0 favorites 0 likes
#research

@AlphaSignalAI: https://x.com/AlphaSignalAI/status/2069064122218717387

X AI KOLs Timeline · yesterday Cached

This article explores how AI agents can automatically write and optimize their skill files using techniques like SkillOpt from Microsoft Research, which treats skill documents as trainable state and delivers significant performance improvements. It addresses the challenge of manual skill tuning and presents frameworks like GEPA and EvoSkill as evolutionary approaches.

0 favorites 0 likes
#research

@yunxi0623: https://x.com/yunxi0623/status/2069054269332889793

X AI KOLs Timeline · yesterday Cached

Introduce 5 Codex Skills to improve research efficiency, including paper framework construction, image to PPT conversion, scientific diagram editing, academic writing assistance, and learning high-level paper structures, emphasizing turning repetitive processes into reusable skills.

0 favorites 0 likes
#research

@geekbb: Organized the quarterly reports, notes, and interviews of fund manager Zheng Xi from over a decade into a structured corpus, built as a traceable AI skill, enabling AI to conduct investment research Q&A and fund analysis based on real data rather than model hallucinations. https://github.com/lyra81604/zhengxi-views…

X AI KOLs Timeline · yesterday Cached

Compiled the public quarterly reports, notes, and interviews of fund manager Zheng Xi into a structured corpus, and built it as a traceable skill across AI platforms for real data-driven investment research Q&A and fund analysis.

0 favorites 0 likes
#research

@arxivblog: Computational complexity theorists show gravity must be quantised https://arxivblog.substack.com/p/computational-comple…

X AI KOLs Timeline · yesterday Cached

Computational complexity theorists argue that semiclassical gravity's non-linear dynamics would enable impossibly powerful computation, proving gravity must be quantized. The paper uses the Schrödinger-Newton equation to show that classical gravity coupled to quantum matter leads to computational contradictions.

0 favorites 0 likes
#research

@Phoenixyin13: Congratulations @alisawuffles on officially joining OpenAI! UW NLP PhD final year, OpenAI SuperAlignment Fellowship recipient, Alisa Liu announced she will join OpenAI next week. In her recent blog, she detailed the whole...

X AI KOLs Timeline · yesterday Cached

Alisa Liu (alisawuffles), a UW NLP PhD in her final year and recipient of the OpenAI SuperAlignment Fellowship, announced she will be joining OpenAI next week. In her blog, she transparently detailed the entire job search process, including 46 recruiter screens and interviews with 11 top AI labs.

0 favorites 0 likes
#research

AIs can do world-modeling now, as seen via the Anthropic Fable standoff

Reddit r/artificial · 2d ago

Anthropic demonstrates that AI systems can now perform world-modeling, as evidenced by the Fable standoff experiment.

0 favorites 0 likes
#research

@akshay_pachaar: Turn any paper into running code. Just swap arxiv → autoarxiv in the paper url. That hands the paper to an AI agent fro…

X AI KOLs Following · 2d ago Cached

autoarxiv lets you turn any arxiv paper into running code by simply changing the URL to autoarxiv.org. An AI agent from alphaXiv reads the paper, clones the repo, sets up dependencies, and runs a minimal reproduction to verify claims, logging everything live.

0 favorites 0 likes
#research

Is AI ruining our skills? Early results are in and they’re not good

Lobsters Hottest · 2d ago Cached

A new study reveals early results suggesting that AI is negatively affecting human skills, raising concerns about cognitive decline.

0 favorites 0 likes
#research

@FinanceYF5: 3/ He believes the AI capability leap in the past 5 months comes not only from tool advancements like Claude Code, but because of 【Mythos】—a new Anthropic model that quietly changed the entire R&D rhythm after its training completed in February this year. Key takeaway: Leading models are helping to train the next generation of leading models...

X AI KOLs Following · 3d ago Cached

According to speculation, Anthropic's new model Mythos, after completing training in February this year, quietly changed the R&D rhythm, leading to a significant leap in AI capabilities over the past 5 months. Leading models are helping to train the next generation of models.

0 favorites 0 likes
#research

Researchers used math to crack Wordle

Hacker News Top · 3d ago Cached

Researchers at Binghamton University used Shannon entropy to develop a mathematical method that solves Wordle puzzles with a 99% success rate, prioritizing informative guesses over likely answers.

0 favorites 0 likes
#research

Slow breathing modulates brain function and risk behavior

Hacker News Top · 3d ago

This paper reports that slow breathing can modulate brain function and influence risk-taking behavior.

0 favorites 0 likes
Next →
← Back to home

Submit Feedback