generative-ai

#generative-ai

Exploring the "Banality" of Deception in Generative AI

Reddit r/ArtificialInteligence ↗ · 7h ago Cached

This position paper explores 'banal deception' in generative AI, arguing that subtle manipulation is becoming normalized in chatbot interactions and requires new safeguards.

0 favorites 0 likes

#generative-ai

@FinanceYF5: Some say it’s one of the best short films they’ve seen in recent years. The film is titled Zombie Scavenger, created by MX-Shell. Soon, we won’t call it an “AI film” anymore—just a “film.”

X AI KOLs Following ↗ · 11h ago Cached

Introducing *Zombie Scavenger*, a short film by MX-Shell, regarded as one of the best in recent years. It highlights how AI-generated video is increasingly being accepted as standard cinematic content.

0 favorites 0 likes

#generative-ai

Language Modeling with Hyperspherical Flows

arXiv cs.LG ↗ · 13h ago Cached

This paper introduces S-FLM, a novel flow-based language model that operates in a hyperspherical latent space to address the computational costs and semantic limitations of existing discrete diffusion and continuous flow models.

0 favorites 0 likes

#generative-ai

TMPO: Trajectory Matching Policy Optimization for Diverse and Efficient Diffusion Alignment

arXiv cs.LG ↗ · 13h ago Cached

This paper introduces Trajectory Matching Policy Optimization (TMPO), a method for aligning diffusion models that addresses reward hacking and visual mode collapse by matching trajectory-level reward distributions rather than maximizing scalar rewards.

0 favorites 0 likes

#generative-ai

Sampling More, Getting Less: Calibration is the Diversity Bottleneck in LLMs

arXiv cs.CL ↗ · 13h ago Cached

This paper introduces a validity-diversity framework attributing diversity collapse in LLMs to order and shape miscalibration during decoding, validated across 14 language models.

0 favorites 0 likes

#generative-ai

@GoogleDeepMind: We’re reimagining a 50-year-old interface - the mouse pointer - with AI. These experimental demos show how people can i…

X AI KOLs ↗ · yesterday Cached

Google DeepMind is experimenting with reimagining the mouse pointer interface using Gemini AI, allowing users to control screens through motion, speech, and natural shorthand.

0 favorites 0 likes

#generative-ai

@wsl8297: When learning AI, the scariest part is getting stuck at "understanding the theory" and freezing when it's time to write code — not knowing where to start, and unable to find decent practice projects. I unearthed a practical treasure trove on GitHub: AI-Project-Gallery. It collects 30+ high-quality AI projects, covering classic topics like house price prediction and disease classification, as well as hot applications like Gemini chatbot and document generator...

X AI KOLs Timeline ↗ · yesterday Cached

This post shares a curated GitHub repository containing over 30 practical AI projects, covering domains from regression to generative AI, with many end-to-end examples, suitable for learners and developers.

0 favorites 0 likes

#generative-ai

Taught Claude to talk like a caveman to use 75% less tokens.

Reddit r/ArtificialInteligence ↗ · yesterday

A user experimented with prompting Claude to communicate concisely, resulting in a 75% reduction in token usage while monitoring potential impacts on model intelligence.

0 favorites 0 likes

#generative-ai

AI turning aggressive generalists into fucking institutions

Reddit r/ArtificialInteligence ↗ · yesterday

The author recounts using AI coding tools to build complex web infrastructure alone, arguing that AI empowers individual operators to achieve institutional-level output without large teams.

0 favorites 0 likes

#generative-ai

NoiseRater: Meta-Learned Noise Valuation for Diffusion Model Training

arXiv cs.LG ↗ · yesterday Cached

This paper introduces NoiseRater, a meta-learning framework that assigns importance scores to individual noise samples during diffusion model training to improve efficiency and generation quality.

0 favorites 0 likes

#generative-ai

Towards Customized Multimodal Role-Play

arXiv cs.LG ↗ · yesterday Cached

This paper introduces UniCharacter, a two-stage training framework for Customized Multimodal Role-Play (CMRP) that enables unified customization of persona, dialogue style, and visual identity. It presents the RoleScape-20 dataset and demonstrates that the model can achieve coherent cross-modal generation with minimal data.

0 favorites 0 likes

#generative-ai

Reinforcement learning for inverse structural design and rapid laser cutting of kirigami prototypes

arXiv cs.LG ↗ · yesterday Cached

This paper introduces RL-Kirigami, a framework combining optimal-transport conditional flow matching and reinforcement learning to solve the inverse design problem for kirigami metamaterials, achieving high accuracy and enabling rapid laser-cut prototype fabrication.

0 favorites 0 likes

#generative-ai

MoCam: Unified Novel View Synthesis via Structured Denoising Dynamics

Hugging Face Daily Papers ↗ · yesterday Cached

MoCam is a research paper introducing a diffusion-based framework for unified novel view synthesis that dynamically coordinates geometric and appearance priors to improve robustness against geometric errors.

0 favorites 0 likes

#generative-ai

VidSplat: Gaussian Splatting Reconstruction with Geometry-Guided Video Diffusion Priors

Hugging Face Daily Papers ↗ · yesterday Cached

VidSplat is a training-free generative reconstruction framework that uses video diffusion priors to recover complete 3D scenes from sparse inputs by synthesizing novel views.

0 favorites 0 likes

#generative-ai

@karpathy: This works really well btw, at the end of your query ask your LLM to "structure your response as HTML", then view the g…

X AI KOLs Following ↗ · 2d ago

Andrej Karpathy suggests prompting LLMs to structure responses as HTML for better visualization and predicts AI output will evolve from text to interactive neural videos.

0 favorites 0 likes

#generative-ai

20,000 Romans Entered Teutoburg Forest - I Made a Dark 15-Minute AI War Vid About It

Reddit r/singularity ↗ · 2d ago

A creator showcases a 15-minute AI-generated cinematic video about the Battle of the Teutoburg Forest, describing a 60-hour workflow utilizing AI for video, voice, and sound design.

0 favorites 0 likes

#generative-ai

Microsoft Copilot May Quietly Win Enterprise AI

Reddit r/ArtificialInteligence ↗ · 2d ago Cached

This analysis argues Microsoft Copilot may win the enterprise AI race through deep workflow integration in existing Microsoft tools rather than pure model superiority. It highlights how organizational habits and path-dependency often dictate technology adoption over technical capabilities.

0 favorites 0 likes

#generative-ai

Who Will Solve the AI Productivity Puzzle?

Reddit r/ArtificialInteligence ↗ · 2d ago Cached

Despite widespread adoption, generative AI has not yielded sustained productivity growth, leading OpenAI and Anthropic to launch private-equity-backed consulting ventures for enterprise integration.

0 favorites 0 likes

#generative-ai

ChatGPT is now creating content for textbooks.

Reddit r/singularity ↗ · 2d ago

This article reports that the AI tool ChatGPT is now being used to create material for educational textbooks. It signifies a new application area for large language models in the publishing industry.

0 favorites 0 likes

#generative-ai

Why there isn't any top LLM providers investing on diffusion LLM?

Reddit r/singularity ↗ · 2d ago

This article questions why major LLM providers are not investing in Diffusion LLMs despite recent advancements like Mercury 2. It explores potential fundamental issues or hardware bottlenecks hindering broader adoption.

0 favorites 0 likes

generative-ai

Submit Feedback