Intent to Prototype: Embedding API

Lobsters Hottest Tools

Summary

The Chromium team proposes a new Embedding API for the web platform that allows developers to generate vector embeddings on-device using Chrome's AI infrastructure, enabling privacy-preserving semantic search, retrieval-augmented generation, and content clustering while reducing latency and cost.

<p><a href="https://lobste.rs/s/czctjh/intent_prototype_embedding_api">Comments</a></p>
Original Article
View Cached Full Text

Cached at: 05/27/26, 01:26 AM

# Intent to Prototype: Embedding API Source: [https://groups.google.com/a/chromium.org/g/blink-dev/c/EjL1gAy3k3Q/m/31Cnh22MBgAJ](https://groups.google.com/a/chromium.org/g/blink-dev/c/EjL1gAy3k3Q/m/31Cnh22MBgAJ) ### Ian Zhao unread, 2:24 PM \(4 hours ago\)2:24 PM to blin\.\.\.@chromium\.org, m\.\.\.@chromium\.org, rei\.\.\.@chromium\.org, kenji\.\.\.@chromium\.org, dbo\.\.\.@chromium\.org **Contact emails** ying\.\.\.@chromium\.org,m\.\.\.@chromium\.org,rei\.\.\.@chromium\.org,kenji\.\.\.@chromium\.org,dbo\.\.\.@chromium\.org **Explainer** [https://github\.com/explainers\-by\-googlers/embedding\-api](https://github.com/explainers-by-googlers/embedding-api) **Specification** *No information provided* **Summary** The Embedding API is a proposed Web Platform API that allows developers to generate high\-dimensional vector representations \(embeddings\) of content directly on the user's device\. By leveraging Chrome's on\-device AI infrastructure and a shared on\-device model, this API enables powerful semantic understanding features—such as semantic search, Retrieval\-Augmented Generation \(RAG\), and content clustering\. It eliminates the latency, cost, and privacy trade\-offs of cloud services\. Furthermore, compared to DIY client\-side approaches, it provides significant user benefits \(saving bandwidth and local storage by preventing each site from downloading its own massive model\) and developer benefits \(abstracting away complex model delivery and keeping WebAssembly/WebGPU frameworks up\-to\-date\)\. **Blink component** [Blink\>AI\>Embedder](https://issues.chromium.org/issues?q=customfield1222907:%22Blink%3EAI%3EEmbedder%22) **Web Feature ID** Missing feature **Motivation** While existing web technologies like WebAssembly and WebGPU provide standardized, high\-performance, and privacy\-preserving execution environments, deploying an embedding model still forces developers into a difficult trade\-off: - WebAssembly/WebGPU \(DIY\): Leads to significant storage and memory bloat, as every site must download its own multi\-hundred megabyte model\. - Cloud APIs: Introduce network latency, financial costs for developers, and require sending potentially sensitive user text to third\-party servers\. By ensuring stateless execution and explicitly not persisting embeddings globally, an on\-device API allows the browser to safely share a single, optimized model across all origins, drastically reducing the resource footprint while providing a simple, high\-level JavaScript primitive for generalist developers\. **Key Use Cases** - Semantic Search: Enable note\-taking or documentation apps to find content based on meaning rather than keywords, entirely offline and private\. - On\-Device RAG: Power local Q&A bots that retrieve relevant context from a user’s own data\. - Real\-time Content Intelligence: Provide proactive moderation hints or content categorization as a user types, before content is ever transmitted to a server\. **Anticipated questions** Here's a list of problems that we want to discuss with other browser vendors and the Web Machine Learning Community Group \(WebML CG\) as part of Standards to ensure interoperability \(Note: the explainer lists more in the "Ensuring an Interoperable API Design" section\) - Model and Space Choices: Exploring requirements for open\-weight models and allowing developers to specify or provide their own models, to ensure compatibility with server\-side embedding databases\. - Content Mediation: Can we develop some sort of mediation when embeddings must be used server\-side? **Initial public proposal** [https://github\.com/webmachinelearning/proposals/issues/18](https://github.com/webmachinelearning/proposals/issues/18) **Requires code in //chrome?** True **Tracking bug** [https://crbug\.com/428233906](https://crbug.com/428233906) **Estimated milestones** No milestones specified **Link to entry on the Chrome Platform Status** [https://chromestatus\.com/feature/5115796490682368?gate=5187435874091008](https://chromestatus.com/feature/5115796490682368?gate=5187435874091008)

Similar Articles

Introducing text and code embeddings

OpenAI Blog

OpenAI introduces a new embeddings API endpoint that converts text and code into numerical vector representations for semantic search, clustering, and classification tasks. The models achieve state-of-the-art results on standard benchmarks including a 20% relative improvement in code search performance.

Embedist

Product Hunt

Embedist is an open-source AI-native embedded development environment tool, launched on Product Hunt, designed for developers working on embedded systems.

A new way to explore the web with AI Mode in Chrome

Google AI Blog

Google has updated Chrome's AI Mode to allow users to explore web content side-by-side with AI assistance without switching tabs, and added the ability to search across recent tabs and files for deeper context.

It&#8217;s make or break time for AI labeling systems

The Verge

Google is expanding SynthID and C2PA Content Credentials verification into Chrome and Search, while OpenAI embeds SynthID into images generated by its tools, marking a major push to make AI-generated content easier to detect online.