Towards Resilient and Autonomous Networks: A BlueSky Vision on AI-Native 6G

arXiv cs.AI 05/22/26, 04:00 AM Papers

6g ai-native foundation-model multi-agent-systems network-autonomy resilient-networks wireless-communications

Summary

This paper presents a visionary framework for AI-native 6G networks, proposing a unified foundation model and collaborative multi-agent systems to achieve autonomous, resilient network management beyond fragmented 5G approaches.

arXiv:2605.21395v1 Announce Type: new Abstract: The proliferation of emerging applications, such as autonomous driving and immersive experiences, demands cellular networks that are not only faster, but fundamentally more resilient and autonomous. This paper presents a BlueSky vision on how Artificial Intelligence will be natively integrated into 6G, shifting the paradigm from \underline{Network for AI} to \underline{AI for Network}. We envision that, unlike 5G's reliance on scattered, ad-hoc models each trained for a single task, native AI in the 6G era will be anchored by a foundation model and and orchestrated via collaborative multi-agent systems, framing network management as a unified, multi-modal, multi-task optimization problem. Built on this vision, we outline two transformative directions. The first focuses on developing a 6G foundation model as a unified backbone, with task-specific knowledge distilled into compact models suited for diverse edge deployments. The second advances multi-agent systems designed to autonomously diagnose, maintain, and recover networks with minimal human intervention. These directions chart a roadmap for 6G to evolve into an intelligent, self-sustaining communication infrastructure.

Original Article

View Cached Full Text

Cached at: 05/22/26, 08:50 AM

# A BlueSky Vision on AI-Native 6G
Source: [https://arxiv.org/html/2605.21395](https://arxiv.org/html/2605.21395)
## Towards Resilient and Autonomous Networks: A BlueSky Vision on AI\-Native 6G

Liang Wu, Kelly Wan, Mayank Darbari, and Liangjie Hong

###### Abstract\.

The proliferation of emerging applications, such as autonomous driving and immersive experiences, demands cellular networks that are not only faster, but fundamentally more resilient and autonomous\. This paper presents a BlueSky vision on how Artificial Intelligence will be natively integrated into 6G, shifting the paradigm fromNetwork for AItoAI for Network\. We envision that, unlike 5G’s reliance on scattered, ad\-hoc models each trained for a single task, native AI in the 6G era will be anchored by a foundation model and and orchestrated via collaborative multi\-agent systems, framing network management as a unified, multi\-modal, multi\-task optimization problem\. Built on this vision, we outline two transformative directions\. The first focuses on developing a 6G foundation model as a unified backbone, with task\-specific knowledge distilled into compact models suited for diverse edge deployments\. The second advances multi\-agent systems designed to autonomously diagnose, maintain, and recover networks with minimal human intervention\. These directions chart a roadmap for 6G to evolve into an intelligent, self\-sustaining communication infrastructure\.

††conference:Proceedings of the 32nd ACM SIGKDD Conference on Knowledge Discovery and Data Mining; August 9–13, 2026; Jeju, South Korea††booktitle:Proceedings of the 32nd ACM SIGKDD Conference on Knowledge Discovery and Data Mining \(KDD ’26\)## 1\.Introduction

The integration of machine learning into cellular networks has demonstrated compelling gains throughout the 5G and 5G\-Advanced era\(Shahid et al\.,[2025](https://arxiv.org/html/2605.21395#bib.bib36)\)\. Task\-specific models have been shown to improve a wide range of network functions, including traffic prediction, beamforming optimization, and handover anticipation\. Yet as we look toward 6G, the demands placed on wireless infrastructure extend far beyond transmitting bits faster from one point to another\. Emerging applications such as autonomous driving, unmanned aerial vehicles, and intelligent wearables introduce qualitatively new requirements\. For example, networks must sense device locations, infer states of cellular network users, and respond to failure modes that have no precedent in prior generations\(Tataria et al\.,[2021](https://arxiv.org/html/2605.21395#bib.bib39); Chataut et al\.,[2024](https://arxiv.org/html/2605.21395#bib.bib9)\)\. Addressing each requirement by training a dedicated, ad\-hoc model, as has been the prevailing practice in 5G\-Advanced\(Chen et al\.,[2023](https://arxiv.org/html/2605.21395#bib.bib11)\), leads to a fragile and costly ecosystem\. The proliferation of narrow models inflates development overhead, complicates maintenance, and introduces subtle incompatibilities when the output of one model serves as input to another\. More fundamentally, a collection of siloed, single\-objective models cannot coherently balance competing network goals—throughput, latency, energy efficiency, and user experience—when these objectives must be jointly optimized in real time\.

To address these challenges, we propose a vision centered on two complementary pillars\. The first is a 6G foundation model: a unified, multi\-modal model capable of jointly processing heterogeneous inputs—natural language, raw wireless signals, positioning data, and sensory streams from connected devices\. Rather than maintaining a fragmented library of task\-specific models, this foundation model serves as a shared knowledge backbone from which task\-specific behaviors can be derived through fine\-tuning, and efficient, deployment\-ready variants can be obtained through knowledge distillation tailored to the resource constraints of diverse edge devices\. This design consolidates semantic understanding across modules, reduces development and maintenance costs, and ensures coherent inter\-module communication\. The second pillar is a multi\-agent system that translates the foundation model’s intelligence into coordinated network action\. Individual agents operate in parallel across network domains—access, core, and transport—handling maintenance, configuration, and recovery tasks autonomously\. The two pillars constitute our vision for AI\-native 6G: a system where a unified foundation model resolves the challenge of knowledge integration, and a multi\-agent architecture resolves the challenge of action coordination, jointly enabling networks that are at once more resilient and more autonomous\. Table[1](https://arxiv.org/html/2605.21395#S1.T1)summarizes the key differences between 5G and our proposed 6G AI\-native vision across three dimensions\. Unlike 5G, which treats AI as an add\-on for isolated tasks, 6G natively integrates a unified foundation model and autonomous multi\-agent systems, pivoting the paradigm from ’Network Supporting AI’ to a fundamentally ’AI\-Native Network’\.

Table 1\.Comparison of 5G and the Proposed 6G AI\-Native Vision
## 2\.Vision: A 6G Foundation Model

A truly AI\-native 6G network demands a new class of model—one that is not assembled from a library of task\-specific modules, but is instead built from the ground up to reflect the diversity, scale, and criticality of telecommunications workloads\. We envision a 6G foundation model organized around three defining properties\. First, it must be multi\-modal, capable of natively ingesting and jointly reasoning over the heterogeneous data streams that define modern wireless networks: raw signal measurements, time\-series telemetry, and spatiotemporal traces from connected devices\. Second, it must be multi\-task, able to perform not only the forecasting tasks familiar from prior work, but also anomaly detection, root cause analysis, and human\-readable explanatory reasoning—all within a unified model\. Third, it must support efficient adaptation, serving as a general\-purpose backbone that can be rapidly specialized, compressed, or distilled to meet the stringent latency and resource constraints of diverse deployment contexts, from core network controllers to the most constrained edge devices\. The three properties define not merely an incremental improvement over existing models, but a qualitative rearchitecting of how intelligence is embedded in the network\.

### 2\.1\.Multi\-Modal Telecom Understanding

The dominant paradigm in telecom\-oriented foundation models today is to adapt large language models to the telecommunications domain through fine\-tuning on domain\-specific corpora—technical standards, configuration logs, and operational manuals\(Qu et al\.,[2025](https://arxiv.org/html/2605.21395#bib.bib33); Chen et al\.,[2025](https://arxiv.org/html/2605.21395#bib.bib10); Maatouk et al\.,[2024](https://arxiv.org/html/2605.21395#bib.bib24); Zhou et al\.,[2024](https://arxiv.org/html/2605.21395#bib.bib45)\)\. While this approach yields models with useful linguistic fluency about network concepts, it remains fundamentally a language modeling exercise\. Such models are ill\-suited to the core demands of AI\-native 6G, which center not on text comprehension, but on the real\-time interpretation of raw network signals, sensor streams, and spatiotemporal data that are native to the wireless medium\.

Time\-series foundation models \(TSFMs\) represent a more promising point of departure, yet existing benchmarks on telecommunications datasets reveal a consistent performance gap: models trained predominantly on meteorological, financial, or IoT sensor data fail to transfer effectively to telecom workloads\(Feng et al\.,[2025](https://arxiv.org/html/2605.21395#bib.bib13)\)\. Two structural factors explain this gap\. The first is a data mismatch: public TSFM training corpora contain negligible quantities of telecommunications\-origin signals, and the statistical properties of network traffic—bursty, non\-stationary, and driven by the superposition of many simultaneous user behaviors—differ markedly from the low\-dimensional, smooth time series these models were designed for\(Ansari et al\.,[2024](https://arxiv.org/html/2605.21395#bib.bib6),[2025](https://arxiv.org/html/2605.21395#bib.bib5); Liu et al\.,[2025](https://arxiv.org/html/2605.21395#bib.bib23); Cohen et al\.,[2024](https://arxiv.org/html/2605.21395#bib.bib12)\)\. The second is an architectural mismatch: standard TSFM tokenization strategies, such as patch\-based segmentation\(Ansari et al\.,[2024](https://arxiv.org/html/2605.21395#bib.bib6),[2025](https://arxiv.org/html/2605.21395#bib.bib5); Liu et al\.,[2025](https://arxiv.org/html/2605.21395#bib.bib23); Cohen et al\.,[2024](https://arxiv.org/html/2605.21395#bib.bib12)\), were designed for signals that are well\-approximated by a small number of latent factors \(temperature, humidity, seasonal effects\)\. Telecommunications data is inherently high\-rank: a traffic matrix aggregated across a cell sector reflects the simultaneous behavior of hundreds to thousands of users, each subject to independent behavioral dynamics\. Patch\-based tokenizers collapse this structure, discarding the cross\-variate correlations that carry diagnostic information\(Feng et al\.,[2025](https://arxiv.org/html/2605.21395#bib.bib13)\)\. A telecommunications foundation model therefore requires new architectural primitives: tokenizers designed for high\-rank, high\-frequency multivariate signals; attention mechanisms that explicitly model covariate relationships across antenna ports or frequency bands; and decoding strategies capable of producing both autoregressive outputs and quantized discrete predictions, as emerging approaches such as Chronos\-2 have begun to explore\.

Beyond time\-series telemetry, the 6G ecosystem introduces an additional layer of modality complexity\. As 6G networks integrate dense sensor infrastructure to support autonomous vehicles\(He et al\.,[2020](https://arxiv.org/html/2605.21395#bib.bib14)\), unmanned aerial systems\(Shrestha et al\.,[2021](https://arxiv.org/html/2605.21395#bib.bib37)\), and extended reality applications\(Yu et al\.,[2023](https://arxiv.org/html/2605.21395#bib.bib41); Ahmad et al\.,[2023](https://arxiv.org/html/2605.21395#bib.bib3)\), the foundation model must reason over three\-dimensional spatiotemporal data—positional traces, velocity fields, and mobility patterns—whose geometry bears no resemblance to one\-dimensional time series\. These trajectories unfold in physical space, carry explicit coordinate structure, and demand architectural treatment suited to their spatial semantics, whether through geometric encoders, graph\-based representations of device neighborhoods, or coordinate\-aware attention\. A 6G foundation model that cannot natively process this spatiotemporal modality will be systematically blind to the most distinctive AI requirements of the new generation, which emphasizes on connecting edge devices with remote sensors focusing on locations and trajectories\.

### 2\.2\.Multi\-Task Telecom Reasoning

Existing foundation models for time series are largely designed around a single canonical objective: predicting the next time step or the next patch of time steps\(Kottapalli et al\.,[2025](https://arxiv.org/html/2605.21395#bib.bib16)\)\. This framing captures prediction accuracy as the primary success criterion, and it is well\-suited to domains where forecasting is the end goal\. Telecommunications, however, is a domain where forecasting is rarely sufficient\. The network operations center does not merely want to know what traffic will look like in the next ten minutes; it needs to know when something has gone wrong, why it went wrong, and what should be done about it\.

This broader task space demands a model that is simultaneously anomaly\-aware during training and capable of structured causal reasoning at inference time\. The challenge is particularly acute because telecommunications time series are inherently high\-dimensional: a single network incident may manifest as coordinated deviations across dozens of correlated metrics spanning multiple layers of the protocol stack\(Sun et al\.,[2024](https://arxiv.org/html/2605.21395#bib.bib38)\)\. In 5G networks, such fault diagnosis already taxes human operators; in 6G, where the data volume and source diversity grow substantially, it becomes practically intractable without automated support\. We envision a 6G foundation model trained with explicit supervision over both normal operational patterns and failure signatures, enabling it to not only flag deviations but to trace their propagation across dependent subsystems and generate human\-readable explanations of likely root causes\. This shift from accurate prediction to interpretable causal narration is essential to realizing the 6G design philosophy of moving from human as doer to human as supervisor\(Pang et al\.,[2026](https://arxiv.org/html/2605.21395#bib.bib27)\)\.

The multi\-task imperative extends beyond time\-series telemetry to the spatiotemporal modalities described above\. When the foundation model monitors the trajectories of unmanned aerial vehicles\(Yang et al\.,[2024](https://arxiv.org/html/2605.21395#bib.bib40)\)or the mobility patterns of autonomous vehicles\(Baccari et al\.,[2024](https://arxiv.org/html/2605.21395#bib.bib7)\), it must simultaneously maintain predictive models of nominal behavior and anomaly detectors sensitive to deviations that may signal mechanical failure, adversarial interference, or environmental disruption\. These tasks are not independent: accurate trajectory forecasting improves anomaly sensitivity, and anomaly detection provides feedback that refines the generative model\. A unified foundation model architecture that couples these objectives—rather than maintaining separate models for prediction and detection—naturally captures this interdependence and supports the kind of joint optimization that high\-stakes 6G applications require\.

### 2\.3\.Efficient Adaptation and Deployment

The consolidation of diverse capabilities into a single foundation model offers substantial efficiency benefits in development and maintenance, but it introduces a complementary challenge: the model must be rapidly and reliably specialized to serve specific operational contexts, and it must be made lightweight enough to function on the heterogeneous hardware landscape that characterizes real 6G deployments\(Zeeshan et al\.,[2025](https://arxiv.org/html/2605.21395#bib.bib42)\)\.

Specialization requirements arise across two dimensions\. First, operational scenarios may demand heightened precision for particular task types: a core network controller managing high\-value enterprise traffic may require anomaly detection sensitivity far exceeding baseline performance, while an edge node serving a dense urban venue may prioritize latency\-sensitive load forecasting\. Rather than retraining from scratch, we envision capability\-specific fine\-tuning pipelines that can selectively amplify a targeted capacity—automatically curating the training signal most informative for that capability, or applying structured fine\-tuning that updates only the parameters most responsible for it\. This second path, in turn, connects to the broader program of interpretability for neural network models: understanding which circuits within the foundation model implement which functions makes it possible to route fine\-tuning updates to where they will have the greatest effect\(Lei et al\.,[\[n\. d\.\]](https://arxiv.org/html/2605.21395#bib.bib20)\), and to prune functionality that is irrelevant or counterproductive in a given deployment context\(Zhang et al\.,[2025](https://arxiv.org/html/2605.21395#bib.bib43)\)\.

Second, inference efficiency is a first\-class constraint in telecommunications\. Many network functions operate under strict latency budgets that a large foundation model cannot meet without modification\. For reasoning\-intensive tasks such as root\-cause analysis, this requires fine\-tuning the model toward more concise reasoning chains—fewer tokens, lower computational cost, no sacrifice in answer quality\(Pedroso et al\.,[2025](https://arxiv.org/html/2605.21395#bib.bib30)\)\. For continuous, high\-throughput monitoring tasks at the edge, the relevant mechanism is knowledge distillation: extracting a compact student model from the foundation model backbone, targeted at a specific capability rather than attempting to replicate the full model’s generality\. Edge devices in a 6G network are not interchangeable; they differ widely in compute, memory, and the nature of the raw data streams they observe\. The distillation pipeline must therefore be automated and capability\-aware, producing a family of deployment\-specific models each optimized for its local task, resource budget, and data regime\. In the most aggressive case, rather than distilling a new model end\-to\-end, it may be possible to extract and prune specific functional subnetworks from the foundation model directly, yielding lightweight modules that retain a targeted capability while discarding the rest—a form of surgical model decomposition that becomes tractable only when the model’s internal structure is sufficiently understood\.

## 3\.Vision: Multi\-Agent Systems for Autonomous 6G Networks

Throughout the 5G and 5G\-Advanced era, machine learning has been progressively integrated into network operations—yet the role of human operators has remained central\(Larsson,[2025](https://arxiv.org/html/2605.21395#bib.bib19); Nam et al\.,[2014](https://arxiv.org/html/2605.21395#bib.bib25)\)\. Routine tasks such as fault triage\(Hu and Zhang,[2020](https://arxiv.org/html/2605.21395#bib.bib15)\), configuration updates\(Pozza et al\.,[2020](https://arxiv.org/html/2605.21395#bib.bib32)\), and capacity planning\(Pérez\-Romero et al\.,[2015](https://arxiv.org/html/2605.21395#bib.bib31)\)still require direct human execution, while only a narrow set of ML\-assisted functions have advanced to a state where humans serve as guides rather than doers\. The result is a network management paradigm that is partially automated but fundamentally human\-dependent\. The vision we advance here is more ambitious: by deploying coordinated multi\-agent systems across both the radio access and core network domains, 6G can achieve a state in which humans serve primarily as supervisors\. That is only setting goals and overriding decisions in exceptional cases, while the network autonomously handles the vast majority of operational, maintenance, and recovery tasks\. This section articulates how that vision translates into concrete agent architectures on each side of the network, and identifies the shared knowledge infrastructure required to support both\.

### 3\.1\.RAN\-Side Multi\-Agent Systems

A distinguishing hardware characteristic of 6G base stations is the transition from fixed\-function, application\-specific processing units toward general\-purpose, scalable GPU\-based compute platforms\(Basaran et al\.,[2025](https://arxiv.org/html/2605.21395#bib.bib8); Kumar,[2025](https://arxiv.org/html/2605.21395#bib.bib18)\)\. Unlike their predecessors, these platforms support elastic capacity expansion and are natively suited to the batch processing workloads that neural inference demands\. Crucially, this architectural shift dissolves a longstanding bottleneck: rather than transmitting raw data from the device to a remote compute facility for processing, a significant portion of inference and decision\-making can now be executed locally at the base station, eliminating the associated transmission latency and backhaul cost\. This co\-location of computation and radio access creates the foundation for responsive, low\-latency agent operation, but it simultaneously introduces a new class of coordination challenge that did not exist when compute and radio were physically separated\.

The most immediate coordination requirement is task routing\. A compute offloading agent must continuously evaluate how inference workloads should be distributed across the three available tiers: user equipment, base station, and remote cloud or core, balancing latency constraints against energy budgets and real\-time compute availability\. This routing decision is neither static nor independently solvable\. It depends on the current channel state, the nature of the task, the mobility profile of the device, and the load on upstream compute resources, all of which evolve on timescales of milliseconds to seconds\.

Mobility management in 6G extends substantially beyond the handover prediction\(Lima et al\.,[2023](https://arxiv.org/html/2605.21395#bib.bib21)\)familiar from prior generations\. As 6G networks integrate dense sensing infrastructure to support autonomous vehicles, unmanned aerial systems, and extended reality devices, a dedicated sensing and localization agent must process Integrated Sensing and Communication \(ISAC\)\(Liu et al\.,[2022](https://arxiv.org/html/2605.21395#bib.bib22)\)data streams to maintain real\-time estimates of device position, velocity, and trajectory within the three\-dimensional deployment space\. These estimates feed directly into mobility decisions and anomaly detection, and they represent a qualitatively new input modality that prior handover models were not designed to consume\.

Classical radio resource management\(Agarwal et al\.,[2022](https://arxiv.org/html/2605.21395#bib.bib2)\)tasks are similarly transformed by agentic treatment\. A beamforming agent, rather than optimizing beam weights from instantaneous channel measurements alone\(Ahmed et al\.,[2018](https://arxiv.org/html/2605.21395#bib.bib4)\), can draw on a persistent context window comprising user mobility history, predicted trajectory, and long\-term channel statistics to produce personalized, anticipatory beam configurations that improve link quality under mobility conditions where reactive approaches degrade\. A spectrum and power management agent must simultaneously determine how spectrum resources\(Ravi and Verma,[2025](https://arxiv.org/html/2605.21395#bib.bib34)\)are allocated across concurrent tasks of differing criticality, prioritizing the low\-latency control channel of an autonomous vehicle over a background file transfer\. These decisions are interdependent\. For example, power levels will affect interference, interference may affect spectrum efficiency, and spectrum allocation constrains what beam configurations can be feasible\.

As the number of specialized agents grows, inter\-agent coordination itself becomes a first\-order problem\. Individual agents optimize local objectives; without a mechanism for arbitration, their actions can conflict in ways that degrade global network performance\. A spectrum agent reducing interference may inadvertently starve a beamforming agent of the resources it requires; a power\-saving agent may conflict with a coverage assurance agent during a sudden demand surge\. Resolving these conflicts requires a RAN orchestrator agent that maintains a global view of agent states and objectives, detects emerging conflicts, and adjudicates among competing actions using a principled priority ordering aligned with network\-level goals\. Such hierarchical multi\-agent architectures for telecommunications represent an open research frontier: no deployed system currently implements this form of coordinated RAN autonomy, and realizing it will require advances in multi\-agent reinforcement learning—both for training individual task agents and for designing orchestrators capable of stable coordination under dynamic, partially observable conditions\. Federated learning will play an equally important role, enabling agents distributed across base stations to collaboratively improve shared models without centralizing sensitive user data\.

### 3\.2\.Core Network\-Side Multi\-Agent Systems

The core network is the functional heart of a cellular system\(Parvez et al\.,[2018](https://arxiv.org/html/2605.21395#bib.bib29)\), responsible for authenticating devices, managing sessions, enforcing policies, routing traffic to its destination, and exposing network capabilities to external services\. In 5G, these functions are implemented as a set of loosely coupled, cloud\-native network functions—the Access and Mobility Management Function \(AMF\), Session Management Function \(SMF\), Policy Control Function \(PCF\), Unified Data Management \(UDM\), and others—each handling a distinct slice of the control plane\. While this service\-based architecture introduced valuable modularity, its operation remains heavily human\-supervised: slice provisioning, charging configuration, policy updates, and fault resolution all require substantial operator involvement\. The 6G core network, as we envision it, replaces this human\-in\-the\-loop model with a layer of autonomous agents that manage each functional domain independently and coordinate across domains through structured orchestration\.

Session management\(Park et al\.,[2022](https://arxiv.org/html/2605.21395#bib.bib28)\)is a natural first target for agent deployment\. In a 6G network carrying simultaneously the traffic of consumer broadband, autonomous vehicle telemetry, industrial IoT control loops, and immersive media, sessions differ not only in their quality\-of\-service requirements but in their failure semantics, latency tolerances, and billing models\. Dedicated session management agents—specialized respectively for data, voice, and control traffic—can maintain awareness of these distinctions and act on them in real time: renegotiating session parameters in response to changing conditions, preemptively migrating sessions ahead of predicted congestion, and escalating to the orchestration layer when cross\-session tradeoffs must be resolved\.

Charging and subscription management\(Chen et al\.,[2023](https://arxiv.org/html/2605.21395#bib.bib11)\), historically among the most labor\-intensive and error\-prone aspects of network operations, are particularly well\-suited to agent\-based automation\. A charging agent continuously monitors usage patterns against active subscription terms, detects billing anomalies—whether arising from misconfiguration, equipment failure, or fraudulent behavior—and autonomously triggers corrective actions such as personalized alerts, adaptive rate limiting, or billing adjustments, without requiring a human operator to review each event\. The shift from periodic, batch\-oriented billing reconciliation to continuous, real\-time agent monitoring represents a meaningful improvement in both operational efficiency and user experience\.

Network slicing\(Zhang,[2019](https://arxiv.org/html/2605.21395#bib.bib44)\), which refers to the virtualization of the physical network into isolated logical instances tailored to the requirements of different customers or applications, is a 5G capability that has not yet been operationalized at scale, largely because provisioning, monitoring, and lifecycle management of slices still demand significant manual effort\. In 6G, a slice lifecycle agent can autonomously instantiate new slices in response to service requests, scale capacity in anticipation of demand surges, enforce SLA compliance through continuous KPI monitoring, and decommission slices when they are no longer needed\. This closes the loop between business intent and network configuration in a way that manual processes cannot match at 6G timescales\.

As on the RAN side, the plurality of core network agents necessitates a core orchestrator agent that coordinates across functional domains, resolves policy conflicts—for instance, between a slice SLA agent demanding more compute and an energy efficiency agent constraining it—and maintains end\-to\-end service coherence across the RAN\-core boundary\. Federated learning and distributed inference techniques will be equally relevant here, enabling models that span multiple operator domains or geographic regions to be trained collaboratively while preserving data sovereignty\.

### 3\.3\.Supporting Knowledge Infrastructure

Realizing the multi\-agent architectures described above requires two foundational infrastructure components that do not yet exist in deployable form\. The first is a network digital twin\(Nguyen et al\.,[2021](https://arxiv.org/html/2605.21395#bib.bib26)\),i\.e\., a continuously updated, generative simulation of the live network that serves as a training environment for reinforcement learning agents and a sandbox for evaluating proposed actions before they are applied to production systems\. A digital twin, in the telecommunications context, is more than a static topology model: it is a dynamic, data\-driven simulator that ingests real\-time telemetry from across the network and maintains a high\-fidelity replica of network state, including traffic distributions, interference patterns, device mobility, and failure propagation dynamics\. Agents trained against a faithful digital twin can acquire robust policies without incurring the cost and risk of learning directly from live network interactions\. Generating such a twin at the fidelity required for 6G, encompassing heterogeneous access technologies, NTN integration\(Saleh et al\.,[2025](https://arxiv.org/html/2605.21395#bib.bib35)\), and dense sensing modalities, is itself an open research challenge, not limited to wireless networks, one that will demand generative modeling techniques capable of capturing the non\-stationary, multi\-scale dynamics of real deployments\.

The second infrastructure component is a telecommunications knowledge graph\(Krinkin et al\.,[2020](https://arxiv.org/html/2605.21395#bib.bib17)\)that encodes the structured relationships among network entities, configurations, failure modes, and remediation procedures\. Where the foundation model provides statistical pattern recognition over raw signals, the knowledge graph provides symbolic, interpretable structure that agents can query through retrieval\-augmented generation to ground their decisions in domain knowledge\. A knowledge graph that links, for instance, a particular alarm signature to its known root causes, the network functions involved, and the recommended resolution procedures enables agents to reason over failure scenarios that may be rare in the training data but well\-documented in operational knowledge bases\. The digital twin and the knowledge graph constitute the epistemic foundation on which autonomous 6G agents must be built—one providing experiential knowledge through simulation, the other providing knowledge through structured representation\.

The proliferation of agentic AI across both network domains, however, introduces a new attack surface and a new class of privacy risks that the architectures described above do not yet fully address\. We turn to these concerns in the following section\.

## 4\.Security and Privacy of AI\-Native 6G

The shift from a fragmented ecosystem of task\-specific models to a unified foundation model resolves many of the coordination and maintenance problems identified in prior sections—but it simultaneously concentrates risk in a way that has no precedent in 5G deployments\. In 5G, a compromised or corrupted model affects a single task in a bounded operational context\. In 6G, the foundation model is the shared cognitive substrate from which all agent behaviors are derived; a vulnerability in the foundation model is therefore a vulnerability in the entire network\.

A defining feature of the 6G foundation model, as envisioned in this paper, is its ability to receive and act on natural language instructions from network operators and, through intent\-based interfaces, from enterprise customers\. This capability is precisely what enables the shift from human\-as\-doer to human\-as\-supervisor—but it also introduces a class of vulnerabilities with no analog in prior network management systems\. Prompt injection attacks, in which adversarial instructions are embedded within seemingly legitimate inputs, can redirect the foundation model’s behavior in ways that are difficult to detect through conventional input validation\. In the context of network management, a successfully injected prompt does not merely produce a harmful text output; it can translate directly into a network reconfiguration, a slice provisioning decision, or a traffic rerouting action that affects millions of users\. The natural language interface that makes the foundation model accessible and powerful is the same interface that makes it exploitable, and securing it requires advances in input sanitization, behavioral constraint enforcement, and anomaly detection over the model’s action space rather than its output text alone\.

In 5G, location information is a derived quantity, inferred from signal strength measurements with limited spatial resolution\. In 6G, precise three\-dimensional position is a first\-class data type, continuously produced by the sensing and localization agents and ingested directly into the foundation model’s spatiotemporal reasoning pipeline\. The foundation model that can predict an autonomous vehicle’s trajectory with high accuracy can, by the same mechanism, maintain persistent behavioral profiles of individual users—profiles that reveal home and work locations, daily routines, social associations, and movement anomalies\. The precision that makes 6G sensing valuable for safety\-critical applications is the same precision that makes it dangerous as a surveillance instrument\.

Conventional access control models, including role\-based and attribute\-based access control frameworks, were designed for environments in which access decisions are made at authentication time and remain stable for the duration of a session\. Autonomous agents present a fundamentally different access control problem: an agent’s required permissions change dynamically as its task context evolves, and the appropriate scope of its access at any moment depends on what it is currently trying to accomplish and on the current state of the network\. The challenges define a security and privacy research agenda that is as ambitious as the AI\-native network architecture it must protect, which is the foundation for all data\-driven applications in 6G including AI and ML\. These challenges define a security and privacy research agenda that is as ambitious as the AI\-native network architecture it must protect, which is the foundation for all data\-driven applications in 6G including AI and ML\. A slice lifecycle agent that legitimately requires write access to provisioning interfaces during a scale\-out operation should not retain that access during normal monitoring phases\.

## 5\.Conclusion

This paper has presented a BlueSky vision for AI\-native 6G, arguing that the limitations of 5G’s fragmented, task\-specific ML paradigm cannot be resolved by incremental refinement, but only by a fundamental rearchitecting of how intelligence is embedded in the network\. We have proposed two complementary pillars toward this end\. The first is a 6G foundation model—multi\-modal, multi\-task, and efficiently adaptable—that consolidates the network’s cognitive capabilities into a unified backbone from which specialized, deployment\-ready variants can be derived\. The second is a multi\-agent system that operationalizes this intelligence as coordinated autonomous action across both the radio access and core network domains, supported by a digital twin for experiential learning and a knowledge graph for structured reasoning\. These pillars pave the way from today’s human\-as\-doer network operations toward a 6G paradigm in which humans serve as supervisors of a largely self\-managing infrastructure\.

## References

- \(1\)
- Agarwal et al\.\(2022\)Bharat Agarwal, Mohammed Amine Togou, Marco Marco, and Gabriel\-Miro Muntean\. 2022\.A comprehensive survey on radio resource management in 5G HetNets: Current solutions, future trends and open issues\.*IEEE Communications Surveys & Tutorials*24, 4 \(2022\), 2495–2534\.
- Ahmad et al\.\(2023\)Hafiz Farooq Ahmad, Wajid Rafique, Raihan Ur Rasool, Abdulaziz Alhumam, Zahid Anwar, and Junaid Qadir\. 2023\.Leveraging 6G, extended reality, and IoT big data analytics for healthcare: A review\.*Computer Science Review*48 \(2023\), 100558\.
- Ahmed et al\.\(2018\)Irfan Ahmed, Hedi Khammari, Adnan Shahid, Ahmed Musa, Kwang Soon Kim, Eli De Poorter, and Ingrid Moerman\. 2018\.A survey on hybrid beamforming techniques in 5G: Architecture and system model perspectives\.*IEEE Communications Surveys & Tutorials*20, 4 \(2018\), 3060–3097\.
- Ansari et al\.\(2025\)Abdul Fatir Ansari, Oleksandr Shchur, Jaris Küken, Andreas Auer, Boran Han, Pedro Mercado, Syama Sundar Rangapuram, Huibin Shen, Lorenzo Stella, Xiyuan Zhang, et al\.2025\.Chronos\-2: From univariate to universal forecasting\.*arXiv preprint arXiv:2510\.15821*\(2025\)\.
- Ansari et al\.\(2024\)Abdul Fatir Ansari, Lorenzo Stella, Caner Turkmen, Xiyuan Zhang, Pedro Mercado, Huibin Shen, Oleksandr Shchur, Syama Sundar Rangapuram, Sebastian Pineda Arango, Shubham Kapoor, et al\.2024\.Chronos: Learning the language of time series\.*arXiv preprint arXiv:2403\.07815*\(2024\)\.
- Baccari et al\.\(2024\)Sihem Baccari, Mohamed Hadded, Hakim Ghazzai, Haifa Touati, and Mourad Elhadef\. 2024\.Anomaly detection in connected and autonomous vehicles: A survey, analysis, and research challenges\.*IEEE access*12 \(2024\), 19250–19276\.
- Basaran et al\.\(2025\)Osman Tugay Basaran, Hammad Zafar, Martin Kasparick, Falko Dressler, and Slawomir Stańczak\. 2025\.Next\-Gen AI\-on\-RAN: AI\-native, Interoperable, and GPU\-Accelerated Testbed Towards 6G Open\-RAN\. In*ICC 2025\-IEEE International Conference on Communications*\. IEEE, 5362–5367\.
- Chataut et al\.\(2024\)Robin Chataut, Mary Nankya, and Robert Akl\. 2024\.6G networks and the AI revolution—Exploring technologies, applications, and emerging challenges\.*Sensors*24, 6 \(2024\), 1888\.
- Chen et al\.\(2025\)Daihang Chen, Yonghui Liu, Mingyi Zhou, Yanjie Zhao, Haoyu Wang, Shuai Wang, Xiao Chen, Tegawendé F Bissyandé, Jacques Klein, and Li Li\. 2025\.Llm for mobile: An initial roadmap\.*ACM Transactions on Software Engineering and Methodology*34, 5 \(2025\), 1–29\.
- Chen et al\.\(2023\)Na Chen, Jingyu Zhou, Hui Guan, Nan Xu, Sheng Xue, Shuting Li, and Zhiqiong Liu\. 2023\.5G charging mechanism based on dynamic step size\.*IEEE Access*11 \(2023\), 15069–15081\.
- Cohen et al\.\(2024\)Ben Cohen, Emaad Khwaja, Kan Wang, Charles Masson, Elise Ramé, Youssef Doubli, and Othmane Abou\-Amal\. 2024\.Toto: Time series optimized transformer for observability\.*arXiv preprint arXiv:2407\.07874*\(2024\)\.
- Feng et al\.\(2025\)Austin Feng, Andreas Varvarigos, Ioannis Panitsas, Daniela Fernandez, Jinbiao Wei, Yuwei Guo, Jialin Chen, Ali Maatouk, Leandros Tassiulas, and Rex Ying\. 2025\.TelecomTS: A Multi\-Modal Observability Dataset for Time Series and Language Analysis\.*arXiv preprint arXiv:2510\.06063*\(2025\)\.
- He et al\.\(2020\)Jianhua He, Kun Yang, and Hsiao\-Hwa Chen\. 2020\.6G cellular networks and connected autonomous vehicles\.*IEEE network*35, 4 \(2020\), 255–261\.
- Hu and Zhang \(2020\)Peng Hu and Jinhuan Zhang\. 2020\.5G\-enabled fault detection and diagnostics: How do we achieve efficiency?*IEEE Internet of Things Journal*7, 4 \(2020\), 3267–3281\.
- Kottapalli et al\.\(2025\)Siva Rama Krishna Kottapalli, Karthik Hubli, Sandeep Chandrashekhara, Garima Jain, Sunayana Hubli, Gayathri Botla, and Ramesh Doddaiah\. 2025\.Foundation models for time series: A survey\.*arXiv preprint arXiv:2504\.04011*\(2025\)\.
- Krinkin et al\.\(2020\)Kirill Krinkin, Alexander Vodyaho, Igor Kulikov, and Nataly Zhukova\. 2020\.Models of telecommunications network monitoring based on knowledge graphs\. In*2020 9th Mediterranean Conference on Embedded Computing \(MECO\)*\. IEEE, 1–7\.
- Kumar \(2025\)Samir Kumar\. 2025\.Nokia launches Nokia RAN Digital Twin to turbo‑charge AI\-native 6G, powered by NVIDIA Aerial Omniverse Digital Twin\. In*Nokia Engineering Blog*\.
- Larsson \(2025\)Christofer Larsson\. 2025\.*5G/5G\-Advanced Networks: Planning, Design, and Optimization*\.Academic Press\.
- Lei et al\.\(\[n\. d\.\]\)Zhenyu Lei, Qiong Wu, JIANXIONG DONG, Yinhan He, Emily Dodwell, Yushun Dong, and Jundong Li\. \[n\. d\.\]\.Reforming the Mechanism: Editing Reasoning Patterns in LLMs with Circuit Reshaping\. In*The Fourteenth International Conference on Learning Representations*\.
- Lima et al\.\(2023\)João PSH Lima, Álvaro AM de Medeiros, Eduardo P de Aguiar, Edelberto F Silva, Vicente A de Sousa, Marcelo L Nunes, and Alysson L Reis\. 2023\.Deep learning\-based handover prediction for 5G and beyond networks\. In*ICC 2023\-IEEE International Conference on Communications*\. IEEE, 3468–3473\.
- Liu et al\.\(2022\)An Liu, Zhe Huang, Min Li, Yubo Wan, Wenrui Li, Tony Xiao Han, Chenchen Liu, Rui Du, Danny Kai Pin Tan, Jianmin Lu, et al\.2022\.A survey on fundamental limits of integrated sensing and communication\.*IEEE Communications Surveys & Tutorials*24, 2 \(2022\), 994–1034\.
- Liu et al\.\(2025\)Chenghao Liu, Taha Aksu, Juncheng Liu, Xu Liu, Hanshu Yan, Quang Pham, Silvio Savarese, Doyen Sahoo, Caiming Xiong, and Junnan Li\. 2025\.Moirai 2\.0: When less is more for time series forecasting\.*arXiv preprint arXiv:2511\.11698*\(2025\)\.
- Maatouk et al\.\(2024\)Ali Maatouk, Nicola Piovesan, Fadhel Ayed, Antonio De Domenico, and Merouane Debbah\. 2024\.Large language models for telecom: Forthcoming impact on the industry\.*IEEE Communications Magazine*63, 1 \(2024\), 62–68\.
- Nam et al\.\(2014\)Wooseok Nam, Dongwoon Bai, Jungwon Lee, and Inyup Kang\. 2014\.Advanced interference management for 5G cellular networks\.*IEEE Communications Magazine*52, 5 \(2014\), 52–60\.
- Nguyen et al\.\(2021\)Huan X Nguyen, Ramona Trestian, Duc To, and Mallik Tatipamula\. 2021\.Digital twin for 5G and beyond\.*IEEE Communications Magazine*59, 2 \(2021\), 10–15\.
- Pang et al\.\(2026\)Gaoyang Pang, Wanchun Liu, Chentao Yue, Daniel E Quevedo, Karl H Johansson, Branka Vucetic, and Yonghui Li\. 2026\.Toward Wireless Human\-Machine Collaboration in the 6G Era\.*arXiv preprint arXiv:2602\.22662*\(2026\)\.
- Park et al\.\(2022\)Seongmin Park, Sungmoon Kwon, Youngkwon Park, Dowon Kim, and Ilsun You\. 2022\.Session management for security systems in 5g standalone network\.*IEEE Access*10 \(2022\), 73421–73436\.
- Parvez et al\.\(2018\)Imtiaz Parvez, Ali Rahmati, Ismail Guvenc, Arif I Sarwat, and Huaiyu Dai\. 2018\.A survey on low latency towards 5G: RAN, core network and caching solutions\.*IEEE Communications Surveys & Tutorials*20, 4 \(2018\), 3098–3130\.
- Pedroso et al\.\(2025\)Diego Frazatto Pedroso, Luís Almeida, Lucas Eduardo Gulka Pulcinelli, William Akihiro Alves Aisawa, Inês Dutra, and Sarita Mazzini Bruschi\. 2025\.Anomaly detection and root cause analysis in cloud\-native environments using large language models and Bayesian networks\.*IEEE Access*\(2025\)\.
- Pérez\-Romero et al\.\(2015\)Jordi Pérez\-Romero, Oriol Sallent, Ramon Ferrús, and Ramón Agustí\. 2015\.Artificial intelligence\-based 5G network capacity planning and operation\. In*2015 International Symposium on Wireless Communication Systems \(ISWCS\)*\. IEEE, 246–250\.
- Pozza et al\.\(2020\)Matteo Pozza, Patrick K Nicholson, Diego F Lugones, Ashwin Rao, Hannu Flinck, and Sasu Tarkoma\. 2020\.On reconfiguring 5G network slices\.*IEEE Journal on Selected Areas in Communications*38, 7 \(2020\), 1542–1554\.
- Qu et al\.\(2025\)Guanqiao Qu, Qiyuan Chen, Wei Wei, Zheng Lin, Xianhao Chen, and Kaibin Huang\. 2025\.Mobile edge intelligence for large language models: A contemporary survey\.*IEEE Communications Surveys & Tutorials*27, 6 \(2025\), 3820–3860\.
- Ravi and Verma \(2025\)Banoth Ravi and Utkarsh Verma\. 2025\.Spectrum allocation in 5G and beyond intelligent ubiquitous networks\.*International Journal of Network Management*35, 1 \(2025\), e2315\.
- Saleh et al\.\(2025\)Sharief Saleh, Pinjun Zheng, Xing Liu, Hui Chen, Musa Furkan Keskin, Basuki Priyanto, Martin Beale, Yasaman Ettefagh, Gonzalo Seco\-Granados, Tareq Y Al\-Naffouri, et al\.2025\.Integrated 6G TN and NTN localization: Challenges, opportunities, and advancements\.*IEEE Communications Standards Magazine*\(2025\)\.
- Shahid et al\.\(2025\)Adnan Shahid, Adrian Kliks, Ahmed Al\-Tahmeesschi, Ahmed Elbakary, Alexandros Nikou, Ali Maatouk, Ali Mokh, Amirreza Kazemi, Antonio De Domenico, Athanasios Karapantelakis, et al\.2025\.Large\-scale AI in telecom: Charting the roadmap for innovation, scalability, and enhanced digital experiences\.*arXiv preprint arXiv:2503\.04184*\(2025\)\.
- Shrestha et al\.\(2021\)Rakesh Shrestha, Rojeena Bajracharya, and Shiho Kim\. 2021\.6G enabled unmanned aerial vehicle traffic management: A perspective\.*IEEe Access*9 \(2021\), 91119–91136\.
- Sun et al\.\(2024\)Chuanhao Sun, Ujjwal Pawar, Molham Khoja, Xenofon Foukas, Mahesh K\. Marina, and Bozidar Radunovic\. 2024\.SpotLight: Accurate, explainable and efficient anomaly detection for Open RAN\. In*Proceedings of the 30th Annual International Conference on Mobile Computing and Networking*\. ACM\.The 30th Annual International Conference On Mobile Computing And Networking, MobiCom 2024 ; Conference date: 18\-11\-2024 Through 22\-11\-2024\.
- Tataria et al\.\(2021\)Harsh Tataria, Mansoor Shafi, Andreas F Molisch, Mischa Dohler, Henrik Sjöland, and Fredrik Tufvesson\. 2021\.6G wireless systems: Vision, requirements, challenges, insights, and opportunities\.*Proc\. IEEE*109, 7 \(2021\), 1166–1199\.
- Yang et al\.\(2024\)Lei Yang, Shaobo Li, Yizong Zhang, Caichao Zhu, and Zihao Liao\. 2024\.Deep learning\-assisted unmanned aerial vehicle flight data anomaly detection: A review\.*IEEE Sensors Journal*24, 20 \(2024\), 31681–31695\.
- Yu et al\.\(2023\)Hao Yu, Masoud Shokrnezhad, Tarik Taleb, Richard Li, and JaeSeung Song\. 2023\.Toward 6g\-based metaverse: Supporting highly\-dynamic deterministic multi\-user extended reality services\.*IEEE Network*37, 4 \(2023\), 30–38\.
- Zeeshan et al\.\(2025\)Mohammad Zeeshan, Rahul Raj, Amit Anand, Ankur Pandey, Mohammad Tabrez Quasim, Joaquín Torres\-Sospedra, Sudhir Kumar, and Kapal Dev\. 2025\.Knowledge distillation\-based AIoT framework for efficient wireless gesture sensing in B5G/6G networks\.*IEEE Network*\(2025\)\.
- Zhang et al\.\(2025\)Jiamu Zhang, Shaochen Zhong, Andrew Ye, Zirui Liu, Sebastian Zhao, Kaixiong Zhou, Li Li, Soo\-Hyun Choi, Rui Chen, Xia Hu, et al\.2025\.Flexible Group Count Enables Hassle\-Free Structured Pruning\. In*Proceedings of the Computer Vision and Pattern Recognition Conference*\. 4807–4818\.
- Zhang \(2019\)Shunliang Zhang\. 2019\.An overview of network slicing for 5G\.*IEEE Wireless Communications*26, 3 \(2019\), 111–117\.
- Zhou et al\.\(2024\)Hao Zhou, Chengming Hu, Ye Yuan, Yufei Cui, Yili Jin, Can Chen, Haolun Wu, Dun Yuan, Li Jiang, Di Wu, et al\.2024\.Large language model \(llm\) for telecommunications: A comprehensive survey on principles, key techniques, and opportunities\.*IEEE Communications Surveys & Tutorials*27, 3 \(2024\), 1955–2005\.

Towards Resilient and Autonomous Networks: A BlueSky Vision on AI-Native 6G

Similar Articles

From Automated to Autonomous: Hierarchical Agent-native Network Architecture (HANA)

LLM-Enabled NWDAF: A Step Toward AI-Native 6G Network Intelligence

JEPA for AI-Native 6G: Predictive Representations and Open Challenges

Toward a Modular Architecture for Embedded AI Agent Systems at the Edge

Distributed General-Purpose Agent Networks: Architecture, Key Mechanisms, and Prototypes

Submit Feedback

Similar Articles

From Automated to Autonomous: Hierarchical Agent-native Network Architecture (HANA)

LLM-Enabled NWDAF: A Step Toward AI-Native 6G Network Intelligence

JEPA for AI-Native 6G: Predictive Representations and Open Challenges

Toward a Modular Architecture for Embedded AI Agent Systems at the Edge

Distributed General-Purpose Agent Networks: Architecture, Key Mechanisms, and Prototypes