Interpreting Style Representations via Style-Eliciting Prompts
Summary
This paper proposes a framework to interpret style representations by using style-eliciting prompts—natural language instructions that steer LLMs to generate text with specific stylistic attributes. The method outperforms baseline LLM prompting techniques in both describing and imitating writing styles.
View Cached Full Text
Cached at: 06/05/26, 08:07 AM
# Interpreting Style Representations via Style-Eliciting Prompts
Source: [https://arxiv.org/html/2606.05716](https://arxiv.org/html/2606.05716)
###### Abstract
Style representation learning is a powerful tool for authorship analysis and modeling writing style, yet the latent nature of learned representations makes them difficult to interpret\. Recent work has attempted to explain these representations by generating natural language descriptions with large language models \(LLMs\) conditioned on input text\. However, such descriptions are often prone to the LLM’s biases and hallucinations, and they lack an explicit objective and practical utility\. In this work, we propose a novel framework for interpreting style representations through style\-eliciting prompts: natural language instructions designed to steer LLMs to generate text that reflects specific stylistic attributes\. We curate 1,010 distinct style features spanning 26 stylistic categories and construct a dataset by prompting an LLM to generate text conditioned on these features\. Using this data, we train a decoder to generate a style prompt from the style representation of the generated text\. We evaluate our approach on three tasks: \(1\) recovering original style prompts from generated text, \(2\) generating text in the same style using the recovered prompts, and \(3\) steering LLM outputs to match the style of human\-written texts\. Experiments demonstrate that our method consistently outperforms strong baselines that directly prompt LLMs with target text, achieving superior performance in both style description and style imitation\. These results highlight that style\-eliciting prompts can provide a practical and interpretable interface to stylistic information encoded in style representations\.
Interpreting Style Representations via Style\-Eliciting Prompts
Junghwan Kim and David JurgensUniversity of Michigan\{kimjhj,jurgens\}@umich\.edu
## 1Introduction
Writing style is a core dimension of natural language, influencing how messages are interpreted, remembered, and disseminated across various contextsKellyet al\.\([2003](https://arxiv.org/html/2606.05716#bib.bib1)\); Boghratiet al\.\([2023](https://arxiv.org/html/2606.05716#bib.bib5)\)\. To model stylistic variation computationally, recent work has developedstyle representations—vector embeddings designed to encode stylistic propertiesRivera\-Sotoet al\.\([2021](https://arxiv.org/html/2606.05716#bib.bib24)\); Wegmannet al\.\([2022](https://arxiv.org/html/2606.05716#bib.bib25)\); Patelet al\.\([2025](https://arxiv.org/html/2606.05716#bib.bib50)\)\. These representations have proven effective for modeling and comparing writing stylesNeelakanteswaraet al\.\([2024](https://arxiv.org/html/2606.05716#bib.bib28)\); Sotoet al\.\([2024](https://arxiv.org/html/2606.05716#bib.bib29)\); Horvitzet al\.\([2024a](https://arxiv.org/html/2606.05716#bib.bib30),[b](https://arxiv.org/html/2606.05716#bib.bib31)\)\. Nevertheless, their latent nature obscures which stylistic attributes they encode, restricting how users can interact with them for controlled text generation\.
One intuitive approach to creating interpretable style representations is to ask LLMs to describe the input text in natural languagePatelet al\.\([2023](https://arxiv.org/html/2606.05716#bib.bib20)\); Alshomaryet al\.\([2025](https://arxiv.org/html/2606.05716#bib.bib21)\)\. Yet, free\-form LLM\-generated descriptions often overlook important stylistic nuances and can be influenced by model\-specific biases or hallucinations\(Ramnathet al\.,[2025](https://arxiv.org/html/2606.05716#bib.bib19)\)\. More importantly, such descriptions are primarily explanatory rather than functional: it is not obvious how they can be reliably used to reproduce, manipulate, or transfer writing style\.
In this work, we adopt a complementary perspective that emphasizes control as an explanation, building on LLMs’ well\-established capability to follow stylistic instructions when generating textReifet al\.\([2022](https://arxiv.org/html/2606.05716#bib.bib6)\)\. We introduce a framework that interprets neural style representations as human\-language style prompts—explicit natural language instructions that specify stylistic constraints and can be directly followed by LLMs\. Our approach constructs a supervised learning setup in which texts are first generated from known style prompts, after which a decoder is trained to recover those prompts from the style representations of the generated texts\. By grounding interpretation in prompts that are directly usable for generation, this formulation yields an operational interface for stylistic control, supporting applications such as creative writing, personalized messaging, and persona simulationMou and Vechtomova \([2020](https://arxiv.org/html/2606.05716#bib.bib7)\)\.
To support this approach, we build a large\-scale synthetic dataset comprising 1,010 distinct style features organized across 26 stylistic dimensions, including lexical choice, syntactic structure, tone, and rhetorical strategy\. Using these prompts, we generate 1\.8M stylized responses with an LLM, forming paired examples of text and explicit stylistic instructions\. This dataset provides fine\-grained supervision for learning to decode style representations and enables systematic evaluation of both style characterization and controllability\.
We demonstrate the benefits of our framework across three tasks: \(1\) inferring original style prompts from LLM\-generated text, \(2\) producing stylistically similar outputs using inferred style prompts, and \(3\) steering LLMs to emulate the style of non\-synthetic, human\-authored texts\. Across all evaluations, our method consistently surpasses baselines that rely on directly prompting LLMs with target text alone\. Our method achieves substantial gains in style prompt recovery \(76\.0% ROUGE\-1, 21\.7% LaBSE, and 42\.8% LLM\-as\-judge improvements\) and stronger style alignment for style control \(12\.9% and 26\.1% L2 improvement for LLM\-generated and human\-written references, respectively\)\. Together, these results demonstrate that decoding style representations into actionable prompts provides an effective pathway for both analyzing and manipulating writing style\.
Our contributions are summarized as follows: \(1\) We introduce a decoder\-based framework that translates latent style representations into natural language style prompts, enabling interpretable and controllable use of stylistic information\. \(2\) We introduce a new, large\-scale synthetic dataset of 1\.8M stylized texts paired with diverse, compositional style prompts spanning 26 stylistic dimensions\. \(3\) We show that our method substantially outperforms existing baselines on both style prompt recovery and style control tasks\. \(4\) We release our dataset111[https://huggingface\.co/datasets/Blablablab/style\-to\-text](https://huggingface.co/datasets/Blablablab/style-to-text)and code222[https://github\.com/junghwanjkim/style\-decoding](https://github.com/junghwanjkim/style-decoding)to facilitate future research on interpretable and controllable writing style modeling\.
## 2Related Work
Our work connects several research areas, including style representation interpretation, style description, style transfer, and prompt discovery\.
#### Style Representation\.
Originally developed for authorship verification, style representation models are trained to embed texts from the same author close together while separating those from different authorsRivera\-Sotoet al\.\([2021](https://arxiv.org/html/2606.05716#bib.bib24)\); Wegmannet al\.\([2022](https://arxiv.org/html/2606.05716#bib.bib25)\); Patelet al\.\([2025](https://arxiv.org/html/2606.05716#bib.bib50)\)\. These dense vector representations have resulted in significant performance gains for many tasks like authorship attribution over older, more interpretable methods from stylometry\(cf\. Rangel and Rosso,[2019](https://arxiv.org/html/2606.05716#bib.bib2); Stamatatoset al\.,[2022](https://arxiv.org/html/2606.05716#bib.bib3); Bevendorffet al\.,[2024](https://arxiv.org/html/2606.05716#bib.bib4)\)\. Although their training objective focuses on author identity, the resulting representations capture rich stylistic features beyond author\-specific ones\. For instance, controlled stylistic perturbations induce consistent linear shiftsZhu and Jurgens \([2021](https://arxiv.org/html/2606.05716#bib.bib22)\)or clustersWegmannet al\.\([2022](https://arxiv.org/html/2606.05716#bib.bib25)\)in the style representation space, indicating their sensitivity to stylistic changes\. Moreover,Wanget al\.\([2023](https://arxiv.org/html/2606.05716#bib.bib26)\)verifies that various non\-author\-specific styles in the CDS datasetKrishnaet al\.\([2020](https://arxiv.org/html/2606.05716#bib.bib47)\)can be successfully predicted from these representations\.
#### Style Representation Interpretation\.
Downstream applications of style representations—such as authorship analysis and writing style modeling—often require interpretability and transparencyTiersma and Solan \([2002](https://arxiv.org/html/2606.05716#bib.bib10)\); Biber and Conrad \([2019](https://arxiv.org/html/2606.05716#bib.bib11)\)\. To address this need, prior work has attempted to interpret style representations by aligning individual embedding dimensions with stylistic featuresPatelet al\.\([2023](https://arxiv.org/html/2606.05716#bib.bib20)\)or by interpolating between representative examples in a latent style spaceAlshomaryet al\.\([2025](https://arxiv.org/html/2606.05716#bib.bib21)\)\. However, in both cases, the resulting style descriptions are generated by prompting LLMs directly with the input text\. As a result, these descriptions may not faithfully reflect the stylistic information encoded in the style representations, nor are they tied to a concrete, reusable objective for generation or control\. In contrast, our framework defines explicit ground\-truth style prompts that capture the stylistic intent injected into the style representations and can be used to steer subsequent text generation, enabling effective training supervision and well\-defined evaluation\.
#### Style Description\.
Recent studiesHunget al\.\([2023](https://arxiv.org/html/2606.05716#bib.bib16)\); Huanget al\.\([2024](https://arxiv.org/html/2606.05716#bib.bib17)\); Huet al\.\([2024b](https://arxiv.org/html/2606.05716#bib.bib18)\); Ramnathet al\.\([2025](https://arxiv.org/html/2606.05716#bib.bib19)\)have explored generating style descriptions as intermediate steps for LLM\-based authorship verificationStamatatos \([2016](https://arxiv.org/html/2606.05716#bib.bib32)\); Tyoet al\.\([2022](https://arxiv.org/html/2606.05716#bib.bib33)\), which aims to determine whether two documents share an author\. Other workPatelet al\.\([2024](https://arxiv.org/html/2606.05716#bib.bib42)\); Yang and Carpuat \([2025](https://arxiv.org/html/2606.05716#bib.bib43)\)has used style descriptions to support LLM\-driven style transferJinet al\.\([2022](https://arxiv.org/html/2606.05716#bib.bib34)\); Huet al\.\([2022](https://arxiv.org/html/2606.05716#bib.bib35)\); Mukherjeeet al\.\([2024](https://arxiv.org/html/2606.05716#bib.bib36)\), where new text is generated with the content of one input and the style of another\. However, asRamnathet al\.\([2025](https://arxiv.org/html/2606.05716#bib.bib19)\)notes, such descriptions often fail to capture the full range of stylistic variation and are susceptible to biases and hallucinations inherited from the underlying LLM\. Our work addresses these limitations by treating style prompts as ground\-truth descriptions, yielding more grounded, goal\-directed, and operational characterizations of style\.
#### Style Transfer\.
The text style transfer literature has long investigated leveraging stylistic information represented as style vectorsHuet al\.\([2017](https://arxiv.org/html/2606.05716#bib.bib37)\); Shenet al\.\([2017](https://arxiv.org/html/2606.05716#bib.bib38)\); Prabhumoyeet al\.\([2018](https://arxiv.org/html/2606.05716#bib.bib39)\); Xuet al\.\([2020](https://arxiv.org/html/2606.05716#bib.bib40)\); Shenet al\.\([2020](https://arxiv.org/html/2606.05716#bib.bib41)\)\. Recent workHorvitzet al\.\([2024a](https://arxiv.org/html/2606.05716#bib.bib30),[b](https://arxiv.org/html/2606.05716#bib.bib31)\)successfully trains text style transfer models that rely on pretrained style representations as input\. While our setting may appear similar due to its focus on stylistic control, our work differs fundamentally in both goal and formulation\. Rather than paraphrasing a given text to match a target style, our objective is to explicitly describe the target style and use this description to generate new text in that style\. Due to this difference, unlike in style transfer, where meaning preservation is often a central requirement, our setting does not involve preserving the original content; instead, we focus on evaluating the faithfulness of the style description and its utility for stylistic change\.
#### Prompt Discovery\.
The problem of discovering prompts that elicit specific behaviors from LLMs has attracted growing interest, particularly for uncovering harmful or undesirable behaviorsPerezet al\.\([2022](https://arxiv.org/html/2606.05716#bib.bib12)\); Liuet al\.\([2024](https://arxiv.org/html/2606.05716#bib.bib13)\); Honget al\.\([2024](https://arxiv.org/html/2606.05716#bib.bib14)\)\. More recent work extends this paradigm to prompt search for arbitrary, user\-defined objectivesLiet al\.\([2025](https://arxiv.org/html/2606.05716#bib.bib15)\)\. Our study addresses a related but more fine\-grained challenge: discovering prompts that induce specific writing styles—such as tone, sentence structure, or rhetorical form—which has not been explored in prior prompt discovery work\. General prompt discovery methods typically aim to find prompts that cause LLMs to generate a target text, often relying on reinforcement learning to explore the prompt space\. In contrast, our method leverages synthetic supervision to train a decoder efficiently in a fully supervised manner\.
## 3Problem Formulation
We study the interpretation of style representations by mapping them into natural\-language descriptions\. Specifically, we formulate style representation interpretation as the task of decoding style representations into style prompts that can be used to steer LLMs to produce text in the style implicitly described by the representation\. Given a style representation of an input text, the objective is to infer a natural\-language prompt that \(1\) induces LLM generations whose style is consistent with the stylistic intent encoded in the representation and \(2\) provides an interpretable description that meaningfully characterizes the style\. This formulation is motivated by three considerations: \(i\) such prompts are easily understandable by humans, \(ii\) they enable principled evaluation by assessing whether generated text exhibits the target style, and \(iii\) they are directly usable for downstream control of LLM generation\.
Formally, let𝚂\\mathtt\{S\}denote a Style Representation Model \(SRM\) that maps an input textxxto a latent style vector𝐱\\mathbf\{x\}\. Our goal is to learn a decoder𝙳\\mathtt\{D\}that maps𝐱\\mathbf\{x\}to a style promptsswhich both characterizes the stylistic features encoded in𝐱\\mathbf\{x\}and steers a subsequent generationyyto exhibit a style similar to that ofxx\. This objective can be expressed as
argmin𝙳ℓ\(𝐱,𝐲\),\{\\arg\\min\}\_\{\\mathtt\{D\}\}\\quad\\ell\(\\mathbf\{x\},\\mathbf\{y\}\),\(1\)where𝐲=𝚂\(y\)\\mathbf\{y\}=\\mathtt\{S\}\(y\)denotes the style vector of the generated textyy, andℓ\\ellis a vector distance \(e\.g\., L2\)\. Appendix Figure[1\(a\)](https://arxiv.org/html/2606.05716#A2.F1.sf1)illustrates this formulation\.
Directly optimizing Equation[1](https://arxiv.org/html/2606.05716#S3.E1)is infeasible due to the vast and discrete nature of the prompt space\. To address this challenge, we recast the problem as supervised learning using synthetic data\. Specifically, we generate synthetic pairs\(x,s\)\(x,s\)consisting of a stylized textxxand its corresponding style promptss, and train the decoder𝙳\\mathtt\{D\}to map the style vector𝐱=𝚂\(x\)\\mathbf\{x\}=\\mathtt\{S\}\(x\)to the ground\-truth promptss:
argmin𝙳ℒ\(s~,s\),\{\\arg\\min\}\_\{\\mathtt\{D\}\}\\quad\\mathcal\{L\}\(\\tilde\{s\},s\),\(2\)wheres~=𝙳\(𝐱\)\\tilde\{s\}=\\mathtt\{D\}\(\\mathbf\{x\}\)denotes the decoded style prompt, andℒ\\mathcal\{L\}is the token\-level cross\-entropy loss\. This surrogate objective is depicted in Appendix Figure[1\(b\)](https://arxiv.org/html/2606.05716#A2.F1.sf2)\. If𝙳\\mathtt\{D\}perfectly recovers the ground\-truth promptss, then both input textxxand the generated textyyin the original formulation are produced using the same promptss\. Consequently, their style vectors𝐱\\mathbf\{x\}and𝐲\\mathbf\{y\}are closely aligned, thereby minimizing the objective in Equation[1](https://arxiv.org/html/2606.05716#S3.E1)\.
To construct synthetic training pairs\(x,s\)\(x,s\), one possible approach is to use LLMs to describe the style of existing texts\. However, such descriptions may overlook important stylistic nuances and can be affected by model\-specific biases or hallucinations\. Moreover, many texts may not exhibit sufficiently salient stylistic features to support reliable description\. Instead, we begin with diverse style prompts and generate stylized texts using LLMs\. Because LLMs demonstrate strong instruction\-following capabilities for stylistic control, these prompts reliably induce the salient stylistic features expressed in the generated texts\. Furthermore, since the prompts are used directly to generate the texts, they are guaranteed to be operational and can steer the style of LLM generations\.
Figure 1:We visualize the style representations of a subset of our final dataset using t\-SNE\. Generations conditioned on different style prompts form distinct clusters, with stylistically similar clusters appearing closer together, illustrating that style representations admit style prompt recovery\.
## 4Dataset Construction
We construct a large\-scale synthetic dataset of stylistically diverse texts paired with ground\-truth style prompts that explicitly characterize their writing style\. To this end, we generate LLM outputs conditioned on a wide range of style prompts and real\-world questions drawn from public Question\-Answering \(QA\) platforms, ensuring content diversity and facilitating direct comparison with human\-written answers\. The dataset construction process consists of three stages: \(1\) generating a large inventory of concrete style features, \(2\) curating diverse real\-world questions, and \(3\) generating stylized responses conditioned jointly on style prompts and questions\. We describe each stage below\.
#### Style Generation\.
A central component of our dataset is a diverse set of modular style features that capture a broad spectrum of stylistic variation\. These features can be composed into prompts to control multiple stylistic attributes\.
We generate the style feature set using GPT\-4o with a hierarchical prompting strategy\. Directly prompting for a flat list of features leads to substantial redundancy and uneven coverage\. Instead, we first define broad stylistic categories and then populate each category with fine\-grained features\.
Our category design draws on prior work and is further expanded using LLM assistance\. We begin with 12 seed categories derived from existing literatureFisheret al\.\([2024](https://arxiv.org/html/2606.05716#bib.bib49)\); Patelet al\.\([2025](https://arxiv.org/html/2606.05716#bib.bib50)\); Ramnathet al\.\([2025](https://arxiv.org/html/2606.05716#bib.bib19)\)and expand this set to 26 categories using GPT\-4o\. These categories span a wide range of stylistic dimensions, including “Sentence Structure and Syntax,” “Word and Expression Usage,” “Tone and Mood,” and “Readability Level\.” The complete list of categories is shown in Table[A4](https://arxiv.org/html/2606.05716#A7.T4)\.
Each category is populated with style features through LLM generation, followed by manual curation\. For each category, GPT\-4o is prompted to generate 40 concrete, modular, and semantically distinct style features\. We then manually filter redundant or overly similar entries both within and across categories\. The resulting feature set contains 1,010 curated style features spanning diverse categories\. Examples of these features are shown in Figure[1](https://arxiv.org/html/2606.05716#S3.F1), with additional details and curation prompts provided in Appendix[C\.1](https://arxiv.org/html/2606.05716#A3.SS1)\.
#### Question Curation\.
We ground stylized generation in a QA setting to enable diverse outputs under the same style prompt\. This setup supports open\-ended yet content\-grounded responses, allowing models to vary content while adhering to a specified style\. By pairing a single style prompt with different questions, LLMs can generate stylistically consistent but semantically distinct outputs\.
We curate 300,000 real\-world questions from 3 publicly available QA platforms: Reddit, StackExchange, and Yahoo Answers\. For each platform, we select 10 topics333Topics correspond to subreddits for Reddit and sites for StackExchange\.and sample 10,000, 5,000, and 15,000 questions per topic from Reddit, StackExchange, and Yahoo Answers, respectively\. The resulting questions span a broad range of domains, including technical problem solving, personal advice, and opinion\-based discussion\. The full list of topics is provided in Table[A5](https://arxiv.org/html/2606.05716#A7.T5), with further details in Appendix[C\.2](https://arxiv.org/html/2606.05716#A3.SS2)\.
To support evaluation against human\-written references, we also collect corresponding human answers for the curated questions\. We first filter out questions with no human responses, then randomly select one answer per remaining question\.
#### LLM Generation\.
We generate stylized responses by prompting LLMs with both a style prompt and a question\. Each style prompt is constructed by randomly sampling between 1 and 10 style features from the curated feature set and concatenating them into a single instruction\. To avoid conflicting stylistic constraints, we ensure that features within a prompt are drawn from distinct style categories\. The sampling distribution favors prompts with fewer features, enabling the decoder to first learn individual stylistic attributes before generalizing to compositional styles\.
Each question is paired with two distinct style prompts, yielding two stylized responses per question\. To mitigate model\-specific stylistic biases, we generate responses using three different LLMs: Phi\-4, Qwen2\.5\-14B, and OLMo\-2\-13B\. Additional details on prompting and generation settings are provided in Appendix[C\.3](https://arxiv.org/html/2606.05716#A3.SS3)\.
#### Final Dataset\.
Each dataset entry consists of a question, a style prompt, and a stylized response\. Together, these examples provide explicit supervision for training the decoder to recover style prompts from style representations\. The final dataset comprises 1\.8M LLM\-generated responses with 434,535 unique style prompts, covering a wide range of stylistic variations and content domains\. In addition, the dataset includes 300K human\-written responses to support comparison with human\-authored style\.
Figure[1](https://arxiv.org/html/2606.05716#S3.F1)visualizes a subset of our dataset in the style representation space\. Distinct clusters emerge, each corresponding to a different style prompt, with semantically similar prompts forming nearby clusters\. For instance, the prompts “Use professional language only” and “Use compound sentences for clarity” produce overlapping clusters\. Notably, the default LLM style lies close to professional language, while human\-written responses cluster near short, punchy wording\. These patterns validate that the geometry of the style representation space captures stylistic similarity and variation, providing evidence that the representation space admits a decoding into interpretable style prompts\.
## 5Proposed Method
We now describe the architecture and training of our proposed style decoder\. As formulated in Section[3](https://arxiv.org/html/2606.05716#S3), the goal is to learn a decoder model𝙳\\mathtt\{D\}that maps a style vector𝐱\\mathbf\{x\}to a corresponding natural\-language style promptss\.
Our decoder architecture is inspired by continuous prompt tuning, which has been shown to be effective for conditioning frozen LLMs in both NLPLi and Liang \([2021](https://arxiv.org/html/2606.05716#bib.bib51)\); Lesteret al\.\([2021](https://arxiv.org/html/2606.05716#bib.bib52)\)and vision–language modelingTsimpoukelliet al\.\([2021](https://arxiv.org/html/2606.05716#bib.bib53)\)\. In this framework, continuous prompts encode input\- or task\-specific information that guides downstream generation\. We extend this paradigm by encoding stylistic information captured in style vectors into continuous prompts\.
The decoder𝙳\\mathtt\{D\}consists of two components: \(1\) a frozen LLM that interprets the stylistic signal encoded in the style vector, and \(2\) a trainable projection module that maps the style vector into the token embedding space of the LLM\. Because the projection module is the only trainable component, this module must be equipped with sufficient capacity to express a wide range of stylistic signals\. Concretely, the projection is implemented as a three\-layer feedforward network with GeLU activationsHendrycks and Gimpel \([2023](https://arxiv.org/html/2606.05716#bib.bib54)\), producing 20 token embeddings, a choice that we found to work well in preliminary experiments\. To further contextualize decoding, we prepend a natural\-language instruction that explicitly specifies the style description task alongside the projected style vector, enabling the LLM to more effectively leverage its instruction\-following capabilities\. The instruction prompt is provided in Appendix[D\.1](https://arxiv.org/html/2606.05716#A4.SS1)\. The overall architecture is illustrated in Figure[2](https://arxiv.org/html/2606.05716#S5.F2)\.
Figure 2:Our decoder𝙳\\mathtt\{D\}consists of a frozen LLM and a trainable projection module\. The projection module maps the input style vector into the token embedding space of the LLM\. Then, LLM generates a style prompt conditioned on the projected style vector\.#### Implementation Details\.
For the style representation model𝚂\\mathtt\{S\}, we use Mistral\-Nemo\-Instruct\-2407, trained via contrastive learning on author\-labeled data following the state\-of\-the\-art approach ofFincke and Boschee \([2024](https://arxiv.org/html/2606.05716#bib.bib27)\)\. The frozen LLM used in the decoder𝙳\\mathtt\{D\}is Ministral\-8B\-Instruct\.
We train the decoder𝙳\\mathtt\{D\}using the surrogate objective defined in Equation[2](https://arxiv.org/html/2606.05716#S3.E2)\. The dataset is split into training, validation, and test sets with a ratio of 8:1:1\. Training is performed for 5 epochs with a learning rate of 5e\-5 and a batch size of 32\. The best checkpoint is selected based on validation loss, and all reported results in this paper are obtained on the test set444180K LLM responses for Sections[6](https://arxiv.org/html/2606.05716#S6)and[7](https://arxiv.org/html/2606.05716#S7)and 60K human responses for Section[8](https://arxiv.org/html/2606.05716#S8)\. Additional training details are provided in Appendix[D\.2](https://arxiv.org/html/2606.05716#A4.SS2)\.
Figure 3:The style descriptions generated by our decoder match the ground truth style prompts better than the style descriptions directly generated by LLM or randomly selected style prompts in our dataset\. The match is consistently demonstrated in all three metrics: ROUGE\-1 score \(token matching\), LaBSE similarity \(semantic embedding\) and LLM\-as\-judge score \(LLM judgment\)\.
## 6Recovering the Style Prompt
We first evaluate prompt recovery, the task of reconstructing the ground\-truth style prompt from the style representation of a text generated by an LLM conditioned on that prompt\. This task directly corresponds to the surrogate training objective defined in Equation[2](https://arxiv.org/html/2606.05716#S3.E2)\. The goal of this experiment is to assess whether the proposed decoder can effectively recover the original stylistic instruction encoded in the style representation—i\.e\., creating an interpretable representation of the style\.
### 6\.1Experiment Setup
Given a style promptssand the corresponding stylized responsexx, our decoder maps the style representation𝐱\\mathbf\{x\}ofxxto the decoded style prompts~\\tilde\{s\}\. We measure how wells~\\tilde\{s\}matches the ground truthss\.
#### Baselines\.
We compare our approach against four baselines\. The first three are LLM\-based description baselines, in which an LLM is prompted to describe the writing style of a given text in natural language\. Specifically, we adapt prompts from two prior works, STYLLPatelet al\.\([2024](https://arxiv.org/html/2606.05716#bib.bib42)\)and RGYang and Carpuat \([2025](https://arxiv.org/html/2606.05716#bib.bib43)\), that generate style descriptions for style transfer, with only minor modifications necessary for our setting\. Because these prompts were originally designed for style transfer, we additionally introduce a custom prompt baseline tailored to prompt recovery\. To minimize performance differences arising from format mismatch, the custom baseline instructs the LLM to match the length and sentence structure of the desired output\. All description baselines use Ministral\-8B\-Instruct, the same LLM employed in our decoder, ensuring a fair comparison\. The full instruction prompts are provided in Appendix[E\.1](https://arxiv.org/html/2606.05716#A5.SS1)\. The final baseline is random sampling, where a style prompt is drawn uniformly from the same distribution used during dataset construction\. This baseline serves as a naive lower bound on prompt recovery performance\.
There is no straightforward off\-the\-shelf baseline that consumes dense style vectors\. We leave the comparison of different instantiations of our framework: mapping style vector to style prompt as future work\.
#### Metrics\.
We evaluate prompt recovery using three complementary metrics that capture both lexical and semantic similarity\. ROUGE\-1 measures unigram overlap between the decoded prompt and the ground\-truth prompt\. While this metric closely aligns with the training objective, it may underestimate similarity when stylistically equivalent prompts differ in surface wording\. To account for this limitation, we additionally report semantic metrics\. LaBSE similarity computes cosine similarity between sentence embeddings of the decoded and ground\-truth prompts, while LLM\-as\-judge employs an external LLM \(Qwen3\-14B\) to rate prompt similarity on a 0–10 scale\. The evaluation prompt is provided in Appendix[E\.2](https://arxiv.org/html/2606.05716#A5.SS2)\. Importantly, the evaluation model is entirely disjoint from those in dataset generation, decoder training, and baseline methods, ensuring an unbiased assessment\.
### 6\.2Results
As shown in Figure[3](https://arxiv.org/html/2606.05716#S5.F3), our decoder consistently outperforms all baselines across all three evaluation metrics by a significant margin\. This result indicates that the decoder is highly effective at recovering the original style prompt in both lexical agreement and semantic similarity\.
All LLM\-based descriptions consistently perform worse than the random baseline\. LLMs fail to generate style descriptions that capture the underlying stylistic intent expressed in the style prompt\. This is consistent with the observation inBerglundet al\.\([2024](https://arxiv.org/html/2606.05716#bib.bib9)\)that even though an LLM generates B from A, it cannot infer that A could have generated B\. Our custom baseline seems to benefit from output format and length information, but it still performs worse than our decoder\.
Overall, these results demonstrate that decoding directly from style representations yields more accurate reconstructions of stylistic intent than post hoc LLM\-based description methods\. Moreover, they provide strong evidence that the style representation space encodes rich, fine\-grained stylistic information, and that our decoder can effectively translate these latent signals into interpretable natural\-language prompts\.
Example Outputs1\. The author uses unreliable memory style, uses formal vocabulary, uses interjections for surprise,uses high noun density, uses repetition for focus, uses commas to set off interrupters,uses a flowchart\-like logic\.2\. The author uses poetic descriptive style, uses anaphora in sentence beginnings\.3\. The author uses comparative adjectives for comparison, uses minimal supporting explanation,uses antithesis in sentence structure\.Table 1:The example outputs of our decoder\.Example output is shown in Table[1](https://arxiv.org/html/2606.05716#S6.T1)\. From our inspection of randomly sampled examples, we do not observe content leakage—i\.e\., the decoded style is not somehow leaking information about the content of the text that would make inferring its style easier\. Instead, all output follows the format we used to train the decoder, making our evaluative comparisons meaningful\.
## 7Controlling the Writing Style
We next examine whether decoded style prompts can functionally control the style of LLM outputs\. In contrast to prompt reconstruction, this setting does not require recovering the original instruction verbatim\. Instead, success is determined by whether the decoded prompt induces generations with the same stylistic effect as those produced under the ground\-truth prompt\. It therefore serves as a test of whether training with the surrogate objective in Equation[2](https://arxiv.org/html/2606.05716#S3.E2)yields prompts that generalize to effective style control in downstream generation\.
### 7\.1Experiment Setup
Given a style vector𝐱\\mathbf\{x\}and the style prompts~\\tilde\{s\}decoded from𝐱\\mathbf\{x\}, we generate a new textyyconditioned ons~\\tilde\{s\}\. We measure how close the style representation𝐲\\mathbf\{y\}ofyyis to the original𝐱\\mathbf\{x\}\. To avoid information leakage,xxandyyare conditioned on different question contexts\.
#### Baselines\.
We compare our approach against five baselines: four LLM\-based style\-imitation methods and one explicit style\-transfer approach\. For the LLM\-based imitation baselines, we adapt prompting strategies from three prior worksBhandarkaret al\.\([2024](https://arxiv.org/html/2606.05716#bib.bib44)\); Wanget al\.\([2025](https://arxiv.org/html/2606.05716#bib.bib45)\); Jangraet al\.\([2025](https://arxiv.org/html/2606.05716#bib.bib46)\), which are designed to generate responses that mimic the writing style of a given target text\. We modify these prompts minimally to fit our QA setup\. In addition, we again introduce a custom LLM baseline tailored to our setting\. The full prompts are provided in Appendix[E\.1](https://arxiv.org/html/2606.05716#A5.SS1)\. Both our method and the LLM\-based imitation baselines use Ministral\-8B\-Instruct for generation\. As a representative style transfer baseline, we include TinyStylerHorvitzet al\.\([2024b](https://arxiv.org/html/2606.05716#bib.bib31)\), a state\-of\-the\-art method that applies style representations to transform text\. We use the official implementation, which uses its own style representation\. We first generate neutral responses without style conditioning and then apply TinyStyler to transfer them into the target style\.
#### Metrics\.
We assess style control by computing the L2 distance between the style representations of the original and the regenerated text\. Our evaluation uses the same style representation model that encodes inputs for the decoder, since the goal of our framework is to faithfully interpret the style representation space\. The decoded prompts should have a consistent effect on the target style representation space, rather than on obscure general style\.
Figure 4:Our approach yields the smallest L2 distance in style representation space from the ground\-truth stylized output, compared to baseline methods\.
### 7\.2Results
Figure[4](https://arxiv.org/html/2606.05716#S7.F4)shows that our decoder outperforms all baselines by a significant margin\. This result indicates that the inferred prompts successfully translate latent stylistic information into actionable instructions that guide generation behavior\.
The TinyStyler outperforms three adapted LLM\-based style\-imitation baselines, but underperforms our custom baseline\. One plausible limitation of the TinyStyler is architectural: TinyStyler relies on a smaller T5\-based generator, whereas both our decoder and the LLM\-based baselines use LLMs with billions of parameters\. In addition, TinyStyler employs the StyleEmbedding modelWegmannet al\.\([2022](https://arxiv.org/html/2606.05716#bib.bib25)\), while our decoder leverages a substantially larger style representation model trained on a much broader, author\-labeled corpus\.
As an additional robustness test, we repeat the evaluation using two alternative style representations not used in our method, LUARRivera\-Sotoet al\.\([2021](https://arxiv.org/html/2606.05716#bib.bib24)\)and StyleDistancePatelet al\.\([2025](https://arxiv.org/html/2606.05716#bib.bib50)\)\. The results, provided in Appendix[F\.2](https://arxiv.org/html/2606.05716#A6.SS2), show consistent trends across all three style representations, with our approach able to generate new documents that are closer in the vector space, regardless of which representation is used\. Together, these results demonstrating that the strong steering performance of our decoder is not specific to the style representation space used for training it\.
Figure 5:Our decoder outperforms baselines at generating style prompts that induce writing most similar to human\-written text\.
## 8Steering towards Human Style
Finally, we investigate whether the proposed decoder can steer LLM generation toward the writing style of human\-authored text\. This experiment evaluates generalization to natural, non\-synthetic language outside the decoder’s training distribution, which constitutes a more realistic and challenging test of practical style control\.
### 8\.1Experiment Setup
We are given a style vector𝐱\\mathbf\{x\}and the style prompts~\\tilde\{s\}decoded from𝐱\\mathbf\{x\}, as in Section[7](https://arxiv.org/html/2606.05716#S7)\. However, the target textxxis now human\-authored text rather than LLM\-generated text, thereby falling outside the distribution of the synthetic training data\. Therefore, the decoder operates on the input distribution unseen during training, and there is no ground\-truth stylessthat guarantees the generation in the style ofxx\. This setting makes the task more challenging, but generalization to it is essential for the decoder to work in a more practical style control\. We use the same metric and baseline as in Section[7](https://arxiv.org/html/2606.05716#S7)\.
### 8\.2Results
As shown in Figure[5](https://arxiv.org/html/2606.05716#S7.F5), generations guided by decoded style prompts achieve substantially smaller distances to human\-authored references than those produced by any baseline method\. Because no ground\-truth style prompts exist for human\-authored texts, strong performance in this setting provides a challenging test of the proposed framework\. The results indicate that the decoder can effectively interpret input style vectors and map them to appropriate style prompts even when the inputs lie outside the distribution of the synthetic training data\. Moreover, the diversity of the curated style features enables generalization across a broad spectrum of stylistic variation, extending to naturally occurring human writing\. Taken together, the results offer compelling evidence for the robustness and transferability of the decoding framework\.
We repeat our robustness tests by using two alternative style representations LUAR and StyleDistance, to measure whether the generated style\-steered document is similar to the reference human\-authored document’s style\. Results shown in Appendix[G](https://arxiv.org/html/2606.05716#A7)demonstrate a consistent trend with our approach performing best for matching the style\. Together, these results show that the style instructions from our decoder successfully steer style towards human texts\.
## 9Conclusion
We propose a novel framework for interpreting dense style representation vectors by decoding them into natural language descriptions of writing style\. To train a decoder in a supervised setting, our approach leverages synthetic data generated via LLM with human\-readable, compositional style prompts\. Comprehensive evaluations demonstrate that the decoded prompts faithfully reflect the stylistic information encoded in style representations and can effectively guide LLM outputs to match a desired style\. Our method consistently outperforms baselines across multiple metrics, including lexical and semantic match in style prompt recovery and style similarity in style control\.
This work bridges the gap between latent neural representations and interpretable, controllable style attributes, offering a path toward more transparent and steerable NLP systems\. Future directions include extending the method to zero\-shot or unsupervised settings, improving disentanglement between content and style, and applying the decoding framework to other latent attributes such as tone, persona, or rhetorical strategy\.
## Limitations
Our study focuses on the English language\. Since LLMs and style representation models that our framework builds on are less available or less performant in non\-English languages, our method may not generalize to such settings\. Moreover, writing styles in different languages can have different distributions and interactions\. We hope that our work inspires other researchers to generalize our framework to non\-English languages\.
Our dataset is restricted to the online question\-answering domain\. While our decoder demonstrates strong performance in describing writing style within this context, it remains unclear how well it generalizes to other domains such as narrative writing, formal prose, or technical documentation\. Evaluating the method across a broader range of domains is an important direction for future work\.
Additionally, our dataset is generated using three similarly\-sized LLMs\. Although we observe consistent style description quality across these models, this setup does not fully capture the diversity of existing LLMs in terms of scale, architecture, and training data\. To mitigate this, we test our decoder on human\-written text and observe promising results\. Nevertheless, caution is warranted when applying the method to text produced by models that differ significantly from those used during dataset construction, as performance may degrade in unseen generative regimes\.
## Ethical Considerations
This work presents a framework for analyzing the writing style of a given text and inferring a style prompt capable of eliciting similar stylistic characteristics from an LLM\. While this technique can support beneficial applications—such as improved interpretability, stylistic control, and enhanced transparency in LLM\-generated text—it also introduces potential risks\. For example, it could be misused to undermine author anonymity in sensitive contexts or to reverse\-engineer proprietary or private prompt formulations\.
We emphasize that our approach is designed for controlled settings with synthetic supervision and does not aim to de\-anonymize authors or recover confidential prompts\. Nevertheless, we encourage future work to explore safeguards and use restrictions to mitigate misuse, particularly in high\-stakes or privacy\-sensitive applications\.
## Acknowledgments
This research is supported in part by the Office of the Director of National Intelligence \(ODNI\), Intelligence Advanced Research Projects Activity \(IARPA\), via the HIATUS Program contract \#2022\-22072200006\. The views and conclusions contained herein are those of the authors and should not be interpreted as necessarily representing the official policies, either expressed or implied, of ODNI, IARPA, or the U\.S\. Government\. The U\.S\. Government is authorized to reproduce and distribute reprints for governmental purposes notwithstanding any copyright annotation therein\.
## References
- M\. Alshomary, N\. Ri, M\. Apidianaki, A\. Patel, S\. Muresan, and K\. McKeown \(2025\)Latent space interpretation for stylistic analysis and explainable authorship attribution\.InProceedings of the 31st International Conference on Computational Linguistics,O\. Rambow, L\. Wanner, M\. Apidianaki, H\. Al\-Khalifa, B\. D\. Eugenio, and S\. Schockaert \(Eds\.\),Abu Dhabi, UAE,pp\. 1124–1135\.External Links:[Link](https://aclanthology.org/2025.coling-main.75/)Cited by:[§1](https://arxiv.org/html/2606.05716#S1.p2.1),[§2](https://arxiv.org/html/2606.05716#S2.SS0.SSS0.Px2.p1.1)\.
- L\. Berglund, M\. Tong, M\. Kaufmann, M\. Balesni, A\. C\. Stickland, T\. Korbak, and O\. Evans \(2024\)The reversal curse: LLMs trained on “a is b” fail to learn “b is a”\.InThe Twelfth International Conference on Learning Representations,External Links:[Link](https://openreview.net/forum?id=GPKTIktA0k)Cited by:[§6\.2](https://arxiv.org/html/2606.05716#S6.SS2.p2.1)\.
- J\. Bevendorff, X\. B\. Casals, B\. Chulvi, D\. Dementieva, A\. Elnagar, D\. Freitag, M\. Fröbe, D\. Korenčić, M\. Mayerl, A\. Mukherjee,et al\.\(2024\)Overview of pan 2024: multi\-author writing style analysis, multilingual text detoxification, oppositional thinking analysis, and generative ai authorship verification\.InEuropean Conference on Information Retrieval,pp\. 3–10\.Cited by:[§2](https://arxiv.org/html/2606.05716#S2.SS0.SSS0.Px1.p1.1)\.
- A\. Bhandarkar, R\. Wilson, A\. Swarup, and D\. Woodard \(2024\)Emulating author style: a feasibility study of prompt\-enabled text stylization with off\-the\-shelf LLMs\.InProceedings of the 1st Workshop on Personalization of Generative AI Systems \(PERSONALIZE 2024\),A\. Deshpande, E\. Hwang, V\. Murahari, J\. S\. Park, D\. Yang, A\. Sabharwal, K\. Narasimhan, and A\. Kalyan \(Eds\.\),St\. Julians, Malta,pp\. 76–82\.External Links:[Link](https://aclanthology.org/2024.personalize-1.6/)Cited by:[§F\.1](https://arxiv.org/html/2606.05716#A6.SS1.SSS0.Px1),[§F\.1](https://arxiv.org/html/2606.05716#A6.SS1.p1.1),[§7\.1](https://arxiv.org/html/2606.05716#S7.SS1.SSS0.Px1.p1.1)\.
- D\. Biber and S\. Conrad \(2019\)Register, genre, and style\.Cambridge University Press\.Cited by:[§2](https://arxiv.org/html/2606.05716#S2.SS0.SSS0.Px2.p1.1)\.
- R\. Boghrati, J\. Berger, and G\. Packard \(2023\)Style, content, and the success of ideas\.Journal of Consumer Psychology33\(4\),pp\. 688–700\.External Links:[Document](https://dx.doi.org/https%3A//doi.org/10.1002/jcpy.1346),[Link](https://myscp.onlinelibrary.wiley.com/doi/abs/10.1002/jcpy.1346),https://myscp\.onlinelibrary\.wiley\.com/doi/pdf/10\.1002/jcpy\.1346Cited by:[§1](https://arxiv.org/html/2606.05716#S1.p1.1)\.
- S\. Fincke and E\. Boschee \(2024\)Separating style from substance: enhancing cross\-genre authorship attribution through data selection and presentation\.External Links:2408\.05192,[Link](https://arxiv.org/abs/2408.05192)Cited by:[§5](https://arxiv.org/html/2606.05716#S5.SS0.SSS0.Px1.p1.2)\.
- J\. Fisher, S\. Hallinan, X\. Lu, M\. L\. Gordon, Z\. Harchaoui, and Y\. Choi \(2024\)StyleRemix: interpretable authorship obfuscation via distillation and perturbation of style elements\.InProceedings of the 2024 Conference on Empirical Methods in Natural Language Processing,Y\. Al\-Onaizan, M\. Bansal, and Y\. Chen \(Eds\.\),Miami, Florida, USA,pp\. 4172–4206\.External Links:[Link](https://aclanthology.org/2024.emnlp-main.241/),[Document](https://dx.doi.org/10.18653/v1/2024.emnlp-main.241)Cited by:[§4](https://arxiv.org/html/2606.05716#S4.SS0.SSS0.Px1.p3.1)\.
- D\. Hendrycks and K\. Gimpel \(2023\)Gaussian error linear units \(gelus\)\.External Links:1606\.08415,[Link](https://arxiv.org/abs/1606.08415)Cited by:[§5](https://arxiv.org/html/2606.05716#S5.p3.1)\.
- Z\. Hong, I\. Shenfeld, T\. Wang, Y\. Chuang, A\. Pareja, J\. R\. Glass, A\. Srivastava, and P\. Agrawal \(2024\)Curiosity\-driven red\-teaming for large language models\.InThe Twelfth International Conference on Learning Representations,External Links:[Link](https://openreview.net/forum?id=4KqkizXgXU)Cited by:[§2](https://arxiv.org/html/2606.05716#S2.SS0.SSS0.Px5.p1.1)\.
- Z\. Horvitz, A\. Patel, C\. Callison\-Burch, Z\. Yu, and K\. McKeown \(2024a\)ParaGuide: guided diffusion paraphrasers for plug\-and\-play textual style transfer\.InProceedings of the Thirty\-Eighth AAAI Conference on Artificial Intelligence and Thirty\-Sixth Conference on Innovative Applications of Artificial Intelligence and Fourteenth Symposium on Educational Advances in Artificial Intelligence,AAAI’24/IAAI’24/EAAI’24\.External Links:ISBN 978\-1\-57735\-887\-9,[Link](https://doi.org/10.1609/aaai.v38i16.29780),[Document](https://dx.doi.org/10.1609/aaai.v38i16.29780)Cited by:[§1](https://arxiv.org/html/2606.05716#S1.p1.1),[§2](https://arxiv.org/html/2606.05716#S2.SS0.SSS0.Px4.p1.1)\.
- Z\. Horvitz, A\. Patel, K\. Singh, C\. Callison\-Burch, K\. McKeown, and Z\. Yu \(2024b\)TinyStyler: efficient few\-shot text style transfer with authorship embeddings\.InFindings of the Association for Computational Linguistics: EMNLP 2024,Y\. Al\-Onaizan, M\. Bansal, and Y\. Chen \(Eds\.\),Miami, Florida, USA,pp\. 13376–13390\.External Links:[Link](https://aclanthology.org/2024.findings-emnlp.781),[Document](https://dx.doi.org/10.18653/v1/2024.findings-emnlp.781)Cited by:[§1](https://arxiv.org/html/2606.05716#S1.p1.1),[§2](https://arxiv.org/html/2606.05716#S2.SS0.SSS0.Px4.p1.1),[§7\.1](https://arxiv.org/html/2606.05716#S7.SS1.SSS0.Px1.p1.1)\.
- S\. Hu, Y\. Tu, X\. Han, G\. Cui, C\. He, W\. Zhao, X\. Long, Z\. Zheng, Y\. Fang, Y\. Huang, X\. Zhang, Z\. L\. Thai, C\. Wang, Y\. Yao, C\. Zhao, J\. Zhou, J\. Cai, Z\. Zhai, N\. Ding, C\. Jia, G\. Zeng, dahai li, Z\. Liu, and M\. Sun \(2024a\)MiniCPM: unveiling the potential of small language models with scalable training strategies\.InFirst Conference on Language Modeling,External Links:[Link](https://openreview.net/forum?id=3X2L2TFr0f)Cited by:[§D\.2](https://arxiv.org/html/2606.05716#A4.SS2.p1.1)\.
- Y\. Hu, Z\. Hu, C\. W\. Seah, and R\. K\. Lee \(2024b\)InstructAV: instruction fine\-tuning large language models for authorship verification\.InFirst Conference on Language Modeling,External Links:[Link](https://openreview.net/forum?id=0UK8c2kg7c)Cited by:[§2](https://arxiv.org/html/2606.05716#S2.SS0.SSS0.Px3.p1.1)\.
- Z\. Hu, R\. K\. Lee, C\. C\. Aggarwal, and A\. Zhang \(2022\)Text style transfer: a review and experimental evaluation\.SIGKDD Explor\. Newsl\.24\(1\),pp\. 14–45\.External Links:ISSN 1931\-0145,[Link](https://doi.org/10.1145/3544903.3544906),[Document](https://dx.doi.org/10.1145/3544903.3544906)Cited by:[§2](https://arxiv.org/html/2606.05716#S2.SS0.SSS0.Px3.p1.1)\.
- Z\. Hu, Z\. Yang, X\. Liang, R\. Salakhutdinov, and E\. P\. Xing \(2017\)Toward controlled generation of text\.InProceedings of the 34th International Conference on Machine Learning,D\. Precup and Y\. W\. Teh \(Eds\.\),Proceedings of Machine Learning Research, Vol\.70,pp\. 1587–1596\.External Links:[Link](https://proceedings.mlr.press/v70/hu17e.html)Cited by:[§2](https://arxiv.org/html/2606.05716#S2.SS0.SSS0.Px4.p1.1)\.
- B\. Huang, C\. Chen, and K\. Shu \(2024\)Can large language models identify authorship?\.InFindings of the Association for Computational Linguistics: EMNLP 2024,Y\. Al\-Onaizan, M\. Bansal, and Y\. Chen \(Eds\.\),Miami, Florida, USA,pp\. 445–460\.External Links:[Link](https://aclanthology.org/2024.findings-emnlp.26/),[Document](https://dx.doi.org/10.18653/v1/2024.findings-emnlp.26)Cited by:[§2](https://arxiv.org/html/2606.05716#S2.SS0.SSS0.Px3.p1.1)\.
- C\. Hung, Z\. Hu, Y\. Hu, and R\. Lee \(2023\)Who wrote it and why? prompting large\-language models for authorship verification\.InFindings of the Association for Computational Linguistics: EMNLP 2023,H\. Bouamor, J\. Pino, and K\. Bali \(Eds\.\),Singapore,pp\. 14078–14084\.External Links:[Link](https://aclanthology.org/2023.findings-emnlp.937/),[Document](https://dx.doi.org/10.18653/v1/2023.findings-emnlp.937)Cited by:[§2](https://arxiv.org/html/2606.05716#S2.SS0.SSS0.Px3.p1.1)\.
- A\. Jangra, B\. Sarrafzadeh, S\. Cucerzan, A\. de Wynter, and S\. K\. Jauhar \(2025\)Evaluating style\-personalized text generation: challenges and directions\.External Links:2508\.06374,[Link](https://arxiv.org/abs/2508.06374)Cited by:[§F\.1](https://arxiv.org/html/2606.05716#A6.SS1.SSS0.Px3),[§F\.1](https://arxiv.org/html/2606.05716#A6.SS1.p1.1),[§7\.1](https://arxiv.org/html/2606.05716#S7.SS1.SSS0.Px1.p1.1)\.
- D\. Jin, Z\. Jin, Z\. Hu, O\. Vechtomova, and R\. Mihalcea \(2022\)Deep learning for text style transfer: a survey\.Computational Linguistics48\(1\),pp\. 155–205\.External Links:[Link](https://aclanthology.org/2022.cl-1.6/),[Document](https://dx.doi.org/10.1162/coli%5Fa%5F00426)Cited by:[§2](https://arxiv.org/html/2606.05716#S2.SS0.SSS0.Px3.p1.1)\.
- J\. Kelly, J\. Knight, L\. A\. Peck, and G\. Reel \(2003\)Straight/narrative? writing style changes readers’ perceptions of story quality\.Newspaper Research Journal24\(4\),pp\. 118–122\.External Links:[Document](https://dx.doi.org/10.1177/073953290302400412),[Link](https://doi.org/10.1177/073953290302400412),https://doi\.org/10\.1177/073953290302400412Cited by:[§1](https://arxiv.org/html/2606.05716#S1.p1.1)\.
- K\. Krishna, J\. Wieting, and M\. Iyyer \(2020\)Reformulating unsupervised style transfer as paraphrase generation\.InProceedings of the 2020 Conference on Empirical Methods in Natural Language Processing \(EMNLP\),B\. Webber, T\. Cohn, Y\. He, and Y\. Liu \(Eds\.\),Online,pp\. 737–762\.External Links:[Link](https://aclanthology.org/2020.emnlp-main.55/),[Document](https://dx.doi.org/10.18653/v1/2020.emnlp-main.55)Cited by:[§2](https://arxiv.org/html/2606.05716#S2.SS0.SSS0.Px1.p1.1)\.
- B\. Lester, R\. Al\-Rfou, and N\. Constant \(2021\)The power of scale for parameter\-efficient prompt tuning\.InProceedings of the 2021 Conference on Empirical Methods in Natural Language Processing,M\. Moens, X\. Huang, L\. Specia, and S\. W\. Yih \(Eds\.\),Online and Punta Cana, Dominican Republic,pp\. 3045–3059\.External Links:[Link](https://aclanthology.org/2021.emnlp-main.243/),[Document](https://dx.doi.org/10.18653/v1/2021.emnlp-main.243)Cited by:[§5](https://arxiv.org/html/2606.05716#S5.p2.1)\.
- X\. L\. Li, N\. Chowdhury, D\. D\. Johnson, T\. Hashimoto, P\. Liang, S\. Schwettmann, and J\. Steinhardt \(2025\)Eliciting language model behaviors with investigator agents\.InForty\-second International Conference on Machine Learning,External Links:[Link](https://openreview.net/forum?id=AulTigiaMv)Cited by:[§2](https://arxiv.org/html/2606.05716#S2.SS0.SSS0.Px5.p1.1)\.
- X\. L\. Li and P\. Liang \(2021\)Prefix\-tuning: optimizing continuous prompts for generation\.InProceedings of the 59th Annual Meeting of the Association for Computational Linguistics and the 11th International Joint Conference on Natural Language Processing \(Volume 1: Long Papers\),C\. Zong, F\. Xia, W\. Li, and R\. Navigli \(Eds\.\),Online,pp\. 4582–4597\.External Links:[Link](https://aclanthology.org/2021.acl-long.353/),[Document](https://dx.doi.org/10.18653/v1/2021.acl-long.353)Cited by:[§5](https://arxiv.org/html/2606.05716#S5.p2.1)\.
- X\. Liu, N\. Xu, M\. Chen, and C\. Xiao \(2024\)AutoDAN: generating stealthy jailbreak prompts on aligned large language models\.InThe Twelfth International Conference on Learning Representations,External Links:[Link](https://openreview.net/forum?id=7Jwpw4qKkb)Cited by:[§2](https://arxiv.org/html/2606.05716#S2.SS0.SSS0.Px5.p1.1)\.
- I\. Loshchilov and F\. Hutter \(2019\)Decoupled weight decay regularization\.InInternational Conference on Learning Representations,External Links:[Link](https://openreview.net/forum?id=Bkg6RiCqY7)Cited by:[§D\.2](https://arxiv.org/html/2606.05716#A4.SS2.p1.1)\.
- L\. Mou and O\. Vechtomova \(2020\)Stylized text generation: approaches and applications\.InProceedings of the 58th Annual Meeting of the Association for Computational Linguistics: Tutorial Abstracts,A\. Savary and Y\. Zhang \(Eds\.\),Online,pp\. 19–22\.External Links:[Link](https://aclanthology.org/2020.acl-tutorials.5/),[Document](https://dx.doi.org/10.18653/v1/2020.acl-tutorials.5)Cited by:[§1](https://arxiv.org/html/2606.05716#S1.p3.1)\.
- S\. Mukherjee, M\. Lango, Z\. Kasner, and O\. Dušek \(2024\)A survey of text style transfer: applications and ethical implications\.External Links:2407\.16737,[Link](https://arxiv.org/abs/2407.16737)Cited by:[§2](https://arxiv.org/html/2606.05716#S2.SS0.SSS0.Px3.p1.1)\.
- A\. Neelakanteswara, S\. Chaudhari, and H\. Zamani \(2024\)RAGs to style: personalizing LLMs with style embeddings\.InProceedings of the 1st Workshop on Personalization of Generative AI Systems \(PERSONALIZE 2024\),A\. Deshpande, E\. Hwang, V\. Murahari, J\. S\. Park, D\. Yang, A\. Sabharwal, K\. Narasimhan, and A\. Kalyan \(Eds\.\),St\. Julians, Malta,pp\. 119–123\.External Links:[Link](https://aclanthology.org/2024.personalize-1.11/)Cited by:[§1](https://arxiv.org/html/2606.05716#S1.p1.1)\.
- A\. Patel, N\. Andrews, and C\. Callison\-Burch \(2024\)Low\-resource authorship style transfer: can non\-famous authors be imitated?\.External Links:2212\.08986,[Link](https://arxiv.org/abs/2212.08986)Cited by:[§E\.1](https://arxiv.org/html/2606.05716#A5.SS1.p1.1),[§2](https://arxiv.org/html/2606.05716#S2.SS0.SSS0.Px3.p1.1),[§6\.1](https://arxiv.org/html/2606.05716#S6.SS1.SSS0.Px1.p1.1)\.
- A\. Patel, D\. Rao, A\. Kothary, K\. McKeown, and C\. Callison\-Burch \(2023\)Learning interpretable style embeddings via prompting LLMs\.InFindings of the Association for Computational Linguistics: EMNLP 2023,H\. Bouamor, J\. Pino, and K\. Bali \(Eds\.\),Singapore,pp\. 15270–15290\.External Links:[Link](https://aclanthology.org/2023.findings-emnlp.1020/),[Document](https://dx.doi.org/10.18653/v1/2023.findings-emnlp.1020)Cited by:[§1](https://arxiv.org/html/2606.05716#S1.p2.1),[§2](https://arxiv.org/html/2606.05716#S2.SS0.SSS0.Px2.p1.1)\.
- A\. Patel, J\. Zhu, J\. Qiu, Z\. Horvitz, M\. Apidianaki, K\. McKeown, and C\. Callison\-Burch \(2025\)StyleDistance: stronger content\-independent style embeddings with synthetic parallel examples\.InProceedings of the 2025 Conference of the Nations of the Americas Chapter of the Association for Computational Linguistics: Human Language Technologies \(Volume 1: Long Papers\),L\. Chiruzzo, A\. Ritter, and L\. Wang \(Eds\.\),Albuquerque, New Mexico,pp\. 8662–8685\.External Links:[Link](https://aclanthology.org/2025.naacl-long.436/),[Document](https://dx.doi.org/10.18653/v1/2025.naacl-long.436),ISBN 979\-8\-89176\-189\-6Cited by:[§F\.2](https://arxiv.org/html/2606.05716#A6.SS2.p1.1),[Appendix G](https://arxiv.org/html/2606.05716#A7.p1.1),[§1](https://arxiv.org/html/2606.05716#S1.p1.1),[§2](https://arxiv.org/html/2606.05716#S2.SS0.SSS0.Px1.p1.1),[§4](https://arxiv.org/html/2606.05716#S4.SS0.SSS0.Px1.p3.1),[§7\.2](https://arxiv.org/html/2606.05716#S7.SS2.p3.1)\.
- E\. Perez, S\. Huang, F\. Song, T\. Cai, R\. Ring, J\. Aslanides, A\. Glaese, N\. McAleese, and G\. Irving \(2022\)Red teaming language models with language models\.InProceedings of the 2022 Conference on Empirical Methods in Natural Language Processing,Y\. Goldberg, Z\. Kozareva, and Y\. Zhang \(Eds\.\),Abu Dhabi, United Arab Emirates,pp\. 3419–3448\.External Links:[Link](https://aclanthology.org/2022.emnlp-main.225/),[Document](https://dx.doi.org/10.18653/v1/2022.emnlp-main.225)Cited by:[§2](https://arxiv.org/html/2606.05716#S2.SS0.SSS0.Px5.p1.1)\.
- S\. Prabhumoye, Y\. Tsvetkov, R\. Salakhutdinov, and A\. W\. Black \(2018\)Style transfer through back\-translation\.InProceedings of the 56th Annual Meeting of the Association for Computational Linguistics \(Volume 1: Long Papers\),I\. Gurevych and Y\. Miyao \(Eds\.\),Melbourne, Australia,pp\. 866–876\.External Links:[Link](https://aclanthology.org/P18-1080/),[Document](https://dx.doi.org/10.18653/v1/P18-1080)Cited by:[§2](https://arxiv.org/html/2606.05716#S2.SS0.SSS0.Px4.p1.1)\.
- S\. Ramnath, K\. Pandey, E\. Boschee, and X\. Ren \(2025\)CAVE: controllable authorship verification explanations\.InProceedings of the 2025 Conference of the Nations of the Americas Chapter of the Association for Computational Linguistics: Human Language Technologies \(Volume 1: Long Papers\),L\. Chiruzzo, A\. Ritter, and L\. Wang \(Eds\.\),Albuquerque, New Mexico,pp\. 8939–8961\.External Links:[Link](https://aclanthology.org/2025.naacl-long.451/),[Document](https://dx.doi.org/10.18653/v1/2025.naacl-long.451),ISBN 979\-8\-89176\-189\-6Cited by:[§1](https://arxiv.org/html/2606.05716#S1.p2.1),[§2](https://arxiv.org/html/2606.05716#S2.SS0.SSS0.Px3.p1.1),[§4](https://arxiv.org/html/2606.05716#S4.SS0.SSS0.Px1.p3.1)\.
- F\. Rangel and P\. Rosso \(2019\)Overview of the 2019 author profiling task at pan\.InCLEF \(Working Notes\),Cited by:[§2](https://arxiv.org/html/2606.05716#S2.SS0.SSS0.Px1.p1.1)\.
- E\. Reif, D\. Ippolito, A\. Yuan, A\. Coenen, C\. Callison\-Burch, and J\. Wei \(2022\)A recipe for arbitrary text style transfer with large language models\.InProceedings of the 60th Annual Meeting of the Association for Computational Linguistics \(Volume 2: Short Papers\),S\. Muresan, P\. Nakov, and A\. Villavicencio \(Eds\.\),Dublin, Ireland,pp\. 837–848\.External Links:[Link](https://aclanthology.org/2022.acl-short.94/),[Document](https://dx.doi.org/10.18653/v1/2022.acl-short.94)Cited by:[§1](https://arxiv.org/html/2606.05716#S1.p3.1)\.
- R\. A\. Rivera\-Soto, O\. E\. Miano, J\. Ordonez, B\. Y\. Chen, A\. Khan, M\. Bishop, and N\. Andrews \(2021\)Learning universal authorship representations\.InProceedings of the 2021 Conference on Empirical Methods in Natural Language Processing,M\. Moens, X\. Huang, L\. Specia, and S\. W\. Yih \(Eds\.\),Online and Punta Cana, Dominican Republic,pp\. 913–919\.External Links:[Link](https://aclanthology.org/2021.emnlp-main.70),[Document](https://dx.doi.org/10.18653/v1/2021.emnlp-main.70)Cited by:[§F\.2](https://arxiv.org/html/2606.05716#A6.SS2.p1.1),[Appendix G](https://arxiv.org/html/2606.05716#A7.p1.1),[§1](https://arxiv.org/html/2606.05716#S1.p1.1),[§2](https://arxiv.org/html/2606.05716#S2.SS0.SSS0.Px1.p1.1),[§7\.2](https://arxiv.org/html/2606.05716#S7.SS2.p3.1)\.
- T\. Shen, T\. Lei, R\. Barzilay, and T\. Jaakkola \(2017\)Style transfer from non\-parallel text by cross\-alignment\.InAdvances in Neural Information Processing Systems,I\. Guyon, U\. V\. Luxburg, S\. Bengio, H\. Wallach, R\. Fergus, S\. Vishwanathan, and R\. Garnett \(Eds\.\),Vol\.30,pp\.\.External Links:[Link](https://proceedings.neurips.cc/paper_files/paper/2017/file/2d2c8394e31101a261abf1784302bf75-Paper.pdf)Cited by:[§2](https://arxiv.org/html/2606.05716#S2.SS0.SSS0.Px4.p1.1)\.
- T\. Shen, J\. Mueller, D\. Barzilay, and T\. Jaakkola \(2020\)Educating text autoencoders: latent representation guidance via denoising\.InProceedings of the 37th International Conference on Machine Learning,H\. D\. III and A\. Singh \(Eds\.\),Proceedings of Machine Learning Research, Vol\.119,pp\. 8719–8729\.External Links:[Link](https://proceedings.mlr.press/v119/shen20c.html)Cited by:[§2](https://arxiv.org/html/2606.05716#S2.SS0.SSS0.Px4.p1.1)\.
- R\. A\. R\. Soto, K\. Koch, A\. Khan, B\. Y\. Chen, M\. Bishop, and N\. Andrews \(2024\)Few\-shot detection of machine\-generated text using style representations\.InThe Twelfth International Conference on Learning Representations,External Links:[Link](https://openreview.net/forum?id=cWiEN1plhJ)Cited by:[§1](https://arxiv.org/html/2606.05716#S1.p1.1)\.
- E\. Stamatatos, M\. Kestemont, K\. Kredens, P\. Pezik, A\. Heini, J\. Bevendorff, B\. Stein, and M\. Potthast \(2022\)Overview of the authorship verification task at pan 2022\.InCEUR workshop proceedings,Vol\.3180,pp\. 2301–2313\.Cited by:[§2](https://arxiv.org/html/2606.05716#S2.SS0.SSS0.Px1.p1.1)\.
- E\. Stamatatos \(2016\)Authorship verification: a review of recent advances\.\.Research in Computing Science123\(1\),pp\. 9–25\.Cited by:[§2](https://arxiv.org/html/2606.05716#S2.SS0.SSS0.Px3.p1.1)\.
- P\. Tiersma and L\. M\. Solan \(2002\)The linguist on the witness stand: forensic linguistics in american courts\.Language78\(2\),pp\. 221–239\.External Links:ISSN 00978507, 15350665,[Link](http://www.jstor.org/stable/3086556)Cited by:[§2](https://arxiv.org/html/2606.05716#S2.SS0.SSS0.Px2.p1.1)\.
- M\. Tsimpoukelli, J\. L\. Menick, S\. Cabi, S\. M\. A\. Eslami, O\. Vinyals, and F\. Hill \(2021\)Multimodal few\-shot learning with frozen language models\.InAdvances in Neural Information Processing Systems,M\. Ranzato, A\. Beygelzimer, Y\. Dauphin, P\.S\. Liang, and J\. W\. Vaughan \(Eds\.\),Vol\.34,pp\. 200–212\.External Links:[Link](https://proceedings.neurips.cc/paper_files/paper/2021/file/01b7575c38dac42f3cfb7d500438b875-Paper.pdf)Cited by:[§5](https://arxiv.org/html/2606.05716#S5.p2.1)\.
- J\. Tyo, B\. Dhingra, and Z\. C\. Lipton \(2022\)On the state of the art in authorship attribution and authorship verification\.arXiv preprint arXiv:2209\.06869\.Cited by:[§2](https://arxiv.org/html/2606.05716#S2.SS0.SSS0.Px3.p1.1)\.
- A\. Wang, C\. Aggazzotti, R\. Kotula, R\. R\. Soto, M\. Bishop, and N\. Andrews \(2023\)Can authorship representation learning capture stylistic features?\.Transactions of the Association for Computational Linguistics11,pp\. 1416–1431\.External Links:ISSN 2307\-387X,[Document](https://dx.doi.org/10.1162/tacl%5Fa%5F00610),[Link](https://doi.org/10.1162/tacl%5C_a%5C_00610),https://direct\.mit\.edu/tacl/article\-pdf/doi/10\.1162/tacl\_a\_00610/2184071/tacl\_a\_00610\.pdfCited by:[§2](https://arxiv.org/html/2606.05716#S2.SS0.SSS0.Px1.p1.1)\.
- Z\. Wang, N\. I\. Tripto, S\. Park, Z\. Li, and J\. Zhou \(2025\)Catch me if you can? not yet: LLMs still struggle to imitate the implicit writing styles of everyday authors\.InFindings of the Association for Computational Linguistics: EMNLP 2025,C\. Christodoulopoulos, T\. Chakraborty, C\. Rose, and V\. Peng \(Eds\.\),Suzhou, China,pp\. 10040–10055\.External Links:[Link](https://aclanthology.org/2025.findings-emnlp.532/),[Document](https://dx.doi.org/10.18653/v1/2025.findings-emnlp.532),ISBN 979\-8\-89176\-335\-7Cited by:[§F\.1](https://arxiv.org/html/2606.05716#A6.SS1.SSS0.Px2),[§F\.1](https://arxiv.org/html/2606.05716#A6.SS1.p1.1),[§7\.1](https://arxiv.org/html/2606.05716#S7.SS1.SSS0.Px1.p1.1)\.
- A\. Wegmann, M\. Schraagen, and D\. Nguyen \(2022\)Same author or just same topic? towards content\-independent style representations\.InProceedings of the 7th Workshop on Representation Learning for NLP,S\. Gella, H\. He, B\. P\. Majumder, B\. Can, E\. Giunchiglia, S\. Cahyawijaya, S\. Min, M\. Mozes, X\. L\. Li, I\. Augenstein, A\. Rogers, K\. Cho, E\. Grefenstette, L\. Rimell, and C\. Dyer \(Eds\.\),Dublin, Ireland,pp\. 249–268\.External Links:[Link](https://aclanthology.org/2022.repl4nlp-1.26),[Document](https://dx.doi.org/10.18653/v1/2022.repl4nlp-1.26)Cited by:[§1](https://arxiv.org/html/2606.05716#S1.p1.1),[§2](https://arxiv.org/html/2606.05716#S2.SS0.SSS0.Px1.p1.1),[§7\.2](https://arxiv.org/html/2606.05716#S7.SS2.p2.1)\.
- P\. Xu, J\. C\. K\. Cheung, and Y\. Cao \(2020\)On variational learning of controllable representations for text without supervision\.InProceedings of the 37th International Conference on Machine Learning,H\. D\. III and A\. Singh \(Eds\.\),Proceedings of Machine Learning Research, Vol\.119,pp\. 10534–10543\.External Links:[Link](https://proceedings.mlr.press/v119/xu20a.html)Cited by:[§2](https://arxiv.org/html/2606.05716#S2.SS0.SSS0.Px4.p1.1)\.
- X\. Yang and M\. Carpuat \(2025\)Steering large language models with register analysis for arbitrary style transfer\.External Links:2505\.00679,[Link](https://arxiv.org/abs/2505.00679)Cited by:[§E\.1](https://arxiv.org/html/2606.05716#A5.SS1.p1.1),[§2](https://arxiv.org/html/2606.05716#S2.SS0.SSS0.Px3.p1.1),[§6\.1](https://arxiv.org/html/2606.05716#S6.SS1.SSS0.Px1.p1.1)\.
- J\. Zhu and D\. Jurgens \(2021\)Idiosyncratic but not arbitrary: learning idiolects in online registers reveals distinctive yet consistent individual styles\.InProceedings of the 2021 Conference on Empirical Methods in Natural Language Processing,M\. Moens, X\. Huang, L\. Specia, and S\. W\. Yih \(Eds\.\),Online and Punta Cana, Dominican Republic,pp\. 279–297\.External Links:[Link](https://aclanthology.org/2021.emnlp-main.25),[Document](https://dx.doi.org/10.18653/v1/2021.emnlp-main.25)Cited by:[§2](https://arxiv.org/html/2606.05716#S2.SS0.SSS0.Px1.p1.1)\.
## Appendix AArtifact for Reproducibility
Table[A3](https://arxiv.org/html/2606.05716#A7.T3)shows the models and their size used in our experiments\.
## Appendix BSupplementary Material for Section[3](https://arxiv.org/html/2606.05716#S3)
Figure[A1](https://arxiv.org/html/2606.05716#A2.F1)illustrates the core problem being addressed\. Instead of the prompt search formulation in Figure[1\(a\)](https://arxiv.org/html/2606.05716#A2.F1.sf1)which is infeasible, we reformulate the task as the supervised prompt recovery \(Figure[1\(b\)](https://arxiv.org/html/2606.05716#A2.F1.sf2)\)\.
\(a\)Prompt Search Formulation in Equation[1](https://arxiv.org/html/2606.05716#S3.E1)\.
\(b\)Prompt Recovery Formulation in Equation[2](https://arxiv.org/html/2606.05716#S3.E2)\.
Figure A1:Our goal is to train a decoder𝙳\\mathtt\{D\}that recovers a style promptsscapable of steering an LLM to generate textyyin the style ofxx\(Figure[1\(a\)](https://arxiv.org/html/2606.05716#A2.F1.sf1)\)\. Because searching the vast prompt space is intractable, we reformulate the task as supervised prompt recovery: Given a style\-response pair\(s,x\)\(s,x\), we match the decoded prompts~=𝙳\(𝐱\)\\tilde\{s\}=\\mathtt\{D\}\(\\mathbf\{x\}\)toss\(Figure[1\(b\)](https://arxiv.org/html/2606.05716#A2.F1.sf2)\)\.
## Appendix CSupplementary Material for Section[4](https://arxiv.org/html/2606.05716#S4)
This section provides additional details on the dataset construction process\.
### C\.1Style Generation
We describe here the procedure used to obtain the style features\. First, we employ theCategory expansion promptbelow to obtain the style categories\. Table[A4](https://arxiv.org/html/2606.05716#A7.T4)lists the stylistic categories in our dataset\. Next, we apply theFeature list promptbelow, in which\[Style Category\]is replaced with the corresponding category name to produce fine\-grained style features\.
#### Category expansion prompt\.
Extend the list of categories for writing styles\. Each category should be distinct and orthogonal from the others\. Provide only new categories\.
‘‘Writing Goal and Intent’’, ‘‘Tone and Mood’’, ‘‘Grade Level’’, ‘‘Sentence Structure and Syntax’’, ‘‘Function Word Usage’’, ‘‘Word and Expression Choice’’, ‘‘Punctuation Usage’’, ‘‘Special Character and Capitalization Usage’’, ‘‘Acronym and Abbreviation Usage’’, ‘‘Formality’’, ‘‘Social and Interpersonal’’, ‘‘Emotional Intensity’’, ‘‘Descriptive Density’’
#### Feature list prompt\.
Your task is to generate a list of 40 writing styles in the \[Style Category\] category\. Answer in JSON format, where each item is a style\.
Detailed Instructions: Each style in the list should be \(1\) distinct and orthogonal to the others\. Similar styles with subtle nuance differences are considered distinct and, therefore, allowed\. \(2\) independent of the content of the text\. \(3\) concrete so that LLMs can implement it when instructed to\. \(4\) minimal so that it cannot be decomposed into multiple styles\. \(5\) a short sentence \(3\-5 words\) in the imperative form that instructs writers to implement the style\.
### C\.2Question Curation
To ensure diversity across content domains, we sample questions stratified by topic\. For each platform, we select the 10 topics with the largest number of available questions\. Table[A5](https://arxiv.org/html/2606.05716#A7.T5)presents the complete list of topics per platform\.
The number of questions sampled per topic is determined proportionally to the total volume of questions available within that platform\. Within each topic, both questions and corresponding human answers are sampled uniformly at random\. On average, human answers in the test set contain 493\.17 characters and 79\.80 words\.
### C\.3LLM Generation
Stylized responses are generated using theStylized text generation promptbelow, where\[max\_words\],\[title\],\[body\], and\[style\]are replaced with the maximum word count, question title, question body, and style prompt, respectively\.
To control generation prompt length, we truncate question titles to 80 tokens and question bodies to 200 tokens\. We format all inputs using the chat template specific to the LLM used for generation\. For decoding, we set the temperature to 1\.0, nucleus sampling threshold \(pp\) to 0\.95, and maximum token count to 200\. The maximum word count in the prompt is matched with this token count\.
#### Stylized text generation prompt\.
\{ "system": "Your task is to answer a question following the provided style instructions\. Your response should clearly show the instructed writing style, but should not directly mention any information about the style instruction\. Do not explain your answer\. Your answer should be within \[max\_words\] words\.", "user": "Answer the following question while adhering to the provided writing style instructions\.\\n\\n\# Question:\\n\[title\]\[body\]\\n\\n\# Style Instructions:\\n\[style\]" \}
## Appendix DSupplementary Material for Section[5](https://arxiv.org/html/2606.05716#S5)
This section provides additional details on our proposed method\.
### D\.1Prompt for Decoding
We employ theStyle decoding promptbelow, where\[max\_words\]is replaced with the maximum word count of 100\. We concatenateprefix,projected style representation, andsuffix, which is then provided into the frozen LLM as an input\.
#### Style decoding prompt\.
\{ "prefix": "Your task is to describe the writing style from the style embedding vectors, in a single sentence, following this structure: ‘The author \[verb\] \[specific technique/trait\], \.\.\.’ You can include 1 to 10 techniques/traits, but should not repeat\. Do not explain your answer\. Your answer should be within \[max\_words\] words\.\\n\\n\#\#\# Style Embedding Vectors: \\n", "suffix": "\\n\\n\# Description: \\n" \}
### D\.2Training Details
All models are implemented with PyTorch\-Lightning and the Huggingface Transformer library\. We use the AdamW optimizerLoshchilov and Hutter \([2019](https://arxiv.org/html/2606.05716#bib.bib55)\)and the WSD learning rate scheduleHuet al\.\([2024a](https://arxiv.org/html/2606.05716#bib.bib56)\)\. Our model training takes 16 hours using 2 A100 GPUs\.
## Appendix ESupplementary Material for Section[6](https://arxiv.org/html/2606.05716#S6)
We provide prompts that our LLM baselines and LLM\-as\-judge evaluations use\.
### E\.1LLM baseline prompts
We show three prompts used for three LLM\-based description baselines: STYLLPatelet al\.\([2024](https://arxiv.org/html/2606.05716#bib.bib42)\), RGYang and Carpuat \([2025](https://arxiv.org/html/2606.05716#bib.bib43)\), and the custom prompt we designed\. For RGYang and Carpuat \([2025](https://arxiv.org/html/2606.05716#bib.bib43)\), style descriptions were generated in two stages; thus, we present two prompts\. We replace\[max\_words\]and\[text\]with the maximum word count and target text, respectively\. The maximum word count is set as 100, matching that of our decoder\.
#### STYLL\.
\{ "system": "You are a forensic linguist who knows how to analyze linguistic and stylometric similarites between texts\.", "user": "List some adjectives, comma\-separated, that describe the writing style of the author of this passage\. Strictly output only the style descriptors without any other content\.\\n\\n\# Passage: \\n\[text\]" \}
#### RG\.
\{ "system": "You are a forensic linguist who knows how to analyze linguistic and stylometric similarites between texts\.", "user": "Analyze the authorship style of this passage in terms of dimensions of register variation according to Douglas Biber\.\\n\\n\# Passage: \\n\[text\]" \}
\{ "system": "You are a forensic linguist who knows how to analyze linguistic and stylometric similarites between texts\.", "user": "List some adjectives, comma\-separated, that describe the writing style of the author of the target text\. Strictly output only the style descriptors without any other content\.\\n\\n\# Style analysis: \\n\[text\]" \}
#### Custom Prompt\.
\{ "system": "Your task is to analyze the writing style of the text and describe the style in a single sentence, following this structure: ‘The author \[verb\] \[specific technique/trait\], \.\.\.’ You can include 1 to 10 techniques/traits, but should not repeat\. Do not explain your answer\. Your answer should be within \[max\_words\] words\.", "user": "Describe the writing style of the text\.\\n\\n\# Text: \\n\[text\]" \}
### E\.2LLM\-as\-judge evaluation prompt
We instruct the evaluating LLM with theLLM\-as\-judge score evaluation promptbelow to compute the LLM\-as\-judge score\. We replace\[pred\]and\[ref\]with the predicted and reference styles, respectively\.
#### LLM\-as\-judge score evaluation prompt\.
You are an expert judge for text similarity\. Given the following two passages, rate their semantic similarity on a scale from 0 \(completely unrelated\) to 10 \(nearly identical in meaning\)\. Do not explain your answer\. Provide the numeric score only\.
Text A: ’\[pred\]’ Text B: ’\[ref\]’
Similarity score \(0–10\):
## Appendix FSupplementary Material for Section[7](https://arxiv.org/html/2606.05716#S7)
We provide prompts that our baselines use and additional evaluation using different style representation models\.
### F\.1LLM baseline prompts
We show four prompts used for four LLM\-based style imitation baselines:Bhandarkaret al\.\([2024](https://arxiv.org/html/2606.05716#bib.bib44)\),Wanget al\.\([2025](https://arxiv.org/html/2606.05716#bib.bib45)\),Jangraet al\.\([2025](https://arxiv.org/html/2606.05716#bib.bib46)\), and the custom prompt we designed\. We replace\[max\_words\],\[title\],\[body\], and\[reference\]with the maximum word count, question title, question body, and the target text, respectively\. The maximum word count is set as 200, matching that of stylized text generation\.
#### Bhandarkaret al\.\([2024](https://arxiv.org/html/2606.05716#bib.bib44)\)\.
\{ "system": "You are an emulator designed to replicate the writing style of a human author\.", "user": "\# Task\\nYour task is to answer the following question while seamlessly integrating with the provided human\-authored snippet\. Strive to make the answer stylistically indistinguishable from the human\-authored text\.\\n\\n\# Instrucions\\nThe goal of this task is to mimic the author’s writing style while paying meticulous attention to lexical richness and diversity, sentence structure, punctuation style, special character style, expressions and idioms, overall tone, emotion and mood, or any other relevant aspect of writing style established by the author\. Your answer should be within \[max\_words\] words\.\\n\\n\# Output Indicator\\nAs output, exclusively return the text completion without any accompanying explanations or comments\.\\n\\n\# Question:\\n\[title\]\[body\]\\n\\n\# Human\-authored Text:\\n\[reference\]" \}
#### Wanget al\.\([2025](https://arxiv.org/html/2606.05716#bib.bib45)\)\.
\{ "system": "You will be given one or more writing samples from a specific author\. Your task is to analyze the author’s style, tone, and voice, then craft an answer that closely mimics their writing based on a provided summary\. Your writing should be around \[max\_words\] words\.", "user": "\#\#\# Author’s Writing Sample\\n\\n\[reference\]\\n\\n\#\#\# Question:\\n\[title\]\[body\]\\n\\n\#\#\# Instructions\\n\\n\- Ensure your writing faithfully replicates the author’s style, including tone, word choices, and sentence structure, etc\. \- Maintain consistency with the author’s voice while accurately reflecting the details of the given summary\. \- Strive to make your writing indistinguishable from the original author’s work\. \- Do not output anything other than the writing\." \}
#### Jangraet al\.\([2025](https://arxiv.org/html/2606.05716#bib.bib46)\)\.
\{ "system": "You are a writing assistant\. Your goal is to address to a user’s query to write a text instance based on their preferences\.\\nThe input would comprise of the following elements enclosed in \|begin INPUT\|\.\.\.\|end INPUT\| \\n \- \|begin USER\_QUERY\|\.\.\.\|end USER\_QUERY\| \- the user query containing the writing task description and instructions on how to generate the OUTPUT\.\\n \- \|begin STYLE\_EXAMPLES\|\.\.\.\|end STYLE\_EXAMPLES\| \- contains the written examples that should be used for writing style, tone and voice inspiration\.\\n \- Generate the response in \|begin OUTPUT\|\.\.\.\|end OUTPUT\|\.\\n \- Depending on the information in input generate the response accordingly\.\\n \- Write the response based on instructions in the USER\_QUERY while taking stylistic inspirations from STYLE\_EXAMPLES\. Responding with generic response when STYLE\_EXAMPLES are present is undesirable, and therefore you should try your best to incorporate the stylistic features while not leaking any information from STYLE\_EXAMPLES into OUTPUT\.", "user": "\|begin INPUT\|\\n \|begin STYLE\_EXAMPLES\|\[reference\]\\n \|end STYLE\_EXAMPLES\|\\n \|begin USER\_QUERY\| Answer the following question while mimicking the writing style of the provided reference text\. Make sure to not generate infactual information that is not present in the INPUT like dates, names, etc\., and instead generate placeholders like \[DATE\], \[NAME\], etc\. Your answer should be within \[max\_words\] words\.\\n\\n\# Question:\\n\[title\]\[body\]\\n\|end USER\_QUERY\|\\n\|end INPUT\|" \}
#### Custom Prompt\.
\{ "system": "Your task is to answer a question mimicking the writing style of the reference text\. Your response should clearly show the writing style that matches that of the reference text\. Do not explain your answer\. Your answer should be within \[max\_words\] words\.", "user": "Answer the following question while mimicking the writing style of the provided reference text\.\\n\\n\# Question:\\n\[title\]\[body\]\\n\\n\# Reference Text:\\n\[reference\]" \}
### F\.2Evaluation with other style representations
As additional robustness tests, we reproduce our experiments using two alternative style representation models\. Figure[A2](https://arxiv.org/html/2606.05716#A6.F2)shows the evaluation in Figure[4](https://arxiv.org/html/2606.05716#S7.F4)with LUARRivera\-Sotoet al\.\([2021](https://arxiv.org/html/2606.05716#bib.bib24)\)and StyleDistancePatelet al\.\([2025](https://arxiv.org/html/2606.05716#bib.bib50)\)style embeddings for documents\. Table[A1](https://arxiv.org/html/2606.05716#A6.T1)presents all numeric values for Figures[4](https://arxiv.org/html/2606.05716#S7.F4)and[A2](https://arxiv.org/html/2606.05716#A6.F2)\.
The trends across different representation models agree\. Since our dataset construction and decoder never use these two additional style representation models, there is no potential circularity from these representations\. These results demonstrate that the strong performance of our decoder is not limited to the style representation used to train it\.
\(a\)LUAR
\(b\)StyleDistance
Figure A2:Our approach yields the smallest L2 distance, measured by other style representations too\.MethodOur EmbeddingLUARStyleDistanceDecoder \(Ours\)26\.076\.016\.82LLM \(Custom\)35\.399\.108\.24Wang et al\. \(2025\)73\.218\.268\.41Jangra et al\. \(2025\)100\.108\.909\.85Bhandarkar et al\. \(2024\)102\.899\.0211\.87TinyStyler49\.9711\.4010\.82
Table A1:The numeric values for evaluation in Section[7](https://arxiv.org/html/2606.05716#S7)\.
## Appendix GSupplementary Material for Section[8](https://arxiv.org/html/2606.05716#S8)
Figure[A3](https://arxiv.org/html/2606.05716#A7.F3)shows the evaluation in Figure[5](https://arxiv.org/html/2606.05716#S7.F5)repeated using two alternative style representation models: LUARRivera\-Sotoet al\.\([2021](https://arxiv.org/html/2606.05716#bib.bib24)\)and StyleDistancePatelet al\.\([2025](https://arxiv.org/html/2606.05716#bib.bib50)\)\. Table[A2](https://arxiv.org/html/2606.05716#A7.T2)presents all numeric values for Figures[5](https://arxiv.org/html/2606.05716#S7.F5)and[A3](https://arxiv.org/html/2606.05716#A7.F3)\.
Again, the trends across different representation models align, demonstrating that our decoder is not limited to the style representation used for training\.
\(a\)LUAR
\(b\)StyleDistance
Figure A3:Our decoder outperforms baselines at generating in human\-like styles, measured by other style representations too\.MethodOur EmbeddingLUARStyleDistanceDecoder \(Ours\)27\.736\.337\.47LLM \(Custom\)37\.549\.399\.79Wang et al\. \(2025\)39\.289\.4610\.88Jangra et al\. \(2025\)42\.019\.279\.64Bhandarkar et al\. \(2024\)35\.539\.318\.94TinyStyler54\.6911\.7714\.38
Table A2:The numeric values for evaluation in Section[8](https://arxiv.org/html/2606.05716#S8)\.Table A3:The LLMs used in our study\.Category\# featuresWriting Goal and Intent40Tone and Mood40Readability Level40Formality40Narrative Perspective and Voice40Social and Interpersonal40Audience Engagement and Interaction40Emotional Intensity38Descriptive Density40Information Density40Logical Structure and Flow39Creativity and Typicality39Abstraction Level39Temporal Focus39Technical and Domain Specificity40Cultural and Regional Influences39Figurative Language Usage38Rhetorical Device Usage38Intertextuality and Allusion38Sentence Structure and Syntax38Visual Formatting and Layout39Function Word Usage38Word and Expression Choice39Punctuation Usage39Special Character and Capitalization Usage31Acronym and Abbreviation Usage39Total1,010
Table A4:The full list of 26 stylistic categories on which we curate style features\.TopicPlatformAdviceRedditAmItheAssholeRedditAskMenRedditAskRedditRedditaskscienceRedditAskWomenRedditexplainlikeimfiveRedditNoStupidQuestionsRedditrelationship\_adviceRedditrelationshipsRedditacademiaStackExchangecookingStackExchangediyStackExchangehistoryStackExchangelawStackExchangemoneyStackExchangephilosophyStackExchangepoliticsStackExchangescifiStackExchangeworkplaceStackExchangebusiness and financeYahoo Answerscomputers and internetYahoo Answerseducation and referenceYahoo Answersentertainment and musicYahoo Answersfamily and relationshipsYahoo AnswershealthYahoo Answerspolitics and governmentYahoo Answersscience and mathematicsYahoo Answerssociety and cultureYahoo AnswerssportsYahoo AnswersTable A5:The full list of 30 topics from which we source our questions\.Similar Articles
PromptPrint: Behavioral Biometrics Through Natural Language Prompting in LLMs
Introduces PromptPrint, a systematic study showing that users' habitual vocabulary and syntax in LLM prompts form a learnable behavioral biometric, with lexical features outperforming semantic encoders and revealing a uniqueness–consistency paradox.
Lightweight Stylistic Consistency Profiling: Robust Detection of LLM-Generated Textual Content for Multimedia Moderation
Proposes LiSCP, a lightweight stylistic consistency profiling method for robust detection of LLM-generated textual content, focusing on feature stability under adversarial manipulation. Achieves superior performance on in-domain and cross-domain detection with notable robustness.
When Does Persona Prompting Actually Help? A Retrieval and Metric Analysis of Expert Role Injection in LLMs
This paper analyzes when persona prompting improves LLM responses, finding that it increases expertise depth at the cost of clarity, with effectiveness varying by domain and question type. The study introduces hybrid retrieval for role selection and advocates for multi-metric evaluation.
Explain Like I'm 5 or Whatever I Choose: Evaluating the Interactive Potential of Language Model Responses
This paper proposes a framework for evaluating LLMs' ability to generate multiple responses to scientific queries at different language complexity levels. The study finds that models often vary complexity inconsistently, with Claude Sonnet 4.5 performing best but only shifting complexity correctly 46% of the time.
Reflective Prompt Tuning through Language Model Function-Calling
Introduces Reflective Prompt Tuning (RPT), a framework that uses LLM function-calling to iteratively diagnose and revise prompts based on systematic error patterns, improving reasoning task performance and calibration.