LLMs with in-context learning for Algorithmic Theoretical Physics

arXiv cs.LG 05/12/26, 04:00 AM Papers
llm theoretical-physics in-context-learning computer-algebra-system cosmology claude
Summary
This paper investigates using Large Language Models, specifically Claude, interfaced with a Computer Algebra System (Maple) to perform algorithmic computations in theoretical physics, such as analyzing cosmological perturbations.
arXiv:2605.08212v1 Announce Type: new Abstract: There is an increasing number of algorithmic computations in theoretical physics. These, while conceptually simple, can nevertheless be time-consuming and contain subtleties that should not be overlooked. Given the recent improvement of Large Language Models (LLM), it is natural to investigate whether LLMs equipped with a computer algebra system (CAS) runtime and sufficiently informative context can reliably carry out these algorithmic tasks. In this work, we interface Claude with Maple, and apply this framework to cosmological perturbations in modified theories of gravity. We demonstrate the current capabilities of this approach, the typical failures, and how the same can be improved. We find that a frontier LLM supplied with worked examples is able to solve most test problems.
Original Article
View Cached Full Text
Cached at: 05/12/26, 07:05 AM
# LLMs with in-context learning for Algorithmic Theoretical Physics
Source: [https://arxiv.org/html/2605.08212](https://arxiv.org/html/2605.08212)
###### Abstract

There is an increasing number of algorithmic computations in theoretical physics\. These, while conceptually simple, can nevertheless be time\-consuming and contain subtleties that should not be overlooked\. Given the recent improvement of Large Language Models \(LLM\), it is natural to investigate whether LLMs equipped with a computer algebra system \(CAS\) runtime and sufficiently informative context can reliably carry out these algorithmic tasks\. In this work, we interface Claude with Maple, and apply this framework to cosmological perturbations in modified theories of gravity\. We demonstrate the current capabilities of this approach, the typical failures, and how the same can be improved\. We find that a frontier LLM supplied with worked examples is able to solve most test problems\.

IPMU26\-0020

Anamaria Hell and Leander Thiele

Kavli IPMU \(WPI\), UTIAS, The University of Tokyo,

and Center for Data\-Driven Discovery,

5\-1\-5 Kashiwanoha, Kashiwa, Chiba 277\-8583, Japan

## 1Introduction

Theoretical physicists deal with a range of tasks in their research\. Between the most mechanistic calculations which have been relegated to numerical and symbolic computer programs, and the art of constructing new theories born out of genuine creativity, lies a large field of “algorithmic computations”\. By this we mean tasks that are sufficiently complex that a general solution in terms of a deterministic computer program is unavailable, while at the same time not being of such difficulty that entirely new methods and ways of thinking are required for their solution\. To bracket algorithmic computations by examples, we mean tasks more complex than individual commands in a computer algebra system, but less complex than the discovery of AdS\-CFT correspondence or certain proofs in Einstein gravity\.

This leaves a large terrain of computations for which a generic recipe exists while the details vary from problem to problem and certain subtleties may require special care\. Perturbative calculations \(like scattering matrices\) in renormalizable QFT, string theory, or EFTs are typical examples\. More familiar to general physicists would be typical problems in classical electrodynamics or perturbation theory in quantum mechanics\.

Such algorithmic computations currently take up much time for many theoretical physicists, even with the help of computer algebra systems \(CAS\)\. At the same time, their relative simplicity implies that their solution using agentic AI might be within reach\. The algorithmic nature and frequency of these problems implies that a large set of worked examples is available and an abstract description of the algorithm can be written down\. In this work, we use both worked examples and the algorithm’s description for in\-context learning and test which contents in the context optimize success rate\.

We thus test in this work the ability of a frontier large language model, interfaced with a CAS session and equipped with an informative context, to solve problems from a specific class of algorithmic computations:identifying the degrees of freedom for cosmological backgrounds in various theories of gravity\. Cosmological perturbations, small ripples in spacetime and matter, give rise to the structure of the Universe that we observe today\(Starobinsky,[1979](https://arxiv.org/html/2605.08212#bib.bib3); Mukhanov and Chibisov,[1981](https://arxiv.org/html/2605.08212#bib.bib4); Chibisov and Mukhanov,[1982](https://arxiv.org/html/2605.08212#bib.bib5); Kodama and Sasaki,[1984](https://arxiv.org/html/2605.08212#bib.bib6); Sasaki,[1986](https://arxiv.org/html/2605.08212#bib.bib7); Mukhanovet al\.,[1992](https://arxiv.org/html/2605.08212#bib.bib8)\)\. They represent the degrees of freedom, the building blocks of theories, which determine their stability and observational consequences\. Identifying them is thus essential, and can be non\-trivial when departing from Einstein’s General Relativity\. Several approaches have been developed with this purpose, including the Hamiltonian analysis with a Dirac\-Bergmann algorithm\(Bergmann,[1949](https://arxiv.org/html/2605.08212#bib.bib9); Dirac,[1950](https://arxiv.org/html/2605.08212#bib.bib10); Anderson and Bergmann,[1951](https://arxiv.org/html/2605.08212#bib.bib11)\), and aFaddeev\-Jackiwapproach\(Faddeev and Jackiw,[1988](https://arxiv.org/html/2605.08212#bib.bib12)\), as well as Lagrangian formalism \(seeHellet al\.\([2026](https://arxiv.org/html/2605.08212#bib.bib14)\)for a general prescription, and references therein\), and a more rigorously mathematical approach founded directly in the equations of motion \(seeHeisenberg \([2025](https://arxiv.org/html/2605.08212#bib.bib13)\)and references therein\)\. In this work, we follow therecipegiven inHellet al\.\([2026](https://arxiv.org/html/2605.08212#bib.bib14)\), which allows for the easiest and most straight\-forward implementation in the context of cosmological perturbations\.

For the purposes of a rigorous study, algorithmic computations of cosmological perturbations have some distinct advantages\. It is easy to construct test problems with research\-grade complexity and with such obscure features that they are impossible to have been encountered in the LLMs training set\. In fact, in our evaluation we sometimes observe the LLM attempting to relate the task to some better\-known problem from the literature\. At the same time, the type of problems have unique and easily checked solutions\.

Our work aims to establish how much theoretical physicists can benefit from automated computation specifically in algorithmic problems\. To this end, we answer the following questions:

1. 1\.What are current capabilities of frontier LLMs equipped with a CAS runtime;
2. 2\.How should the LLM’s context be used;
3. 3\.What are typical failure modes and how can they be mitigated\.

Thanks to the restricted set of problems and our available expert knowledge, we can understand the LLM’s solution strategies beyond simply checking the final answer, thus enabling a complementary qualitative evaluation\. A central aim of this work is to develop practical guidelines for how practitioners can employ their existing worked examples as useful context for LLMs to autonomously tackle similar problems\.

As we demonstrate in Sec\.[2](https://arxiv.org/html/2605.08212#S2), most work on LLMs equipped with a symbolic computation tool has concentrated on mathematics\. Compared to these works, we focus less on the model’s creativity in finding proof strategies and more on its ability to perform relatively routine computations that are simply burdensome for the human physicist\. Note, however, that these computations are not necessarily easy, and the use of a CAS does not guarantee correctness\. Another unusual aspect of this work is the difficulty in defining a locally relevant context: some manipulations need to be carried out with later steps in mind \(for example, solution of background equations may produce roots if done naively, which would lead to problems in subsequent steps\)\. Thus, algorithmic theoretical physics is an interesting testing ground for long contexts\.

The remainder of this paper is structured as follows\. In Sec\.[2](https://arxiv.org/html/2605.08212#S2), we discuss previous works applying LLMs equipped with CAS to mathematics and physics problems\. In Sec\.[3](https://arxiv.org/html/2605.08212#S3), we describe our setup, experiments, and evaluation strategy\. In Sec\.[4](https://arxiv.org/html/2605.08212#S4), we summarize our results both qualitatively and quantitatively\. We conclude in Sec\.[5](https://arxiv.org/html/2605.08212#S5)\.

## 2Related Work

Using structure in an LLM’s response goes back to chain\-of\-thought\(Weiet al\.,[2023](https://arxiv.org/html/2605.08212#bib.bib23)\)\. A natural extension of this idea is to interface the model with tools\(e\.g\., Gaoet al\.,[2023](https://arxiv.org/html/2605.08212#bib.bib15); Chenet al\.,[2023](https://arxiv.org/html/2605.08212#bib.bib16)\)\. Such tool\-aided reasoning has found widespread application in mathematics, thanks to the strict verifiability afforded by programs such as Lean\(e\.g\., Gouet al\.,[2024](https://arxiv.org/html/2605.08212#bib.bib17); Khaitan and Ganesh,[2025](https://arxiv.org/html/2605.08212#bib.bib18); Breenet al\.,[2025](https://arxiv.org/html/2605.08212#bib.bib20); Nezhadet al\.,[2026](https://arxiv.org/html/2605.08212#bib.bib36); Luoet al\.,[2026](https://arxiv.org/html/2605.08212#bib.bib37)\)\. Not surprisingly, tool use is found to be especially helpful for more difficult problmes\(Daset al\.,[2024](https://arxiv.org/html/2605.08212#bib.bib35)\)\. In a similar fashion to this work, in\-context learning\(e\.g\., Donget al\.,[2024](https://arxiv.org/html/2605.08212#bib.bib22)\)has been studied as a way to improve the LLM’s performance in math problems\(e\.g\., Liuet al\.,[2024](https://arxiv.org/html/2605.08212#bib.bib21); Sultanet al\.,[2026](https://arxiv.org/html/2605.08212#bib.bib19); Gaoet al\.,[2026](https://arxiv.org/html/2605.08212#bib.bib27)\)\. In contrast to explicitly fine\-tuned models, general purpose models with useful context are easier to use in a human\-AI collaboration and can be more easily adapted if features in the tools \(e\.g\., Lean strategies\) change\(Breenet al\.,[2025](https://arxiv.org/html/2605.08212#bib.bib20)\)\.

In the context of theoretical physics, using LLMs for automated problem solving has so far been explored less than in mathematics\. Part of the reason may be the difficulty in finding research\-grade but still easily gradeable problems; recently there have been efforts towards constructing such benchmarks\(Chunget al\.,[2025](https://arxiv.org/html/2605.08212#bib.bib28); Panet al\.,[2026](https://arxiv.org/html/2605.08212#bib.bib32)\)and using them to evaluate test\-time scaling\(Gaoet al\.,[2025](https://arxiv.org/html/2605.08212#bib.bib38)\)\. These works find that without dedicated tools the current frontier models are unable to solve most research\-grade benchmark problems\. A recent paper that is similar in spirit to our work isMenzoet al\.\([2026](https://arxiv.org/html/2605.08212#bib.bib30)\): they let an LLM use a specialized Mathematica package to compute Feynman diagrams\. In contrast to our work, their LLM operates in a much more constrained environment: its only real decision is about a small number of binary choices to do with conventions\. Our work is focused on problems that require more actual intelligence to come up with calculation strategies and deal with subtleties\. Much more ambitiously, some recent works\(e\.g\., Agrawalet al\.,[2026](https://arxiv.org/html/2605.08212#bib.bib31)\)attempt to let artificial agents perform the entire pipeline of theoretical physics research\. Others argue that this approach is unlikely to be successful\(Luet al\.,[2026](https://arxiv.org/html/2605.08212#bib.bib29)\)\. We do not aim to replace the human researcher entirely and thus restrict the experiments to algorithmic theoretical physics problems\. Similarly toLuet al\.\([2026](https://arxiv.org/html/2605.08212#bib.bib29)\), we anticipate that equipping the LLM with a CAS is likely useful\.

One of the main aims of this work is to establish how well a current frontier model can use examples or other information provided in the context\. Indeed, it has been noted that long contexts can degrade performance\(e\.g\., Liet al\.,[2024](https://arxiv.org/html/2605.08212#bib.bib40); Liuet al\.,[2023](https://arxiv.org/html/2605.08212#bib.bib41); Jinet al\.,[2024](https://arxiv.org/html/2605.08212#bib.bib42); Duet al\.,[2025](https://arxiv.org/html/2605.08212#bib.bib43); Huanget al\.,[2024](https://arxiv.org/html/2605.08212#bib.bib24)\), even specifically for mathematics\(Liuet al\.,[2024](https://arxiv.org/html/2605.08212#bib.bib21)\)\. These findings suggest that a small set of well\-chosen example problems\(for how to choose, e\.g\., Yanget al\.,[2024](https://arxiv.org/html/2605.08212#bib.bib39)\)should optimize performance\. On the other hand,Zouet al\.\([2025](https://arxiv.org/html/2605.08212#bib.bib26)\)find a mixed picture andAgarwalet al\.\([2024](https://arxiv.org/html/2605.08212#bib.bib25)\)note that more capable modern frontier models are able to better deal with larger contexts\.

## 3Methods

In this section, we give a pedagogical introduction to the considered theoretical physics problems \(Sec\.[3\.1](https://arxiv.org/html/2605.08212#S3.SS1)\), describe our implementation of an LLM equipped with a CAS runtime \(Sec\.[3\.2](https://arxiv.org/html/2605.08212#S3.SS2)\), list the test problems \(Sec\.[3\.3](https://arxiv.org/html/2605.08212#S3.SS3)\), describe our quantitative evaluation method \(Sec\.[3\.4](https://arxiv.org/html/2605.08212#S3.SS4)\), and finally describe the different choices of context we provide to the LLM \(Sec\.[3\.5](https://arxiv.org/html/2605.08212#S3.SS5)\)\.

### 3\.1The basics of dof

In general, the number of degrees of freedom \(dof\) of a physical system is defined by the number of initial conditions to completely determine the Cauchy problem of equations of motion divided by two\. For example, a free particle in 1\+1 dimensions is described by the LagrangianL=12q˙2L=\\frac\{1\}\{2\}\\dot\{q\}^\{2\}where dot is the derivative with respect to time, andqqis the generalized coordinate\. By varying the corresponding actionS=∫𝑑tLS=\\int dtLwith respect to the generalized coordinate, we find the corresponding equation of motionq¨=0\\ddot\{q\}=0\. To completely determine the Cauchy problem, we need to determine two initial conditions – the initial position and velocitiesq\(t0\)q\(t\_\{0\}\)andq˙\(t0\)\\dot\{q\}\(t\_\{0\}\)– which thus gives one dof\. In quantum field theories and theories of gravity one more often studies the dynamics of fields\. In contrast to a free particle, such objects also depend also on the point in space, with a prime example being the scalar fieldϕ\(x→,t\)\\phi\(\\vec\{x\},t\)\. Formally speaking, a theory of a standard free canonical scalar field in four dimensions thus has an infinite number of dof, and so when we state this, we will refer to their number per point in space\-time\.

In flat and cosmological backgrounds, the space\-time has spherical symmetry which gives rise to three well\-known types of perturbations that we classify according to the group of spatial rotations: scalar, vector and tensor modes\. However, if we consider a theory of gravity, where the main quantity in addition to matter is the metric tensor,gμνg\_\{\\mu\\nu\}, not all of its components will be physical degrees of freedom\. For example, general relativity has two dof instead of ten which correspond to all components of the metric tensor, meaning that the remaining ones are not propagating\. While there are many ways to show that this holds, in this work, we focus only on actions that are quadratic in the perturbations around particular backgrounds based on the Lagrangian formulation, and determine the physical modes according to the procedure described in\(Hellet al\.,[2026](https://arxiv.org/html/2605.08212#bib.bib14)\)\.

### 3\.2Implementation

Our experimental setup is relatively simple: the context and problem formulation are input via the LLM’s API, which produces CAS commands to be locally executed in the runtime read\-eval\-print loop \(REPL\), CAS results are appended to the message history and sent back via the API, and so forth until the LLM decides a solution has been found \(or aborts\)\. We choose to work with a single artificial agent since the considered computations are relatively linear and we do not expect a multi\-agent environment would yield substantial improvements\. We call the API directly instead of relying on a ready\-made SDK \(e\.g\., Claude Code\) since reproducible model ability is important for our experiments\. We choose to work with a fixed, typically rather long, context for multiple reasons\. First, it enables us to systematically study the impact of different materials in the context\. Second, frontier models are getting increasingly better with long contexts and have reached impressive capabilities already\(Agarwalet al\.,[2024](https://arxiv.org/html/2605.08212#bib.bib25)\)\. Third, given the specialized nature of the algorithmic computations we are aiming at, we anticipate practitioners to want a simple prescription how to apply automated computation for their own work which would usually rely on private example calculations\. Fourth, the considered computations can despite their algorithmic linear nature benefit from a “look\-ahead”, as discussed in the introduction\.

For the LLM, we use Claude Opus 4\-6, and for the CAS we use Maple\. It is important to turn the human\-oriented printing off on Maple \(prettyprint=0\)\. We find that the LLM relatively frequently restarts to a fresh Maple session when it notoices that it is stuck; we include this for our quantitative analysis below\. Most of our results are obtained withmax\_tokens=1024and thinking turned off; we discuss experiments with thinking turned on in Sec\.[4\.5](https://arxiv.org/html/2605.08212#S4.SS5)\. We run each problem only once because we observe relatively stable behavior when repeating some runs\. To control runtime, we set a maximum limit of 100 turns at which point the REPL gets aborted\. Empirically, we find that the turns limit interrupts the loop in situations that do not seem to be leading to success\.

### 3\.3Problems and restrictions

To analyse the usefulness of Claude in studying the cosmological perturbations, we evaluate it on a set of nine problems, given by the following:

R2FsConsiderMP22R\+βR2\\frac\{M\_\{P\}^\{2\}\}\{2\}R\+\\beta R^\{2\}gravity on a cosmological background, whereMPM\_\{P\}is the Planck mass, andβ\\betais the coupling\. Find the quadratic action for the scalar perturbation around this background, and express it in terms of the propagating modes\. Perform the calculation in the Jordan frame, you are not allowed to use the Einstein frame\.

sRFsConsiderMP22\(R−2Λ\)\+12σ2\(m2\+f\(R\)\)\\frac\{M\_\{P\}^\{2\}\}\{2\}\(R\-2\\Lambda\)\+\\frac\{1\}\{2\}\\sigma^\{2\}\(m^\{2\}\+f\(R\)\)gravity on a cosmological background, whereMPM\_\{P\}is the Planck mass,Λ\\Lambdais the cosmological constant,σ\\sigmais the scalar field,m2m^\{2\}is the mass, andf\(R\)f\(R\)is a function of Ricci scalar\. Find the quadratic action for the scalar perturbations around this background, and express it in terms of the propagating modes\.

sRFvConsiderMP22\(R−2Λ\)\+12σ2\(m2\+f\(R\)\)\\frac\{M\_\{P\}^\{2\}\}\{2\}\(R\-2\\Lambda\)\+\\frac\{1\}\{2\}\\sigma^\{2\}\(m^\{2\}\+f\(R\)\)gravity on a cosmological background, whereMPM\_\{P\}is the Planck mass,Λ\\Lambdais the cosmological constant,σ\\sigmais the scalar field,m2m^\{2\}is the mass, andf\(R\)f\(R\)is a function of Ricci scalar\. Find the quadratic action for the vector perturbations around this background, and express it in terms of the propagating modes\.

sRFtConsiderMP22\(R−2Λ\)\+12σ2\(m2\+f\(R\)\)\\frac\{M\_\{P\}^\{2\}\}\{2\}\(R\-2\\Lambda\)\+\\frac\{1\}\{2\}\\sigma^\{2\}\(m^\{2\}\+f\(R\)\)gravity on a cosmological background, whereMPM\_\{P\}is the Planck mass,Λ\\Lambdais the cosmological constant,σ\\sigmais the scalar field,m2m^\{2\}is the mass, andf\(R\)f\(R\)is a function of Ricci scalar\. Find the quadratic action for the tensor perturbations around this background, and express it in terms of the propagating modes\.

sRMsConsiderMP22\(R−2Λ\)\+12σ2\(m2\+f\(R\)\)\\frac\{M\_\{P\}^\{2\}\}\{2\}\(R\-2\\Lambda\)\+\\frac\{1\}\{2\}\\sigma^\{2\}\(m^\{2\}\+f\(R\)\)gravity on a flat background, whereMPM\_\{P\}is the Planck mass,σ\\sigmais the scalar field,m2m^\{2\}is the mass, andf\(R\)f\(R\)is a function of Ricci scalar\. Find the quadratic action for the scalar perturbations around this background, and express it in terms of the propagating modes\.

sRMtConsiderMP22R\+12σ2\(m2\+f\(R\)\)\\frac\{M\_\{P\}^\{2\}\}\{2\}R\+\\frac\{1\}\{2\}\\sigma^\{2\}\(m^\{2\}\+f\(R\)\)gravity on a flat background, whereMPM\_\{P\}is the Planck mass,σ\\sigmais the scalar field,m2m^\{2\}is the mass, andf\(R\)f\(R\)is a function of Ricci scalar\. Find the quadratic action for the tensor perturbations around this background, and express it in terms of the propagating modes\.

sRi2MsConsiderMP22\(R−2Λ\)\+12σ2\(m2\+ξRμνRμν\)\\frac\{M\_\{P\}^\{2\}\}\{2\}\(R\-2\\Lambda\)\+\\frac\{1\}\{2\}\\sigma^\{2\}\(m^\{2\}\+\\xi R\_\{\\mu\\nu\}R^\{\\mu\\nu\}\)gravity on a flat background, whereMPM\_\{P\}is the Planck mass,σ\\sigmais the scalar field,m2m^\{2\}is the mass, andξ\\xiis the coupling constant\. Find the quadratic action for the scalar perturbations around this background, and express it in terms of the propagating modes\.

sRi2FsConsiderMP22\(R−2Λ\)\+12σ2\(m2\+ξRμνRμν\)\\frac\{M\_\{P\}^\{2\}\}\{2\}\(R\-2\\Lambda\)\+\\frac\{1\}\{2\}\\sigma^\{2\}\(m^\{2\}\+\\xi R\_\{\\mu\\nu\}R^\{\\mu\\nu\}\)gravity on a cosmological background, whereMPM\_\{P\}is the Planck mass,σ\\sigmais the scalar field,m2m^\{2\}is the mass, andξ\\xiis the coupling constant\. Find the quadratic action for the scalar perturbations around this background, and express it in terms of the propagating modes\.

sRi2FtConsiderMP22\(R−2Λ\)\+12σ2\(m2\+ξRμνRμν\)\\frac\{M\_\{P\}^\{2\}\}\{2\}\(R\-2\\Lambda\)\+\\frac\{1\}\{2\}\\sigma^\{2\}\(m^\{2\}\+\\xi R\_\{\\mu\\nu\}R^\{\\mu\\nu\}\)gravity on a cosmological background, whereMPM\_\{P\}is the Planck mass,σ\\sigmais the scalar field,m2m^\{2\}is the mass, andξ\\xiis the coupling constant\. Find the quadratic action for the tensor perturbations around this background, and express it in terms of the propagating modes\.

We note that the above problems are passed in plain text, and only set in an appropriateLaTeXform for readability\. An actual input looks like the following example:

Consider Mp^2/2\*R\+beta\*R^2 gravity on a cosmological background, where Mp is the Planck mass, and beta is the coupling\. Find the quadratic action for the scalar perturbation around this background, and express it in terms of the propagating modes\. Perform the calculation in the Jordan frame, you are not allowed to use the Einstein frame\.

In creating these problems, we only focus on tensor, or scalar\-tensor theories in flat and cosmological backgrounds\. A more comprehensive study including also other type of theories, such as vector, tensor or two\-form theories will be presented in follow\-up work\. Aside from the first problem \(R2Fs\), remaining problems are built on the recently introduced constrained scalar\-field framework111See also\(Hell and Daniel,[2026](https://arxiv.org/html/2605.08212#bib.bib2)\)for a theory with constrained vector field\.\(Hell and Sasaki,[2025](https://arxiv.org/html/2605.08212#bib.bib1)\), ensuring that their analysis was not previously worked out in the literature\. The problems also test Claude on a wide range of issues, commonly found in studying the perturbations, such as finding special background solutions that give rise to multiple branches, resolving the constraints, dealing with higher\-derivatives and reducing their order, as well as analyzing different types of the perturbations\. R2Fs, in contrast, corresponds to the Starobinsky inflation, which is one of the most well\-known models in Cosmology, and serves as a reference point\.

### 3\.4Output evaluation

We base our evaluation of the results on apass/failbasis\. The solutions to the problems can be divided into four main steps: the setup, computing the background equations of motion, finding the second\-order action, and analyzing the perturbations\. Theories that also involve higher\-derivative terms have an additional step in reduction of the derivatives that can happen either in the setup or at the level of perturbations\.

Based on this, we adopt the following rules for the evaluation:

1. 1\.If the setup is wrong, it is a fail\.
2. 2\.If the background equations of motion are not computed, or not properly substituted in the action, and this affects the result, then it is a fail\. Otherwise, if this does not change the result, it is a pass\.
3. 3\.The perturbations, and the reduction of the higher\-order derivatives should be analyzed correctly for a pass\.
4. 4\.If the resulting code obtains a correct final result, but forgets a subtlety, then it is still a pass\.

In order to illustrate the output, we present it for one of the problems in the Appendix[C](https://arxiv.org/html/2605.08212#A3)\.

### 3\.5Contexts

In the spirit of in\-context learning, we supplement our Claude\-Maple interface with four choices of context:

Context “10ex”: A set of 10 solved examples, 60k tokensAs context, we prepare 10 step\-by\-step solved and explained examples that would represent problems of finding propagating dofs of theories of gravity in cosmology and flat space\-time\. For illustration, an explicit presentation of one of these examples is given in the Appendix[A](https://arxiv.org/html/2605.08212#A1)\. Most of these examples are just standard examples, meant to cover different simplifications and Maple\-code when studying scalar, vector or tensor perturbations\. Specially, they do not involve any higher derivatives but are at most quadratic in the fields\.

Context “3broad”: A set of 3 solved examples, 18k tokensWe test the case when the number of examples is overall reduced to three\. This also reduces the overall cost of the project\. To still keep it general enough, we select three representatives, corresponding to the Example 2, Example 3 and Example 8, which is stated in Appendix[A](https://arxiv.org/html/2605.08212#A1)\.

Context “3tailored”: A modified set of 3 solved examples, 24k tokensAs we will see in the results, attempts to some problems with contexts 10ex and 3broad fail with relatively common mistakes\. To examine this further, we keep the Example 3 and Example 8 as part of the input data, and supplement the two with another modified example that is supposed to account for the common mistakes, given in Sec\.[4\.3](https://arxiv.org/html/2605.08212#S4.SS3)\.

Context “instruction”: A general description of the method, 2k tokensFinally, we provide as an input just a general instruction text based on the recent work\(Hellet al\.,[2026](https://arxiv.org/html/2605.08212#bib.bib14)\), and with no example code\. Thisrecipeis given in the Appendix[B](https://arxiv.org/html/2605.08212#A2)\.

## 4Results

R2FssRFssRFvsRFtsRMssRMtsRi2MssRi2FssRi2Ft10ex3broad3tailoredinstructionpassfailFigure 1:Results for the problems depending on the provided context\.R2FssRFssRFvsRFtsRMssRMtsRi2MssRi2FssRi2Ft10ex3broad3tailoredinstruction\|\|492\|\|822\|\|322\|\|571\|\|937\|\|181\|\|373\|\|662\|\|691\|\|752\|\|892\|\|321\|\|372\|\|1006\|\|361\|\|402\|\|381\|\|662\|\|491\|\|1004\|\|301\|\|291\|\|552\|\|191\|\|241\|\|772\|\|411\|\|504\|\|9513\|\|799\|\|7317\|\|708\|\|287\|\|284\|\|809\|\|1007turns \(🌑\)020406080restarts \(×\\times\)02468Figure 2:The lower panel shows the solution length as turns\|\|restarts depending on context \(horizontal\) and problem \(vertical\)\. The upper panel summarizes the table in terms of mean \(solid\) and median \(dashed\)\.In Fig\.[1](https://arxiv.org/html/2605.08212#S4.F1), we visualize the solutions’ evaluation according to problem \(vertical axis\) and context \(horizontal axis\)\. As an additional quantitative evaluation, we also provide the number of turns and number of Maple restarts in Fig\.[2](https://arxiv.org/html/2605.08212#S4.F2)\. The baseline case of a long context \(10ex\) shows 5 correct solutions and 4 incorrect ones\. For the shorter context of representative problems \(3broad\), the success rate is identical but the failures mostly shift \(there is a subtlety in this case to be discussed below\)\. The case of specifically tailored problems \(3tailored\), which is informed by the previous two experiments, shows the highest success rate and is the only one where sRi2Ft is solved correctly\. The 3tailored experiments also display the lowest number of turns and restarts on average, indicating that the deliberately constructed examples help the model find the solution efficiently\. The experiments with an abstract description of therecipe\(instruction\) show the lowest number of successes and the highest average number of turns and restarts\. For a given problem, a higher number of turns and restarts is correlated with lower success rate\.

In the following, we discuss the overview following each of the provided contexts, described in the previous section, followed by a brief comparison of thinking mode for selected problems\.

### 4\.1Context 10ex

Overall, the performance of Claude with the submitted context has a marginal pass, with five problems correctly solved among nine\. The number of turns – sequences in which Claude runs Maple – ranges from the minimal 18 to 93, with an average of 56, while the number of restarts222The minimal number of restarts should be one, corresponding to the start of the code\.is mostly from 1 to 3, with a special exception of 7 restarts in sRMs, and averaging on 2\.

Among the given problems, R2Fs is a very useful reference in confirming that Claude uses not only the examples submitted in the context, but also information fed during training\. In particular, while its task is to study the Starobinsky model in the Jordan frame, it still introduces an external scalar field, rewriting the initial action instead as∫−g\(MP22\+2βσ\)R−βσ2\\int\\sqrt\{\-g\}\\left\(\\frac\{M\_\{P\}^\{2\}\}\{2\}\+2\\beta\\sigma\\right\)R\-\\beta\\sigma^\{2\}which reduces to the original theory by resolving the constraint forσ\\sigma, and substituting it back to the action\. This significantly simplifies the analysis, as it removes all of the higher derivatives acting on the perturbations that would otherwise be present until the order is reduced\.

Throughout the remaining problems, one of the major challenges is the presence of higher\-order derivatives\. They were mostly dealt with through an incorrect substitution, with the aim of reducing the order of the higher derivatives acting on the perturbations\. In particular, the main issue was that Claude forgets to include the constraints via the Lagrange multipliers, and thus reduces the order incorrectly\. This was the case for sRFs, sRi2Fs and sRi2Ft\.

Another challenge is resolving the background equations of motion\. The above test problems were constructed with the intention of having multiple branching solutions, and thus challenging the LLM in this way as well\. Surprisingly, however, different branches were noticed for all but sRMt, with Claude then selecting one of the branches \(usually the more complicated one\)\. The main issue associated to the background equations of motion was to resolve both constraint equation, acceleration equation and the equation for the scalar in such a way that there are no roots appearing in the solutions\. In other words, the challenge is how to pick the variables which would be best to solve the equations for, when substituting them back to the action\. Interestingly, in sRFv and sRFt, Claude also struggled with the background equations of motion, gave up and went to the second\-order action\. Only once there, it went back and studied the background equations, analyzing if they can simplify the resulting action at that point\.

An especially interesting case is sRMs\. It is first correctly solved for the flat background\. However, then, Claude deems the result too trivial, and instead turns to the analysis in curved space\-time\. This is interesting, because in sRMt where the flat background is formulated in the same way everything is solved correctly\.

### 4\.2Context 3broad

In this case, the number of solved problems is again five, although the correctly solved problems themselves do not match with the previous context\. The average number of turns is similar to 10ex\. However, sRMs reaches the 100\-turn limit thus automatically terminating\. The average number of restarts is two, with sRMs being again an exception with 6 restarts overall\.

R2Fs is solved in a very similar way as in the previous case\. Interestingly, Claude then uses the same trick to also solve sRFs, rewriting the action of the problem as

S=∫d4x−g\[\(12σ2f′\(χ\)\+MP22\)R−MP2Λ\+12σ2\(m2\+f\(χ\)−χf′\(χ\)\)\]\.\\begin\{split\}S&=\\int d^\{4\}x\\sqrt\{\-g\}\\left\[\\left\(\\frac\{1\}\{2\}\\sigma^\{2\}f^\{\\prime\}\(\\chi\)\+\\frac\{M\_\{P\}^\{2\}\}\{2\}\\right\)R\-M\_\{P\}^\{2\}\\Lambda\\right\.\\\\ &\\left\.\+\\frac\{1\}\{2\}\\sigma^\{2\}\\left\(m^\{2\}\+f\(\\chi\)\-\\chi f^\{\\prime\}\(\\chi\)\\right\)\\right\]\.\\end\{split\}Strictly speaking, this action is equivalent to the previous one only iff′′\(χ\)≠0f^\{\\prime\\prime\}\(\\chi\)\\neq 0\. However, aside from this subtlety, the problem is now solved correctlyunlike with context 10ex\. Another correctly solved problem is sRi2Fs\. While it has noticed different branching solutions depending on if the constrained scalar vanishes or not, it chooses the simpler one withσ\(t\)=0\\sigma\(t\)=0\. This procedure is equally valid as choosing the arbitrary branch, although it simplifies the analysis significantly\. In addition, we should note that sRFv and sRFt are mostly correctly solved\. However, it should be noted that background equations of motion were overall unsolved/ignored\. Since this still didn’t affect the final expression too much, we graded it with a passing grade, but one should keep in mind that this could have easily led to the wrong result in more complicated scenarios\.

sRMs, sRMt and sRi2Ms primarily fail because Claude understands flat background for cosmological evolution\. However it would have been correct to select the Minkowski space\-time\. In a realistic human\-AI interaction, this issue would have been caught\. In future work, we may adjust the evaluation to account for such cases\. Finally, in sRi2Ft it fails due to incorrect reduction of higher\-order derivative terms\.

### 4\.3Context 3tailored

The primary issues in the previous two contexts are

- •reducing the higher\-derivatives on the level of perturbations,
- •solving background equations of motion,
- •interpretation of the background on which perturbation theory should be performed\.

To counter these issues, in this context we replace the Example 2 with the following example on the pureR2R^\{2\}gravity:

Modified example:ConsiderβR2\\beta R^\{2\}on a cosmological background, whereβ\\betais the coupling constant\. Find the quadratic action for the scalar perturbation around this background in the Jordan frame, and express it in terms of the propagating modes\.

This problem inherently contains the higher\-derivatives in the Jordan frame, which are reduced on the level of perturbations to show that the theory describes one scalar mode on cosmological backgrounds \(in addition to the two tensor ones\), with the aim of countering the first issue found in the previous analyses\. In addition, further explanations and tips on how to solve the background equations of motion in more general cases are added to the context\. With aim of avoiding misinterpretations between flat background and cosmology, we also add: “Note that if the problem said “flat background”, you should assume Minkowski space\-time, and seta\(t\)=1a\(t\)=1too\.” Finally, in the previous sets, we had noticed that the LLM spent many turns manipulating the form of the final action\. To reduce the number of turns, we add an additional instruction:“If you have reached the final action, do not simplify it further\.”

Overall, the performance of this set is significantly improved compared to the previous cases\. Out of nine problems, only two were not solved: sRFs has reached 100 turns, which is why it had stopped\. The LLM also did not perturb the constrained scalar in addition to the metric perturbations\. The latter issue was also present in the sRi2Fs, which was otherwise very close to the correct solution\. In contrast to the previous contexts this context does not include an example where an external scalar field \(such as the constrained scalar\) is perturbed, but only had metric \(scalar\) perturbations in the exercise problems\. Potentially, therefore, one might suspect that by adding an example where this is covered, one would find all of the problems solved\.

On average, the number of turns is 47, while the average number of restarts is 2, with the maximum of 4 in the sRFs\. Notably, in contrast to previous cases, R2Fs, which is well\-known in the literature, is now solved purely in the Jordan frame, with no introduction of another scalar at the level of action, but rather by reducing the higher derivatives on the level of perturbations, as was shown in the example\.

### 4\.4Context instruction

The context “instruction” does not include any code, but instead describes an algorithm to compute the number of degrees of freedom for the given problems\. As a result, the LLM does not analyze any of the problems by fully formulating them in Maple and running there the analysis\. Instead, for some of the problems, it combined the analytical reasoning “by hand”, and a bit of Maple code, up to expanding the Ricci scalar or Ricci tensor in the action, to infer the final form of the perturbations\.

On average, the number of restarts per program is 9, with a maximum of 17 in sRFt, and a minimum of 4 in R2Fs and sRi2Ms\. The number of turns is significantly higher than the rest of the cases, averaging on 67\.

Overall, its performance is significantly lower than the other cases, with only sRFt, sRMt and sRi2Ms getting a passing grade\. What these problems have in common is the actions are particularly simple, as the program evaluated the tensor modes and scalars on flat background, the latter taking with vanishing background value of the scalar field\. However, it should be pointed out that these problems were not solved perfectly\. The program never computed the full set of the background equations of motion\. In addition, in sRFt, the action given in the summary of the computation differed by a \(correct\) factor of1/21/2in contrast to the action that was previously derived\. In the cases of flat background, it understood this as a cosmological background, as was the case for contexts 10ex and 3broad\.

The largest issue among the failed problems is the decision\-making\. For example, in sRFs, it could never decide on the approach, and never performs any computations in Maple\. In fact, the trend of doing minimal Maple computations extends to many other problems, resulting in a fail\. Among them, specially, sRi2Ft reaches the 100 turns which is why its evaluation stops, while in the sRi2Fs it simply gives up after 80 turns\.

### 4\.5Thinking mode: ON

The previous main results are obtained with thinking turned off\. In the following, we present the results when 1024 thinking tokens are allowed\. Overall, we find very little improvement from allowing thinking tokens\.

10ex — sRi2FtOriginally, without thinking, this problem presents an incorrect result, with the mistake taking place almost at the very end when Claude tries to reduce the order of higher\-derivatives for the tensor modes\. With the thinking mode ON, it makes the same mistake, again leading to the wrong result\.

3broad — sRMtClaude again does the same mistake even in this problem – flat background is interpreted as flat FLRW background instead of the Minkowski space\-time, a notion which is well\-known in the literature\.

3tailored — sRFsIn contrast to the previous run, the thinking mode manages to finish the running of the program in 58 steps, with only initial restart\. However, it does exactly the same mistake as in the initial run, forgetting to perturb the constrained scalar in addition to the metric perturbations\. Therefore, the result still remains a fail\.

instruction — sRi2FsFinally, with context containing the algorithm description, Claude does more elaborate computations with the thinking mode\. However, it still does not manage to get to the end result\. Therefore, this case also does not present a change\.

## 5Conclusions

In their daily work, theoretical physicists encounter many problems of algorithmic character\. We explore how well a frontier LLM equipped with a CAS session can handle such problems\. Specifically, we focus on finding the degrees of freedom in modified theories of gravity and utilize completely novel problems for evaluation\.

Equipped with qualitative and quantitative evaluations described before, we can now answer the questions posed in the introduction as follows:

1. 1\.What are current capabilities of frontier LLMs equipped with a CAS runtime?When supplied with worked examples, the LLM is able to competently use the CAS REPL and in a majority of cases solves the test problems correctly\. We evaluate the LLM’s ability on par with a first\-year graduate student\.
2. 2\.How should the LLM’s context be used?We find that worked examples are essential for the LLM to succeed whereas an abstract description of the algorithm is not sufficient\. Pracitioners will typically have worked examples available from their prior work; we recommend to use such examples for in\-context learning\. We find that a smaller set of example problems, chosen for representativeness, is more effective than a large set \(even though in our quantitative evaluation the smaller representative set 3broad introduced a very specific failure mode of misunderstanding the problem\)\. A small, targeted set can mitigate observed failure modes but may introduce new issues\.
3. 3\.What are typical failure modes and how can they be mitigated?One common failure was misinterpretation of the problem \(typically, because the solution of the actual problem was deemed “trivial” – not unlike students in exams\)\. A human in the loop would be able to correct this type of problem\. Other mistakes such as failure to correctly apply the background equations are more worrisome but would be caught by an experienced supervisor\. We find that tailoring the context with hand\-crafted examples that cover observed failure modes can be effective in reducing mistakes\.

While the recommendations to choose the context deliberately and utilize human\-AI collaboration are expected, we also make some unexpected findings\. The test problems were tailored to contain subtleties such as branching solutions\. Our expectation that their existence would be overlooked was wrong: the LLM identified most of the special cases correctly and used some of them to simplify the analysis\. We had also expected that enabling thinking would be more useful, especially for the cases with an abstract description of the algorithm in the context\. However, the model makes little use of the thinking budget and does not improve performance\.

Two aspects of the LLM’s behavior are noteworthy\. First, it shows strong perseverance and goal\-orientedness\. In several examples we observe its pursuing a wrong path, noticing that it is getting nowhere, and thenrestarting the Maple session for a new attempt\. Second, the LLM shows a good ability to consider the problems abstractly\. Sometimes a problem prevents it from correctly solving the background equations of motion, at which point it proceeds with whatever it can accomplish in the rest of the computation while keeping the background solution as an abstract object\. Later it tries to go back and obtain the concrete solution\.

We have only explored one class of algorithmic theoretical physics problems, with a single frontier LLM, and with a single CAS, and a fixed context\. In future work we will expand on these and especially evaluate a RAG\-style setup in which the model can pull specific parts of the example calculations into context\. As we have observed strong performance on the considered test problems, we believe that CAS\-equipped LLMs with in\-context learning will be useful for other algorithmic computations across string theory, quantum field theory, gravity, and cosmology\.

## Acknowledgements

AH would like to thank Hugo Allaire, Elisa Ferreira, Misao Sasaki, Tadayuki Takahashi, and Ying\-Li Zhang for useful discussions\. LT would like to thank Fatih Dinç for advice\. AH is supported by JSPS KAKENHI Grant No\. JP26K17133, and by the CD3 Google Seed grant\. LT is supported by JSPS under KAKENHI 24K22878 and 26K17136 and by the Royal Society under ICA\\R2\\252140\.

## References

- R\. Agarwal, A\. Singh, L\. M\. Zhang, B\. Bohnet, L\. Rosias, S\. Chan, B\. Zhang, A\. Anand, Z\. Abbas, A\. Nova, J\. D\. Co\-Reyes, E\. Chu, F\. Behbahani, A\. Faust, and H\. Larochelle \(2024\)Many\-shot in\-context learning\.External Links:2404\.11018,[Link](https://arxiv.org/abs/2404.11018)Cited by:[§2](https://arxiv.org/html/2605.08212#S2.p3.1),[§3\.2](https://arxiv.org/html/2605.08212#S3.SS2.p1.1)\.
- The fermiacc: agents for particle theory\.External Links:2603\.22538,[Link](https://arxiv.org/abs/2603.22538)Cited by:[§2](https://arxiv.org/html/2605.08212#S2.p2.1)\.
- J\. L\. Anderson and P\. G\. Bergmann \(1951\)Constraints in covariant field theories\.Phys\. Rev\.83,pp\. 1018–1025\.External Links:[Document](https://dx.doi.org/10.1103/PhysRev.83.1018)Cited by:[§1](https://arxiv.org/html/2605.08212#S1.p4.1)\.
- P\. G\. Bergmann \(1949\)Non\-Linear Field Theories\.Phys\. Rev\.75,pp\. 680–685\.External Links:[Document](https://dx.doi.org/10.1103/PhysRev.75.680)Cited by:[§1](https://arxiv.org/html/2605.08212#S1.p4.1)\.
- B\. Breen, M\. D\. Tredici, J\. McCarran, J\. A\. Mijares, W\. W\. Yin, K\. Sulimany, J\. M\. Taylor, F\. H\. L\. Koppens, and D\. Englund \(2025\)Ax\-prover: a deep reasoning agentic framework for theorem proving in mathematics and quantum physics\.External Links:2510\.12787,[Link](https://arxiv.org/abs/2510.12787)Cited by:[§2](https://arxiv.org/html/2605.08212#S2.p1.1)\.
- W\. Chen, X\. Ma, X\. Wang, and W\. W\. Cohen \(2023\)Program of thoughts prompting: disentangling computation from reasoning for numerical reasoning tasks\.External Links:2211\.12588,[Link](https://arxiv.org/abs/2211.12588)Cited by:[§2](https://arxiv.org/html/2605.08212#S2.p1.1)\.
- G\. V\. Chibisov and V\. F\. Mukhanov \(1982\)Galaxy formation and phonons\.Mon\. Not\. Roy\. Astron\. Soc\.200,pp\. 535–550\.Cited by:[§1](https://arxiv.org/html/2605.08212#S1.p4.1)\.
- D\. J\. H\. Chung, Z\. Gao, Y\. Kvasiuk, T\. Li, M\. Münchmeyer, M\. Rudolph, F\. Sala, and S\. C\. Tadepalli \(2025\)Theoretical physics benchmark \(tpbench\) – a dataset and study of ai reasoning capabilities in theoretical physics\.External Links:2502\.15815,[Link](https://arxiv.org/abs/2502.15815)Cited by:[§2](https://arxiv.org/html/2605.08212#S2.p2.1)\.
- D\. Das, D\. Banerjee, S\. Aditya, and A\. Kulkarni \(2024\)MATHSENSEI: a tool\-augmented large language model for mathematical reasoning\.External Links:2402\.17231,[Link](https://arxiv.org/abs/2402.17231)Cited by:[§2](https://arxiv.org/html/2605.08212#S2.p1.1)\.
- P\. A\. M\. Dirac \(1950\)Generalized Hamiltonian dynamics\.Can\. J\. Math\.2,pp\. 129–148\.External Links:[Document](https://dx.doi.org/10.4153/CJM-1950-012-1)Cited by:[§1](https://arxiv.org/html/2605.08212#S1.p4.1)\.
- Q\. Dong, L\. Li, D\. Dai, C\. Zheng, J\. Ma, R\. Li, H\. Xia, J\. Xu, Z\. Wu, T\. Liu, B\. Chang, X\. Sun, L\. Li, and Z\. Sui \(2024\)A survey on in\-context learning\.External Links:2301\.00234,[Link](https://arxiv.org/abs/2301.00234)Cited by:[§2](https://arxiv.org/html/2605.08212#S2.p1.1)\.
- Y\. Du, M\. Tian, S\. Ronanki, S\. Rongali, S\. Bodapati, A\. Galstyan, A\. Wells, R\. Schwartz, E\. A\. Huerta, and H\. Peng \(2025\)Context length alone hurts llm performance despite perfect retrieval\.External Links:2510\.05381,[Link](https://arxiv.org/abs/2510.05381)Cited by:[§2](https://arxiv.org/html/2605.08212#S2.p3.1)\.
- L\. D\. Faddeev and R\. Jackiw \(1988\)Hamiltonian Reduction of Unconstrained and Constrained Systems\.Phys\. Rev\. Lett\.60,pp\. 1692–1694\.External Links:[Document](https://dx.doi.org/10.1103/PhysRevLett.60.1692)Cited by:[§1](https://arxiv.org/html/2605.08212#S1.p4.1)\.
- A\. Gao, C\. Zhang, X\. Zhang, D\. Li, M\. Zhao, F\. Liu, and X\. Zhang \(2026\)Process in\-context learning: enhancing mathematical reasoning via dynamic demonstration insertion\.External Links:2601\.11979,[Link](https://arxiv.org/abs/2601.11979)Cited by:[§2](https://arxiv.org/html/2605.08212#S2.p1.1)\.
- L\. Gao, A\. Madaan, S\. Zhou, U\. Alon, P\. Liu, Y\. Yang, J\. Callan, and G\. Neubig \(2023\)PAL: program\-aided language models\.External Links:2211\.10435,[Link](https://arxiv.org/abs/2211.10435)Cited by:[§2](https://arxiv.org/html/2605.08212#S2.p1.1)\.
- Z\. Gao, T\. Li, Y\. Kvasiuk, S\. C\. Tadepalli, M\. Rudolph, D\. J\. H\. Chung, F\. Sala, and M\. Münchmeyer \(2025\)Test\-time scaling techniques in theoretical physics – a comparison of methods on the tpbench dataset\.External Links:2506\.20729,[Link](https://arxiv.org/abs/2506.20729)Cited by:[§2](https://arxiv.org/html/2605.08212#S2.p2.1)\.
- Z\. Gou, Z\. Shao, Y\. Gong, Y\. Shen, Y\. Yang, M\. Huang, N\. Duan, and W\. Chen \(2024\)ToRA: a tool\-integrated reasoning agent for mathematical problem solving\.External Links:2309\.17452,[Link](https://arxiv.org/abs/2309.17452)Cited by:[§2](https://arxiv.org/html/2605.08212#S2.p1.1)\.
- L\. Heisenberg \(2025\)Counting Degrees of Freedom: A Method Applicable from Scalars to f\(Q\) Gravity and Beyond\.External Links:2509\.18192Cited by:[§1](https://arxiv.org/html/2605.08212#S1.p4.1)\.
- A\. Hell and T\. Daniel \(2026\)Branching Universes\.External Links:2603\.18147Cited by:[footnote 1](https://arxiv.org/html/2605.08212#footnote1)\.
- A\. Hell, E\. G\. M\. Ferreira, D\. Lust, and M\. Sasaki \(2026\)The recipe for the degrees of freedom\.JHEP03,pp\. 235\.External Links:2601\.10288,[Document](https://dx.doi.org/10.1007/JHEP03%282026%29235)Cited by:[§1](https://arxiv.org/html/2605.08212#S1.p4.1),[§3\.1](https://arxiv.org/html/2605.08212#S3.SS1.p2.1),[§3\.5](https://arxiv.org/html/2605.08212#S3.SS5.p5.1)\.
- A\. Hell and M\. Sasaki \(2025\)Accelerating Universe from Constraints\.External Links:2507\.00986Cited by:[§3\.3](https://arxiv.org/html/2605.08212#S3.SS3.p13.1)\.
- X\. Huang, L\. L\. Zhang, K\. Cheng, F\. Yang, and M\. Yang \(2024\)Fewer is more: boosting llm reasoning with reinforced context pruning\.External Links:2312\.08901,[Link](https://arxiv.org/abs/2312.08901)Cited by:[§2](https://arxiv.org/html/2605.08212#S2.p3.1)\.
- B\. Jin, J\. Yoon, J\. Han, and S\. O\. Arik \(2024\)Long\-context llms meet rag: overcoming challenges for long inputs in rag\.External Links:2410\.05983,[Link](https://arxiv.org/abs/2410.05983)Cited by:[§2](https://arxiv.org/html/2605.08212#S2.p3.1)\.
- A\. Khaitan and V\. Ganesh \(2025\)O\-forge: an llm \+ computer algebra framework for asymptotic analysis\.External Links:2510\.12350,[Link](https://arxiv.org/abs/2510.12350)Cited by:[§2](https://arxiv.org/html/2605.08212#S2.p1.1)\.
- H\. Kodama and M\. Sasaki \(1984\)Cosmological Perturbation Theory\.Prog\. Theor\. Phys\. Suppl\.78,pp\. 1–166\.External Links:[Document](https://dx.doi.org/10.1143/PTPS.78.1)Cited by:[§1](https://arxiv.org/html/2605.08212#S1.p4.1)\.
- T\. Li, G\. Zhang, Q\. D\. Do, X\. Yue, and W\. Chen \(2024\)Long\-context llms struggle with long in\-context learning\.External Links:2404\.02060,[Link](https://arxiv.org/abs/2404.02060)Cited by:[§2](https://arxiv.org/html/2605.08212#S2.p3.1)\.
- J\. Liu, Z\. Huang, C\. Wang, X\. Huang, C\. Zhai, and E\. Chen \(2024\)What makes in\-context learning effective for mathematical reasoning: a theoretical analysis\.External Links:2412\.12157,[Link](https://arxiv.org/abs/2412.12157)Cited by:[§2](https://arxiv.org/html/2605.08212#S2.p1.1),[§2](https://arxiv.org/html/2605.08212#S2.p3.1)\.
- N\. F\. Liu, K\. Lin, J\. Hewitt, A\. Paranjape, M\. Bevilacqua, F\. Petroni, and P\. Liang \(2023\)Lost in the middle: how language models use long contexts\.External Links:2307\.03172,[Link](https://arxiv.org/abs/2307.03172)Cited by:[§2](https://arxiv.org/html/2605.08212#S2.p3.1)\.
- S\. Lu, Z\. Jin, T\. J\. Zhang, P\. Kos, J\. I\. Cirac, and B\. Schölkopf \(2026\)Can theoretical physics research benefit from language agents?\.External Links:2506\.06214,[Link](https://arxiv.org/abs/2506.06214)Cited by:[§2](https://arxiv.org/html/2605.08212#S2.p2.1)\.
- H\. Luo, H\. Feng, Q\. Sun, C\. Xu, K\. Zheng, Y\. Wang, T\. Yang, H\. Hu, and Y\. Tang \(2026\)AgentMath: empowering mathematical reasoning for large language models via tool\-augmented agent\.External Links:2512\.20745,[Link](https://arxiv.org/abs/2512.20745)Cited by:[§2](https://arxiv.org/html/2605.08212#S2.p1.1)\.
- T\. Menzo, A\. Roman, G\. T\. Fleming, S\. Gleyzer, K\. T\. Matchev, and S\. Mrenna \(2026\)Agentic diagrammatica: towards autonomous symbolic computation in high energy physics\.External Links:2603\.26990,[Link](https://arxiv.org/abs/2603.26990)Cited by:[§2](https://arxiv.org/html/2605.08212#S2.p2.1)\.
- V\. F\. Mukhanov and G\. V\. Chibisov \(1981\)Quantum Fluctuations and a Nonsingular Universe\.JETP Lett\.33,pp\. 532–535\.Cited by:[§1](https://arxiv.org/html/2605.08212#S1.p4.1)\.
- V\. F\. Mukhanov, H\. A\. Feldman, and R\. H\. Brandenberger \(1992\)Theory of cosmological perturbations\. Part 1\. Classical perturbations\. Part 2\. Quantum theory of perturbations\. Part 3\. Extensions\.Phys\. Rept\.215,pp\. 203–333\.External Links:[Document](https://dx.doi.org/10.1016/0370-1573%2892%2990044-Z)Cited by:[§1](https://arxiv.org/html/2605.08212#S1.p4.1)\.
- S\. B\. Nezhad, Y\. Li, and A\. Agrawal \(2026\)SymCode: a neurosymbolic approach to mathematical reasoning via verifiable code generation\.External Links:2510\.25975,[Link](https://arxiv.org/abs/2510.25975)Cited by:[§2](https://arxiv.org/html/2605.08212#S2.p1.1)\.
- H\. Pan, J\. V\. Roggeveen, E\. Berg, J\. Carrasquilla, D\. Chowdhury, S\. Ganguli, F\. Ghimenti, J\. Hasik, H\. Hunt, H\. Jiang, M\. Kamb, Y\. Kao, E\. Khatami, M\. J\. Lawler, D\. Luo, T\. Neupert, X\. Qi, M\. P\. Brenner, and E\. Kim \(2026\)CMT\-benchmark: a benchmark for condensed matter theory built by expert researchers\.External Links:2510\.05228,[Link](https://arxiv.org/abs/2510.05228)Cited by:[§2](https://arxiv.org/html/2605.08212#S2.p2.1)\.
- M\. Sasaki \(1986\)Large Scale Quantum Fluctuations in the Inflationary Universe\.Prog\. Theor\. Phys\.76,pp\. 1036\.External Links:[Document](https://dx.doi.org/10.1143/PTP.76.1036)Cited by:[§1](https://arxiv.org/html/2605.08212#S1.p4.1)\.
- A\. A\. Starobinsky \(1979\)Spectrum of relict gravitational radiation and the early state of the universe\.JETP Lett\.30,pp\. 682–685\.Cited by:[§1](https://arxiv.org/html/2605.08212#S1.p4.1)\.
- O\. Sultan, E\. Stern, and D\. Shahaf \(2026\)A neuro\-symbolic approach for reliable proof generation with llms: a case study in euclidean geometry\.External Links:2505\.14479,[Link](https://arxiv.org/abs/2505.14479)Cited by:[§2](https://arxiv.org/html/2605.08212#S2.p1.1)\.
- J\. Wei, X\. Wang, D\. Schuurmans, M\. Bosma, B\. Ichter, F\. Xia, E\. Chi, Q\. Le, and D\. Zhou \(2023\)Chain\-of\-thought prompting elicits reasoning in large language models\.External Links:2201\.11903,[Link](https://arxiv.org/abs/2201.11903)Cited by:[§2](https://arxiv.org/html/2605.08212#S2.p1.1)\.
- X\. Yang, J\. Lin, Z\. Wang, and C\. Zhai \(2024\)Learning by analogy: enhancing few\-shot prompting for math word problem solving with computational graph\-based retrieval\.External Links:2411\.16454,[Link](https://arxiv.org/abs/2411.16454)Cited by:[§2](https://arxiv.org/html/2605.08212#S2.p3.1)\.
- K\. Zou, M\. Khalifa, and L\. Wang \(2025\)On many\-shot in\-context learning for long\-context evaluation\.External Links:2411\.07130,[Link](https://arxiv.org/abs/2411.07130)Cited by:[§2](https://arxiv.org/html/2605.08212#S2.p3.1)\.

## Appendix APresentation of the problems for the context 10ex with one solved example

The full list of the problems for which we have presented detailed code with explanations in Maple is given by the following:

Example 1:Consider the w=0 Brans\-Dicke theory with a linear potential on a cosmological background\. Find the quadratic action for the scalar perturbation around this background, and express it in terms of the propagating modes\.

Example 2:Consider a canonical scalar field theory with an arbitrary potential on a cosmological background with Einstein gravity\. Find the quadratic action for the scalar perturbation around this background, and express it in terms of the propagating modes\.

Example 3:Consider a canonical scalar field theory with an arbitrary potential on a cosmological background with Einstein gravity\. Find the quadratic action for the vector perturbation around this background, and express it in terms of the propagating modes\.

Example 4:Consider the action consisting of Einstein gravity with cosmological constant, and a massive vector field that is non\-minimally coupled to gravity through an Einstein term\. Find the tensor modes, in case when the vector field has a vanishing background value, but the space\-time corresponds to the de Sitter Universe\.

Example 5:Consider the Einstein gravity in flat space\-time\. Find the quadratic action for the scalar perturbation around this background, and express it in terms of the propagating modes\.

Example 6:Consider the Einstein gravity with cosmological constant in flat space\-time\. Find the quadratic action for the scalar perturbation around this background, and express it in terms of the propagating modes\.

Example 7:Consider a non\-minimally coupled scalar field to the Ricci scalar, with a sin\-potential\. Express the action in terms of the propagating scalar modes\.

Example 8:Consider Einstein gravity with a cosmological constant, and a scalar field with a quartic potential\. Find the quadratic action for the tensor perturbations around this background, and express it in terms of the propagating modes\.

Example 9:Consider a k\-essence field theory with an arbitrary potential on a cosmological background with Einstein gravity\. Find the quadratic action for the vector perturbation around this background, and express it in terms of the propagating modes\.

Example 10:Consider a k\-essence field theory with an arbitrary potential on a cosmological background with Einstein gravity\. Find the quadratic action for the scalar perturbation around this background, and express it in terms of the propagating modes\.

One should note that while the above formulations are essentially loose, they are supplemented with a solved step\-by\-step code, and thus each of them automatically comes with a defined action\. As an illustration, we present the Example 2:

```
> #Example 2: Consider a canonical scalar field theory with an arbitrary
> potential on a cosmological background with Einstein gravity. Find the
> quadratic action for the scalar perturbation around this background, and
> express it in terms of the propagating modes.
>
> #Step 1: Setup
>
> restart
> interface(prettyprint = 0):
> #Define the background metric with scalar perturbations in the longitudinal
> gauge. Epsilon is the perturbative parameter.
> g_metric := Matrix(4, 4, [[-N(t)^2*(1 + 2*epsilon*phi(t, x)), 0, 0, 0], [0,
> a(t)^2*(1 + 2*epsilon*psi(t, x)), 0, 0], [0, 0, a(t)^2*(1 + 2*epsilon*psi(t,
> x)), 0], [0, 0, 0, a(t)^2*(1 + 2*epsilon*psi(t, x))]]):
> with(Physics):
> ds2 := ((LinearAlgebra[Transpose](<dt, dx, dy, dz>)) . g_metric) . <dt, dx,
> dy, dz>
Typesetting:-mprintslash([(ds2 := -dt^2*N(t)^2*(1+2*epsilon*phi(t,x))+dx^2*a(t)
^2*(1+2*epsilon*psi(t,x))+dy^2*a(t)^2*(1+2*epsilon*psi(t,x))+dz^2*a(t)^2*(1+2*
epsilon*psi(t,x)))],[-dt^2*N(t)^2*(1+2*epsilon*phi(t,x))+dx^2*a(t)^2*(1+2*
epsilon*psi(t,x))+dy^2*a(t)^2*(1+2*epsilon*psi(t,x))+dz^2*a(t)^2*(1+2*epsilon*
psi(t,x))])
> Setup(mathematicalnotation = true, dimension = 4, signature = ‘-+++‘)
[dimension = 4, mathematicalnotation = true, signature = ‘- + + +‘]
> Coordinates(Z = [t, x, y, z])
‘Systems of spacetime coordinates are: {Z = (t, x, y, z)} ‘ {Z}
> Setup(metric = ds2)
_______________________________________________________
Coordinates: [t, x, y,z]. Signature: (- + + +)
_______________________________________________________
Typesetting:-mprintslash([g_[mu,nu] = Matrix(4,4,{(1, 1) = N(t)^2*(-1-2*epsilon
*phi(t,x)), (2, 2) = a(t)^2*(1+2*epsilon*psi(t,x)), (3, 3) = a(t)^2*(1+2*
epsilon*psi(t,x)), (4, 4) = a(t)^2*(1+2*epsilon*psi(t,x))},datatype = anything,
storage = triangular[upper],order = Fortran_order,shape = [symmetric])],[g_[mu,
nu] = Matrix(%id = 36893488152155112260)])
_______________________________________________________
[metric = {(1, 1) =
-N(t)^2*(1+2*epsilon*phi(t,x)), (2, 2) = a(t)^2*(1+2* epsilon*psi(t,x)), (3, 3)
= a(t)^2*(1+2*epsilon*psi(t,x)), (4, 4) = a(t)^2*(1+2 *epsilon*psi(t,x))},
spaceindices = lowercaselatin_is]
>
> #Define the scalar field:
> Sigma:=sigma(t)+epsilon*dSg(t,x)
Typesetting:-mprintslash([(Sigma := sigma(t)+epsilon*dSg(t,x))],[sigma(t)+
epsilon*dSg(t,x)])
> #Define the kinetic term for the scalar field:
> Define(redo, sigma_d[mu] = D_[mu](Sigma, [Z]), minimizetensorcomponents):
Defined objects with tensor properties
> dSig_sq := add(rhs(sigma_d[~mu, Array])[i1]*rhs(sigma_d[mu, Array])[i1], i1 =
> 1 .. 4)
Typesetting:-mprintslash([(dSig_sq := -(diff(sigma(t),t)+epsilon*diff(dSg(t,x),
t))^2/N(t)^2/(1+2*epsilon*phi(t,x))+epsilon^2/a(t)^2/(1+2*epsilon*psi(t,x))*
diff(dSg(t,x),x)^2)],[-(diff(sigma(t),t)+epsilon*diff(dSg(t,x),t))^2/N(t)^2/(1+
2*epsilon*phi(t,x))+epsilon^2/a(t)^2/(1+2*epsilon*psi(t,x))*diff(dSg(t,x),x)^2]
)
>
>
> #Define the action:
> det_g := LinearAlgebra[Determinant](rhs(g_[mu, nu, Array]))
Typesetting:-mprintslash([(det_g := N(t)^2*(-1-2*epsilon*phi(t,x))*a(t)^6*(1+2*
epsilon*psi(t,x))^3)],[N(t)^2*(-1-2*epsilon*phi(t,x))*a(t)^6*(1+2*epsilon*psi(t
,x))^3])
> action:=sqrt(-det_g)*(Mp^2/2*Ricci[scalar]-1/2*dSig_sq-V(Sigma)):
> action_s := simplify(action):
> Assume(0 < N(t), 0 < a(t))
{t::real, N(t)::RealRange(Open(0),infinity), a(t)::RealRange(Open(0),infinity)}
>
> #Step 2: Background equations of motion
>
> #To find the background equations of motion, we vary the action with respect
> to the lapse N, which we then set to 1, and the other variables from the
> ansatz that are not perturbations (multiplied with epsilon). For this we need
> the action that is at zeroth order in perturbations.
>
>
> background_action := simplify(series(action_s, epsilon, 1)):
> background_action1 := convert(convert(background_action, polynom), D):
> lapse_eq := expand(simplify(subs(t1 = t,
> convert(Fundiff(Intc(background_action1, t), N(t1)), D)))):
> a_eq := expand(simplify(subs(t1 = t, convert(Fundiff(Intc(background_action1,
> t), a(t1)), D)))):
> scalar_eq:= expand(simplify(subs(t1 = t,
> convert(Fundiff(Intc(background_action1, t), sigma(t1)), D)))):
>
>
> N := t -> 1
Typesetting:-mprintslash([(N := t -> 1)],[t -> 1])
> #Solve the background equations in terms of the variables. The best way is to
> usually solve them in terms of the highest-order derivatives.
> background_eom := simplify(solve({scalar_eq, lapse_eq, a_eq}, [D[1, 1](a)(t),
> (D@@2)(sigma)(t), V(sigma(t))])[1])
Warning, solve may be ignoring assumptions on the input variables.
Typesetting:-mprintslash([(background_eom := [(D@@2)(a)(t) = 1/2*(2*Mp^2*D(a)(t
)^2-a(t)^2*D(sigma)(t)^2)/Mp^2/a(t), (D@@2)(sigma)(t) = (-D(V)(sigma(t))*a(t)-3
*D(a)(t)*D(sigma)(t))/a(t), V(sigma(t)) = 1/2/a(t)^2*(6*Mp^2*D(a)(t)^2-a(t)^2*D
(sigma)(t)^2)])],[[(D@@2)(a)(t) = 1/2*(2*Mp^2*D(a)(t)^2-a(t)^2*D(sigma)(t)^2)/
Mp^2/a(t), (D@@2)(sigma)(t) = (-D(V)(sigma(t))*a(t)-3*D(a)(t)*D(sigma)(t))/a(t)
, V(sigma(t)) = 1/2/a(t)^2*(6*Mp^2*D(a)(t)^2-a(t)^2*D(sigma)(t)^2)]])
> #Step 3: First-order action
>
> #Expand the action to first order in perturbations, and confirm that it
> vanishes after substituting the background equations of motion
> action_1st_order := series(action_s, epsilon, 2):
> action_1st_order1 := simplify(convert(convert(action_1st_order, polynom), D)):
> simplify(subs(background_eom, subs(tt1 = t, xx1 = t,
> convert(Fundiff(Intc(action_1st_order1, t, x), phi(tt1, xx1)), D))))
0
> simplify(subs(background_eom, subs(tt1 = t, xx1 = t,
> convert(Fundiff(Intc(action_1st_order1, t, x), psi(tt1, xx1)), D))))
0
> simplify(subs(background_eom, subs(tt1 = t, xx1 = t,
> convert(Fundiff(Intc(action_1st_order1, t, x), dSg(tt1, xx1)), D))))
0
> #Since everything is zero, we can proceed to the next step.
>
> #Step 4: Second-order action
>
> #Expand the action to second order in perturbations.
>
> action_2nd_order := series(action_s, epsilon, 3):
> action_2nd_order1 := simplify(convert(coeff(convert(series(action_2nd_order,
> epsilon, 3), polynom), epsilon^2), D)):
>
> #Simplify its form due to spherical symmetry.
> phi := unapply(ph(t)*cos(k*x)/sqrt(Pi), t, x)
Typesetting:-mprintslash([(phi := (t, x) -> 1/Pi^(1/2)*ph(t)*cos(k*x))],[(t, x)
-> 1/Pi^(1/2)*ph(t)*cos(k*x)])
> psi := unapply(ps(t)*cos(k*x)/sqrt(Pi), t, x)
Typesetting:-mprintslash([(psi := (t, x) -> 1/Pi^(1/2)*ps(t)*cos(k*x))],[(t, x)
-> 1/Pi^(1/2)*ps(t)*cos(k*x)])
> dSg := unapply(si(t)*cos(k*x)/sqrt(Pi), t, x)
Typesetting:-mprintslash([(dSg := (t, x) -> 1/Pi^(1/2)*si(t)*cos(k*x))],[(t, x)
-> 1/Pi^(1/2)*si(t)*cos(k*x)])
> action_2nd_order2 := (simplify(int(action_2nd_order1, x = -Pi .. Pi)) assuming
> k::posint):
> action_ready := (simplify(subs(background_eom, action_2nd_order2)) assuming
> k::posint):
> action_ready:
>
>
> #Step 5: Analyze the perturbations
>
> #In this step, we will find the kinetic matrix of the action
>
> fld:=[ph,ps,si]
Typesetting:-mprintslash([(fld := [ph, ps, si])],[[ph, ps, si]])
> action_scalars:=action_ready:
> #Check if there are second-order derivatives. If yes, partially integrate to
> bring it to the form: K_ij*D(field_i)(t)*D(field_j)(t)
>
> for i from 1 to 3 do print(simplify(coeff(action_scalars, D[1,
> 1](fld[i])(t)))): end do: unassign(’i’):
0 -3*Mp^2*(ph(t)-ps(t))*a(t)^3 0
> integrate_by_parts_tt:=proc(act::algebraic,var::algebraic,coo::name) local f,
> t, loc; t := coo; loc := coeff(act,D[1,1](var)(t)); f := act -
> loc*D[1,1](var)(t) - D(var)(t)*convert(diff(loc,t),D); f end proc:
>
> action_scalars2:=simplify(subs(background_eom,
> integrate_by_parts_tt(action_scalars, ps, t))):
> for i from 1 to 3 do print(simplify(coeff(action_scalars2, D[1,
> 1](fld[i])(t)))): end do: unassign(’i’):
0 0 0
>
> #Define the kinetic matrix and check its determinant.
> KM:=Matrix(3): # Kinetic matrix
>
> for i from 1 to 3 do for j from 1 to 3 do if i = j then KM[i, i] :=
> coeff(action_scalars2, D[1](fld)[i](t)^2); else KM[i, j] :=
> coeff(coeff(action_scalars2, D[1](fld[i])(t)), D[1](fld[j])(t)) / 2; end if;
> end do; end do;
> KM
Typesetting:-mprintslash([Matrix(3,3,{(2, 2) = -3*a(t)^3*Mp^2, (3, 3) = 1/2*a(t
)^3},datatype = anything,storage = rectangular,order = Fortran_order,shape = []
)],[Matrix(%id = 36893488152158363268)])
> LinearAlgebra[Determinant](KM)
0
> vecKM:=LinearAlgebra[LinearSolve](KM, <0, 0, 0>)
Typesetting:-mprintslash([(vecKM := Vector[column](3,{1 = _t[1]},datatype =
anything,storage = rectangular,order = Fortran_order,shape = []))],[Vector[
column](%id = 36893488152158365444)])
> #If the determinant of the kinetic matrix is non-zero, stop. You have found
> the final expression. Otherwise, check if the components of vecKM have some
> dependence on the background variables. If yes, make a substitution such that
> this is removed. If no, check the non-zero variable, and integrate out this
> field in the next step.
>
> #Step 6: Integrate out ph
>
> #Check if the action has terms linear in derivatives of ph. If yes, then
> partially integrate.
> simplify(coeff(action_scalars2, D(ph)(t)))
9*Mp^2*(ph(t)-ps(t))*a(t)^2*D(a)(t)
>
> integrate_by_parts_Dt_t:=proc(act::algebraic,var::algebraic,coo::name) local
> f, t, loc; t := coo; loc := coeff(coeff(act,D(var)(t)),var(t));
> f:=act-loc*D(var)(t)*var(t)-1/2*convert(diff(loc,t),D)*var(t)^2; f end proc:
>
> integrate_by_parts_t:=proc(act::algebraic,var::algebraic,coo::name) local f,
> t, loc; t := coo; loc := coeff(act,D(var)(t)); f := act - loc*D(var)(t) -
> var(t)*convert(diff(loc,t),D); f end proc:
>
> action_scalars3:=simplify(subs(background_eom,
> integrate_by_parts_Dt_t(action_scalars2, ph, t))):
> action_scalars4:=simplify(subs(background_eom,
> integrate_by_parts_t(action_scalars3, ph, t))):
>
> #Find the constraint for ph
>
>
> find_constraint:=proc(act::algebraic,var::algebraic,coo::name) local f, t, t1;
> t := coo;
> f:=subs(t1=t,convert(Physics[Fundiff](Physics[Intc](act,t),var(t1)),D)): f end
> proc:
>
> ph_constraint := simplify(find_constraint(action_scalars4, ph, t)):
>
> #If the constraint can be solved for ph, solve it, and replace the value of ph
> with this solution. Otherwise, solve it for the field for which you can solve
> it, and replace that field.
>
> ph_solution := simplify(solve(ph_constraint, ph(t))):
Warning, solve may be ignoring assumptions on the input variables.
> ph := unapply(ph_solution, t)
Typesetting:-mprintslash([(ph := t -> (-6*Mp^2*a(t)*D(a)(t)*D(ps)(t)-2*ps(t)*Mp
^2*k^2+a(t)^2*si(t)*D(V)(sigma(t))+a(t)^2*D(si)(t)*D(sigma)(t))/(-6*Mp^2*D(a)(t
)^2+a(t)^2*D(sigma)(t)^2))],[t -> (-6*Mp^2*a(t)*D(a)(t)*D(ps)(t)-2*ps(t)*Mp^2*k
^2+a(t)^2*si(t)*D(V)(sigma(t))+a(t)^2*D(si)(t)*D(sigma)(t))/(-6*Mp^2*D(a)(t)^2+
a(t)^2*D(sigma)(t)^2)])
>
> #Step 7: Check the resulting action and repeat the procedeure of step 5
>
> action_scalars5:=simplify(subs(background_eom, action_scalars4)):
> fld:=[ps,si]
Typesetting:-mprintslash([(fld := [ps, si])],[[ps, si]])
>
> for i from 1 to 2 do print(simplify(coeff(action_scalars5, D[1,
> 1](fld[i])(t)^2))): end do: unassign(’i’):
0 0
>
> #Kinetic matrix
> KM:=Matrix(2): # Kinetic matrix
>
> for i from 1 to 2 do for j from 1 to 2 do if i = j then KM[i, i] :=
> coeff(action_scalars5, D[1](fld)[i](t)^2); else KM[i, j] :=
> coeff(coeff(action_scalars5, D[1](fld[i])(t)), D[1](fld[j])(t)) / 2; end if;
> end do; end do;
> KM
Typesetting:-mprintslash([Matrix(2,2,{(1, 1) = -3*D(sigma)(t)^2*Mp^2*a(t)^5/(-6
*Mp^2*D(a)(t)^2+a(t)^2*D(sigma)(t)^2), (1, 2) = 3*Mp^2*D(sigma)(t)*D(a)(t)*a(t)
^4/(-6*Mp^2*D(a)(t)^2+a(t)^2*D(sigma)(t)^2), (2, 1) = 3*Mp^2*D(sigma)(t)*D(a)(t
)*a(t)^4/(-6*Mp^2*D(a)(t)^2+a(t)^2*D(sigma)(t)^2), (2, 2) = -3*Mp^2*D(a)(t)^2*a
(t)^3/(-6*Mp^2*D(a)(t)^2+a(t)^2*D(sigma)(t)^2)},datatype = anything,storage =
rectangular,order = Fortran_order,shape = [])],[Matrix(%id =
36893488152227830828)])
> LinearAlgebra[Determinant](KM)
0
> vecKM:=LinearAlgebra[LinearSolve](KM, <0, 0>)
Typesetting:-mprintslash([(vecKM := Vector[column](2,{1 = _t0[1], 2 = 1/D(a)(t)
*a(t)*D(sigma)(t)*_t0[1]},datatype = anything,storage = rectangular,order =
Fortran_order,shape = []))],[Vector[column](%id = 36893488152254155700)])
> #Make substitution so that vecKM components do not depend on the background
>
> si:=unapply(si2(t)+a(t)*D(sigma)(t)*1/D(a)(t)*ps(t),t)
Typesetting:-mprintslash([(si := t -> si2(t)+a(t)*D(sigma)(t)/D(a)(t)*ps(t))],[
t -> si2(t)+a(t)*D(sigma)(t)/D(a)(t)*ps(t)])
> action_scalars6:=simplify(subs(background_eom, action_scalars5)):
>
> fld:=[ps,si2]
Typesetting:-mprintslash([(fld := [ps, si2])],[[ps, si2]])
> for i from 1 to 2 do print(simplify(coeff(action_scalars6, D[1,
> 1](fld[i])(t)^2))): end do: unassign(’i’):
0 0
>
> #Kinetic matrix
> KM:=Matrix(2): # Kinetic matrix
>
> for i from 1 to 2 do for j from 1 to 2 do if i = j then KM[i, i] :=
> coeff(action_scalars6, D[1](fld)[i](t)^2); else KM[i, j] :=
> coeff(coeff(action_scalars6, D[1](fld[i])(t)), D[1](fld[j])(t)) / 2; end if;
> end do; end do;
> KM
Typesetting:-mprintslash([Matrix(2,2,{(2, 2) = -3*Mp^2*D(a)(t)^2*a(t)^3/(-6*Mp^
2*D(a)(t)^2+a(t)^2*D(sigma)(t)^2)},datatype = anything,storage = rectangular,
order = Fortran_order,shape = [])],[Matrix(%id = 36893488152231181836)])
> LinearAlgebra[Determinant](KM)
0
> vecKM:=LinearAlgebra[LinearSolve](KM, <0, 0>)
Typesetting:-mprintslash([(vecKM := Vector[column](2,{1 = _t1[1]},datatype =
anything,storage = rectangular,order = Fortran_order,shape = []))],[Vector[
column](%id = 36893488152231174980)])
>
> #Step 8: Integrate out ps
>
> #Check if the action has terms linear in derivatives of ps. If yes, then
> partially integrate.
> simplify(coeff(action_scalars6, D(ps)(t)))
(3*Mp^2*ps(t)*D(a)(t)^3+2*Mp^2*ps(t)*D(a)(t)*k^2-a(t)^3*ps(t)*D(sigma)(t)*D(V)(
sigma(t))+3*a(t)^2*ps(t)*D(a)(t)*D(sigma)(t)^2-si2(t)*a(t)^2*D(a)(t)*D(V)(sigma
(t)))*a(t)^2/D(a)(t)^2
>
> action_scalars7:=simplify(subs(background_eom,
> integrate_by_parts_Dt_t(action_scalars6, ps, t))):
> action_scalars8:=simplify(subs(background_eom,
> integrate_by_parts_t(action_scalars7, ps, t))):
>
>
> ps_constraint := simplify(find_constraint(action_scalars8, ps, t)):
> ps_solution := simplify(solve(ps_constraint, ps(t))):
Warning, solve may be ignoring assumptions on the input variables.
> ps := unapply(ps_solution, t)
Typesetting:-mprintslash([(ps := t -> 1/4*a(t)*(2*Mp^2*si2(t)*a(t)*D(a)(t)*D(V)
(sigma(t))+6*Mp^2*si2(t)*D(a)(t)^2*D(sigma)(t)+2*Mp^2*D(si2)(t)*a(t)*D(a)(t)*D(
sigma)(t)-si2(t)*a(t)^2*D(sigma)(t)^3)/Mp^4/D(a)(t)/k^2)],[t -> 1/4*a(t)*(2*Mp^
2*si2(t)*a(t)*D(a)(t)*D(V)(sigma(t))+6*Mp^2*si2(t)*D(a)(t)^2*D(sigma)(t)+2*Mp^2
*D(si2)(t)*a(t)*D(a)(t)*D(sigma)(t)-si2(t)*a(t)^2*D(sigma)(t)^3)/Mp^4/D(a)(t)/k
^2])
>
>
> #Step 9: Check the resulting action and repeat the procedeure of step 5
>
> action_scalars9:=simplify(subs(background_eom, action_scalars8)):
> fld:=[si2]
Typesetting:-mprintslash([(fld := [si2])],[[si2]])
>
> for i from 1 to 1 do print(simplify(coeff(action_scalars9, D[1,
> 1](fld[i])(t)^2))): end do: unassign(’i’):
0
>
> #Kinetic matrix
> KM:=Matrix(1): # Kinetic matrix
>
> for i from 1 to 1 do for j from 1 to 1 do if i = j then KM[i, i] :=
> coeff(action_scalars9, D[1](fld)[i](t)^2); else KM[i, j] :=
> coeff(coeff(action_scalars9, D[1](fld[i])(t)), D[1](fld[j])(t)) / 2; end if;
> end do; end do;
> KM
Typesetting:-mprintslash([Matrix(1,1,{(1, 1) = 1/2*a(t)^3},datatype = anything,
storage = rectangular,order = Fortran_order,shape = [])],[Matrix(%id =
36893488152238293108)])
> LinearAlgebra[Determinant](KM)
1/2*a(t)^3
> #Determinant is not vanishing, and therefore you are done. (Assuming that
> there are no higher-order derivatives overall)
>
>
> collect(action_scalars9, [D(si2)(t)^2,D(si2)(t),si2(t)],simplify)
>
1/2*a(t)^3*D(si2)(t)^2-1/2*a(t)^4/Mp^2*si2(t)/D(a)(t)*D(sigma)(t)^2*D(si2)(t)-1
/8*a(t)*(4*Mp^4*a(t)^2*(D@@2)(V)(sigma(t))*D(a)(t)^2+4*Mp^4*D(a)(t)^2*k^2+4*Mp^
2*a(t)^3*D(a)(t)*D(sigma)(t)*D(V)(sigma(t))+6*Mp^2*a(t)^2*D(a)(t)^2*D(sigma)(t)
^2-a(t)^4*D(sigma)(t)^4)/D(a)(t)^2/Mp^4*si2(t)^2
```

## Appendix BThe recipe for the context instructions

Work\-out the perturbations based on the following the general instructions:

The following is VERY IMPORTANT: SET interface\(prettyprint = 0\):

\(A\) Specify the background One of the main reasons to study any theory is to describe physical phenomena, such as, for example, the acceleration of the Universe, black\-holes, or cosmological defects\. These are the solutions to the background equations of motion, found by varying the action with respect to fields\. Notably, depending on the background, the number of degrees of freedom \(dof\) can change\. Therefore, when performing the analysis to calculate the dof, it is best to assume that the background is always satisfied\. One can ensure this by simply assuming that the background equations of motion always hold at any step in the computation\. \(B\) Prepare the perturbations Once you specify the background \(which can also be trivial\), it’s time to perturb around it\. These perturbations will provide the notion of the degrees of freedom\. To find them, there are several preparatory things to consider before studying their corresponding action\. In particular, it is useful to answer the following questions to decide on the best way forward: \(i\) Does your system have any symmetries? Here, by symmetries, we mean what is left after one separates the time and space components\. If then the answer is yes, this might suggest the form of your perturbations, from a general study of components of your field to a much simpler form\. For example, in space\-times that are homogeneous and isotropic, it is possible to do a decomposition of the modes according to the group of spatial rotations, and decompose the fields into scalars, vectors, and tensors\. If the space\-time has broken anisotropies in one direction and three spatial dimensions, one can instead decompose the components of the fields into even and odd modes\.

\(B\) Does your system have any gauge redundancy? If yes, analyzing first how the perturbations are transforming under gauge redundancy could be very beneficial: if one fixes the gauge such that it matches the gauge invariant variables, it can decrease the length of the computation\. Other gauges are possible too, as long as one makes sure that the gauge is fully fixed\. It should be stressed that the above two questions can be avoided – it is perfectly fine if one considers all components of the fields, and does not even fix the gauge\. These two decisions would lead to equivalent results, but, also can make the computation significantly longer\.

\(C\) Expand the action Once one decides on the background and the type of perturbations around it, it is time to study the perturbations themselves\. This can be analyzed either on the level of action or directly on the level of equations of motion\. However, studying them directly from the action is more beneficial, since one can then have a clear view into the conditions under which the degrees of freedom are well behaved, such as the positivity of the factors multiplying their kinetic terms, necessary to avoid the ghost modes\.

In general, one is supposed to expand the action to the required order in perturbations\. If there is a non\-vanishing background for the fields, the leading order corresponds to the equations of motion, which are subsequently satisfied\. Thus, at first order, the corrections vanish\. The kinetic terms for the perturbations are then at second order in the action, and one can also study higher orders to analyze the non\-linear corrections\. Note that this also holds if the background is trivial, eg, including vanishing background values of the fields, together with the Minkowski space\-time\.

\(D\) Analyze the theory Once the action is ready, it is time to analyze the modes\. Let us first assume that there are at most two time derivatives acting on the fields, and that all constraints are holonomic\. Then, essentially, the procedure to find the propagating modes is given by the following: \(i\) Locate the fields that are not propagating – the constrained modes\. \(ii\) Find their corresponding constraint by varying the action with respect to them\. \(iii\) Solve it, and substitute it back into the action\. Note that in the case of non\-linearities, this is usually done perturbatively\. \(iv\) Check if the determinant of the kinetic matrix for the Lagrangian density of fields is vanishing\. If yes, this means that there are more non\-propagating fields\. Locate them in the Lagrangian density\. If all of them appear with the form where they carry terms quadratic in time\-derivatives, this means that one should perform a substitution, which will render one of them to be non\-propagating, and then one can repeat the procedure \(i\) – \(iv\)\. \(v\) If, after repeating the procedure, the determinant of the kinetic matrix is not vanishing, and your system contains no higher derivatives, then you have found the Lagrangian density in terms of the propagating modes\.

\(E\) Higher\-order derivatives The above procedure works for a wide range of theories\. However, one can also easily encounter theories that contain higher\-order derivatives\. Luckily, this does not drastically complicate the analysis\. The key to analyzing it is to reduce the order of derivatives until one reaches maximally two time derivatives per term in the Lagrangian\.

## Appendix CClaude output example

In the following, we present as an example, the Claude output for the sRMt, with the context 10ex\.

```
# +++++++++++++++++ META TRY 1 ++++++++++++++

# ----------------- TURN 1 -----------------

# I’ll solve this step-by-step, following the pattern from the example
# problems.

# ## Step 1: Setup
restart;
interface(prettyprint = 0):

# Define the background metric with tensor perturbations. Epsilon is the perturbative parameter.
g_metric := Matrix(4, 4, [[-N(t)^2, 0,0, 0], [0, a(t)^2, 0, 0], [0, 0,
1/2*a(t)^2*(epsilon*sqrt(2)*tenp(t, x) + 2),
1/2*epsilon*a(t)^2*sqrt(2)*tenc(t, x)], [0, 0,
1/2*epsilon*a(t)^2*sqrt(2)*tenc(t, x), -1/2*a(t)^2*(epsilon*sqrt(2)*tenp(t, x)
- 2)]]):

with(Physics):
ds2 := ((LinearAlgebra[Transpose](<dt, dx, dy, dz>)) . g_metric) . <dt, dx, dy, dz>;
~ ds2 := -dt^2*N(t)^2+dx^2*a(t)^2+dy*(1/2*a(t)^2*(epsilon*2^(1/2)*tenp(t,x)+2)*dy
~ +1/2*epsilon*a(t)^2*2^(1/2)*tenc(t,x)*dz)+dz*(1/2*epsilon*a(t)^2*2^(1/2)*tenc(t
~ ,x)*dy-1/2*a(t)^2*(epsilon*2^(1/2)*tenp(t,x)-2)*dz)
# ----------------- TURN 2 -----------------
Setup(mathematicalnotation = true, dimension = 4, signature = ‘-+++‘);
Coordinates(Z = [t, x, y, z]);
Setup(metric = ds2);
~ [dimension = 4, mathematicalnotation = true, signature = ‘- + + +‘]
~ ‘Systems of spacetime coordinates are: {Z = (t, x, y, z)} ‘
~ {Z}
~ "_______________________________________________________"
~ "Coordinates: [t, x, y, z]. Signature: (- + + +)"
~ "_______________________________________________________"
~ g_[mu,nu] = Matrix(4,4,{(1, 1) = -N(t)^2, (2, 2) = a(t)^2, (3, 3) = 1/2*a(t)^2*
~ (epsilon*2^(1/2)*tenp(t,x)+2), (3, 4) = 1/2*epsilon*a(t)^2*2^(1/2)*tenc(t,x), (
~ 4, 4) = -1/2*a(t)^2*(epsilon*2^(1/2)*tenp(t,x)-2)},datatype = anything,storage
~ = triangular[upper],order = Fortran_order,shape = [symmetric])
~ "_______________________________________________________"
~ [metric = {(1, 1) = -N(t)^2, (2, 2) = a(t)^2, (3, 3) = 1/2*a(t)^2*(epsilon*2^(1
~ /2)*tenp(t,x)+2), (3, 4) = 1/2*epsilon*a(t)^2*2^(1/2)*tenc(t,x), (4, 4) = -1/2*
~ a(t)^2*(epsilon*2^(1/2)*tenp(t,x)-2)}, spaceindices = lowercaselatin_is]
# ----------------- TURN 3 -----------------
# Define the scalar field (background only for tensor perturbations):
Sigma := sigma(t);

# Define the action: Mp^2/2*R + 1/2*sigma^2*(m^2 + f(R))
det_g := LinearAlgebra[Determinant](rhs(g_[mu, nu, Array]));

action := sqrt(-det_g)*(Mp^2/2*Ricci[scalar] + 1/2*Sigma^2*(m^2 + f(Ricci[scalar])));
action_s := simplify(action):
Assume(0 < N(t), 0 < a(t));
~ Sigma := sigma(t)
~ det_g := 1/2*N(t)^2*a(t)^6*(tenp(t,x)^2*epsilon^2+epsilon^2*tenc(t,x)^2-2)
~ action := 1/2*(-2*N(t)^2*a(t)^6*(tenp(t,x)^2*epsilon^2+epsilon^2*tenc(t,x)^2-2)
~ )^(1/2)*(1/4*Mp^2*(16*diff(tenp(t,x),t)*tenc(t,x)^2*tenp(t,x)*diff(a(t),t)*a(t)
~ *N(t)*epsilon^4-8*diff(tenp(t,x),t)*tenc(t,x)*diff(tenc(t,x),t)*tenp(t,x)*a(t)^
~ 2*N(t)*epsilon^4+16*tenc(t,x)*diff(tenc(t,x),t)*tenp(t,x)^2*diff(a(t),t)*a(t)*N
~ (t)*epsilon^4+48*diff(a(t),t)^2*N(t)+48*tenc(t,x)^2*diff(a(t),t)*a(t)*diff(N(t)
~ ,t)*epsilon^2+48*tenp(t,x)^2*diff(a(t),t)*a(t)*diff(N(t),t)*epsilon^2+8*diff(
~ tenp(t,x),t)*tenp(t,x)*a(t)^2*diff(N(t),t)*epsilon^2+8*tenc(t,x)*diff(tenc(t,x)
~ ,t)*a(t)^2*diff(N(t),t)*epsilon^2+3*diff(tenp(t,x),t)^2*tenc(t,x)^2*a(t)^2*N(t)
~ *epsilon^4-4*diff(tenp(t,x),t)*tenp(t,x)^3*a(t)^2*diff(N(t),t)*epsilon^4-12*
~ tenc(t,x)^4*diff(a(t),t)*a(t)*diff(N(t),t)*epsilon^4-4*tenc(t,x)^3*diff(tenc(t,
~ x),t)*a(t)^2*diff(N(t),t)*epsilon^4+24*tenc(t,x)^2*tenp(t,x)^2*diff(a(t),t)^2*N
~ (t)*epsilon^4+3*diff(tenc(t,x),t)^2*tenp(t,x)^2*a(t)^2*N(t)*epsilon^4-12*tenp(t
~ ,x)^4*diff(a(t),t)*a(t)*diff(N(t),t)*epsilon^4-32*diff(tenp(t,x),t)*tenp(t,x)*
~ diff(a(t),t)*a(t)*N(t)*epsilon^2-32*tenc(t,x)*diff(tenc(t,x),t)*diff(a(t),t)*a(
~ t)*N(t)*epsilon^2-4*diff(tenp(t,x),t)*tenc(t,x)^2*tenp(t,x)*a(t)^2*diff(N(t),t)
~ *epsilon^4+16*diff(tenp(t,x),t)*tenp(t,x)^3*diff(a(t),t)*a(t)*N(t)*epsilon^4+16
~ *tenc(t,x)^3*diff(tenc(t,x),t)*diff(a(t),t)*a(t)*N(t)*epsilon^4-24*tenc(t,x)^2*
~ tenp(t,x)^2*diff(a(t),t)*a(t)*diff(N(t),t)*epsilon^4-4*tenc(t,x)*diff(tenc(t,x)
~ ,t)*tenp(t,x)^2*a(t)^2*diff(N(t),t)*epsilon^4+8*tenc(t,x)*tenp(t,x)*diff(tenp(t
~ ,x),x)*diff(tenc(t,x),x)*N(t)^3*epsilon^4+48*a(t)*N(t)*diff(a(t),t $ 2)+tenc(t,
~ x)^2*diff(tenc(t,x),x)^2*N(t)^3*epsilon^4+tenp(t,x)^2*diff(tenp(t,x),x)^2*N(t)^
~ 3*epsilon^4+12*tenc(t,x)^4*diff(a(t),t)^2*N(t)*epsilon^4-3*tenc(t,x)^2*diff(
~ tenp(t,x),x)^2*N(t)^3*epsilon^4+12*tenp(t,x)^4*diff(a(t),t)^2*N(t)*epsilon^4-3*
~ tenp(t,x)^2*diff(tenc(t,x),x)^2*N(t)^3*epsilon^4-6*diff(tenp(t,x),t)^2*a(t)^2*N
~ (t)*epsilon^2-48*tenc(t,x)^2*diff(a(t),t)^2*N(t)*epsilon^2-6*diff(tenc(t,x),t)^
~ 2*a(t)^2*N(t)*epsilon^2-48*tenp(t,x)^2*diff(a(t),t)^2*N(t)*epsilon^2+6*diff(
~ tenp(t,x),x)^2*N(t)^3*epsilon^2+6*diff(tenc(t,x),x)^2*N(t)^3*epsilon^2-48*diff(
~ a(t),t)*a(t)*diff(N(t),t)+24*tenc(t,x)^2*tenp(t,x)^2*a(t)*N(t)*diff(a(t),t $ 2)
~ *epsilon^4+4*tenc(t,x)^2*tenp(t,x)*a(t)^2*N(t)*diff(tenp(t,x),t $ 2)*epsilon^4+
~ 4*tenc(t,x)*tenp(t,x)^2*a(t)^2*N(t)*diff(tenc(t,x),t $ 2)*epsilon^4-8*tenp(t,x)
~ *a(t)^2*N(t)*diff(tenp(t,x),t $ 2)*epsilon^2-48*tenc(t,x)^2*a(t)*N(t)*diff(a(t)
~ ,t $ 2)*epsilon^2-48*tenp(t,x)^2*a(t)*N(t)*diff(a(t),t $ 2)*epsilon^2-8*tenc(t,
~ x)*a(t)^2*N(t)*diff(tenc(t,x),t $ 2)*epsilon^2+12*tenc(t,x)^4*a(t)*N(t)*diff(a(
~ t),t $ 2)*epsilon^4+4*tenc(t,x)^3*a(t)^2*N(t)*diff(tenc(t,x),t $ 2)*epsilon^4-4
~ *tenc(t,x)^2*tenp(t,x)*N(t)^3*diff(tenp(t,x),x $ 2)*epsilon^4-4*tenc(t,x)*tenp(
~ t,x)^2*N(t)^3*diff(tenc(t,x),x $ 2)*epsilon^4+12*tenp(t,x)^4*a(t)*N(t)*diff(a(t
~ ),t $ 2)*epsilon^4+4*tenp(t,x)^3*a(t)^2*N(t)*diff(tenp(t,x),t $ 2)*epsilon^4-a(
~ t)^2*diff(tenp(t,x),t)^2*tenp(t,x)^2*N(t)*epsilon^4-a(t)^2*tenc(t,x)^2*N(t)*
~ diff(tenc(t,x),t)^2*epsilon^4-4*tenc(t,x)^3*N(t)^3*diff(tenc(t,x),x $ 2)*
~ epsilon^4-4*tenp(t,x)^3*N(t)^3*diff(tenp(t,x),x $ 2)*epsilon^4+8*tenc(t,x)*N(t)
~ ^3*diff(tenc(t,x),x $ 2)*epsilon^2+8*tenp(t,x)*N(t)^3*diff(tenp(t,x),x $ 2)*
~ epsilon^2)/a(t)^2/N(t)^3/(tenp(t,x)^2*epsilon^2+epsilon^2*tenc(t,x)^2-2)^2+1/2*
~ sigma(t)^2*(m^2+f(1/2*(16*diff(tenp(t,x),t)*tenc(t,x)^2*tenp(t,x)*diff(a(t),t)*
~ a(t)*N(t)*epsilon^4-8*diff(tenp(t,x),t)*tenc(t,x)*diff(tenc(t,x),t)*tenp(t,x)*a
~ (t)^2*N(t)*epsilon^4+16*tenc(t,x)*diff(tenc(t,x),t)*tenp(t,x)^2*diff(a(t),t)*a(
~ t)*N(t)*epsilon^4+48*diff(a(t),t)^2*N(t)+48*tenc(t,x)^2*diff(a(t),t)*a(t)*diff(
~ N(t),t)*epsilon^2+48*tenp(t,x)^2*diff(a(t),t)*a(t)*diff(N(t),t)*epsilon^2+8*
~ diff(tenp(t,x),t)*tenp(t,x)*a(t)^2*diff(N(t),t)*epsilon^2+8*tenc(t,x)*diff(tenc
~ (t,x),t)*a(t)^2*diff(N(t),t)*epsilon^2+3*diff(tenp(t,x),t)^2*tenc(t,x)^2*a(t)^2
~ *N(t)*epsilon^4-4*diff(tenp(t,x),t)*tenp(t,x)^3*a(t)^2*diff(N(t),t)*epsilon^4-\
~ 12*tenc(t,x)^4*diff(a(t),t)*a(t)*diff(N(t),t)*epsilon^4-4*tenc(t,x)^3*diff(tenc
~ (t,x),t)*a(t)^2*diff(N(t),t)*epsilon^4+24*tenc(t,x)^2*tenp(t,x)^2*diff(a(t),t)^
~ 2*N(t)*epsilon^4+3*diff(tenc(t,x),t)^2*tenp(t,x)^2*a(t)^2*N(t)*epsilon^4-12*
~ tenp(t,x)^4*diff(a(t),t)*a(t)*diff(N(t),t)*epsilon^4-32*diff(tenp(t,x),t)*tenp(
~ t,x)*diff(a(t),t)*a(t)*N(t)*epsilon^2-32*tenc(t,x)*diff(tenc(t,x),t)*diff(a(t),
~ t)*a(t)*N(t)*epsilon^2-4*diff(tenp(t,x),t)*tenc(t,x)^2*tenp(t,x)*a(t)^2*diff(N(
~ t),t)*epsilon^4+16*diff(tenp(t,x),t)*tenp(t,x)^3*diff(a(t),t)*a(t)*N(t)*epsilon
~ ^4+16*tenc(t,x)^3*diff(tenc(t,x),t)*diff(a(t),t)*a(t)*N(t)*epsilon^4-24*tenc(t,
~ x)^2*tenp(t,x)^2*diff(a(t),t)*a(t)*diff(N(t),t)*epsilon^4-4*tenc(t,x)*diff(tenc
~ (t,x),t)*tenp(t,x)^2*a(t)^2*diff(N(t),t)*epsilon^4+8*tenc(t,x)*tenp(t,x)*diff(
~ tenp(t,x),x)*diff(tenc(t,x),x)*N(t)^3*epsilon^4+48*a(t)*N(t)*diff(a(t),t $ 2)+
~ tenc(t,x)^2*diff(tenc(t,x),x)^2*N(t)^3*epsilon^4+tenp(t,x)^2*diff(tenp(t,x),x)^
~ 2*N(t)^3*epsilon^4+12*tenc(t,x)^4*diff(a(t),t)^2*N(t)*epsilon^4-3*tenc(t,x)^2*
~ diff(tenp(t,x),x)^2*N(t)^3*epsilon^4+12*tenp(t,x)^4*diff(a(t),t)^2*N(t)*epsilon
~ ^4-3*tenp(t,x)^2*diff(tenc(t,x),x)^2*N(t)^3*epsilon^4-6*diff(tenp(t,x),t)^2*a(t
~ )^2*N(t)*epsilon^2-48*tenc(t,x)^2*diff(a(t),t)^2*N(t)*epsilon^2-6*diff(tenc(t,x
~ ),t)^2*a(t)^2*N(t)*epsilon^2-48*tenp(t,x)^2*diff(a(t),t)^2*N(t)*epsilon^2+6*
~ diff(tenp(t,x),x)^2*N(t)^3*epsilon^2+6*diff(tenc(t,x),x)^2*N(t)^3*epsilon^2-48*
~ diff(a(t),t)*a(t)*diff(N(t),t)+24*tenc(t,x)^2*tenp(t,x)^2*a(t)*N(t)*diff(a(t),t
~ $ 2)*epsilon^4+4*tenc(t,x)^2*tenp(t,x)*a(t)^2*N(t)*diff(tenp(t,x),t $ 2)*
~ epsilon^4+4*tenc(t,x)*tenp(t,x)^2*a(t)^2*N(t)*diff(tenc(t,x),t $ 2)*epsilon^4-8
~ *tenp(t,x)*a(t)^2*N(t)*diff(tenp(t,x),t $ 2)*epsilon^2-48*tenc(t,x)^2*a(t)*N(t)
~ *diff(a(t),t $ 2)*epsilon^2-48*tenp(t,x)^2*a(t)*N(t)*diff(a(t),t $ 2)*epsilon^2
~ -8*tenc(t,x)*a(t)^2*N(t)*diff(tenc(t,x),t $ 2)*epsilon^2+12*tenc(t,x)^4*a(t)*N(
~ t)*diff(a(t),t $ 2)*epsilon^4+4*tenc(t,x)^3*a(t)^2*N(t)*diff(tenc(t,x),t $ 2)*
~ epsilon^4-4*tenc(t,x)^2*tenp(t,x)*N(t)^3*diff(tenp(t,x),x $ 2)*epsilon^4-4*tenc
~ (t,x)*tenp(t,x)^2*N(t)^3*diff(tenc(t,x),x $ 2)*epsilon^4+12*tenp(t,x)^4*a(t)*N(
~ t)*diff(a(t),t $ 2)*epsilon^4+4*tenp(t,x)^3*a(t)^2*N(t)*diff(tenp(t,x),t $ 2)*
~ epsilon^4-a(t)^2*diff(tenp(t,x),t)^2*tenp(t,x)^2*N(t)*epsilon^4-a(t)^2*tenc(t,x
~ )^2*N(t)*diff(tenc(t,x),t)^2*epsilon^4-4*tenc(t,x)^3*N(t)^3*diff(tenc(t,x),x $
~ 2)*epsilon^4-4*tenp(t,x)^3*N(t)^3*diff(tenp(t,x),x $ 2)*epsilon^4+8*tenc(t,x)*N
~ (t)^3*diff(tenc(t,x),x $ 2)*epsilon^2+8*tenp(t,x)*N(t)^3*diff(tenp(t,x),x $ 2)*
~ epsilon^2)/a(t)^2/N(t)^3/(tenp(t,x)^2*epsilon^2+epsilon^2*tenc(t,x)^2-2)^2)))
~ {t::real, N(t)::RealRange(Open(0),infinity), a(t)::RealRange(Open(0),infinity)}
# ----------------- TURN 4 -----------------
# ## Step 2: Background equations of motion
background_action := simplify(series(action_s, epsilon, 1)):
background_action1 := convert(convert(background_action, polynom), D):
lapse_eq := expand(simplify(subs(t1 = t,
convert(Fundiff(Intc(background_action1, t), N(t1)), D)))):
a_eq := expand(simplify(subs(t1 = t, convert(Fundiff(Intc(background_action1,
t), a(t1)), D)))):
scalar_eq := expand(simplify(subs(t1 = t, convert(Fundiff(Intc(background_action1,
t), sigma(t1)), D)))):
# ----------------- TURN 5 -----------------
N := t -> 1;
~ N := t -> 1
# ----------------- TURN 6 -----------------
# Now for this problem, the background is flat space-time (a(t)=1), so
# let me check if that’s consistent:
a := t -> 1;
# Check background equations
simplify(lapse_eq);
simplify(a_eq);
simplify(scalar_eq);
~ a := t -> 1
~ 1/2*sigma(t)^2*(m^2+f(0))
~ 1/2*(3*m^2+3*f(0))*sigma(t)^2+6*(D@@2)(sigma)(t)*D(f)(0)*sigma(t)+6*D(f)(0)*D(
~ sigma)(t)^2
~ sigma(t)*(m^2+f(0))
# ----------------- TURN 7 -----------------
# For flat background (a=1, N=1), the Ricci scalar R=0.
# The lapse equation gives: 1/2*sigma(t)^2*(m^2+f(0)) = 0
# The scalar equation gives: sigma(t)*(m^2+f(0)) = 0
# These are satisfied if either sigma(t)=0 or m^2+f(0)=0.
# Let’s take sigma(t) as constant sigma0, and m^2 + f(0) = 0.
# Actually, let’s check if sigma=0 works for a_eq too:
simplify(subs(sigma(t)=0, D(sigma)(t)=0, (D@@2)(sigma)(t)=0, a_eq));
~ 0
# ----------------- TURN 8 -----------------
# sigma=0 satisfies all background equations on flat background. Let’s use this.
# But wait - if sigma=0, then the sigma^2 term vanishes and we just have GR at background level.
# However, sigma can have a background value. Let me reconsider.
# For sigma(t) nonzero, we need m^2 + f(0) = 0, which means m^2 = -f(0).
# And the a_eq becomes: 3*sigma^2*(m^2+f(0)) + 6*sigma’’*D(f)(0)*sigma + 6*D(f)(0)*sigma’^2
# With m^2+f(0)=0: 6*D(f)(0)*(sigma*sigma’’ + sigma’^2) = 6*D(f)(0)*d/dt(sigma*sigma’) = 0
# This is getting complex. Let’s try sigma=0 as the simplest flat background.

# Actually, looking at the problem more carefully - it says "flat background"
# With sigma=0, the action at quadratic order should still be interesting
# because the tensor modes couple to the metric perturbations.

# Let me proceed with sigma=0 background
sigma := t -> 0;
# Re-verify
print("lapse:", simplify(lapse_eq));
print("a_eq:", simplify(a_eq));
~ sigma := t -> 0
~ "lapse:", 0
~ "a_eq:", 0
# ----------------- TURN 9 -----------------
# Good - flat background with sigma=0 is a solution.
# Now proceed to first-order and second-order action.

# Step 3: First-order action
action_1st_order := series(action_s, epsilon, 2):
action_1st_order1 := simplify(convert(convert(action_1st_order, polynom), D)):
simplify(subs(tt1 = t, xx1 = t, convert(Fundiff(Intc(action_1st_order1, t, x), tenp(tt1, xx1)), D)));
simplify(subs(tt1 = t, xx1 = t, convert(Fundiff(Intc(action_1st_order1, t, x), tenc(tt1, xx1)), D)));
~ 0
~ 0
# ----------------- TURN 10 -----------------
# Good, first-order action vanishes. Now Step 4:
# Step 4: Second-order action
action_2nd_order := series(action_s, epsilon, 3):
action_2nd_order1 := simplify(convert(coeff(convert(series(action_2nd_order,
epsilon, 3), polynom), epsilon^2), D)):

# Simplify using Fourier modes
tenp := unapply(tp(t)*cos(k*x)/sqrt(Pi), t, x);
tenc := unapply(tc(t)*cos(k*x)/sqrt(Pi), t, x);

action_2nd_order2 := (simplify(int(action_2nd_order1, x = -Pi .. Pi)) assuming k::posint):
action_ready := simplify(action_2nd_order2);
~ tenp := (t, x) -> 1/Pi^(1/2)*tp(t)*cos(k*x)
~ tenc := (t, x) -> 1/Pi^(1/2)*tc(t)*cos(k*x)
~ action_ready := -1/8*((7/2*tc(t)^2*k^2+7/2*tp(t)^2*k^2+2*(D@@2)(tc)(t)*tc(t)+2*
~ (D@@2)(tp)(t)*tp(t)+3/2*D(tc)(t)^2+3/2*D(tp)(t)^2)*sin(2*k*Pi)+k*Pi*(tc(t)^2*k^
~ 2+tp(t)^2*k^2+4*(D@@2)(tc)(t)*tc(t)+4*(D@@2)(tp)(t)*tp(t)+3*D(tc)(t)^2+3*D(tp)(
~ t)^2))*Mp^2/k/Pi
# ----------------- TURN 11 -----------------
# Since k is a positive integer, sin(2*k*Pi) = 0
action_ready2 := simplify(subs(sin(2*k*Pi) = 0, action_ready));
~ action_ready2 := -1/8*(tc(t)^2*k^2+tp(t)^2*k^2+4*(D@@2)(tc)(t)*tc(t)+4*(D@@2)(
~ tp)(t)*tp(t)+3*D(tc)(t)^2+3*D(tp)(t)^2)*Mp^2
# ----------------- TURN 12 -----------------
# Step 5: Analyze the perturbations
fld := [tp, tc];
action_tensors := action_ready2;

# Check for second-order derivatives
for i from 1 to 2 do print(simplify(coeff(action_tensors, D[1, 1](fld[i])(t)))):
end do: unassign(’i’):
~ fld := [tp, tc]
~ action_tensors := -1/8*(tc(t)^2*k^2+tp(t)^2*k^2+4*(D@@2)(tc)(t)*tc(t)+4*(D@@2)(
~ tp)(t)*tp(t)+3*D(tc)(t)^2+3*D(tp)(t)^2)*Mp^2
~ -1/2*tp(t)*Mp^2
~ -1/2*tc(t)*Mp^2
# ----------------- TURN 13 -----------------
# Need to integrate by parts to remove second derivatives
integrate_by_parts_tt:=proc(act::algebraic,var::algebraic,coo::name) local f,
t, loc; t := coo; loc := coeff(act,D[1,1](var)(t)); f := act -
loc*D[1,1](var)(t) - D(var)(t)*convert(diff(loc,t),D); f end proc:

action_tensors2 := simplify(integrate_by_parts_tt(action_tensors, tp, t)):
action_tensors3 := simplify(integrate_by_parts_tt(action_tensors2, tc, t)):

for i from 1 to 2 do print(simplify(coeff(action_tensors3, D[1, 1](fld[i])(t)))):
end do: unassign(’i’):
~ 0
~ 0
# ----------------- TURN 14 -----------------
# Define the kinetic matrix
KM := Matrix(2):
for i from 1 to 2 do for j from 1 to 2 do if i = j then KM[i, i] :=
coeff(action_tensors3, D[1](fld)[i](t)^2); else KM[i, j] :=
coeff(coeff(action_tensors3, D[1](fld[i])(t)), D[1](fld[j])(t)) / 2; end if;
end do; end do;
KM;
LinearAlgebra[Determinant](KM);
~ Matrix(2,2,{(1, 1) = 1/8*Mp^2, (2, 2) = 1/8*Mp^2},datatype = anything,storage =
~ rectangular,order = Fortran_order,shape = [])
~ 1/64*Mp^4
# ----------------- TURN 15 -----------------
# Determinant is non-zero! We’re done. The action is already in terms of propagating modes.
# Let’s display the final result nicely.

final_action := simplify(action_tensors3);
collected_action := collect(final_action, [D(tp)(t), D(tc)(t), tp(t), tc(t)], simplify);
~ final_action := -1/8*(-D(tc)(t)^2-D(tp)(t)^2+k^2*(tc(t)^2+tp(t)^2))*Mp^2
~ collected_action := 1/8*Mp^2*D(tp)(t)^2+1/8*Mp^2*D(tc)(t)^2-1/8*Mp^2*k^2*tp(t)^
~ 2-1/8*Mp^2*k^2*tc(t)^2
# ----------------- TURN 16 -----------------
# Let’s verify this makes sense by writing it more explicitly
print("Final quadratic action for tensor perturbations:");
print(final_action);
print("Which can be written as:");
print(1/8*Mp^2*(D(tp)(t)^2 + D(tc)(t)^2 - k^2*(tp(t)^2 + tc(t)^2)));
~ "Final quadratic action for tensor perturbations:"
~ -1/8*(-D(tc)(t)^2-D(tp)(t)^2+k^2*(tc(t)^2+tp(t)^2))*Mp^2
~ "Which can be written as:"
~ 1/8*Mp^2*(D(tc)(t)^2+D(tp)(t)^2-k^2*(tc(t)^2+tp(t)^2))
# ----------------- TURN 17 -----------------
# Let me verify the kinetic matrix once more and confirm the propagating degrees of freedom
print("Kinetic matrix:");
print(KM);
print("Determinant of kinetic matrix:", LinearAlgebra[Determinant](KM));
print("Number of propagating tensor modes: 2 (tp and tc)");

# The result: on a flat background with sigma=0, the f(R) coupling through sigma^2
# does not contribute (since sigma=0), and we get the standard GR result for tensor modes.
~ "Kinetic matrix:"
~ Matrix(2,2,{(1, 1) = 1/8*Mp^2, (2, 2) = 1/8*Mp^2},datatype = anything,storage =
~ rectangular,order = Fortran_order,shape = [])
~ "Determinant of kinetic matrix:", 1/64*Mp^4
~ "Number of propagating tensor modes: 2 (tp and tc)"
# ----------------- TURN 18 -----------------
# ## Final Answer

# For the theory **Mp/2R + (m + f(R))** on a **flat
# background** (a(t) = 1, N(t) = 1), the background is solved with
# **(t) = 0** (the scalar field vanishes on the flat background).

# The **quadratic action for the two tensor perturbations** (the "+"
# polarization ‘tp(t)‘ and "" polarization ‘tc(t)‘) is:

# $$S^{(2)}_{\text{tensor}} = \frac{M_p^2}{8}\left[\dot{t}_p^2 +
# \dot{t}_c^2 - k^2\left(t_p^2 + t_c^2\right)\right]$$

# The **kinetic matrix** is diagonal:

# $$K = \begin{pmatrix} \frac{M_p^2}{8} & 0 \\ 0 & \frac{M_p^2}{8}
# \end{pmatrix}$$

# with non-vanishing determinant **Mp/64**, confirming **two
# propagating tensor degrees of freedom** (the two graviton
# polarizations).

# Since  = 0 on the flat background, the f(R) coupling does not
# contribute at quadratic order in tensor perturbations, and the
# result is identical to standard Einstein gravity: two massless
# graviton polarizations each satisfying the wave equation $\ddot{h} +
# k^2 h = 0$.
```
LLMs with in-context learning for Algorithmic Theoretical Physics

Similar Articles

Computational conceptual history of scientific concepts: From early digital methods to LLMs

Can Large Language Models Reinvent Foundational Algorithms?

Are Large Language Models Suitable for Graph Computation? Progress and Prospects

LLMs Improving LLMs: Agentic Discovery for Test-Time Scaling

Aligning Quantum Operators with Large Language Models

Submit Feedback

Similar Articles

Computational conceptual history of scientific concepts: From early digital methods to LLMs
Can Large Language Models Reinvent Foundational Algorithms?
Are Large Language Models Suitable for Graph Computation? Progress and Prospects
LLMs Improving LLMs: Agentic Discovery for Test-Time Scaling
Aligning Quantum Operators with Large Language Models