@Liu_zhongxisn: https://x.com/Liu_zhongxisn/status/2057267000137896110

X AI KOLs Timeline 05/21/26, 01:07 AM Tools

codex-app ai-code-assistant workflow file-generation web-development automation newbie-guide

Summary

A practical tutorial for beginners to advanced users on Codex App, detailing how to make AI generate real files (Word/PDF/PPT/Excel), practice the complete web development process through mini-games, and use Playwright to automate business processes. It emphasizes starting from delivering real results rather than just talking about concepts.

https://t.co/vSjgtWxiOc

Original Article

View Cached Full Text

Cached at: 05/21/26, 09:40 PM

Codex App Advanced Practical Guide: From Chat Box to Real Deliverables

Many people use Codex, but in the end they treat it as just a chat box that’s good at writing code.

They get it to write copy, edit titles, organize materials, and run a few small tasks.

But when it comes to real workflows, problems emerge:

Can it actually deliver files? Can it make a webpage that opens? Can it leave behind screenshots, logs, and revision records? Can it push something from requirements all the way to delivery?

The previous article was about going from 0 to 1: first knowing where the entry point is, how to submit requirements, and how to run through a low-risk task.

This article takes it one step further.

This article won’t talk about too many concepts, only three actionable training paths:

How to make Codex deliver real files
How to practice the complete web page flow with a small game
And when you can start touching business automation like Playwright

The first two are better for beginners to practice. Treat the third one as an advanced direction for now.

You don’t need to rush into big projects. The advanced path for Codex is best started with “Can it deliver something real?” rather than “Can it make me money?”

1. Office Deliverables: Don’t Just Let Codex Write Text – Make It Deliver Files

Many people use AI to write proposals, but in the end, they still have to copy it themselves into Word, PPT, and spreadsheets, then slowly adjust the formatting.

Here, don’t just let Codex give you a block of text.

Push it to deliver a file package that can be opened, screenshot, and edited again.

If you regularly make event plans, project reports, course materials, consulting reports, or employee training materials, you can have Codex generate real files directly, instead of just giving you a block of Markdown.

I did a demo task with the theme “Codex App Content Production Workflow.” The prompt was roughly like this:

In this conversation, please do a demo task for an office suite of four items, with the theme “Codex App Content Production Workflow.” Requirements: Generate four types of real files: Word document, PDF file, PPTX presentation, and XLSX spreadsheet; Each file must have actual content, not empty files; After generation, tell me the real path of each file; Try to generate a preview or file list so I can screenshot it for the tutorial; Finally, provide an acceptance checklist to confirm all four files exist, can be opened, and are not placeholder content.

Now, when I judge whether Codex is useful, I don’t look at how beautifully it answers.

I only look at one thing: can it deliver real files, real pages, real screenshots, and real logs?

Now I recommend you verify Codex like this:

Whether the file was actually generated;
Whether the path is clearly written;
Whether the content is placeholders;
Whether the layout is readable after opening;
Whether the spreadsheet has real fields and data;
Whether the PPT can actually play;
Whether the PDF renders correctly.

Once this chain works, many lightweight material production tasks that used to be delegated to copywriters, designers, assistants, and operators can be handed to Codex to generate the first version.

I actually ran through this set of tasks.

Not by letting it say “I’ve generated it” in the chat box, but by checking whether the files actually exist locally, whether they open normally, and whether I could take screenshots. Below are the acceptance screenshots.

Acceptance results:

Word document acceptance screenshot:
PDF file acceptance screenshot:
PPT presentation acceptance screenshot:
Spreadsheet acceptance screenshot:

A note about PPT.

There are currently two approaches worth trying (treat this as a class assignment):

Image-based PPT: Have Codex generate a series of 16:9 visual images, then lay them into PPT, suitable for visual impact.
HTML-based PPT: Write slides using HTML, CSS, and JS, suitable for layout control, animations, screenshots, and screen recording.

At the beginning, don’t worry about which approach is better. Being able to generate a PPT that actually opens is much more important than “which solution is more advanced.”

Then, after three rounds of feedback-based modifications, its boundaries will quickly become clear.

2. Content Production: Turn One Sentence into a Playable Web Page

The second type of practical scenario is content production.

Here, I don’t recommend jumping straight into building a large official website, complex backend, or full app.

A better fit for beginners is a small web game.

For example, Snake, Breakout, or obstacle avoidance.

These things look like play, but they’re especially good for practicing with Codex. Because the result is very intuitive: as soon as the page opens, you immediately know whether it works.

This is much more suitable for getting started than “a webpage that looks advanced but is hard to verify.”

The first time you make a web page, don’t aim for perfection. First, get a version that runs.

The practical benefits of small web games are very real:

Fast feedback: if something looks bad, moves too fast, or has a bug, you can find it within minutes;
High controllability: usually no database, account system, or complex backend required;
Good presentation effect: later, when making tutorials, screenshots, or screen recordings, it’s more engaging than abstract configurations;
Complete chain: requirements, code, running, testing, feedback, optimization – you go through all of them.

I treat it as Codex’s first web page practical case.

Not because Snake itself is important, but because it lets you truly feel one thing: Codex isn’t just capable of writing code snippets – it can turn a vague idea into a runnable page.

You can give the first round of prompts like this:

Please design a Snake game in HTML. Requirements:

Controllable with keyboard arrow keys, press Enter to pause;

Speed increases as the snake’s length grows;

Only one apple appears at a time, random position;

Press Space to accelerate, acceleration should be higher than the current speed level;

Hitting the walls or its own body kills the snake;

Pop retro style, with motion effects and death explosion effects;

Make it a web page that can be opened directly in a browser.

This prompt is already much clearer than “help me make a small game.”

It gives Codex several key constraints: gameplay, keys, speed, failure conditions, visual style, and delivery format.

After the first version comes out, don’t rush to add a bunch of new features.

For the first version, only check four things:

Can the browser open it;
Can the game start;
Do the arrow keys work;
Does death and restart work normally.

Once these four things work, it’s worth adding more later.

You can continue to give feedback based on actual results:

Speed too fast, lower the initial speed;
Colors too ugly, change to more prominent retro contrast;
Start button not obvious, enlarge it and add status hints;
Score display unclear, fix it to the top right corner;
Death effect too weak, add an explosion animation;
If it’s not fun on mobile, explicitly make it a desktop keyboard-only version.

The easiest place to mess up here is the rhythm of changing requirements.

Only change one type of problem at a time.

First, change gameplay, then UI, then add instructions, and finally add screenshots and descriptive text. Don’t let it modify rules, change style, add sound effects, add leaderboards, and add mobile adaptation all at once.

Many people fail when making web pages with Codex the first time, not because Codex is incapable, but because the requirements pile up chaotically.

You need to let it keep moving forward, not tear down and start over every round.

Once the small game runs smoothly, you can conveniently turn it into tutorial material.

For example, have Codex output:

Local file structure;
How to run it;
Screenshot description of the first version;
Revision record for each round;
Common bugs and fixes;
Feature list of the final version.

At this point, it has not only made a webpage.

Later I found that what really saved time wasn’t the small game itself, but the reusable tutorial production process I got as a byproduct.

In the past, when writing this kind of tutorial, the most annoying part wasn’t writing the text, but taking screenshots, arranging steps, adding descriptions, and checking for omissions. Each step wasn’t difficult, but together they were very time-consuming.

Codex is well-suited for handling tasks that are “not complex individually but have a long chain.”

If you frequently repeat a certain type of delivery – whether it’s tutorials, event plans, or listing processes – you can go one step further: turn it into your own Skill.

Not a “write more like me” instruction, but clearly writing out your structural habits:

Start with real people and specific bottlenecks;
Bold the core judgment early;
Break complex methods into numbered steps;
Write clearly the judgment, method, and acceptance for each step;
Avoid neat AI clichés;
End with clear actions, not generic summaries.

One-shot generation often leads to “right direction but wrong tone.”

A workflow approach will be a bit slower, but it accumulates.

3. Business Automation: This Is the Advanced Tutorial – Come Study Here After Understanding the Above

Finally, let’s talk about a more advanced scenario.

Codex is not meant to be put on a pedestal and studied. It’s meant to be used for work.

This isn’t a task for beginners on day one, but you need to know how far Codex can go.

I once talked to an e-commerce business owner. He mainly sells on Pinduoduo, his store is expanding, and back-end listing and maintenance are already dragging down efficiency.

Their pain points are typical:

Large number of product links;
Multiple SKUs under each link;
Thousands of rows of data in spreadsheets;
Hundreds of active product links;
When opening a new store, they need to recreate hundreds of product links;
Each SKU requires filling in stock, group purchase price, spec code, product code, and reference price;
Daily maintenance of price and code changes;
Manually entering data from spreadsheets into the backend is slow and error-prone.

This scenario is perfect for automation.

Because it’s not about creative judgment.

It’s essentially just one sentence:

Stably fill structured data from a spreadsheet into a web backend.

My first thought was Computer Use.

That is, have Codex look at the screen, operate the computer, open the spreadsheet, identify backend fields, and fill in the content.

This approach works for open websites, but it can easily run into problems on backends like Pinduoduo. Those backends detect automated behavior, and Computer Use may not access them smoothly.

The real breakthrough came with the second path: Playwright.

Playwright doesn’t look at the screen and click the mouse. It controls the browser through code, directly finding buttons, input fields, and dropdown menus on the page.

This is better suited for batch tasks.

The complete flow can be broken down like this:

What Codex handles here is more than just “write a script.”

It can read Excel, analyze which columns are useful, organize them into a standard listing table, let you confirm field mappings, and then write the Playwright automation.

When running the first product, it can take a screenshot and pause to let you confirm the page is filled correctly. After confirmation, it runs the batch.

Finally, it can output success logs, failure logs, and error screenshots for easy review later.

Once this kind of thing is stable, it’s not just a one-time efficiency gain.

If this chain stabilizes, the next step is to turn it into an internal company Skill.

From “help me fill in this table once” to “from now on, all listing tasks follow this flow.”

Common Pitfalls

There’s no need to explain scenarios too much, but the pitfalls we’ve encountered must be mentioned in advance.

1. Computer Use Is Not Omnipotent

The current usage environment for Computer Use also has limitations. Not everyone can open Codex and use it directly.

Computer Use is very much like a person sitting in front of a computer, looking at the screen and operating it.

It’s suitable for open websites, document editing, data organization, and screenshot-type tasks.

But if you need to operate e-commerce backends, banking backends, or highly secure platforms, it may not work well. These platforms check browser fingerprints, WebDriver flags, headless mode, login environment, and behavior patterns.

For such backends, I would prioritize Playwright, and try to reuse an already logged-in browser environment as much as possible.

The idea is not to circumvent rules, but to make automation more stable and controllable, with confirmation and logging at every step.

2. Codex Cannot Take Over an Already Open Terminal on Your Machine

Some people might think: can I have Codex operate my locally open Ghostty, Claude Code, or other terminals?

Currently, this path is not smooth.

Codex can see the screen, but it cannot casually take over external terminal process input. The terminal interface is also relatively opaque for automation – cursor, focus, and input state are unstable.

For such tasks, I use a different approach:

If a command can be run in Codex’s built-in terminal, just let it run directly;
If I want to invoke a web AI, let it open the web version;
If I want it to do what Claude Code can do, many times it’s easier to just do it directly in Codex.

3. Only Giving Direction, Not Ideas, Will Lead to Unstable Output

Many people use it like this:

Help me write an article about XX.

Or:

Help me make a nice-looking webpage.

These prompts can be used, but the result often becomes a cliché generic answer.

Codex doesn’t know your knowledge structure, your aesthetic, or where your target audience is stuck.

If you give it direction, it will freely generate.

If you give it ideas, it’s helping you accelerate execution.

A better input should look like this:

What is the core point;
Who is the target audience;
Where are they stuck;
What problem does the article solve;
Roughly what each section should cover;
What shouldn’t be written;
What files need to be delivered at the end;
How to consider the task complete.

This is also why I always recommend building Skills.

The clearer you write, the less room for guesswork.

Final Thoughts

When you truly incorporate Codex into your workflow, the most important thing is not how much you consumed this time, but whether something was left behind.

File templates, acceptance checklists, screenshot records, reusable prompts, automation scripts.

If these are left behind, the next time you won’t start from zero.

This is not to urge you to immediately ramp up.

What beginners really need to consider is: Did this practice leave behind something reusable?

Did the office suite task leave behind file templates and an acceptance checklist? Did the small web game leave behind requirements, code, screenshots, and revision records? Did the image-text tutorial leave behind material structure, step descriptions, and a published version?

As these things accumulate, you will gradually learn what Codex is good at and what it is not.

Codex is not meant to be put on a pedestal and studied.

It’s meant to be used for work.

@Liu_zhongxisn: https://x.com/Liu_zhongxisn/status/2057267000137896110

Codex App Advanced Practical Guide: From Chat Box to Real Deliverables

1. Office Deliverables: Don’t Just Let Codex Write Text – Make It Deliver Files

2. Content Production: Turn One Sentence into a Playable Web Page

3. Business Automation: This Is the Advanced Tutorial – Come Study Here After Understanding the Above

Common Pitfalls

1. Computer Use Is Not Omnipotent

2. Codex Cannot Take Over an Already Open Terminal on Your Machine

3. Only Giving Direction, Not Ideas, Will Lead to Unstable Output

Final Thoughts

Similar Articles

@gengdaJ: https://x.com/gengdaJ/status/2053724702993190917

@wei_wang: https://x.com/wei_wang/status/2057666488530596258

@xiaogaifun: https://x.com/xiaogaifun/status/2064268648601268540

@qloog: Stop calling AI a mere efficiency booster. This OpenAI-endorsed Codex tutorial lets one person do an entire team’s job—iOS app, code, investor deck—end-to-end. Two levers: custom skills (reusable know-how) + automation (exponential speed).

@AmberTreelet: https://x.com/AmberTreelet/status/2057096445610881349

Submit Feedback

Similar Articles

@gengdaJ: https://x.com/gengdaJ/status/2053724702993190917

@wei_wang: https://x.com/wei_wang/status/2057666488530596258

@xiaogaifun: https://x.com/xiaogaifun/status/2064268648601268540

@qloog: Stop calling AI a mere efficiency booster. This OpenAI-endorsed Codex tutorial lets one person do an entire team’s job—iOS app, code, investor deck—end-to-end. Two levers: custom skills (reusable know-how) + automation (exponential speed).

@AmberTreelet: https://x.com/AmberTreelet/status/2057096445610881349