Improve Codex Manipulation Of Notebooks

Some of the issues I've observed

* Codex ends up adding/modifying the wrong notebook
   * I think this happens if you switch tabs
   * I think our toolcalls might implicitly assume you are working with the current notebook so if you switch tabs while codex is working it messes things up

* codex gets cell syntax wrong
   * I think there are enums for cellType that it frequently gets wrong (uses an int not the string or vice versa)
   * This makes modifications very slow because it has to do multiple trials to get it correct

* Poor job searching Google Drive

I think we'd like to move away from multiple tool calls and just have a single toolcall to let codex execute code. We could then build out suitable libraries. These libraries could also be used by users.

# How could we build a suitable sandbox to safely execute agentic code?

* Google Drive - restrict to readonly access via scopes 
* Data exfiltration - Block network access except to Google
* Allow Read/Write to open notebooks - these are just read/writes to indexed DB
   * We could always make them undoable

https://github.com/runmedev/web/blob/main/docs-dev/design/0310_appkernel_sandbox.md



Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Improve Codex Manipulation Of Notebooks #154

How could we build a suitable sandbox to safely execute agentic code?

Metadata

Assignees

Labels

Type

Projects

Milestone

Relationships

Development

Improve Codex Manipulation Of Notebooks #154

Description

How could we build a suitable sandbox to safely execute agentic code?

Metadata

Metadata

Assignees

Labels

Type

Projects

Milestone

Relationships

Development

Issue actions