Update `/generate.py` to handle "thinking" models #1323

srdas · 2025-04-14T23:22:00Z

Fixes Issue #1315

In models that return an entire chain of thought, /generate fails as the notebook file name becomes excessively long, as noted in the issue cited above.

The title returned contains all the thinking about the title, including proposed title text buried within qoutes inside the chain of thought (CoT). This modification extracts the suggested title in the CoT and uses it for the title.

The modification also adds a generic failsafe title in case the CoT fails to return a title. For non-CoT LLMs, the code does not pursue this special handling.

To test:
[1] Set chat model provider to Ollama and the model to huggingface.co/lmstudio-community/DeepCoder-14B-Preview-GGUF:Q6_K
[2] Try the following chat command: /generate write "hi" in the file "allo.txt" in python (this will take some time to run, as the model works through its process of reflection).

As shown:

Result:

Additional tests undertaken:

The chat command /generate add two numbers in python with the same LLM gives the notebook title: Adding Two Numbers in Python - Basic to Advanced.ipynb.
The chat command /generate write "hi" in the file "allo.txt" in python generates a title: Write 'hi' to allo.txt.ipynb. (Here the LLM is deepseek-chat with provider OpenRouter.)

for more information, see https://pre-commit.ci

packages/jupyter-ai/jupyter_ai/chat_handlers/generate.py

* add test catching error with empty touched config.json * handle empty touched config.json, passes test * ensure schema file doesn't have duplicated types * pre-commit

packages/jupyter-ai/jupyter_ai/chat_handlers/generate.py

srdas and others added 2 commits April 14, 2025 16:09

Update /generate.py to handle "thinking" models

286893f

[pre-commit.ci] auto fixes from pre-commit.com hooks

a8acca2

for more information, see https://pre-commit.ci

srdas added the bug Bugs reported by users label Apr 14, 2025

srdas requested a review from dlqqq April 14, 2025 23:31

JGuinegagne reviewed Apr 15, 2025

View reviewed changes

packages/jupyter-ai/jupyter_ai/chat_handlers/generate.py Show resolved Hide resolved

dlqqq and others added 3 commits April 15, 2025 14:07

[3.x] Expand edge case handling in ConfigManager (jupyterlab#1322)

90b730d

* add test catching error with empty touched config.json * handle empty touched config.json, passes test * ensure schema file doesn't have duplicated types * pre-commit

added tests for generate_title

5e724ce

Merge branch 'main' into generate_think

b7bb731

srdas requested a review from JGuinegagne April 15, 2025 21:11

JGuinegagne reviewed Apr 16, 2025

View reviewed changes

packages/jupyter-ai/jupyter_ai/chat_handlers/generate.py Outdated Show resolved Hide resolved

packages/jupyter-ai/jupyter_ai/chat_handlers/generate.py Outdated Show resolved Hide resolved

srdas added 2 commits April 16, 2025 10:45

Update generate.py

c283f35

Update generate.py

8eb4c55

srdas requested a review from JGuinegagne April 16, 2025 18:01

JGuinegagne approved these changes Apr 16, 2025

View reviewed changes

srdas requested review from dlqqq and removed request for dlqqq April 16, 2025 18:19

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Update `/generate.py` to handle "thinking" models #1323

Update `/generate.py` to handle "thinking" models #1323

srdas commented Apr 14, 2025 •

edited

Loading

Update /generate.py to handle "thinking" models #1323

Are you sure you want to change the base?

Update /generate.py to handle "thinking" models #1323

Conversation

srdas commented Apr 14, 2025 • edited Loading

Update `/generate.py` to handle "thinking" models #1323

Update `/generate.py` to handle "thinking" models #1323

srdas commented Apr 14, 2025 •

edited

Loading