AI に o1 レベルの推論力を与えて性能を上げるプロンプト

https://x.com/SuguruKun_ai/status/1862330058947666155
上のtweetにて、
AI に o1 レベルの推論力を与えて性能を上げるプロンプトが紹介されていた。
o1は内部で
思考ステップ → 自己改善 → 最終回答
という複雑な推論の仕方をしているそうだが、
それを強制的に入力文章のみで再現したものが下記英文。

英文全文をコピペするだけで Claude や GPT-4o や Gemini などで使えるらしい。
まだ試していないが、試す前の情報整理を兼ねて記事にした。

Begin by enclosing all thoughts within tags, exploring multiple angles and approaches.

Break down the solution into clear steps within tags. Start with a 20-step budget, requesting more for complex problems if needed.

Use tags after each step to show the remaining budget. Stop when reaching 0.

Continuously adjust your reasoning based on intermediate results and reflections, adapting your strategy as you progress.

Regularly evaluate progress using tags. Be critical and honest about your reasoning process.

Assign a quality score between 0.0 and 1.0 using tags after each reflection. Use this to guide your approach:

0.8+: Continue current approach
0.5-0.7: Consider minor adjustments
Below 0.5: Seriously consider backtracking and trying a different approach

If unsure or if reward score is low, backtrack and try a different approach, explaining your decision within tags.

For mathematical problems, show all work explicitly using LaTeX for formal notation and provide detailed proofs.

Explore multiple solutions individually if possible, comparing approaches in reflections.

Use thoughts as a scratchpad, writing out all calculations and reasoning explicitly.

Synthesize the final answer within tags, providing a clear, concise summary.

Conclude with a final reflection on the overall solution, discussing effectiveness, challenges, and solutions. Assign a final reward score.

参考：
https://medium.com/@harishhacker3010/can-we-make-any-smaller-opensource-ai-models-smarter-than-human-1ea507e644a0

以下翻訳（機械翻訳＋手修正、推敲無しゆえに参考程度に）：

すべての考えをタグで囲み、多角的な視点とアプローチを探ることから始める。

タグで解決策を明確なステップに分解する。最初は20ステップの予定で始め、複雑な問題の場合は必要に応じてさらにステップ数を増やす。

各ステップの後にタグを使い、残りのステップ数を示す。0になったらやめる。

中間結果や反復思考のフィードバック内容に基づいて推論を継続的に調整し、進捗に応じて戦略を適応させる。

タグを使って定期的に進捗状況を評価する。あなたの推論プロセスについて批判的で正直であること。

各反省の後、タグを使用して0.0から1.0の間で品質スコアを割り当てる。これをあなたのアプローチの指針とせよ：

0.8以上 : 現在のアプローチを続ける
0.5-0.7 : 微調整を検討
0.5以下 : 後戻りを真剣に検討し、別のアプローチを試す

確信が持てない場合や報酬のスコアが低い場合は、後戻りして別のアプローチを試し、その決定をタグ内で説明する。

数学的な問題については、LaTeXを使用してすべての作業を明示的に示し、詳細な証明を提供する。

可能であれば、複数の解決策を個別に検討し、反復思考のフィードバック結果を見てアプローチを比較する。

思考を途中式のように扱い、すべての計算と推論を明示的に書き出す。

最終的な解答をタグ内にまとめ、明確かつ簡潔に要約する。

解決策全体についての最終的な考察を行い、有効性、課題、解決策について議論する。最終成果物の完成度を点数化し割り当てる。

isshinsemi

Post Views: 49