Iteration of Thought

Enhancing Large Language Models with Inner Dialogue

The paper "Iteration of Thought: Leveraging Inner Dialogue for Autonomous Large Language Model Reasoning" by Santosh Kumar Radha, Yasamin Nouri Jelyani, Ara Ghukasyan, and Oktay Goktas introduces a novel approach to improving the reasoning capabilities of large language models (LLMs). The key idea? Use an inner dialogue mechanism to iteratively refine responses, enhancing accuracy and reducing the need for human intervention.

🛠️ The IoT Framework

The Iteration of Thought (IoT) framework consists of three core components:

  1. Inner Dialogue Agent (IDA): Generates dynamic, context-sensitive prompts based on the original query and the LLM’s previous responses.

  2. LLM Agent (LLMA): Processes these prompts using its internal knowledge to refine responses.

  3. Iterative Prompting Loop: Facilitates a back-and-forth between the IDA and LLMA until a satisfactory answer is reached or the maximum number of iterations is hit.

🔄 Two Variants of IoT

  • 🧠 Autonomous Iteration of Thought (AIoT): The LLM decides when to stop iterating based on a stopping criterion.

  • 🎯 Guided Iteration of Thought (GIoT): A fixed number of iterations ensures thorough exploration of reasoning paths.

🌟 What Sets IoT Apart?

Unlike methods like Chain of Thought (CoT) or Tree of Thoughts (ToT), IoT dynamically adjusts to evolving contexts. The inner dialogue mechanism allows for more adaptive cross-path exploration, making it efficient by avoiding unnecessary thought generation and discarding.

🔬 Experimental Setup and Results

The IoT framework was tested across various datasets, including:

  • GPQA: For complex reasoning tasks.

  • Game of 24: Explorative problem-solving.

  • Mini Crosswords: Puzzle solving.

  • HotpotQA: Multi-hop question answering.

Results:

  • GPQA: AIoT improved accuracy by 14.11% over the baseline.

  • Game of 24 & Mini Crosswords: GIoT outperformed other methods due to its comprehensive exploration strategy.

  • HotpotQA: AIoT achieved higher Exact Match (EM) and F1 scores compared to CoT and multi-agent frameworks like AgentLite.

✅ Advantages and Limitations

Advantages:

  • ⚡ Dynamic Adaptation: Improves response accuracy and reduces the need for human intervention.

  • 📝 Explainability: Provides a clear trace of the reasoning process.

Limitations:

  • 🛑 Premature Stopping (AIoT): Could lead to incomplete answers.

  • 🔄 Redundant Iterations (GIoT): Can increase computational cost.

  • 💡 Task Complexity: Performance varies depending on the complexity of the task and LLM capabilities.

🏁 Conclusion

The IoT framework introduces an innovative way to enhance LLM reasoning using an inner dialogue mechanism for dynamic, iterative refinement. It outperforms methods like CoT in complex reasoning tasks, offering a balance between efficiency and comprehensive exploration.

Future work could expand the knowledge base of the IDA and integrate specialized LLMs to further boost performance.

Listen to the podcast: https://lnkd.in/gsCrCkx9

🚀 Explore the Paper: Interested in pushing the boundaries of what small language models can achieve? This paper is a must-read.

Subscribe for more insights like this!