Skip to content

Awesome LLM papers, news and projects about learning to reason with LLM, OpenAI o1, reasonning techniques, chain-of-thought (COT), Large Language Model, Straberry

Notifications You must be signed in to change notification settings

jxhuang0508/Awesome-LLM-Reasoning-OpenAI-o1

Folders and files

NameName
Last commit message
Last commit date

Latest commit

 

History

18 Commits
 
 

Repository files navigation

Awesome Learning to Reason with LLMs: From Prompt Engineering to OpenAI o1 A Survey

Awesome Maintenance PR's Welcome visitor badge

This repository explores recent advancements in reasoning with LLMs, starting from conventional methods like Chain-of-Thought (CoT) prompting to the latest innovations, such as OpenAI's learning-based CoT reasoning in the o1 model series. It provides a comprehensive overview of how reasoning with LLMs has evolved, inlcuding recent advancements, key research papers, and discussions from blogs, social media, and expert talks.


Table of Contents


Official OpenAI Documents

  • OpenAI o1 Hub
    • Main page introducing the o1 series, including details about its capabilities, reasoning tasks, and usage limits.
  • OpenAI o1 System Card
    • A document outlining the safety evaluations, potential risks, and mitigations applied to the o1 model​.
  • OpenAI Reasoning Guide
    • A guide focusing on how OpenAI models can enhance reasoning tasks, explaining methods like chain-of-thought reasoning and how models handle complex problem-solving.

News about LLM Reasoning OpenAI o1

Blogs

Talks

Social Media Posts

Twitter

Reddit

Awesome Papers

How to Contribute

  • We welcome every researcher who contributes to this repository. Please feel free to pull requests or contact us if you find any related papers that are not included here.

The process to submit a pull request is as follows:

  • a. Fork the project into your own repository.
  • b. Add the Title, Paper link, Conference, Project/Code link in README.md using the following format:
  |[Title](Paper Link)|Venue|[Code](Code link)|
  • c. Submit the pull request to this branch.

Learning-based Reasoning

Title Venue Code
STaR: Self-Taught Reasoner Bootstrapping Reasoning With Reasoning NeurIPS 2022 Code
Large Language Models Can Self-Improve EMNLP 2023 -
Training Chain-of-Thought via Latent-Variable Inference NeurIPS 2023 -
V-STaR: Training Verifiers for Self-Taught Reasoners COLM 2024 -
Quiet-STaR: Language Models Can Teach Themselves to Think Before Speaking arXiv 2024 -
MindStar: Enhancing Math Reasoning in Pre-trained LLMs at Inference Time arXiv 2024 -
Q*: Improving Multi-step Reasoning for LLMs with Deliberative Planning arXiv 2024

Prompt Engineering-based Reasoning

Title Venue Code
Chain-of-Thought Prompting Elicits Reasoning in Large Language Models NeurIPS 2022 -
Large Language Models are Zero-Shot Reasoners NeurIPS 2022 -
Automatic Chain of Thought Prompting in Large Language Models arXiv 2022 Code
Self-Consistency Improves Chain of Thought Reasoning in Language Models ICLR 2023 -
Tree of Thoughts: Deliberate Problem Solving with Large Language Models NeurIPS 2023 Code
Large Language Model Guided Tree-of-Thought arXiv 2024 Code
ART: Automatic multi-step reasoning and tool-use for large language models arXiv 2023 -
Active Prompting with Chain-of-Thought for Large Language Models ACL 2024 Code

About

Awesome LLM papers, news and projects about learning to reason with LLM, OpenAI o1, reasonning techniques, chain-of-thought (COT), Large Language Model, Straberry

Topics

Resources

Stars

Watchers

Forks

Releases

No releases published

Packages

No packages published