Updated on 2025-03-04

lxaw · lxaw · commit 39244a547c13 · 2025-03-04T12:22:31.000-05:00
diff --git a/papers/list.json b/papers/list.json
@@ -1,4 +1,22 @@
 [
+  {
+    "title": "DORY: Deliberative Prompt Recovery for LLM",
+    "author": "Lirong Gao et al",
+    "year": "2024",
+    "topic": "inversion",
+    "venue": "Arxiv",
+    "description": "This paper introduces a novel approach to recover original prompts from limited outputs of large language models. The authors discover a strong negative correlation between output probability-based uncertainty and prompt recovery success, showing that tokens with lower uncertainty are more likely to have appeared in the original prompt. Building on this insight, DORY recovers prompts through a three-step process: reconstructing a draft from output text, generating hints based on uncertainty, and reducing noise by comparing draft output with actual output. Unlike previous methods, DORY requires only a single LLM without any external resources or model training, making it a cost-effective solution for prompt recovery.",
+    "link": "https://arxiv.org/pdf/2405.20657"
+  },
+  {
+    "title": "Weak-to-Strong Reasoning",
+    "author": "Yuqing Yang et al",
+    "year": "2024",
+    "topic": "reasoning",
+    "venue": "Arxiv",
+    "description": "This paper introduces a progressive learning framework for weak-to-strong reasoning, addressing the challenge of improving large language models (LLMs) without high-quality supervision. The authors demonstrate that naively fine-tuning a stronger model (like Llama2-70b) on outputs from weaker models (like Llama2-7b or Gemma-2b) is insufficient for complex reasoning tasks. Their proposed two-stage approach first uses selective data curation through a \"final answer consistency\" method to identify potentially correct examples, then applies preference optimization that enables the model to learn from contrasting examples. Experiments on mathematical reasoning datasets show substantial improvements over baseline approaches, with the framework proving particularly effective when the strong model learns to distinguish between correct and incorrect reasoning paths.",
+    "link": "https://arxiv.org/pdf/2407.13647"
+  },
   {
     "title": "The False Promise of Imitating Proprietary LLMs",
     "author": "Arnav Gudibande et al",
diff --git a/papers_read.html b/papers_read.html
@@ -16,10 +16,10 @@ <h1>Here's where I keep a list of papers I have read.</h1>
         I typically use this to organize papers I found interesting. Please feel free to do whatever you want with it. Note that this is not every single paper I have ever read, just a collection of ones that I remember to put down.
     </p>
     <p id="paperCount">
-        So far, we have read 236 papers. Let's keep it up!
+        So far, we have read 238 papers. Let's keep it up!
     </p> 
     <small id="searchCount">
-        Your search returned 236 papers. Nice! 
+        Your search returned 238 papers. Nice! 
     </small>
     
     <div class="search-inputs">
@@ -46,6 +46,26 @@ <h1>Here's where I keep a list of papers I have read.</h1>
         </thead>
         <tbody>
         
+            <tr>
+                <td>DORY: Deliberative Prompt Recovery for LLM</td>
+                <td>Lirong Gao et al</td>
+                <td>2024</td>
+                <td>inversion</td>
+                <td>Arxiv</td>
+                <td>This paper introduces a novel approach to recover original prompts from limited outputs of large language models. The authors discover a strong negative correlation between output probability-based uncertainty and prompt recovery success, showing that tokens with lower uncertainty are more likely to have appeared in the original prompt. Building on this insight, DORY recovers prompts through a three-step process: reconstructing a draft from output text, generating hints based on uncertainty, and reducing noise by comparing draft output with actual output. Unlike previous methods, DORY requires only a single LLM without any external resources or model training, making it a cost-effective solution for prompt recovery.</td>
+                <td><a href="https://arxiv.org/pdf/2405.20657" target="_blank">Link</a></td>
+            </tr>
+        
+            <tr>
+                <td>Weak-to-Strong Reasoning</td>
+                <td>Yuqing Yang et al</td>
+                <td>2024</td>
+                <td>reasoning</td>
+                <td>Arxiv</td>
+                <td>This paper introduces a progressive learning framework for weak-to-strong reasoning, addressing the challenge of improving large language models (LLMs) without high-quality supervision. The authors demonstrate that naively fine-tuning a stronger model (like Llama2-70b) on outputs from weaker models (like Llama2-7b or Gemma-2b) is insufficient for complex reasoning tasks. Their proposed two-stage approach first uses selective data curation through a &quot;final answer consistency&quot; method to identify potentially correct examples, then applies preference optimization that enables the model to learn from contrasting examples. Experiments on mathematical reasoning datasets show substantial improvements over baseline approaches, with the framework proving particularly effective when the strong model learns to distinguish between correct and incorrect reasoning paths.</td>
+                <td><a href="https://arxiv.org/pdf/2407.13647" target="_blank">Link</a></td>
+            </tr>
+        
             <tr>
                 <td>The False Promise of Imitating Proprietary LLMs</td>
                 <td>Arnav Gudibande et al</td>