Replies: 38 comments
-
Code as Reward: Empowering Reinforcement Learning with VLMs |
Beta Was this translation helpful? Give feedback.
-
StarCoder 2 and The Stack v2: The Next Generation |
Beta Was this translation helpful? Give feedback.
-
Sora: A Review on Background, Technology, Limitations, and Opportunities of Large Vision Models |
Beta Was this translation helpful? Give feedback.
-
TaskWeaver: A Code-First Agent Framework |
Beta Was this translation helpful? Give feedback.
-
Large World Model (LWM) |
Beta Was this translation helpful? Give feedback.
-
Language Segment-Anything |
Beta Was this translation helpful? Give feedback.
-
EgoCOT: Embodied Chain-of-Thought Dataset for Vision Language Pre-training |
Beta Was this translation helpful? Give feedback.
-
Beta Was this translation helpful? Give feedback.
-
Beta Was this translation helpful? Give feedback.
-
Everything of Thoughts (XoT)triangle: Defying the Law of Penrose Triangle for Thought Generation |
Beta Was this translation helpful? Give feedback.
-
https://towardsdatascience.com/an-overview-of-the-lora-family-515d81134725 An Overview of the LoRA Family. LoRA, DoRA, AdaLoRA, Delta-LoRA, and… | by Dorian Drost | Mar, 2024 |
Beta Was this translation helpful? Give feedback.
-
Beta Was this translation helpful? Give feedback.
-
LlamaIndex |
Beta Was this translation helpful? Give feedback.
-
DeepSeek-VL: Towards Real-World Vision-Language Understanding |
Beta Was this translation helpful? Give feedback.
-
https://blog.research.google/2024/03/chain-of-table-evolving-tables-in.html Chain-of-table: Evolving tables in the reasoning chain for table understanding |
Beta Was this translation helpful? Give feedback.
-
Beta Was this translation helpful? Give feedback.
-
https://blog.research.google/2024/03/screenai-visual-language-model-for-ui.html?m=1 |
Beta Was this translation helpful? Give feedback.
-
https://hal.science/hal-04107105/document Ghost in the Minecraft: Generally Capable Agents for Open-World Environments via Large Language Models with Text-based Knowledge and Memory |
Beta Was this translation helpful? Give feedback.
-
Grounding Language Plans in Demonstrations Through Counterfactual Perturbations |
Beta Was this translation helpful? Give feedback.
-
LLM Agent Operating System |
Beta Was this translation helpful? Give feedback.
-
Beta Was this translation helpful? Give feedback.
-
Open-World Object Manipulation using Pre-trained Vision-Language Models |
Beta Was this translation helpful? Give feedback.
-
Beta Was this translation helpful? Give feedback.
-
DeliGrasp: Inferring Object Mass, Friction, and Compliance with LLMs for Adaptive and Minimally Deforming Grasp Policies |
Beta Was this translation helpful? Give feedback.
-
InternLM2 Technical Report |
Beta Was this translation helpful? Give feedback.
-
Explorative Inbetweening of Time and Space |
Beta Was this translation helpful? Give feedback.
-
Can LLMs Separate Instructions From Data? And What Do We Even Mean By That? |
Beta Was this translation helpful? Give feedback.
-
Beta Was this translation helpful? Give feedback.
-
A Survey on Game Playing Agents and Large Models: Methods, Applications, and Challenges |
Beta Was this translation helpful? Give feedback.
-
Language Models as Compilers: Simulating Pseudocode Execution Improves Algorithmic Reasoning in Language Models |
Beta Was this translation helpful? Give feedback.
Uh oh!
There was an error while loading. Please reload this page.
Uh oh!
There was an error while loading. Please reload this page.
-
Please post interesting papers!
https://huggingface.co/papers
Beta Was this translation helpful? Give feedback.
All reactions