Skip to content

NL2G/TransformingScienceLLMs

Folders and files

NameName
Last commit message
Last commit date

Latest commit

Β 

History

6 Commits
Β 
Β 
Β 
Β 

Repository files navigation

Transforming Science with Large Language Models

arXiv MIT License Maintenance Contribution Welcome

Welcome to the Transforming Science with Large Language Models repository! This repository is a collection of the most influential papers, AI models, and tools to empower researchers and academics worldwide to conduct their research more efficiently and effectively.

πŸ‘₯ Authors

πŸ“’ Updates

πŸ‘€ Introduction

Science is undergoing a transformation with AI-driven tools assisting researchers at every stage of the research cycle.

image

Our survey provides a comprehensive overview of LLMs role in scientific workflows, structured around five key areas: search and summarization, experimentation, unimodal and multimodal content generation, and peer review.

For a detailed introduction, please refer to our survey paper.

πŸ“Œ Table of Contents

πŸ” Literature Search, Summarization, and Comparison

AI-Enhanced Search

Platform Search Reco-mmen-dations Collec-tions Citation Analysis Trending Analysis Author Profiles Visual-ization Tools Paper Chat Idea Gener-ation Paper Writing Summa-rization Paper Review Data-sets Code Reposi-tories LLM Inte-gration Web API Personal-ization Free
Elicit βœ”οΈ ❌ ❌ ❌ ❌ ❌ ❌ βœ”οΈ βœ”οΈ ❌ βœ”οΈ βœ”οΈ ❌ ❌ βœ”οΈ ❌ ❌ βœ”οΈβŒ
OpenSholar βœ”οΈ ❌ βœ”οΈ ❌ ❌ ❌ ❌ βœ”οΈ ❌ ❌ βœ”οΈ ❌ ❌ ❌ βœ”οΈ ❌ ❌ βœ”οΈ
Undermind βœ”οΈ ❌ βœ”οΈ ❌ ❌ ❌ ❌ βœ”οΈ ❌ ❌ βœ”οΈ ❌ ❌ ❌ βœ”οΈ ❌ βœ”οΈ ❌
Perplexity βœ”οΈ ❌ ❌ ❌ ❌ ❌ ❌ βœ”οΈ βœ”οΈ ❌ βœ”οΈ βœ”οΈ ❌ ❌ βœ”οΈ ❌ ❌ βœ”οΈβŒ
Consensus βœ”οΈ ❌ βœ”οΈ ❌ ❌ ❌ ❌ βœ”οΈ ❌ ❌ βœ”οΈ ❌ ❌ ❌ βœ”οΈ βœ”οΈ ❌ βœ”οΈβŒ
SciSpace βœ”οΈ ❌ βœ”οΈ ❌ ❌ ❌ ❌ βœ”οΈ βœ”οΈ ❌ βœ”οΈ βœ”οΈ ❌ ❌ βœ”οΈ ❌ ❌ βœ”οΈβŒ
scienceQA βœ”οΈ ❌ βœ”οΈ βœ”οΈ ❌ ❌ ❌ βœ”οΈ βœ”οΈ ❌ βœ”οΈ βœ”οΈ ❌ ❌ βœ”οΈ ❌ ❌ βœ”οΈβŒ
PaperQA2 ❌ ❌ ❌ ❌ ❌ ❌ ❌ βœ”οΈ ❌ ❌ ❌ ❌ ❌ βœ”οΈ βœ”οΈ ❌ ❌ βœ”οΈ
Paperguide βœ”οΈ ❌ βœ”οΈ ❌ ❌ ❌ ❌ βœ”οΈ βœ”οΈ ❌ βœ”οΈ βœ”οΈ ❌ ❌ βœ”οΈ ❌ ❌ βœ”οΈβŒ
HyperWrite βœ”οΈ ❌ ❌ ❌ ❌ ❌ ❌ βœ”οΈ βœ”οΈ βœ”οΈ βœ”οΈ βœ”οΈ ❌ ❌ βœ”οΈ ❌ ❌ ❌
ResearchKick βœ”οΈ ❌ ❌ ❌ ❌ ❌ ❌ βœ”οΈ βœ”οΈ βœ”οΈ βœ”οΈ βœ”οΈ ❌ ❌ βœ”οΈ ❌ βœ”οΈ ❌

Graph-Based

Platform Search Reco-mmen-dations Collec-tions Citation Analysis Trending Analysis Author Profiles Visual-ization Tools Paper Chat Idea Gener-ation Paper Writing Summa-rization Paper Review Data-sets Code Reposi-tories LLM Inte-gration Web API Personal-ization Free
Connected Papers βœ”οΈ ❌ βœ”οΈ ❌ ❌ ❌ βœ”οΈ ❌ ❌ ❌ ❌ ❌ ❌ ❌ ❌ ❌ ❌ βœ”οΈβŒ
ScholarGPS βœ”οΈ ❌ ❌ βœ”οΈ βœ”οΈ βœ”οΈ βœ”οΈ ❌ ❌ ❌ ❌ ❌ ❌ ❌ ❌ ❌ ❌ βœ”οΈ
CiteSpace ❌ ❌ ❌ ❌ βœ”οΈ ❌ βœ”οΈ ❌ ❌ ❌ ❌ ❌ ❌ ❌ ❌ ❌ ❌ βœ”οΈβŒ
Sci2 ❌ ❌ ❌ ❌ ❌ ❌ βœ”οΈ ❌ ❌ ❌ ❌ ❌ ❌ ❌ ❌ ❌ ❌ βœ”οΈ
NLP KG βœ”οΈ ❌ βœ”οΈ βœ”οΈ ❌ βœ”οΈ βœ”οΈ ❌ ❌ ❌ ❌ ❌ ❌ ❌ ❌ ❌ ❌ βœ”οΈ
ORKG ASK βœ”οΈ ❌ βœ”οΈ ❌ ❌ ❌ ❌ ❌ ❌ ❌ βœ”οΈ ❌ ❌ ❌ βœ”οΈ ❌ ❌ βœ”οΈ

Paper Chat

Platform Search Reco-mmen-dations Collec-tions Citation Analysis Trending Analysis Author Profiles Visual-ization Tools Paper Chat Idea Gener-ation Paper Writing Summa-rization Paper Review Data-sets Code Reposi-tories LLM Inte-gration Web API Personal-ization Free
ChatGPT βœ”οΈ ❌ ❌ ❌ ❌ ❌ ❌ βœ”οΈ βœ”οΈ βœ”οΈ βœ”οΈ βœ”οΈ ❌ ❌ βœ”οΈ βœ”οΈ ❌ βœ”οΈβŒ
Claude βœ”οΈ ❌ ❌ ❌ ❌ ❌ ❌ βœ”οΈ βœ”οΈ βœ”οΈ βœ”οΈ βœ”οΈ ❌ ❌ βœ”οΈ βœ”οΈ ❌ βœ”οΈβŒ
Deepseek βœ”οΈ ❌ ❌ ❌ ❌ ❌ ❌ βœ”οΈ βœ”οΈ βœ”οΈ βœ”οΈ βœ”οΈ ❌ ❌ βœ”οΈ βœ”οΈ ❌ βœ”οΈ
Research ❌ ❌ βœ”οΈ ❌ ❌ ❌ ❌ βœ”οΈ βœ”οΈ ❌ βœ”οΈ βœ”οΈ ❌ ❌ βœ”οΈ ❌ ❌ βœ”οΈβŒ
NotebookLM ❌ ❌ ❌ ❌ ❌ ❌ ❌ βœ”οΈ βœ”οΈ ❌ βœ”οΈ βœ”οΈ ❌ ❌ βœ”οΈ ❌ βœ”οΈ βœ”οΈβŒ
EnagoRead βœ”οΈ ❌ βœ”οΈ ❌ ❌ ❌ ❌ βœ”οΈ βœ”οΈ ❌ βœ”οΈ βœ”οΈ ❌ ❌ βœ”οΈ ❌ βœ”οΈ βœ”οΈβŒ
DocAnalyzer.AI ❌ ❌ βœ”οΈ ❌ ❌ ❌ ❌ βœ”οΈ βœ”οΈ ❌ βœ”οΈ βœ”οΈ ❌ ❌ βœ”οΈ βœ”οΈ βœ”οΈ ❌
CoralAI ❌ ❌ βœ”οΈ ❌ ❌ ❌ ❌ βœ”οΈ βœ”οΈ ❌ βœ”οΈ βœ”οΈ ❌ ❌ βœ”οΈ ❌ ❌ βœ”οΈβŒ
ExplainPaper ❌ ❌ ❌ ❌ ❌ ❌ ❌ βœ”οΈ βœ”οΈ ❌ βœ”οΈ βœ”οΈ ❌ ❌ βœ”οΈ ❌ ❌ βœ”οΈβŒ
ChatPDF βœ”οΈ ❌ βœ”οΈ ❌ ❌ ❌ ❌ βœ”οΈ βœ”οΈ ❌ βœ”οΈ βœ”οΈ ❌ ❌ βœ”οΈ ❌ ❌ ❌

Recommender

Platform Search Reco-mmen-dations Collec-tions Citation Analysis Trending Analysis Author Profiles Visual-ization Tools Paper Chat Idea Gener-ation Paper Writing Summa-rization Paper Review Data-sets Code Reposi-tories LLM Inte-gration Web API Personal-ization Free
Arxiv Sanity βœ”οΈ βœ”οΈ βœ”οΈ ❌ ❌ ❌ ❌ ❌ ❌ ❌ ❌ ❌ ❌ ❌ ❌ ❌ βœ”οΈ βœ”οΈ
Scholar Inbox βœ”οΈ βœ”οΈ βœ”οΈ ❌ βœ”οΈ ❌ βœ”οΈ ❌ ❌ ❌ ❌ ❌ ❌ ❌ βœ”οΈ ❌ βœ”οΈ βœ”οΈ
ResearchTrend.ai βœ”οΈ ❌ ❌ ❌ βœ”οΈ ❌ ❌ ❌ ❌ ❌ ❌ ❌ ❌ ❌ ❌ ❌ ❌ βœ”οΈβŒ
TrendingPapers βœ”οΈ βœ”οΈ ❌ ❌ βœ”οΈ ❌ ❌ ❌ ❌ ❌ βœ”οΈ ❌ ❌ ❌ βœ”οΈ ❌ βœ”οΈ βœ”οΈ
Bytez βœ”οΈ ❌ ❌ ❌ βœ”οΈ ❌ ❌ βœ”οΈ βœ”οΈ ❌ βœ”οΈ βœ”οΈ ❌ ❌ βœ”οΈ βœ”οΈ ❌ βœ”οΈβŒ
Notesum.ai βœ”οΈ βœ”οΈ βœ”οΈ ❌ ❌ ❌ ❌ ❌ ❌ ❌ βœ”οΈ ❌ ❌ ❌ βœ”οΈ ❌ βœ”οΈ βœ”οΈβŒ
Research Rabbit βœ”οΈ ❌ βœ”οΈ ❌ ❌ ❌ βœ”οΈ ❌ ❌ ❌ ❌ ❌ ❌ ❌ ❌ ❌ ❌ βœ”οΈ

Search Engines

Platform Search Reco-mmen-dations Collec-tions Citation Analysis Trending Analysis Author Profiles Visual-ization Tools Paper Chat Idea Gener-ation Paper Writing Summa-rization Paper Review Data-sets Code Reposi-tories LLM Inte-gration Web API Personal-ization Free
Google Sholar βœ”οΈ βœ”οΈ βœ”οΈ βœ”οΈ ❌ βœ”οΈ ❌ ❌ ❌ ❌ ❌ ❌ ❌ ❌ ❌ ❌ βœ”οΈ βœ”οΈ
Semantic Sholar βœ”οΈ βœ”οΈ βœ”οΈ βœ”οΈ βœ”οΈ βœ”οΈ ❌ βœ”οΈ ❌ ❌ βœ”οΈ ❌ ❌ ❌ βœ”οΈ βœ”οΈ βœ”οΈ βœ”οΈ
Baidu Sholar βœ”οΈ βœ”οΈ βœ”οΈ βœ”οΈ βœ”οΈ βœ”οΈ ❌ ❌ ❌ ❌ ❌ ❌ ❌ ❌ βœ”οΈ ❌ βœ”οΈ βœ”οΈβŒ
BASE βœ”οΈ ❌ βœ”οΈ ❌ ❌ ❌ ❌ ❌ ❌ ❌ ❌ ❌ ❌ ❌ ❌ βœ”οΈ ❌ βœ”οΈ
Internet Archive Sholar βœ”οΈ ❌ ❌ ❌ ❌ ❌ ❌ ❌ ❌ ❌ ❌ ❌ ❌ ❌ ❌ βœ”οΈ ❌ βœ”οΈ
Scilit βœ”οΈ ❌ βœ”οΈ βœ”οΈ ❌ βœ”οΈ ❌ ❌ ❌ ❌ ❌ ❌ ❌ ❌ ❌ ❌ ❌ βœ”οΈ
The Lens βœ”οΈ ❌ βœ”οΈ ❌ ❌ βœ”οΈ ❌ ❌ ❌ ❌ ❌ ❌ ❌ ❌ ❌ βœ”οΈ ❌ βœ”οΈβŒ
Science.gov βœ”οΈ ❌ ❌ ❌ ❌ ❌ βœ”οΈ ❌ ❌ ❌ ❌ ❌ ❌ ❌ ❌ ❌ ❌ βœ”οΈ
Academia.eu βœ”οΈ ❌ βœ”οΈ ❌ ❌ βœ”οΈ ❌ ❌ ❌ ❌ ❌ ❌ ❌ ❌ ❌ ❌ ❌ βœ”οΈβŒ
OpenAlex βœ”οΈ ❌ ❌ ❌ ❌ βœ”οΈ ❌ ❌ ❌ ❌ ❌ ❌ ❌ ❌ ❌ βœ”οΈ ❌ βœ”οΈβŒ
AceMap βœ”οΈ ❌ ❌ βœ”οΈ βœ”οΈ βœ”οΈ βœ”οΈ ❌ ❌ ❌ ❌ βœ”οΈ ❌ ❌ ❌ ❌ ❌ βœ”οΈ
PubTator3 βœ”οΈ ❌ βœ”οΈ βœ”οΈ ❌ ❌ ❌ ❌ ❌ ❌ ❌ ❌ ❌ ❌ ❌ βœ”οΈ ❌ βœ”οΈ

Benchmarks

Platform Search Reco-mmen-dations Collec-tions Citation Analysis Trending Analysis Author Profiles Visual-ization Tools Paper Chat Idea Gener-ation Paper Writing Summa-rization Paper Review Data-sets Code Reposi-tories LLM Inte-gration Web API Personal-ization Free
Papers with Code βœ”οΈ ❌ ❌ ❌ ❌ ❌ ❌ ❌ ❌ ❌ ❌ ❌ βœ”οΈ βœ”οΈ ❌ ❌ ❌ βœ”οΈ
ScienceAgentBench ❌ ❌ ❌ ❌ ❌ ❌ ❌ ❌ ❌ ❌ βœ”οΈ ❌ βœ”οΈ βœ”οΈ βœ”οΈ ❌ ❌ βœ”οΈ
ORKG Benchmarks ❌ ❌ ❌ ❌ βœ”οΈ ❌ βœ”οΈ ❌ ❌ ❌ ❌ ❌ βœ”οΈ ❌ ❌ ❌ ❌ βœ”οΈ
Huggingface βœ”οΈ ❌ βœ”οΈ ❌ βœ”οΈ ❌ ❌ ❌ ❌ ❌ ❌ ❌ βœ”οΈ βœ”οΈ ❌ ❌ ❌ βœ”οΈβŒ

πŸ’‘ AI-Driven Scientific Discovery: Ideation, Hypothesis Generation, and Experimentation

Idea Generation

  • The IDEA Challenge 2022 dataset [Dataset]
  • SPACE-IDEAS: A Dataset for Salient Information Detection in Space Innovation [Paper]
  • Nova: An Iterative Planning and Search Approach to Enhance Novelty and Diversity of LLM Generated Ideas [Paper]
  • Chain of Ideas: Revolutionizing Research Via Novel Idea Development with LLM Agents [Paper]
  • Scideator: Human-LLM Scientific Idea Generation Grounded in Research-Paper Facet Recombination [Paper]
  • Many Heads Are Better Than One: Improved Scientific Idea Generation by A LLM-Based Multi-Agent System [Paper]

Hypothesis Generation

  • Large Language Models are Zero Shot Hypothesis Proposers [Paper]
  • Hypothesis Generation with Large Language Models [Paper]
  • Exploring Scientific Hypothesis Generation with Mamba [Paper]
  • Large Language Models for Automated Open-domain Scientific Hypotheses Discovery [Paper]
  • Large Language Models as Biomedical Hypothesis Generators: A Comprehensive Evaluation [Paper]
  • Improving Scientific Hypothesis Generation with Knowledge Grounded Large Language Models [Paper]
  • Towards an AI co-scientist [Paper]
  • MOOSE-Chem: Large Language Models for Rediscovering Unseen Chemistry Scientific Hypotheses [Paper]
  • Literature Meets Data: A Synergistic Approach to Hypothesis Generation [Paper]

Automated Experimentation

  • AutoML-GPT: Large Language Model for AutoML [Paper]
  • MLAgentBench: Evaluating Language Agents on Machine Learning Experimentation [Paper]
  • SWE-bench: Can Language Models Resolve Real-world Github Issues? [Paper]
  • MLCopilot: Unleashing the Power of Large Language Models in Solving Machine Learning Tasks [Paper]
  • Automatic benchmarking of large multimodal models via iterative experiment programming [Paper]
  • Agent-as-a-Judge: Evaluate Agents with Agents [Paper]
  • ScienceAgentBench: Toward Rigorous Assessment of Language Agents for Data-Driven Scientific Discovery [Paper]
  • AutoML-Agent: A Multi-Agent LLM Framework for Full-Pipeline AutoML [Paper]
  • Tree Search for Language Model Agents [Paper]
  • SELA: Tree-Search Enhanced LLM Agents for Automated Machine Learning [Paper]
  • OpenHands: An Open Platform for AI Software Developers as Generalist Agents [Paper]
  • AI agents in chemical research: GVIM - an intelligent research assistant system [Paper]
  • SWE-bench Multimodal: Do AI Systems Generalize to Visual Software Domains? [Paper]
  • AIDE: AI-Driven Exploration in the Space of Code [Paper]
  • MLGym: A New Framework and Benchmark for Advancing AI Research Agents [Paper]
  • DrugAgent: Automating AI-aided Drug Discovery Programming through LLM Multi-Agent Collaboration [Paper]

πŸ“ Text-based Content Generation

Title

  • PaperRobot: Incremental Draft Generation of Scientific Ideas [Paper]
  • Automatic Title Generation for Text with Pre-trained Transformer Language Model [Paper]
  • Transformers Go for the LOLs: Generating (Humourous) Titles from Scientific Abstracts End-to-End [Paper]

Abstract

  • PaperRobot: Incremental Draft Generation of Scientific Ideas [Paper]
  • Comparing scientific abstracts generated by ChatGPT to real abstracts with detectors and blinded human reviewers [Paper]
  • How trustworthy is ChatGPT? The case of bibliometric analyses [Paper]
  • Can ChatGPT assist authors with abstract writing in medical journals? Evaluating the quality of scientific abstracts generated by ChatGPT and original abstracts [Paper]

Related Work

  • Towards Automated Related Work Summarization [Paper]
  • Neural Related Work Summarization with a Joint Context-driven Attention Mechanism [Paper]
  • ScisummNet: A Large Annotated Corpus and Content-Impact Models for Scientific Paper Summarization with Citation Networks [Paper]
  • PaperRobot: Incremental Draft Generation of Scientific Ideas [Paper]
  • Automatic related work section generation: experiments in scientific document abstracting [Paper]
  • Automatic Generation of Related Work Sections in Scientific Papers: An Optimization Approach [Paper]
  • CORWA: A Citation-Oriented Related Work Annotation Dataset [Paper]
  • Automatic generation of related work through summarizing citations [Paper]
  • CiteBench: A Benchmark for Scientific Citation Text Generation [Paper]
  • ToC-RWG: Explore the Combination of Topic Model and Citation Information for Automatic Related Work Generation [Paper]

Citation

  • Fabrication and errors in the bibliographic citations generated by ChatGPT [Paper]
  • Cited Text Spans for Scientific Citation Text Generation [Paper]
  • Systematic Task Exploration with LLMs: A Study in Citation Text Generation [Paper]
  • Citation: A Key to Building Responsible and Accountable Large Language Models [Paper]
  • Citation-Enhanced Generation for LLM-based Chatbots [Paper]
  • Related Work and Citation Text Generation: A Survey [Paper]

Long Text

  • LongWriter: Unleashing 10,000+ Word Generation from Long Context LLMs [Paper]
  • LongReward: Improving Long-context Large Language Models with AI Feedback [Paper]
  • LongEval: A Comprehensive Analysis of Long-Text Generation Through a Plan-based Paradigm [Paper]

Proof-Reading and Paraphrasing

  • Can artificial intelligence help for scientific writing? [Paper]
  • Good Practices for Scientific Article Writing with ChatGPT and Other Artificial Intelligence Language Models [Paper]
  • The role of ChatGPT in scientific communication: writing better scientific review articles [Paper]
  • Using ChatGPT for language editing in scientifc articles [Paper]
  • The Ability of ChatGPT in Paraphrasing Texts and Reducing Plagiarism: A Descriptive Analysis [Paper]

Press Release

  • Expertise Style Transfer: A New Task Towards Better Communication between Experts and Laymen [Paper]
  • Making Science Simple: Corpora for the Lay Summarisation of Scientific Literature [Paper]
  • β€˜Don’t Get Too Technical with Me’: A Discourse Structure-Based Framework for Automatic Science Journalism [Paper]

🎨 Multimodal Content Generation and Understanding

Scientific Figure Understanding

  • A Diagram is Worth a Dozen Images [Paper]
  • A simple neural network module for relational reasoning [Paper]
  • FigureQA: An Annotated Figure Dataset for Visual Reasoning [Paper]
  • ChartQA: A Benchmark for Question Answering about Charts with Visual and Logical Reasoning [Paper]
  • ChartSumm: A Comprehensive Benchmark for Automatic Chart Summarization of Long and Short Summaries [Paper]
  • Multimodal ArXiv: A Dataset for Improving Scientific Comprehension of Large Vision-Language Models [Paper]
  • SciMMIR: Benchmarking Scientific Multi-modal Information Retrieval [Paper]
  • SPIQA: A Dataset for Multimodal Question Answering on Scientific Papers [Paper]
  • CharXiv: Charting Gaps in Realistic Chart Understanding in Multimodal LLMs [Paper]
  • ChartAdapter: Large Vision-Language Model for Chart Summarization [Paper]

Scientific Figure Generation

  • Data2Vis: Automatic Generation of Data Visualizations Using Sequence-to-Sequence Recurrent Neural Networks [Paper]
  • ADVISor: Automatic Visualization Answer for Natural-Language Question on Tabular Data [Paper]
  • Sevi: Speech-to-Visualization through Neural Machine Translation [Paper]
  • Chat2VIS: Generating Data Visualizations via Natural Language Using ChatGPT, Codex and GPT-3 Large Language Models [Paper]
  • AutomaTikZ: Text-Guided Synthesis of Scientific Vector Graphics with TikZ [Paper]
  • SciDoc2Diagrammer-MAF: Towards Generation of Scientific Diagrams from Documents guided by Multi-Aspect Feedback Refinement [Paper]
  • Plots Made Quickly: An Efficient Approach for Generating Visualizations from Natural Language Queries [Paper]
  • DiagrammerGPT: Generating Open-Domain, Open-Platform Diagrams via LLM Planning [Paper]
  • DeTikZify: Synthesizing Graphics Programs for Scientific Figures and Sketches with TikZ [Paper]
  • ChartMimic: Evaluating LMM's Cross-Modal Reasoning Capability via Chart-to-Code Generation [Paper]
  • ScImage: How Good Are Multimodal Large Language Models at Scientific Text-to-Image Generation? [Paper]

Scientific Table Understanding

  • ToTTo: A Controlled Table-To-Text Generation Dataset [Paper]
  • SciGen: a Dataset for Reasoning-Aware Text Generation from Scientific Tables [Paper]
  • Towards Table-to-Text Generation with Numerical Reasoning [Paper]
  • SciXGen: A Scientific Paper Dataset for Context-Aware Text Generation [Paper]
  • Structure-Aware Pre-Training for Table-to-Text Generation [Paper]
  • Table-To-Text generation and pre-training with TabT5 [Paper]
  • Few-shot Table-to-text Generation with Prefix-Controlled Generator [Paper]
  • Robust (Controlled) Table-to-Text Generation with Structure-Aware Equivariance Learning [Paper]
  • SORTIE: Dependency-Aware Symbolic Reasoning for Logical Data-to-text Generation [Paper]
  • LoFT: Enhancing Faithfulness and Diversity for Table-to-Text Generation via Logic Form Control [Paper]
  • Arithmetic-Based Pretraining Improving Numeracy of Pretrained Language Models [Paper]
  • Structure-aware Table-to-Text Generation with Prefix-tuning [Paper]
  • Table-to-Text Using Pre-trained Large Language Model and LoRA [Paper]
  • Unifying Structured Data as Graph for Data-to-Text Pre-Training [Paper]
  • Integrating Table Representations into Large Language Models for Improved Scholarly Document Comprehension [Paper]

Scientific Table Generation

  • gTBLS: Generating Tables from Text by Conditional Question Answering [Paper]
  • ArxivDIGESTables: Synthesizing Scientific Literature into Tables using Language Models [Paper]
  • OpenTE: Open-Structure Table Extraction From Text [Paper]
  • Is This a Bad Table? A Closer Look at the Evaluation of Table Generation from Text [Paper]
  • LATTE: Improving Latex Recognition for Tables and Formulae with Iterative Refinement [Paper]

Scientific Slides and Poster Generation

  • SlidesGen: Automatic Generation of Presentation Slides for a Technical Paper Using Summarization [Paper]
  • PPSGen: learning to generate presentation slides for academic papers [Paper]
  • Learning to Generate Posters of Scientific Papers [Paper]
  • Phrase-Based Presentation Slides Generation for Academic Papers [Paper]
  • D2S: Document-to-Slide Generation Via Query-Based Text Summarization [Paper]
  • Towards Topic-Aware Slide Generation For Academic Papers With Unsupervised Mutual Learning [Paper]
  • DOC2PPT: Automatic Presentation Slides Generation from Scientific Documents [Paper]
  • PosterBot: A System for Generating Posters of Scientific Papers with Neural Models [Paper]
  • Presentations by the Humans and For the Humans: Harnessing LLMs for Generating Persona-Aware Slides from Documents [Paper]
  • Enhancing Presentation Slide Generation by LLMs with a Multi-Staged End-to-End Approach [Paper]
  • Presentations are not always linear! GNN meets LLM for Text Document-to-Presentation Transformation with Attribution [Paper]

βœ… Peer Review

Analysis of Peer Reviews

  • Argument Mining for Understanding Peer Reviews [Paper]
  • Aspect-based Sentiment Analysis of Scientific Reviews [Paper]
  • APE: Argument Pair Extraction from Peer Review and Rebuttal via Multi-task Learning [Paper]
  • Argument Mining Driven Analysis of Peer-Reviews [Paper]
  • HedgePeer: A Dataset for Uncertainty Detection in Peer Reviews [Paper]
  • PolitePEER: does peer review hurt? A dataset to gauge politeness intensity in the peer reviews [Paper]
  • Automatic Analysis of Substantiation in Scientific Peer Reviews [Paper]
  • Exploring Jiu-Jitsu Argumentation for Writing Peer Review Rebuttals [Paper]

Paper Feedback and Automatic Reviewing

  • DeepSentiPeer: Harnessing Sentiment in Review Texts to Recommend Peer Review Decisions [Paper]
  • Exploring the Potential of GPT-2 for Generating Fake Reviews of Research Papers [Paper]
  • Multi-task Peer-Review Score Prediction [Paper]
  • ReviewRobot: Explainable Paper Review Generation based on Knowledge Synthesis [Paper]
  • PEERAssist: Leveraging on Paper-Review Interactions to Predict Peer Review Decisions [Paper]
  • Can We Automate Scientific Reviewing? [Paper]
  • ReviewerGPT? An Exploratory Study on Using Large Language Models for Paper Reviewing [Paper]
  • GPT4 is Slightly Helpful for Peer-Review Assistance: A Pilot Study [Paper]
  • Can large language models provide useful feedback on research papers? A large-scale empirical analysis [Paper]
  • MARG: Multi-Agent Review Generation for Scientific Papers [Paper]

Scientific Rigour

  • Online software spots genetic errors in cancer papers [Paper]
  • SciScore [Tool]
  • Assessing Scientific Research Papers with Knowledge Graphs [Paper]
  • On the Rigour of Scientific Writing: Criteria, Analysis, and Insights [Paper]

Scientific Claim Verification

  • SciFact-Open: Towards open-domain scientific claim verification [Paper]
  • Scientific Fact-Checking: A Survey of Resources and Approaches [Paper]
  • The Intended Uses of Automated Fact-Checking Artefacts: Why, How and Who [Paper]
  • Missci: Reconstructing Fallacies in Misrepresented Science [Paper]
  • Overview of the Context24 Shared Task on Contextualizing Scientific Claims [Paper]
  • How We Refute Claims: Automatic Fact-Checking through Flaw Identification and Explanation [Paper]
  • Claim Verification in the Age of Large Language Models: A Survey [Paper]
  • Grounding Fallacies Misrepresenting Scientific Publications in Evidence [Paper]

Meta Review Generation

  • Uncertainty-aware machine support for paper reviewing on the interspeech 2019 submission corpus [Paper]
  • A Deep Neural Architecture for Decision-Aware Meta-Review Generation [Paper]
  • Summarizing Multiple Documents with Conversational Structure for Meta-Review Generation [Paper]
  • Scientific Opinion Summarization: Paper Meta-review Generation Dataset, Methods, and Evaluation [Paper]
  • LLMs as Meta-Reviewers' Assistants: A Case Study [Paper]

πŸš€ End-to-End

  • Scientific discovery in the age of artificial intelligence [Paper]
  • The AI Scientist: Towards Fully Automated Open-Ended Scientific Discovery [Paper]

Citation

@article{eger2025transforming,
  title={Transforming Science with Large Language Models: A Survey on AI-assisted Scientific Discovery, Experimentation, Content Generation, and Evaluation},
  author={Eger, Steffen and Cao, Yong and D'Souza, Jennifer and Geiger, Andreas and Greisinger, Christian and Gross, Stephanie and Hou, Yufang and Krenn, Brigitte and Lauscher, Anne and Li, Yizhi and others},
  journal={arXiv preprint arXiv:2502.05151},
  year={2025}
}

About

No description, website, or topics provided.

Resources

Stars

Watchers

Forks

Releases

No releases published

Packages

No packages published