Skip to content

Commit 191bb61

Browse files
authored
Merge pull request #882 from wbq321/main
add blog and bio
2 parents 4efa9c1 + 847c633 commit 191bb61

File tree

3 files changed

+86
-0
lines changed

3 files changed

+86
-0
lines changed

content/authors/wbq321/_index.md

Lines changed: 43 additions & 0 deletions
Original file line numberDiff line numberDiff line change
@@ -0,0 +1,43 @@
1+
---
2+
# Display name
3+
title: Baiqiang Wang
4+
5+
# Username (this should match the folder name)
6+
authors:
7+
- wbq321
8+
9+
# Is this the primary user of the site?
10+
superuser: false
11+
12+
# Role/position
13+
role: "Ph.D. Student in CSS, University of Washington"
14+
15+
# Organizations/Affiliations
16+
organizations:
17+
- name: University of Washington
18+
url: "https://www.washington.edu/"
19+
20+
# Short bio (displayed in user profile at end of posts)
21+
bio: Baiqiang Wang is a first-year Ph.D. student at University of Washington. He works on vector database and cryptography.
22+
23+
# Social/Academic Networking
24+
social:
25+
- icon: envelope
26+
icon_pack: fas
27+
link: mailto:wangbaiqiang1234@gmail.com
28+
- icon: github
29+
icon_pack: fab
30+
link: https://github.com/wbq321
31+
- icon: linkedin
32+
icon_pack: fab
33+
link: https://www.linkedin.com/in/baiqiang-wang-205b35224/
34+
35+
# Enter email to display Gravatar (if Gravatar enabled in Config)
36+
email: wangbaiqiang1234@gmail.com
37+
38+
# Organizational groups that you belong to (for People widget)
39+
user_groups:
40+
- 2025 Contributors
41+
---
42+
43+
Baiqiang Wang is a first-year Ph.D. student at University of Washington. His research interests are vector database and cryptography. During OSRE 2025, he is working on enhancing reproducibility in RAG frameworks for scientific workflows.
Loading
Lines changed: 43 additions & 0 deletions
Original file line numberDiff line numberDiff line change
@@ -0,0 +1,43 @@
1+
---
2+
title: "Enhancing Reproducibility in RAG Frameworks for Scientific Workflows"
3+
subtitle:
4+
summary: "This project addresses the critical issue of non-determinism in Retrieval-Augmented Generation (RAG) systems. We aim to develop a suite of tools, benchmarks, and best practices to ensure scientific workflows using Large Language Models are reliable, transparent, and reproducible."
5+
authors:
6+
- wbq-321
7+
tags: ["osre25", "reproducibility", "rag", "llm", "ai-for-science"]
8+
categories: ["Project"]
9+
date: 2025-06-25
10+
lastmod: 2025-06-25
11+
featured: false
12+
draft: false
13+
14+
# Featured image
15+
# To use, add an image named `featured.jpg/png` to your page's folder.
16+
# Focal points: Smart, Center, TopLeft, Top, TopRight, Left, Right, BottomLeft, Bottom, BottomRight.
17+
image:
18+
caption: ""
19+
focal_point: "Smart"
20+
preview_only: false
21+
---
22+
23+
Hello, I'm Baiqiang. As part of the [Enhancing Reproducibility in RAG Frameworks for Scientific Workflows](https://ucsc-ospo.github.io/project/osre25/pnnl/llm_rag_reproducibility/) project, I am excited to introduce my work on a crucial challenge in modern computational science. My [proposal](https://www.overleaf.com/read/fcbxtpngdnhw#8cc2c8) under the mentorship of Luanzheng "Lenny" Guo at Pacific Northwest National Laboratory and Dongfang Zhao at the University of Washington aims to enhance the reproducibility of AI-driven scientific workflows.
24+
25+
### The Problem: A Crisis of Confidence in AI for Science
26+
27+
Large Language Models (LLMs) are transforming scientific research, from accelerating literature reviews to generating novel hypotheses. However, their power is matched by their pitfalls: a tendency to "hallucinate" facts and a lack of transparency. Retrieval-Augmented Generation (RAG) was developed as a powerful solution, grounding LLM outputs in factual evidence retrieved from a specific knowledge base (like a database of scientific papers).
28+
29+
But a hidden problem lurks within RAG: **non-determinism**. The very first step of a RAG system—the similarity search that finds relevant documents—can produce different results even when asked the same question. Variations in indexing algorithms, data updates, or even the underlying software can change which documents are retrieved. For science, this is a critical flaw. If an experiment cannot be repeated with the same results, its conclusions cannot be trusted. This project tackles that challenge head-on.
30+
31+
### Our Mission: Forging a Path to Reproducible RAG
32+
33+
This project proposes a comprehensive solution to systematically identify, measure, and mitigate non-determinism in RAG frameworks. Our goal is to empower researchers to build and use AI tools with confidence.
34+
35+
Our approach is built on four key pillars:
36+
37+
1. **Systematic Analysis:** We will conduct a deep dive into popular RAG components (like FAISS, ScaNN, and HNSW) to pinpoint the exact sources of randomness and variability.
38+
2. **Rigorous Benchmarking:** We will develop a public, open-source benchmarking suite using standardized scientific datasets (from PubMed, arXiv, etc.). This will allow anyone to quantitatively measure the reproducibility of their own RAG pipeline using clear metrics like retrieval overlap and rank correlation.
39+
3. **Targeted Enhancements:** Based on our findings, we will implement practical solutions, including:
40+
* Promoting deterministic algorithms and configurations.
41+
* Building robust data versioning and provenance tracking tools (inspired by DVC and Git LFS).
42+
* Creating tools for precise configuration management to capture the entire experimental setup.
43+
4. **Practical Guidance and Open Source Tools:** We will distill our insights into comprehensive documentation, reusable code examples, and best practices. All tools and findings will be contributed back to the open-source community.

0 commit comments

Comments
 (0)