RepoUpstage (Solar)Upstage (Solar)published May 9, 2025seen 5d

UpstageAI/CReSt

Python

Open original ↗

Captured source

source ↗
published May 9, 2025seen 5dcaptured 16hhttp 200method plain

UpstageAI/CReSt

Description: CReSt: A Comprehensive Benchmark for Retrieval-Augmented Generation with Complex Reasoning over Structured Documents

Language: Python

Stars: 5

Forks: 0

Open issues: 0

Created: 2025-05-09T02:00:12Z

Pushed: 2025-07-28T07:41:46Z

Default branch: main

Fork: no

Archived: no

README:

CReSt Benchmark

CReSt: A Comprehensive Benchmark for Retrieval-Augmented Generation with Complex Reasoning over Structured Documents

⚡️ Introduction

CReSt is a benchmark consisting of 2,245 human-annotated examples in English and Korean, designed to capture complex, multi-step RAG scenarios.

You can explore the dataset on Hugging Face at: https://huggingface.co/datasets/upstage/CReSt

📣 Latest Updates

  • [15/05/2025] Release of CReSt code

🚀 Quick Start

1. Clone the repository and install the required dependencies.

git clone git@github.com:UpstageAI/CReSt.git
cd CReSt
pip install -r requirements.txt

2. Copy the .env.example template and rename it to .env. Then, update it with your API keys.

cp .env.example .env

3. Run the script.

python -m scripts.run_evaluation --model $MODEL \
--eval-model gpt-4o \
--method $METHOD \
--dataset upstage/CReSt

📜 License

This benchmark is distributed under the CC-by-NC 4.0.

📝 Citation

If you use this code in your research, please cite:

@inproceedings{khang2025crest,
title={CReSt: A Comprehensive Benchmark for Retrieval-Augmented Generation with Complex Reasoning over Structured Documents},
author={Khang, Minsoo and Park, Sangjun and Hong, Teakgyu and Jung, Dawoon},
booktitle={TBD},
pages={TBD},
year={2025}
}

Notability

notability 3.0/10

Low stars, routine new repo