What does this repo signal mean?

ByteDance (Doubao/Seed) published ByteDance-Seed/DiscoX (Python). This repository signal exposes tooling, eval, infrastructure, or model-adjacent work before it may appear in a launch post. High-signal details: repo ByteDance-Seed/DiscoX · language Python · New repo, low traction, routine.. onlylabs links this event to 1 captured evidence page and 6 related repo signals.

ByteDance (Doubao/Seed) Repo: ByteDance-Seed/DiscoX

Captured source

source ↗

GitHub/github.com/ByteDance-Seed/DiscoX

ByteDance-Seed/DiscoX repository metadata

Source ↗

published Oct 22, 2025seen Jun 5captured Jun 11http 200method plain

ByteDance-Seed/DiscoX

Language: Python

License: NOASSERTION

Stars: 11

Forks: 0

Open issues: 1

Created: 2025-10-22T03:33:53Z

Pushed: 2025-12-17T18:26:13Z

Default branch: main

Fork: no

Archived: no

README:

DISCOX

Project Overview

DISCOX is a benchmark designed to evaluate the performance of LLMs on discourse-level and expert-level translation tasks. Unlike existing benchmarks that primarily focus on isolated sentences or general-domain texts, DISCOX emphasizes the ability of models to maintain discourse coherence, terminological precision, and domain-specific accuracy across long-form professional content.

This benchmark covers multiple domains (e.g., social sciences, natural sciences, humanities, applied disciplines, news, domain-specific scenarios, and literature & arts) with a wide range of translation challenges. It enables fine-grained evaluation of LLM outputs using criteria such as accuracy, fluency, and appropriateness.

Getting Started

1. Install Dependencies

Make sure you are using Python 3.9+. Then install the required packages:

pip install -r requirements.txt

2. Configure Environment Variables

Set up your API key and endpoint in the .env file:

JUDGE_API_KEY=your_judgemodel_api_key_here
JUDGE_API_BASE=your_judgemodel_api_base_here
CANDIDATE_API_KEY=your_candidatemodel_api_key_here
CANDIDATE_API_BASE=your_candidatemodel_api_base_here

3. Run Evaluation Tasks

You can run tasks by specifying the target model and the judge model. For example:

python3 run_tasks.py --model openai/gpt4o-2024-11-20 --judgemodel azure/gemini-2.5-pro

Example Use Case

Model Under Evaluation: openai/gpt4o-2024-11-20
Judge Model: azure/gemini-2.5-pro

This configuration runs translation tasks using GPT-4o and evaluates them with Gemini-2.5-Pro under the Metric-S evaluation framework.

---

License

This project is released under the Apache 2.0 License. See [LICENSE](LICENSE) for details.

Notability

notability 3.0/10

New repo, low traction, routine.