What does this repo signal mean?

Tencent Hunyuan published Tencent-Hunyuan/HY3D-Bench (Python). This repository signal exposes tooling, eval, infrastructure, or model-adjacent work before it may appear in a launch post. High-signal details: repo Tencent-Hunyuan/HY3D-Bench · language Python · New 3D benchmark repo, moderate stars. onlylabs links this event to 1 captured evidence page and 6 related repo signals.

Tencent Hunyuan Repo: Tencent-Hunyuan/HY3D-Bench

Captured source

source ↗

GitHub/github.com/Tencent-Hunyuan/HY3D-Bench

Tencent-Hunyuan/HY3D-Bench repository metadata

Source ↗

published Feb 3, 2026seen Jun 5captured Jun 11http 200method plain

Tencent-Hunyuan/HY3D-Bench

Language: Python

License: NOASSERTION

Stars: 341

Forks: 13

Open issues: 5

Created: 2026-02-03T13:35:57Z

Pushed: 2026-02-27T10:03:22Z

Default branch: main

Fork: no

Archived: no

README:

🔥 News

Feb 04, 2026: 🎉 We release HY3D-Bench - a comprehensive collection of high-quality 3D datasets with 3 DIFFERENT subsets!
✨ Full-level Dataset: 252K+ watertight meshes with multi-view renderings and sampled points
✨ Part-level Dataset: 240K+ objects with fine-grained part decomposition
✨ Synthetic Dataset: 125K+ AI-synthesized objects across 1,252 categories
🚀 Baseline Model: Hunyuan3D-Shape-v2-1 Small (0.8B DiT) trained on our Full-level data

> Join our [Wechat](#) and [Discord](https://discord.gg/dNBrdrGGMa) group to discuss and find help from us.

| Wechat Group | Xiaohongshu | X | Discord | |--------------------------------------------------|-------------------------------------------------------|---------------------------------------------|---------------------------------------------------| | | | | |

---

☯️ HY3D-Bench

HY3D-Bench is a collection of high-quality 3D datasets designed to address the critical limitations of existing 3D repositories. While pioneering large-scale datasets have provided unprecedented volumes of 3D data, their utility is often hampered by significant noise, non-manifold geometry, and lack of structural granularity.

We release three complementary datasets that provide clean, structured, and diverse 3D content for research in computer vision, generative modeling, and robotics.

🎯 Key Features Across All Datasets

✅ Training-Ready Quality: All meshes are watertight, normalized, and cleaned ✅ Standardized Format: Consistent file formats and metadata structure

🔷 Full Dataset - Complete Objects with Watertight Meshes

High-quality holistic 3D objects processed through a professional pipeline to ensure training-ready quality.

What's Included:

Watertight meshes for each object (no holes, manifold geometry)
High-fidelity multi-view renderings (RGB images from standardized camera poses)
Cleaned and normalized geometry ready for stable 3D generation training

Use Cases:

Training 3D generative models (diffusion, GAN, autoregressive)
3D reconstruction benchmarks
Single-view to 3D tasks
Geometric deep learning

Instructions for Use For detailed usage, see [baseline/README.md](baselines/README.md) and [debug_dataloader.py](baselines/debug_scripts/debug_dataloader.py)

🔷 Part Dataset - Structured Part-Level Decomposition

Objects with consistent part-level segmentation and individual part assets.

What's Included:

Original mesh segmentation results (part labels)
Individual part-level watertight meshes (each part as separate clean mesh)
Part-assembly RGB renderings (view-dependent images of assembled parts)

Use Cases:

Part-aware 3D generation
Fine-grained geometric perception
Part-based shape analysis
Robotics manipulation (affordance learning, grasp planning)

Instructions for Use For detailed instructions, see [Part_README.md](part-level_data/Part_README.md)

🔷 Sythetic Dataset - AI-Synthesized Long-Tail Objects

Scalable AIGC-driven synthetic data covering rare and diverse categories.

What's Included:

20 super-categories, 130 categories, 1,252 fine-grained sub-categories
Text-to-3D pipeline outputs (LLM → Diffusion → Image-to-3D reconstruction)
Long-tail object coverage (rare items, specialized categories)

Use Cases:

Training robust models on diverse categories
Data augmentation for underrepresented classes
Zero-shot generalization evaluation
Robotics simulation environments (diverse object libraries)

Generation Pipeline: 1. Text-to-Text: Semantic expansion using Large Language Models 2. Text-to-Image: Visual synthesis via Diffusion Models 3. Image-to-3D: Textured Mesh generation with the most advanced 3D Generative Model

---

📥 Download

🤗 Hugging Face

# Download entire dataset
hf download tencent/HY3D-Bench --repo-type dataset --local-dir "your/local/path"

# Download specific subset, e.g. full.
hf download tencent/HY3D-Bench --repo-type dataset --include "full/**" --local-dir "your/local/path"

| Dataset | Objects | Size | |---------|---------|------| | Full-level | 252K+ | ~11 TB | | Part-level | 240K+ | ~5.0 TB | | Synthetic | 125K+ | ~6.5 TB |

Dataset Structure

HY3D-Bench/
├── full/
│ ├── test/
│ │ ├──images
│ │ ├──sample_points
│ │ └──water_tight_meshes
│ ├── train/ # same subsets as test
│ └── val/ # same subsets as test
├── part/
│ ├── images/ # rendering
│ └── water_tight_meshes # meshes
└── synthetic/
├── glb/ # AI-generated meshes storaged with glb format files
└── img/ # The condition images used to generate meshes

---

🎁 Baseline Model

We train a baseline model Hunyuan3D-Shape-v2-1 Small with the full-level data to evaluate the effectiveness of the full-level dataset.

| Model | Date | Size | Huggingface | |-------|------|------|-------------| | Model_2048tokens | 2026-02-04 | 0.8B | [Download]() | | Model_4096tokens | 2026-02-04 | 0.8B | [Download]() |

🔗 BibTeX

If you found this repository helpful, please cite our reports:

@misc{hunyuan3d2026hy3dbenchgeneration3dassets,
title={HY3D-Bench: Generation of 3D Assets},
author={Team Hunyuan3D and : and Bowen Zhang and Chunchao Guo and Dongyuan Guo and Haolin Liu and Hongyu Yan and Huiwen Shi and Jiaao Yu and Jiachen Xu and Jingwei Huang and Kunhong Li and Lifu Wang and Linus and Penghao Wang and Qingxiang Lin and Ruining Tang and Xianghui Yang and Yang Li and Yirui Guan and Yunfei Zhao and Yunhan Yang and Zeqiang Lai and Zhihao Liang and Zibo Zhao},
year={2026},
eprint={2602.03907},
archivePrefix={arXiv},
primaryClass={cs.CV},
url={https://arxiv.org/abs/2602.03907},
}

@article{ma2025p3sam,
title={P3-sam: Native 3d part segmentation},
author={Ma, Changfeng and Li, Yang and Yan, Xinhao and Xu, Jiachen and Yang, Yunhan and Wang, Chunshi and Zhao, Zibo and Guo, Yanwen and Chen, Zhuo and Guo, Chunchao},
journal={arXiv preprint arXiv:2509.06784},
year={2025}
}

@article{yan2025xpart,
title={X-Part: high fidelity and structure coherent shape decomposition},
author={Yan, Xinhao and Xu, Jiachen and Li, Yang and Ma, Changfeng and Yang,...

Excerpt shown — open the source for the full document.

Notability

notability 5.0/10

New 3D benchmark repo, moderate stars