ai21labs/Jamba-tiny-dev
Captured source
source ↗published Sep 3, 2024seen 5dcaptured 14hhttp 200method plainlicense apache-2.0params 319Mdownloads 1181klikes 14
This is a tiny Jamba model used for development, debugging and experimentation over the Jamba architecture.
It has 319M parameters (instead of 52B in Jamba 1.5 Mini (and Jamba v0.1) and 398B in Jamba 1.5 Large), and was trained on ~40B tokens.
It is great for use in unittests since it is a small model (doesn't take long to download) that has valid and non-random outputs. Yet, it did not undergo extensive training and should not be expected to generate high-quality text.
Notability
notability 8.0/10Very high HF downloads indicate strong traction