{"schema_version":"onlylabs.public_analysis.v1","url":"https://onlylabs.fyi/analysis/gmi-cloud","json_url":"https://onlylabs.fyi/analysis/gmi-cloud/analysis.json","evidence_json_url":"https://onlylabs.fyi/analysis/gmi-cloud/evidence.json","generated_at":"2026-06-27T22:29:23.451Z","analysis":{"org_slug":"gmi-cloud","url":"https://onlylabs.fyi/analysis/gmi-cloud","json_url":"https://onlylabs.fyi/analysis/gmi-cloud/analysis.json","evidence_json_url":"https://onlylabs.fyi/analysis/gmi-cloud/evidence.json","dossier_url":"https://onlylabs.fyi/labs/gmi-cloud","org":{"slug":"gmi-cloud","name":"GMI Cloud","category":"neocloud","category_label":"Neocloud","homepage_url":"https://www.gmicloud.ai"},"title":"GMI Cloud analysis","summary":"GMI Cloud is an inference-optimized neocloud building a full-stack platform tightly coupled to NVIDIA's hardware roadmap. The evidence shows a company transitioning from bare-metal GPU provisioning to a managed platform layer: 10 open roles are clustered around a named \"Inference Engine\" product, AgentBox has shipped as an agent marketplace and hosting platform, and every public post ties GMI's infrastructure…","markdown":"## Thesis\nGMI Cloud is an inference-optimized neocloud building a full-stack platform tightly coupled to NVIDIA's hardware roadmap. The evidence shows a company transitioning from bare-metal GPU provisioning to a managed platform layer: 10 open roles are clustered around a named \"Inference Engine\" product [P1](https://www.gmicloud.ai/en/company/career)[E4](https://www.gmicloud.ai/en/company/career)[E6](https://www.gmicloud.ai/en/company/career), AgentBox has shipped as an agent marketplace and hosting platform [W5](https://www.gmicloud.ai/en/blog/agentbox-is-live-the-whole-stack-for-production-ai-agents-in-one-place), and every public post ties GMI's infrastructure identity to NVIDIA's GB200/B200/B300 and Vera Rubin cadence [W1](https://www.gmicloud.ai/en/blog/gmi-cloud-brings-nvidia-nemotron-3-ultra-to-developers-on-day-0)[W2](https://www.gmicloud.ai/en/blog/gmi-cloud-supports-the-next-era-of-ai-factories-with-nvidia-vera-rubin). The dual GTM motion — enterprise managed inference via Fireworks AI [W4](https://www.gmicloud.ai/en/blog/building-the-infrastructure-for-production-ai-fireworks-ai-gmi-cloud) plus a founder/developer ecosystem via SCALE accelerator [W6](https://www.gmicloud.ai/en/blog/scale-at-gmi-cohort-1-recap) — signals a land-grab for inference workloads before the neocloud category consolidates.\n\n## Signal desks\n\n### Hiring\n- **Inference Engine commercialization**: Inference Engine Product Manager and BD Manager, Inference Engine — two dedicated roles for a named platform product, both Mountain View [P1](https://www.gmicloud.ai/en/company/career)[E4](https://www.gmicloud.ai/en/company/career)[E6](https://www.gmicloud.ai/en/company/career).\n- **Inference engineering**: Machine Learning Engineer and Machine Learning Engineer (LLM Inference), both Mountain View [P1](https://www.gmicloud.ai/en/company/career)[E7](https://www.gmicloud.ai/en/company/career)[E9](https://www.gmicloud.ai/en/company/career); Infra Engineer – SRE (Kubernetes), US remote [P1](https://www.gmicloud.ai/en/company/career)[E5](https://www.gmicloud.ai/en/company/career).\n- **GTM and content scaling**: Solutions Architect (US Sales) [P1](https://www.gmicloud.ai/en/company/career)[E2](https://www.gmicloud.ai/en/company/career), Content & Growth Marketer [P1](https://www.gmicloud.ai/en/company/career)[E8](https://www.gmicloud.ai/en/company/career), Product Management Operations [P1](https://www.gmicloud.ai/en/company/career) — marketing and customer-facing buildout.\n- **Organizational growth**: Talent Acquisition Partner [P1](https://www.gmicloud.ai/en/company/career)[E3](https://www.gmicloud.ai/en/company/career), Technical Program Manager [P1](https://www.gmicloud.ai/en/company/career)[E1](https://www.gmicloud.ai/en/company/career) — scaling headcount and cross-team execution.\n- **Location concentration**: 8 of 10 roles in Mountain View, CA; 2 remote-US (Solutions Architect, SRE) [P1](https://www.gmicloud.ai/en/company/career).\n\n### Forks\nNo cited evidence in this pack.\n\n### Releases\n- **AgentBox** (2026-06-08): Full-stack agent hosting platform and marketplace. Includes ready-to-use agents for code review, retrieval graph construction (S3, SharePoint, Confluence, Notion), and benchmark suites (MMLU, HumanEval). GMI handles server setup, provisioning, and scaling underneath [W5](https://www.gmicloud.ai/en/blog/agentbox-is-live-the-whole-stack-for-production-ai-agents-in-one-place).\n- **Nemotron 3 Ultra Day-0 Access** (2026-06-04): 550B-parameter (55B active) agentic model available on GMI Cloud's GB200/B200/B300 and H200 clusters at BF16, FP8, and NVFP4 precisions. Runs on as few as 2× GB200. Optimized for tool use, coding, and deep research [W1](https://www.gmicloud.ai/en/blog/gmi-cloud-brings-nvidia-nemotron-3-ultra-to-developers-on-day-0).\n- **Kimi K2.6 support** (2026-06-04): Available on serverless and dedicated H100/H200 infrastructure with OpenAI-compatible API, automatic request batching, and bare-metal self-hosting at $2.00/hr (H100) / $2.60/hr (H200) [W3](https://www.gmicloud.ai/en/blog/which-gpu-clouds-support-kimi-k2-and-the-latest-open-source-llms).\n\n### Talking\n- **NVIDIA hardware partnership as identity**: Two posts within three days frame GMI Cloud as NVIDIA Reference Architecture-validated [W1](https://www.gmicloud.ai/en/blog/gmi-cloud-brings-nvidia-nemotron-3-ultra-to-developers-on-day-0) and an \"AI-native cloud infrastructure company purpose-built for production AI\" supporting Vera Rubin [W2](https://www.gmicloud.ai/en/blog/gmi-cloud-supports-the-next-era-of-ai-factories-with-nvidia-vera-rubin). The phrase \"reference platform cloud partner\" appears in both the Fireworks AI [W4](https://www.gmicloud.ai/en/blog/building-the-infrastructure-for-production-ai-fireworks-ai-gmi-cloud) and Nemotron [W1](https://www.gmicloud.ai/en/blog/gmi-cloud-brings-nvidia-nemotron-3-ultra-to-developers-on-day-0) posts.\n- **Agentic AI factory narrative**: Vera Rubin [W2](https://www.gmicloud.ai/en/blog/gmi-cloud-supports-the-next-era-of-ai-factories-with-nvidia-vera-rubin) and AgentBox [W5](https://www.gmicloud.ai/en/blog/agentbox-is-live-the-whole-stack-for-production-ai-agents-in-one-place) posts both center \"agentic AI factories\" and \"production AI agents\" — positioning inference, not training, as the strategic surface.\n- **Enterprise customer signaling**: Fireworks AI partnership names Uber, Genspark, and Shopify as downstream inference customers [W4](https://www.gmicloud.ai/en/blog/building-the-infrastructure-for-production-ai-fireworks-ai-gmi-cloud).\n- **Open-source model positioning**: Pricing-forward post markets Kimi K2.6 support with transparent GPU pricing and OpenAI-compatible API compatibility [W3](https://www.gmicloud.ai/en/blog/which-gpu-clouds-support-kimi-k2-and-the-latest-open-source-llms).\n- **Founder ecosystem play**: SCALE accelerator Cohort 1 recap targets early-stage founders in agentic AI, multi-model applications, and robotics — equity-free with infrastructure and GTM mentorship [W6](https://www.gmicloud.ai/en/blog/scale-at-gmi-cohort-1-recap).\n\n## Shipping\n- **AgentBox**: Launched June 8, 2026 — agent marketplace with pre-built agents for code review, retrieval, and benchmarks, running on GMI Cloud infrastructure [W5](https://www.gmicloud.ai/en/blog/agentbox-is-live-the-whole-stack-for-production-ai-agents-in-one-place).\n- **Nemotron 3 Ultra**: Day-0 availability on GMI Cloud's GB200/B200/B300/H200 clusters as of June 4, 2026 [W1](https://www.gmicloud.ai/en/blog/gmi-cloud-brings-nvidia-nemotron-3-ultra-to-developers-on-day-0).\n- **Kimi K2.6**: Supported on serverless and dedicated H100/H200 with OpenAI-compatible API at published per-GPU-hour pricing [W3](https://www.gmicloud.ai/en/blog/which-gpu-clouds-support-kimi-k2-and-the-latest-open-source-llms).\n- **Fireworks AI partnership**: Announced June 2, 2026; GMI Cloud provides inference infrastructure for Fireworks' enterprise platform [W4](https://www.gmicloud.ai/en/blog/building-the-infrastructure-for-production-ai-fireworks-ai-gmi-cloud).\n- **SCALE Cohort 1**: Completed May 2026; Cohort 2 recruiting [W6](https://www.gmicloud.ai/en/blog/scale-at-gmi-cohort-1-recap).\n\n## Research themes\nNo cited evidence of internal model training, fundamental ML research publications, or research organization. All cited activity is platform engineering, infrastructure operations, and partnership-driven model hosting — consistent with a neocloud operator, not a model builder [W1](https://www.gmicloud.ai/en/blog/gmi-cloud-brings-nvidia-nemotron-3-ultra-to-developers-on-day-0)[W3](https://www.gmicloud.ai/en/blog/which-gpu-clouds-support-kimi-k2-and-the-latest-open-source-llms)[W4](https://www.gmicloud.ai/en/blog/building-the-infrastructure-for-production-ai-fireworks-ai-gmi-cloud)[W5](https://www.gmicloud.ai/en/blog/agentbox-is-live-the-whole-stack-for-production-ai-agents-in-one-place). The two MLE roles [P1](https://www.gmicloud.ai/en/company/career)[E7](https://www.gmicloud.ai/en/company/career)[E9](https://www.gmicloud.ai/en/company/career) may indicate applied inference optimization (kernel work, quantization, batching), but no research artifacts are cited to confirm. The AgentBox benchmark agents run existing suites (MMLU, HumanEval) rather than contributing new eval research [W5](https://www.gmicloud.ai/en/blog/agentbox-is-live-the-whole-stack-for-production-ai-agents-in-one-place).\n\n## Hiring & scaling\nTen active roles across five functions as of June 2026 [P1](https://www.gmicloud.ai/en/company/career):\n- **Engineering (4)**: TPM [E1](https://www.gmicloud.ai/en/company/career), SRE/Kubernetes [E5](https://www.gmicloud.ai/en/company/career), MLE [E9](https://www.gmicloud.ai/en/company/career), MLE (LLM Inference) [E7](https://www.gmicloud.ai/en/company/career)\n- **Product (2)**: Inference Engine PM [E4](https://www.gmicloud.ai/en/company/career), Product Management Operations [P1](https://www.gmicloud.ai/en/company/career)\n- **GTM (3)**: Solutions Architect [E2](https://www.gmicloud.ai/en/company/career), BD Manager (Inference Engine) [E6](https://www.gmicloud.ai/en/company/career), Content & Growth Marketer [E8](https://www.gmicloud.ai/en/company/career)\n- **HR (1)**: Talent Acquisition Partner [E3](https://www.gmicloud.ai/en/company/career)\n\nThe dedicated Talent Acquisition Partner [E3](https://www.gmicloud.ai/en/company/career) signals that hiring velocity itself is being scaled. The pairing of an Inference Engine PM [E4](https://www.gmicloud.ai/en/company/career) with a BD Manager for the same product [E6](https://www.gmicloud.ai/en/company/career) indicates a named platform product entering active go-to-market. Geographic center of gravity is Mountain View (8 roles), with infrastructure SRE open to broader US remote [E5](https://www.gmicloud.ai/en/company/career) and Solutions Architect listed as \"US\" [E2](https://www.gmicloud.ai/en/company/career).\n\n## Category implications\n- **Infrastructure strategy**: GMI Cloud is aligning its capital deployment to NVIDIA's hardware roadmap (H200 → Blackwell GB200/B200/B300 → Vera Rubin), betting that inference-optimized instances with NVLink/InfiniBand will differentiate against general-purpose cloud GPUs [W1](https://www.gmicloud.ai/en/blog/gmi-cloud-brings-nvidia-nemotron-3-ultra-to-developers-on-day-0)[W2](https://www.gmicloud.ai/en/blog/gmi-cloud-supports-the-next-era-of-ai-factories-with-nvidia-vera-rubin). NVIDIA Reference Architecture validation [W1](https://www.gmicloud.ai/en/blog/gmi-cloud-brings-nvidia-nemotron-3-ultra-to-developers-on-day-0) and Reference Platform Cloud Partner designation [W4](https://www.gmicloud.ai/en/blog/building-the-infrastructure-for-production-ai-fireworks-ai-gmi-cloud) imply preferential supply-chain access, which is existential in a GPU-constrained market.\n- **Product strategy**: AgentBox [W5](https://www.gmicloud.ai/en/blog/agentbox-is-live-the-whole-stack-for-production-ai-agents-in-one-place) moves GMI Cloud up the stack from IaaS (bare-metal GPU rental) to a managed platform with an agent marketplace — competing with serverless inference endpoints (Together, Fireworks, Modal) while monetizing the underlying compute. The \"Inference Engine\" product/BD pair [E4](https://www.gmicloud.ai/en/company/career)[E6](https://www.gmicloud.ai/en/company/career) suggests a parallel managed-API product distinct from the AgentBox marketplace.\n- **GTM implications**: Dual go-to-market in evidence: (1) enterprise managed inference via the Fireworks AI partnership, which brings named customers (Uber, Genspark, Shopify) [W4](https://www.gmicloud.ai/en/blog/building-the-infrastructure-for-production-ai-fireworks-ai-gmi-cloud); (2) developer/founder ecosystem via SCALE accelerator [W6](https://www.gmicloud.ai/en/blog/scale-at-gmi-cohort-1-recap) and transparent open-model pricing [W3](https://www.gmicloud.ai/en/blog/which-gpu-clouds-support-kimi-k2-and-the-latest-open-source-llms). The Content & Growth Marketer hire [E8](https://www.gmicloud.ai/en/company/career) indicates scaling of both content-led growth and the accelerator program.\n- **Hiring implications**: The concentration of inference engineering (2× MLE) [E7](https://www.gmicloud.ai/en/company/career)[E9](https://www.gmicloud.ai/en/company/career), inference product [E4](https://www.gmicloud.ai/en/company/career), and inference BD [E6](https://www.gmicloud.ai/en/company/career) around a single named product suggests the Inference Engine is the near-term commercialization priority, not training, fine-tuning, or data pipeline services.\n- **Thin spots**: No cited evidence for training infrastructure, model fine-tuning services, custom silicon, data pipeline tooling, safety research, or eval framework development. The evidence depicts an inference-only neocloud hosting third-party models rather than building or fine-tuning its own [W1](https://www.gmicloud.ai/en/blog/gmi-cloud-brings-nvidia-nemotron-3-ultra-to-developers-on-day-0)[W3](https://www.gmicloud.ai/en/blog/which-gpu-clouds-support-kimi-k2-and-the-latest-open-source-llms)[W4](https://www.gmicloud.ai/en/blog/building-the-infrastructure-for-production-ai-fireworks-ai-gmi-cloud)[W5](https://www.gmicloud.ai/en/blog/agentbox-is-live-the-whole-stack-for-production-ai-agents-in-one-place).\n\n## Traction highlights\n- Enterprise inference customers named via Fireworks AI: Uber, Genspark, Shopify [W4](https://www.gmicloud.ai/en/blog/building-the-infrastructure-for-production-ai-fireworks-ai-gmi-cloud).\n- NVIDIA Reference Platform Cloud Partner (inaugural cohort) [W4](https://www.gmicloud.ai/en/blog/building-the-infrastructure-for-production-ai-fireworks-ai-gmi-cloud); NVIDIA Reference Architecture-validated infrastructure [W1](https://www.gmicloud.ai/en/blog/gmi-cloud-brings-nvidia-nemotron-3-ultra-to-developers-on-day-0).\n- SCALE accelerator operational: Cohort 1 completed, Cohort 2 recruiting [W6](https://www.gmicloud.ai/en/blog/scale-at-gmi-cohort-1-recap).\n- AgentBox launched with multiple pre-built agents and a publisher model for third-party builders [W5](https://www.gmicloud.ai/en/blog/agentbox-is-live-the-whole-stack-for-production-ai-agents-in-one-place).\n- Day-0 availability of NVIDIA Nemotron 3 Ultra positions GMI Cloud as a launch partner for NVIDIA's frontier agentic models [W1](https://www.gmicloud.ai/en/blog/gmi-cloud-brings-nvidia-nemotron-3-ultra-to-developers-on-day-0).\n\n## Sources\n- [P1](https://www.gmicloud.ai/en/company/career) GMI Cloud Careers Page, 2026-06-11\n- [E1](https://www.gmicloud.ai/en/company/career) Technical Program Manager (TPM) job opened, 2026-06-05\n- [E2](https://www.gmicloud.ai/en/company/career) Solutions Architect job opened, 2026-06-05\n- [E3](https://www.gmicloud.ai/en/company/career) Talent Acquisition Partner job opened, 2026-06-05\n- [E4](https://www.gmicloud.ai/en/company/career) Inference Engine Product Manager job opened, 2026-06-05\n- [E5](https://www.gmicloud.ai/en/company/career) Infra Engineer – SRE (Kubernetes) job opened, 2026-06-05\n- [E6](https://www.gmicloud.ai/en/company/career) BD Manager, Inference Engine job opened, 2026-06-05\n- [E7](https://www.gmicloud.ai/en/company/career) Machine Learning Engineer (LLM Inference) job opened, 2026-06-05\n- [E8](https://www.gmicloud.ai/en/company/career) Content & Growth Marketer job opened, 2026-06-05\n- [E9](https://www.gmicloud.ai/en/company/career) Machine Learning Engineer job opened, 2026-06-05\n- [W1](https://www.gmicloud.ai/en/blog/gmi-cloud-brings-nvidia-nemotron-3-ultra-to-developers-on-day-0) \"NVIDIA Nemotron 3 Ultra Day-0 Access on GMI Cloud\", 2026-06-04\n- [W2](https://www.gmicloud.ai/en/blog/gmi-cloud-supports-the-next-era-of-ai-factories-with-nvidia-vera-rubin) \"GMI Cloud Supports the Next Era of AI Factories with NVIDIA Vera Rubin\", 2026-06-01\n- [W3](https://www.gmicloud.ai/en/blog/which-gpu-clouds-support-kimi-k2-and-the-latest-open-source-llms) \"Best GPU Cloud for Open-Source LLMs and Kimi K2.6 Models\", 2026-06-04\n- [W4](https://www.gmicloud.ai/en/blog/building-the-infrastructure-for-production-ai-fireworks-ai-gmi-cloud) \"Building the Infrastructure for Production AI: Fireworks AI + GMI Cloud\", 2026-06-02\n- [W5](https://www.gmicloud.ai/en/blog/agentbox-is-live-the-whole-stack-for-production-ai-agents-in-one-place) \"AgentBox is live: the whole stack for production AI agents, in one place\", 2026-06-08\n- [W6](https://www.gmicloud.ai/en/blog/scale-at-gmi-cohort-1-recap) \"SCALE at GMI Cohort 1 Recap\", 2026-05-14","generated_at":"2026-06-27T19:12:49.47+00:00","citations":[{"url":"https://www.gmicloud.ai/en/company/career","path":null,"label":"gmicloud.ai/en","type":"external"},{"url":"https://www.gmicloud.ai/en/company/career","path":null,"label":"gmicloud.ai/en","type":"external"},{"url":"https://www.gmicloud.ai/en/company/career","path":null,"label":"gmicloud.ai/en","type":"external"},{"url":"https://www.gmicloud.ai/en/blog/agentbox-is-live-the-whole-stack-for-production-ai-agents-in-one-place","path":null,"label":"gmicloud.ai/en","type":"external"},{"url":"https://www.gmicloud.ai/en/blog/gmi-cloud-brings-nvidia-nemotron-3-ultra-to-developers-on-day-0","path":null,"label":"gmicloud.ai/en","type":"external"},{"url":"https://www.gmicloud.ai/en/blog/gmi-cloud-supports-the-next-era-of-ai-factories-with-nvidia-vera-rubin","path":null,"label":"gmicloud.ai/en","type":"external"},{"url":"https://www.gmicloud.ai/en/blog/building-the-infrastructure-for-production-ai-fireworks-ai-gmi-cloud","path":null,"label":"gmicloud.ai/en","type":"external"},{"url":"https://www.gmicloud.ai/en/blog/scale-at-gmi-cohort-1-recap","path":null,"label":"gmicloud.ai/en","type":"external"},{"url":"https://www.gmicloud.ai/en/company/career","path":null,"label":"gmicloud.ai/en","type":"external"},{"url":"https://www.gmicloud.ai/en/company/career","path":null,"label":"gmicloud.ai/en","type":"external"},{"url":"https://www.gmicloud.ai/en/company/career","path":null,"label":"gmicloud.ai/en","type":"external"},{"url":"https://www.gmicloud.ai/en/company/career","path":null,"label":"gmicloud.ai/en","type":"external"},{"url":"https://www.gmicloud.ai/en/company/career","path":null,"label":"gmicloud.ai/en","type":"external"},{"url":"https://www.gmicloud.ai/en/company/career","path":null,"label":"gmicloud.ai/en","type":"external"},{"url":"https://www.gmicloud.ai/en/company/career","path":null,"label":"gmicloud.ai/en","type":"external"},{"url":"https://www.gmicloud.ai/en/blog/which-gpu-clouds-support-kimi-k2-and-the-latest-open-source-llms","path":null,"label":"gmicloud.ai/en","type":"external"}],"provenance":{"provider":"deepseek","model":"deepseek-v4-pro","workflow":"onlylabs-deepagents-analysis-v3","agent":"deepagents"},"evidence":{"total":16,"pages":1,"events":9,"web":6,"signal_desks":{"forks":0,"repos":0,"hiring":9,"talking":0,"releases":0},"data_radar_lanes":null,"data_radar_matches":null}},"signal_counts":{"total":9,"model_released":0,"release":0,"repo_new":0,"repo_forked":0,"post_published":0,"job_opened":9}}