In a move that signals a major advancement for Japan’s internal technological infrastructure, SoftBank Corp. officially launched its new artificial intelligence computing platform on December 25, 2025. This high-performance installation is built around the NVIDIA GB200 NVL72 system, marking a pivotal moment in the nation’s ability to support the development of next-generation generative AI technologies [1]. By providing the massive compute resources required for modern large-scale models, the platform is positioned as a cornerstone of Japan’s domestic AI capabilities.
The deployment is particularly notable for its integration of advanced thermal management systems. It represents the first commercial use of liquid-cooled NVIDIA GB200 NVL72 systems within Japan [1]. This transition to liquid cooling is not merely a preference but a technical necessity; the Blackwell architecture utilized in these systems generates extreme heat that requires specialized management to maintain energy efficiency. In the high-density data center environments where these units operate, liquid cooling allows for sustained performance that would be difficult to achieve with traditional air-cooled methods [1].
Advanced Architecture and Performance Capabilities
The heart of the SoftBank platform is the NVIDIA GB200 NVL72 architecture, which represents a massive leap in hardware integration. This system functions by connecting 72 Blackwell GPUs into what effectively acts as a single, massive GPU via the NVLink interconnect [1]. This level of hardware synergy provides computing power that was previously restricted to the world’s largest global hyperscalers, allowing SoftBank to offer elite-tier performance to a broader range of domestic users.
The performance metrics associated with this architecture are significant, particularly in the realm of model deployment. The GB200 NVL72 offers a 30x performance increase for Large Language Model (LLM) inference compared to previous generations [1]. This surge in efficiency is critical as the industry shifts from purely training models to deploying them in real-time, high-demand environments. The ability to process inference tasks thirty times faster allows for more responsive applications and lower operational costs for developers using the SoftBank infrastructure.
Strategic Infrastructure and Sovereign AI
SoftBank has strategically located this AI platform in the Kanto region to ensure optimal service for Japan’s primary economic hub. By placing the hardware in close proximity to Tokyo, the company can provide low-latency access to the capital’s dense ecosystem of enterprises and startups [1]. This geographic positioning is vital for real-time AI applications where even millisecond delays can impact the utility of the service.
Beyond latency, the Kanto-based facility is a central component of Japan’s “Sovereign AI” initiatives. By hosting these powerful resources domestically, SoftBank ensures that sensitive data remains within national borders, addressing growing concerns over data residency and security [1]. This infrastructure allows Japanese companies to develop and deploy AI models while adhering to local regulatory standards and maintaining control over their proprietary information.
Collaborative Development and the Japanese Supply Chain
The construction and rollout of this AI infrastructure have been bolstered by a significant collaboration between SoftBank and Fujitsu. By leveraging Fujitsu’s established expertise in high-performance server technology, the two companies have created a robust foundation for the new platform [1]. This partnership is viewed as a strategic move to strengthen the Japanese supply chain for AI hardware.
By developing domestic expertise and infrastructure through this collaboration, Japan is working to reduce its reliance on purely Western infrastructure providers [1]. This move toward self-sufficiency in high-end computing hardware ensures that the Japanese tech sector remains resilient and capable of supporting its own growth without being entirely dependent on external supply chains that may be subject to global volatility.
Market Drivers: The Rise of Agentic AI
The massive capital investments seen in late 2025, including SoftBank’s platform launch, are being driven by a fundamental shift in AI capabilities: the rise of “Agentic AI” [2]. Unlike standard chatbots, Agentic AI refers to systems that can autonomously execute complex, multi-step tasks without constant human intervention. These autonomous agents require a different level of computational support than earlier iterations of generative AI.
SoftBank’s new platform is specifically engineered to handle the continuous and complex processing cycles necessitated by these autonomous agents [1]. Because Agentic AI must maintain context over long sequences of actions and perform real-time reasoning, the high-density compute provided by the GB200 NVL72 is essential. The industry’s focus has moved toward creating systems that can “execute” rather than just “generate,” making high-performance hardware a prerequisite for staying competitive in the 2025 landscape [2].
NVIDIA’s Strategic Market Expansion
The hardware landscape has been further transformed by NVIDIA’s aggressive moves to control the entire AI lifecycle. In late 2025, NVIDIA acquired the AI inference startup Groq for $20 billion [1]. This acquisition was a strategic effort to dominate the inference market with Groq’s Language Processing Unit (LPU) technology, complementing NVIDIA’s existing dominance in the training market with the Blackwell-based GB200 systems.
By owning both the training hardware (GB200) and specialized inference technology (Groq), NVIDIA aims to provide a comprehensive ecosystem for AI development [1]. This vertical integration means that platforms like SoftBank’s are part of a broader global shift where the hardware used to build a model is increasingly integrated with the hardware used to run it, streamlining the path from development to commercial application.
Economic Impacts and the Efficiency Shift
The unprecedented scale of infrastructure spending throughout 2025 has had a profound impact on the global economy and the tech sector’s leadership. The massive capital expenditure (CapEx) required for chips like the GB200 has fueled a historic surge in the wealth of tech leaders [1]. This financial success is a direct reflection of the massive investments companies like SoftBank are making to secure the future of AI compute. The scale of these investments indicates a “winner-takes-all” dynamic where access to the most advanced hardware is a primary differentiator for success.
However, the boom is also being shaped by a new focus on algorithmic efficiency. The release of high-efficiency models, such as DeepSeek, in 2025 has forced hardware providers to reconsider their strategies [1]. While raw power remains essential, there is an increasing demand for platforms that can optimize for both high-performance compute and algorithmic efficiency. SoftBank’s platform is designed to be flexible enough to support these rapidly evolving architectures, ensuring it can accommodate models that prioritize being “lean” just as effectively as those that are “massive” [1].
Conclusion
The launch of SoftBank’s liquid-cooled AI computing platform represents more than just a hardware upgrade; it is a strategic assertion of Japan’s role in the global AI economy. By combining the 30x inference performance of the NVIDIA GB200 NVL72 with the domestic server expertise of Fujitsu, SoftBank has created an environment capable of supporting the most demanding Agentic AI applications [1][2]. As the industry continues to evolve toward autonomous systems and more efficient model architectures, the availability of high-density, low-latency compute in the Kanto region will likely be a defining factor in Japan’s technological sovereignty and its ability to lead in the age of artificial intelligence.






