Taiwan AI Labs Releases BLOOM-zh Traditional Chinese Language Model↗
via Taipei Times
Taiwan AI Labs has released BLOOM-zh, a large language model specifically optimized for Traditional Chinese. Built on the BLOOM architecture with additional pre-training on Taiwanese web data, government documents, and academic literature, the model addresses the performance gap between Simplified and Traditional Chinese in existing multilingual models. The 13B parameter model is available under an open license and shows strong performance on Mandarin comprehension benchmarks.
Analysis — BLOOM-zh is as much a cultural sovereignty play as a technical one — Taiwan needs NLP models that understand Traditional Chinese nuance without mainland training data bias. The government document pre-training is the strategic ingredient.