Taiwan AI corpus grows to 1.1bn data units - News Summed Up

Taiwan AI corpus grows to 1.1bn data units


Taiwan AI corpus grows to 1.1bn data unitsLANGUAGE TOOLS: The dictionaries category of the training corpus consistently ranks among users’ most frequently searched topics, the Ministry of Digital Affairs saidBy Chiu Chiao-chen and Shelley Shan / Staff reportersTaiwan’s Sovereign AI Training Corpus has grown to include more than 1.1 billion tokens just more than a month after its official launch, the Ministry of Digital Affairs said yesterday. The platform initially contained more than 2,000 datasets totaling more than 600 million units of data, also known as tokens, Department of Data Innovation Director-General Chuang Ming-fen (莊明芬) said. Ministry of Digital Affairs officials introduce the Sovereign AI Training Corpus at a news conference in Taipei on Dec. 24 last year. “That shows that people in research institutions, government agencies and the corporate world pay close attention to the high-quality data released by the government to train sovereign AI databases. In related news, the government’s Open Data Platform has attracted about 175.84 million views and 22.27 million downloads more than a decade since its launch.


Source: Taipei Times January 27, 2026 16:05 UTC



Loading...
Loading...
  

Loading...

                           
/* -------------------------- overlay advertisemnt -------------------------- */