Sunway TaihuLight explained

Location:National Supercomputer Center, Wuxi, Jiangsu, China
Dates:June 2016
Memory:1.32 PB (5591 TB/s total bandwidth)
Storage:20 PB
Speed:1.45 GHz (3.06 TFlops single CPU, 105 PFLOPS LINPACK, 125 PFLOPS peak)
Os:Sunway RaiseOS 2.0.5 (based on Linux)
Power:15 MW (LINPACK)
Cost:1.8 billion Yuan (US$273 million)
Purpose:Oil prospecting, life sciences, weather forecast, industrial design, pharmaceutical research

The Sunway TaihuLight (Shénwēi·tàihú zhī guāng) is a Chinese supercomputer which,, is ranked 11th in the TOP500 list,[1] with a LINPACK benchmark rating of 93 petaflops.[2] The name is translated as divine power, the light of Taihu Lake.[3] This is nearly three times as fast as the previous Tianhe-2, which ran at 34 petaflops., it is ranked as the 16th most energy-efficient supercomputer in the Green500,[4] with an efficiency of 6.1 GFlops/watt. It was designed by the National Research Center of Parallel Computer Engineering & Technology (NRCPC) and is located at the National Supercomputing Center in Wuxi in the city of Wuxi, in Jiangsu province, China.[2]

The Sunway TaihuLight was the world's fastest supercomputer for two years, from June 2016 to June 2018, according to the TOP500 lists. The record was surpassed in June 2018 by IBM's Summit.[5] [6] [7]

Architecture

The Sunway TaihuLight utilizes domestically developed semiconductors, including a total of 40,960 Chinese-designed SW26010 manycore 64-bit RISC processors based on the Sunway architecture.[8] Each processor chip contains 256 processing cores, and an additional four auxiliary cores for system management (also RISC cores, just more fully featured) for a total of 10,649,600 CPU cores across the entire system.[8]

The processing cores feature 64 KB of scratchpad memory for data (and 16 KB[8] for instructions) and communicate via a network on a chip, instead of having a traditional cache hierarchy.[9]

Software

The system runs on its own operating system, Sunway RaiseOS 2.0.5, which is based on Linux.[8] The system has its own customized implementation of OpenACC 2.0 to aid the parallelization of code.[10]

Future development

China's first exascale supercomputer was scheduled to enter service by 2020 according to the head of the school of computing at the National University of Defense Technology (NUDT). According to the national plan for the next generation of high performance computers, the country would have develop an exascale computer during the 13th Five-Year-Plan period (2016–2020). The government of Tianjin Binhai New Area, NUDT and the National Supercomputing Center of Tianjin are working on the project.[11] The investment is likely to hit 3 billion yuan ($470.6 million).[12]

See also

External links

31.5486°N 120.248°W

Notes and References

  1. Web site: TOP500 List November 2023 . . 2024-03-08.
  2. Web site: China Tops Supercomputer Rankings with New 93-Petaflop Machine. top500.org. 2016-06-20.
  3. Web site: Rise of the supercomputers: Developing software, talent are key challenges for China . . 2019-01-15. 2018-01-28.
  4. Web site: The Green500 List - June 2017. Green500 .
  5. Web site: TOP500 - November 2017. TOP500. 25 June 2018.
  6. Web site: World's Fastest Supercomputer Now Has Chinese Chip Technology. Clark. Jack. King. Ian. 2016-06-20. Bloomberg.com. 2016-06-20.
  7. Web site: China builds world's most powerful computer. 2016-06-20. BBC News. 2016-06-20.
  8. Web site: Report on the Sunway TaihuLight System. Dongarra. Jack. 2016-06-20. netlib.org. 2016-06-20 . Each CPE Cluster is composed of a Management Processing Element (MPE) which is a 64-bit RISC core which is supporting both user and system modes, a 256-bit vector instructions, 32 KB L1 instruction cache and 32 KB L1 data cache, and a 256KB L2 cache. The Computer Processing Element (CPE) is composed of an 8×8 mesh of 64-bit RISC cores, supporting only user mode, with a 256-bit vector instructions, 16 KB L1 instruction cache and 64 KB Scratch Pad Memory (SPM). [..] Each CPE has a 64 KB local (scratchpad) memory, no cache memory. The local memory is SRAM. There is a 16KB instruction cache. Each of the 4 CPE/MPE clusters has 8 GB of DDR3 memory. So a node has 32 GB of primary memory. Each processor connects to four 128-bit DDR3-2133 memory controllers, with a memory bandwidth of 136.51 GB/s..
  9. Web site: Meet the new world's fastest supercomputer: China's TaihuLight. Lendino. James. 2016-06-20. Extremetech. 2016-06-21. The TOP500 report said that the chip also lacks any traditional L1-L2-L3 cache, and instead has 12KB[sic] of instruction cache and 64KB "local scratchpad" that works sort of like an L1 cache..
  10. Fu. Haohuan. Liao. Junfeng. Yang. Jinzhe. Wang. Lanning. Song. Zhenya. Huang. Xiaomeng. Yang. Chao. Xue. Wei. Liu. Fangfang. Qiao. Fangli. Zhao. Wei. Yin. Xunqiang. Hou. Chaofeng. Zhang. Chenglong. Ge. Wei. Zhang. Jian. Wang. Yangang. Zhou. Chunbo. Yang. Guangwen. 2016. The Sunway TaihuLight Supercomputer: System and Applications. Sci. China Inf. Sci.. 59. 7. 10.1007/s11432-016-5588-7. free.
  11. Web site: China's Exascale Supercomputer Operational by 2020---Chinese Academy of Sciences. english.cas.cn.
  12. Web site: One Billion Billion! Tianhe-3 Exascale Supercomputer Prototype Passes Tests. 27 July 2018. Medium.