User's Manual

Cambricon Technologies, MLU270-F Series Intelligent Processing Card User Manual
1. Product Brief
Figure.1 MLU270-F Series Intelligent Processing Card appearance
Specially designed MLU270-F series dedicated to AI inferences and data center
accelerations with high EER (Energy Efficiency Rate)
The SIYUAN 270 ASIC has been designed based on several innovative technologies in
the architecture of processors owned by Cambricon. Along with it, many up-to-date
features have been integrated into a standard FHFL PCIe card, which can be inserted into a
modern AI PC/server so as to provide a huge extension of calculation power of AI inferences.
MLU270-F series has a moderate TDP of 150W and it can provide a calculation power as
high as 4 times of the previous generation. It can be widely used in applications such as
vision, voice, natural language processing, legacy machine learning, and many other AI
scenarios, and it can be used in an AI inferences platform to make it work with even higher
EER.
Brand-new Cambricon MLUv02 Architecture
The MLUv02 architecture is not just a simple update from previous generation, but a
brand new design based on NOC (Network on Chip), which will guarantee the parallel
efficiency of execution of 16 NPU cluster within the SIYUN 270 ASIC. The dataflow will be
compressed within the chip by dedicated hardware engine and this will increase the volume
and bandwidth effectively. The new architecture can fully support all the AI accuracies such
as INT16, INT8, INT4, FP32, FP16, and provide necessary calculation power to many kinds of
Neural Network. In a word, the new architecture can provide both good universality and best
performances to customers at the same time.
One more step on the performances of inferences
When using INT8 accuracy for AI inference calculations, the performance of non-sparse
networks is improved up to 4 times as much as that of the previous generation. MLU270-F
series can provide a great EER as high as 40 times of a normal CPU. There are embedded
newly designed code/decode hardware for videos and pictures, so when the system have to