US Patent:
20210158131, May 27, 2021
Inventors:
- Seattle WA, US
Yizhi Liu - Fremont CA, US
Hongbin Zheng - San Jose CA, US
Jeffrey T. Huynh - San Jose CA, US
Haichen Li - Campbell CA, US
Drazen Borkovic - Los Altos CA, US
Jindrich Zejda - Saratoga CA, US
Richard John Heaton - San Jose CA, US
Randy Renfu Huang - Morgan Hill CA, US
Zhi Chen - Santa Clara CA, US
Yida Wang - Palo Alto CA, US
International Classification:
G06N 3/063
G06N 3/04
Abstract:
Methods and apparatuses for hierarchical partitioning of operators of a neural network for execution on an acceleration engine are provided. Neural networks are built in machine learning frameworks using neural network operators. The neural network operators are compiled into executable code for the acceleration engine. Development of new framework-level operators can exceed the capability to map the newly developed framework-level operators onto the acceleration engine. To enable neural networks to be executed on an acceleration engine, hierarchical partitioning can be used to partition the operators of the neural network. The hierarchical partitioning can identify operators that are supported by a compiler for execution on the acceleration engine, operators to be compiled for execution on a host processor, and operators to be executed on the machine learning framework.