当前查询到3条专利与查询词 "忤锐"相关,搜索用时0.2812436秒!排序方式:
发明专利:1实用新型: 2外观设计: 0
1 条,当前第 1-1 条 返回搜索页
申请号:201811418100.0 公开号:CN111221574A 主分类号:G06F9/30
摘要:【中文】本发明一种大矩阵快速转置多核并行处理方法,包括以下步骤:步骤一:DSP每个内核利用EDMA将外部大存储器中需处理子矩阵Ai(N,M),i∈[0,x‑1]搬移至SRAM缓存;步骤二、x个内核并行处理,CPU利用优化的内联函数,对缓存数据进行转置,得到AΤi(N,M),i∈[0,x‑1],再通过EDMA将结果数据搬移至外部大存储器。本发明提高了数据处理速度。 【EN】The invention discloses a large matrix fast transposition multi-core parallel processing method, which comprises the following steps: the method comprises the following steps: each kernel of the DSP utilizes EDMA to process the sub-matrix A in the external large memoryi(N,M),i∈[0,x‑1]Moving to an SRAM cache; step two, x kernels are processed in parallel, and the CPU transposes the cache data by utilizing the optimized inline function to obtain AΤi(N,M),i∈[0,x‑1]And then the result data is moved to an external large memory through the EDMA. The invention improves the data processing speed.
详细信息 下载全文

1 条,当前第 1-1 条 返回搜索页