找回密码
 注册
搜索
查看: 491|回复: 0

[综合资料] 循环冗余校验计算一个TMS320C54x实现方案

[复制链接]
发表于 2007-10-26 15:08:12 | 显示全部楼层 |阅读模式
循环冗余校验计算一个TMS320C54x实现方案
【文件名】:071026@52RD_循环冗余校验计算一个TMS320C54x实现方案 .PDF
【格 式】:pdf
【大 小】:33K
【简 介】:
Design Problem
In many typical DSP applications, loops comprise a majority of the number of cycles, or
MIPS. Because of this, performance of loops can greatly affect the performance of the
entire application. Many of these loops are nested loops with both an inner and outer
loop. Some common examples are FIR and IIR filters, FFT, and DCT. To optimize these
nested loops, it is necessary to consider not only the inner loop performance but also the
outer loop performance, especially when the inner loop count is small for execution of
each outer loop.
One technique used to optimize loops on the highly parallel C6x VelociTI architecture is
software pipelining. This involves initiating new iterations of the loop before previous
iterations have completed to obtain high throughput. This implies there are some cycles
(loop prolog) to begin executing, or pipe up, of each inner loop and some more cycles to
pipe down the loop (loop epilog). These cycles will be incurred for each outer loop
execution so they can affect performance, especially when the inner loop count is small.
The more deeply pipelined the DSP is, the more cycles will be required for the prolog and
epilog.
Figure 1 shows a simple dot product example, (with non-C6x-like single cycle loads and
multiplies), where inner loop setup is 2 cycles, the prolog is 2 cycles, the epilog is 2
cycles, and the time to execute outer loop instructions is 2 cycles. At the end of cycle 9
there is a branch back to the beginning of the loop setup (Br 1). Thus, 8 cycles will be
incurred each time this inner loop is executed in an outer loop. As we move to deeper
and deeper pipelines in DSPs for higher clock speeds, the number of cycles of overhead
will increase. The higher the number of cycles for setup, prolog, epilog, and outer loop
instructions, and the lower the inner loop count, the more overall nested loop
performance is reduced.


【目 录】:






本帖子中包含更多资源

您需要 登录 才可以下载或查看,没有账号?注册

×
高级模式
B Color Image Link Quote Code Smilies

本版积分规则

Archiver|手机版|小黑屋|52RD我爱研发网 ( 沪ICP备2022007804号-2 )

GMT+8, 2024-10-7 10:14 , Processed in 0.046001 second(s), 18 queries , Gzip On.

Powered by Discuz! X3.5

© 2001-2023 Discuz! Team.

快速回复 返回顶部 返回列表