#1: The TMS320C54XX DSP uses a modified Harvard architecture with separate program and data buses for high parallelism. It has multiple buses and on-chip memory for efficient data and program flow.
#2: The CPU has a MAC unit, accumulators, ALU, and other components for powerful DSP computing. On-chip peripherals and memory-mapped registers provide efficient I/O.
#3: The architecture utilizes eight buses, on-chip memory blocks, and specialized units like the CSSU to optimize performance of DSP algorithms like Viterbi processing.