Instruction scheduling for clustered VLIW architectures
Visualitza/Obre
Cita com:
hdl:2117/100204
Tipus de documentText en actes de congrés
Data publicació2000
EditorInstitute of Electrical and Electronics Engineers (IEEE)
Condicions d'accésAccés obert
Tots els drets reservats. Aquesta obra està protegida pels drets de propietat intel·lectual i
industrial corresponents. Sense perjudici de les exempcions legals existents, queda prohibida la seva
reproducció, distribució, comunicació pública o transformació sense l'autorització del titular dels drets
Abstract
Clustered VLIW organizations are nowadays a common trend in the design of embedded/DSP processors. In this work we propose a novel modulo scheduling approach for such architectures. The proposed technique performs the cluster assignment and the instruction scheduling in a single pass, which is more effective than doing first the assignment and latter the scheduling. We also show that loop unrolling significantly enhances the performance of the proposed scheduler, especially when the communication channel among clusters is the main performance bottleneck. By selectively unrolling some loops, we can obtain the best performance with the minimum increase in code size. Performance evaluation for the SPECfp95 shows that the clustered architecture achieves about the same IPC (Instructions Per Cycle) as a unified architecture with the same resources. Moreover, when the cycle time is taken into account, a 4-cluster configuration is 3.6 times faster than the unified architecture.
CitacióSánchez, F., González, A. Instruction scheduling for clustered VLIW architectures. A: International Symposium on System Synthesis. "Proceedings: The 13th International Symposium on System Synthesis". Madrid: Institute of Electrical and Electronics Engineers (IEEE), 2000, p. 41-46.
ISBN0-7695-0765-4
Versió de l'editorhttp://ieeexplore.ieee.org/document/874027/
Fitxers | Descripció | Mida | Format | Visualitza |
---|---|---|---|---|
00874027.pdf | 654,3Kb | Visualitza/Obre |