Direct instruction wakeup for out-of-order processors
Document typeConference report
PublisherInstitute of Electrical and Electronics Engineers (IEEE)
Rights accessOpen Access
Instruction queues consume a significant amount of power in high-performance processors, primarily due to instruction wakeup logic access to the queue structures. The wakeup logic delay is also a critical timing parameter. This paper proposes a new queue organization using a small number of successor pointers plus a small number of dynamically allocated full successor bit vectors for cases with a larger number of successors. The details of the new organization are described and it is shown to achieve the performance of CAM-based or full dependency matrix organizations using just one pointer per instruction plus eight full bit vectors. Only two full bit vectors are needed when two successor pointers are stored per instruction. Finally, a design and pre-layout of all critical structures in 70 nm technology was performed for the proposed organization as well as for a CAM-based baseline. The new design is shown to use 1/2 to 1/5th of the baseline instruction queue power, depending on queue size. It is also shown to use significantly less power than the full dependency matrix based design.
CitationRamírez, M., Cristal, A., Veidenbaum, A., Villa, L., Valero, M. Direct instruction wakeup for out-of-order processors. A: International Workshop on Innovative Architecture for Future Generation High-Performance Processors and Systems. "Innovative Architecture for Future Generation High-Performance Processors and Systems: IWIA 2004: 12-14 January 2004, Maui, Hawaii: proceedings". Maui, Hawaii: Institute of Electrical and Electronics Engineers (IEEE), 2004, p. 2-9.