Event Details

Extended Architectural Enhancements for Minimizing Message Delivery Latency on Cache-Less Architectures (eg. Cell BE)

Presenter: Anthony Kroeker
Supervisor: Dr. Nikitas Dimopoulos

Date: Mon, December 19, 2011
Time: 13:30:00 - 00:00:00
Place: ECS 660

ABSTRACT

ABSTRACT

We propose to reduce the latency of MPI receive operations on cache-less architectures, by removing the delay of copying messages when they are first received. This is achieved by copying the messages directly into buffers in the lowest level of the memory hierarchy (eg. scratchpad memory). The previously proposed solution introduced an Indirection Cache which would map between the receive variables and the buffered message payload locations. This proved somewhat beneficial, but the lookup penalty of the Indirection Cache limited its effectiveness. Therefore we propose that a most recently used buffer (ie. an Indirection Buffer) be placed in front of the Indirection Cache to eliminate this penalty and speed up access. The tests conducted demonstrated that this method was indeed effective and improved over the original method by at least an order of magnitude. Finally, examination of implementation feasibility showed that this could be implemented with a small Cache, and that even with access times 6x slower than initially assumed, the approach with the Indirection Buffer would still be effective.