Stream-based Memory Access Specialization for General Purpose Processors

UC Case No. 2019-878

 

SUMMARY:

Researchers led by Zhengrong Wang and Tony Nowatzki from the Computer Science Department at UCLA have created a way to improve computer processing power, speed, and efficiency by optimizing how processors access memory.

 

BACKGROUND:

Computing processing power and speed have benefited from Moore’s Law, where transistors have become smaller so that more of them can fit on a computer chip. However, we have reached a point of diminishing returns where Moore’s Law is no longer practical and transistors can’t get smaller. Therefore, another approach must be employed to make computer processing more powerful and efficient. The instruction set architecture (ISA) is a set of basic instructions of how to access memory that a processors understands. Current ISA’s have instructions for single load/stores in memory and slow down processing performance by constantly having to fetch information from memory to process it with short inefficient requests.  In addition, these are often difficult to prefetch (execute earlier than requested) due to control flow and indirect access. Improvements in ISA’s hold the potential of improving processing power, speed, and efficiency in the wake of diminishing hardware improvements.

 

INNOVATION:

Researchers led by Zhengrong Wang and Tony Nowatzki from the Computer Science Department at UCLA have created a way to improve computer processing power, speed, and efficiency by optimizing how processors access memory. Typically processors are slowed down by having to constantly fetch information from memory, and it is also well-known that memory can take up to 50% of the power on the chip. This group’s invention attacks this by 1) creating a specialized ISA interface for “streams” of memory access that can vectorize (lengthen) memory access even when programs have control flow, 2) decoupling this “stream” access from processing to remove the memory access delay, and 3) create an adaptive and efficient “stream” aware cache to enable smarter cache policies. In a set of test studies, their design led to a 1.53x increase in speed and a 1.31x energy efficiency improvement across industry standard general-purpose processing workloads.

 

POTENTIAL APPLICATIONS:

  • Computer processors
  • Graphic processors

 

ADVANTAGES:

  • Easily implemented
  • Scalable
  • Increased processing speed
  • Increased processing power

 

RELATED MATERIALS:

Patent Information: