UMass Amherst

Design Space


Home
Up

Project Overview

For cost and performance reasons, network processors are implemented as single-chip multiprocessors with on-chip memory and I/O interfaces. An important question is how to configure such a system-on-a-chip. More parallel processors provide more "processing cycles" at the cost of less on-chip memory. This leads to more cache misses, memory stalls, and finally lower processor utilization.

We have developed analytic performance models of network processors that consider a range of system and workload parameters to determine a variety of performance and cost metrics. Our initial model considers the following system parameters:

  • Number and clock speed of processors.
  • Number of hardware threads per processor
  • Size of instruction and data caches.
  • Bandwidth and number of memory channels
  • Bandwidth of I/O channel.
  • Off-chip memory speed

The performance criteria for such a system are:

  • Processing power of the overall system
  • Power consumption
  • Chip area required for an implementation

The results of this work are twofold. For one, optimal configurations for a particular workload and technology parameters can be derived. The second, and probably more interesting, result is the quantitative performance impact when considering different configurations (e.g., how does power consumption change if the number of threads per processor is increased significantly?).

In a recent extension, we have extended the design space exploration to also consider the system topology. This allows the consideration of pipeline width and depth as well as the interconnect between system components and memory interfaces.

Publications

For a complete list of NSL publications, see the publications page.


© 1998-2006 Tilman Wolf. Site Policies.