Basic Course Information:
II. Historical Architectures
III. Instruction Level Parallelism
IV. Cache/Memory Hierarchy
VI. Other Interesting Architectures
Class | Readings (Very approximate, at this point!) |
March 29 | Introduction |
March 31 | H&P 6.1, 6.3, 6.4, 6.5 |
April 5 | H&P 6.6-6.8 |
April 7 |
Finish memory consistency, plus: |
April 12 | Memory Coherence in Shared Virtual Memory Systems
An Evaluation of Memory Consistency Models for Shared-Memory Systems with ILP Processors |
April 14 | Architecture and Applications of the HEP multiprocessor computing system,
The Tera Computer System, Exploiting Choice: Instruction Fetch and Issue on an Implementable Simultaneous Multithreading Processor |
April 19 |
(1) The Case for a Single-Chip Multiprocessor (2) Exploiting ILP, TLP, and DLP with the Polymorphous TRIPS Architecture (3) Single-ISA Heterogeneous Multi-Core Architectures for Multithreaded Workload Performance background, not required: Single-ISA Heterogeneous Multi-Core Architectures: The Potential for Processor Power Reduction |
April 21 | (1)Architecture of the IBM System/360
(2) Parallel Operation in the Control Data 6600 (3) The CRAY-1 Computer System (4) CRAY-1 Computer Technology also read pages 1-16 of the text (pages 1-55 in total) |
April 26 |
(1) Cramming More Components onto Integrated Circuits,
|
April 28 |
(1) Design Tradeoffs for the Alpha EV8 Conditional Branch Predictor |
May 3 | (1) A Comparison of Full and Partial Predicated Execution Support for
ILP Processors
(2) Instruction Issue Logic for High-Performance, Interruptable Pipelined Processors (3) Highly accurate data value prediction using hybrid predictors |
May 5 |
(2) A Scalable Approach to Thread-Level Speculation (3) A Quantitative Assessment of Thread-Level Speculation Techniques |
May 10 | midterm |
May 12 |
(1) Execution-based
Prediction Using Speculative Slices
|
May 17 | (1) Mitosis
Compiler: An Infrastructure for Speculative Threading Based on Pre-Computation
Slices
(2) Threaded Multi-path Execution |
May 19 | (1) The
Alpha 21264 Microprocessor
(2) The Microarchitecture of the Pentium 4 Processor |
May 24 | (1) Improving Direct-Mapped Cache ... (Jouppi)
(2) Organization and Performance of a Two-Level ... (Wang, Baer, Levy) |
May 26 | (1) Trace
Cache: A Low Latency Approach to High Bandwidth Instruction Fetching
(2) Putting the fill unit to work: dynamic optimizations for trace cache microprocessors |
May 31 | (1) Opportunistic Transient-Fault Detection
(2) Design and Evaluation of Hybrid Fault-Detection Systems< |
June 2 | Final |
June 9 | Project Presentations 1-4 p.m., APM 4218 |