openRow, closedRow FCFS/FR-FCFS memory scheduling algorithm Prefetching, (software and hardware prefetching) Prefetching with loop unrolling Prefetch distance 3 prefetch metrics (accuracy, coverage, timeliness) Hardware prefetcher (stream/stride) prefetcher Interrupts/exceptions Speculative executions Precise exception Predicated execution Memory disambiguation problem Load store queue Static instruction scheduling Trace scheduling Hyperblock Superbock Loop unrolling Software pipelining Function inlining VLIW Flynn's Taxonomy (SISD/SIMD/MIMD/MISD) Shared memory system/distributed memory system Message passing machines CMP SMT SMT fetch/rename MSI/MESI coherence Directory based cache coherence system Lock/barrier Implementing lock Memory consistency (understanding the problem itself) GPU execution model (pipeline), scorbording, performance calculation Flit, Direct/indirect connections Crossbar, mesh, butterfly, torus, hybercube, Ring (know the topology) Blocking/nonblocking Store/forward control flow Virtual cut through control flow Power, energy Static power/dynamic power Power reduction technologies DVFS/DVS DTM EDP, EDDP Power gating/clock gating/pipeline gating DMA Memory mapped I/O MTTF, MTTR, Faults (different faults..) RAID (basic concepts) ECC