• When issue_width is 3 and there are two threads are running, how many instructions can we fetch in one cycle? Is it 3 or 3*2?
  • Can we set MAX_THREAD as some numbers?
  • Do we have to worry about memory disambiguation problem in this assignment?
  • What should I use to find out memory instructions? is checking mem_type sufficient or do I have to check opcode?
  • What will be the cache miss penalty? Is this KNOB_DCACHE_HIT_LATENCY+KNOB_MEM_LATENCY or just KNOB_MEM_LATENCY?
  • Do we need to implement load-store forwarding?
  • Do we need to differentiate load and store?