• Do we need to implement load-store forwarding?
  • Do we need to differentiate load and store?
  • Do memory instructions wait op_latency[OP_LD] op_latency[OP_ST] in an execution stage?