Skip to content

PC Logic

This section details the implementation of the Program Counter (PC) logic, covering PC_Logic.v, ProgramCounter.v, and the relevant multiplexing logic for the PC adder within RV.v.

Branch Resolution

In Mach-V Version 2, branch and jump instructions are committed in the Mem stage. Moving this logic from Execute to Memory improved timing performance, allowing the design to achieve a clock frequency of 115 MHz.

The microarchitecture consists of two main components:

  1. PC Logic Unit: Determines whether a branch is taken based on ALU flags and branch conditions.
  2. PC Adder: Computes the next PC value based on the current PC, branch targets, and jump addresses.

PC Logic Unit

The PC Logic Unit decides how the next PC value is formed. It uses instruction type and ALU comparison results to generate a 2-bit control signal PCSrc[1:0], which selects the PC update behavior and thus PCSrc[1:0] can indicate whether a branch or jump is taken.

Note

As PCSrc[1:0] controls the next PC value and it is generated in the Mem stage, we say that our branch/jump instructions are "committed" in the Mem stage.

  1. PCS (Input)
    • Encodes the instruction category (sequential, branch, JAL, JALR).
    • Combined with ALU flags to decide whether a control transfer is taken.
  2. ALUFlags[2:0] = {eq, lt, ltu} (Input)
    • Result of comparisons performed by the ALU.
    • Used mainly for conditional branches (e.g., BEQ, BLT, BLTU).
  3. PCSrc[1:0] (Output)
    • Controls how the next PC is computed.

The logic of PC Logic can be summarized in the following table:

PCS Instruction Funct3 PCSrc[1] PCSrc[0]
00 Non control x 0 0
01 beq 000 0 ALUFlags[2]
01 bne 001 0 ALUFlags[2]'
01 blt 100 0 ALUFlags[1]
01 bge 101 0 ALUFlags[1]'
01 bltu 110 0 ALUFlags[0]
01 bgeu 111 0 ALUFlags[0]'
10 jal x 0 1
11 jalr x 1 1

PC Adder

The PC Adder computes the next address (including the branch target or jump address). Because the commitment happens in the Mem stage, the inputs must be sourced correctly to avoid hazards:

  • PC_Base: Selects between the current PC (PCF, PCM) or a register value (RD1M). Note that RD1M is derived from RD1E_Forwarded and latched into the Mem stage pipeline register.
  • PC_Offset: Selects between sequential increment (+4) or the branch offset (+ExtImmM).
PCSrc Base (PC_Base) Offset (PC_Offset) Meaning
00 PCF +4 Sequential execution
01 PCM +ExtImm Taken branch or JAL
10 RD1M +4 JALR (no immediate offset)
11 RD1M +ExtImm JALR with immediate

Warning

Simply delaying the control signals is insufficient for moving the PC logic from Exe stage to Mem stage. The Hazard Unit must also be updated to handle the new branch resolution timing correctly.

1-bit Branch Predictor

To implement a 1-bit branch predictor, the design uses two structures:

The architecture of the 1-bit branch predictor is shown as follows:

1-bit Branch Predictor Block Diagram

1-bit Branch Predictor Block Diagram

The main change is in the PC_In signal selection, I have separated it into two stages:

  1. Speculative Execution (Fetch Stage): The speculative next PC is chosen based on the BHT prediction.
    • If PrPCSrcF= 1 (predicted taken), the speculative next PC is PrBTAF.
    • Else, the speculative next PC is PCPlus4F.
  2. Resolution (Memory Stage): The actual branch outcome is evaluated against the speculative prediction to catch and correct errors.
    • If the BranchMispredictM == 1, the next PC is corrected to PC_ResolvedM, which is the correct next PC based on the actual branch outcome. This is becauase a misprediction occurred due to an error in either:
      • Branch decision (MispredPCSrcM): The CPU guessed the wrong action. This ensures that the instruction was actually a branch/jump (not just a normal instruction like ADD) and that the Taken/Not Taken prediction was correct.
      • Branch target (MispredBTAM): The CPU guessed the wrong destination. This checks if the predicted address matches the actual resolved target address.
    • Otherwise, we use the speculative next PC value from step 1.

Another important thing is how the BranchMispredictM signal affects the Hazard.v unit. So, what is done here is that I use BranchMispredictM to replace the PCSrcM[0] in the non branch predictor unit version.

Why the BHT does not care about the instruction type?

The BHT's only job is to provide a fast, speculative prediction based on the instruction address during the Fetch stage, without needing to know the actual instruction type. If a non-branch instruction is incorrectly predicted as a branch, the Memory stage catches this by comparing the authoritative, decoded PCSrc[0] against the prediction (PrPCSrcM). This mismatch automatically triggers a misprediction flush (BranchMispredictM = 1), seamlessly correcting the PC.

Branch History Table (BHT)

The Branch History Table stores 1-bit PCSrc[0] for any type of instruction.

BHT Block Diagram

Branch History Table (1-bit) Block Diagram

Functionality

  • In the Fetch stage, PCF indexes the BHT to produce PrPCSrcF, which predicts whether the current instruction will take a branch.
  • In the Memory stage, if a branch instruction is resolved and found to be mispredicted, the corresponding BHT entry is updated using PCM.

Inputs

  • PCF: Index used to read the BHT and generate a prediction in the Fetch stage.
  • PCM: Index used to update the BHT when the actual branch outcome is known.
  • PCSrcM: The actual branch outcome (taken or not taken) determined in the Memory stage.
  • WE_PrPCSrc: Write enable signal, asserted only when a branch is mispredicted and the BHT entry must be corrected.

Output

  • PrPCSrcF: Predicted PCSrc value for the instruction in the Fetch stage.

Note

PCSrc[1:0] indicates whether an instruction is a branch/jump instuction and it so, it also indicates whether the branch is taken (1) or not taken (0).

Branch Target Buffer (BTB)

The Branch Target Buffer stores the target address of previously taken branches.

BTB Block Diagram

Branch Target Buffer (1-bit) Block Diagram

Functionality

  • In the Fetch stage, PCF indexes the BTB to obtain PrBTAF, the predicted branch target address.
  • In the Memory stage, if a branch is mispredicted or newly encountered, the BTB entry is updated with the correct target address using PCM.

Inputs

  • PCF: Index used to read the BTB in the Fetch stage.
  • PCM: Index used to update the BTB entry in the Memory stage.
  • BTAM: The actual branch target address computed when the branch is resolved.
  • WE_PrBTA: Write enable signal, asserted only when the BTB needs to be updated.

Output

  • PrBTAF: Predicted branch target address for the instruction in the Fetch stage.