Mgsc2 align by jensen-yan · Pull Request #686 · OpenXiangShan/GEM5

jensen-yan · 2025-12-26T08:54:23Z

for sc test

Summary by CodeRabbit

New Features
- Added eight configurable parameters to control MGSC predictor table behavior and thresholds for fine-grained experimentation.
- Introduced comprehensive statistics collection for improved performance analysis and debugging of branch prediction outcomes.
- Added tracing support for MGSC prediction data to enable detailed performance observation.
Documentation
- Added detailed documentation for MGSC predictor integration, including design points, operational flow, statistics interpretation, and sensitivity testing guidance.

_{✏️ Tip: You can customize this high-level summary in your review settings.}

- Updated calculatePercsum and calculateScaledPercsum documentation for clarity. - Enhanced MgscStats to include new statistics for weight scale sensitivity and prediction correctness.

Change-Id: I5992adb066d5c5c75bd16141564ac55264aa7e0a

Change-Id: Ieaef1b35cd968cb97679b982eab6db54b0b74b4c

- Enhanced the scaling formula in the calculateScaledPercsum function to implement a 4-level granularity for weight adjustments. - The new scaling logic provides more nuanced control over the importance of percsum based on weight ranges. Change-Id: I45dda73bec5d79d3521fec3ce7244c8e3c52dcab

Change-Id: Ib26755301387cbf1ce4b12bb26e988d43a49eea8

coderabbitai · 2025-12-26T08:54:39Z

📝 Walkthrough

Walkthrough

The PR adds comprehensive diagnostic infrastructure for the MGSC predictor including detailed technical documentation, new statistics counters for tracking per-table performance and threshold dynamics, test control switches for selectively enabling/disabling SC tables, and a trace structure for recording MGSC/TAGE prediction data.

Changes

Cohort / File(s)	Summary
Documentation `docs/Gem5_Docs/frontend/mgsc_notes.md`	New technical reference covering MGSC design points, data structures, feature encoding (G, P, L, BW, I/IMLI, Bias), adaptive update rules, statistics interpretation guide, and sensitivity test descriptions with usage guidance.
Header & Interface Updates `src/cpu/pred/btb/btb_mgsc.hh`	MgscPrediction augmented with tage_conf_high/mid/low fields; new recordPredictionStats method declared; test switches introduced (forceUseSC, enableBwTable, enableLTable, enableITable, enableGTable, enablePTable, enableBiasTable, enablePCThreshold); MgscStats expanded with 30\+ new statistics fields tracking per-table weight scale differences, percsum correctness, threshold updates, and SC usage patterns across TAGE confidence levels.
Implementation & Instrumentation `src/cpu/pred/btb/btb_mgsc.cc`	generateSinglePrediction updated to pass tage_conf fields and populate per-table percsums conditionally via enable flags; new recordPredictionStats method centralizes statistics collection; updateSinglePredictor delegates to recordPredictionStats and conditionally updates thresholds based on enablePCThreshold; statistics counters incremented throughout prediction and update paths.
Configuration Parameters `src/cpu/pred/BranchPredictor.py`	Eight new boolean parameters added to BTBMGSC: forceUseSC (default False), enableBwTable/enableLTable/enableITable/enableGTable/enablePTable/enableBiasTable (all default True), enablePCThreshold (default False).
Trace Infrastructure `src/cpu/pred/btb/stream_struct.hh`	New MgscTrace struct extending Record with set() method recording MGSC/TAGE prediction metadata (branch PC, TAGE prediction/confidence levels, per-table percsums, total sum/threshold, SC decision, actual outcome) for trace-based analysis.

Estimated code review effort

🎯 3 (Moderate) | ⏱️ ~22 minutes

Possibly related PRs

Add counter for components align #639: Modifies BTBMGSC prediction code and statistics tracking in commit/commitBranch logic, aligned with this PR's diagnostic expansion.

Suggested labels

perf, align-kmhv3

Suggested reviewers

Yakkhini
CJ362ff

Poem

🐰 Hop, skip, and a leap through traces we keep!
Each table weighs in with its metrics deep,
Statistics bloom like carrots in spring,
MGSC's secrets, now observing everything! 🥕✨

🚥 Pre-merge checks | ✅ 1 | ❌ 2

❌ Failed checks (1 warning, 1 inconclusive)

Check name	Status	Explanation	Resolution
Docstring Coverage	⚠️ Warning	Docstring coverage is 52.94% which is insufficient. The required threshold is 80.00%.	Write docstrings for the functions missing them to satisfy the coverage threshold.
Title check	❓ Inconclusive	The title 'Mgsc2 align' is vague and does not clearly convey what changes are being made; it lacks specificity about the actual modifications beyond internal alignment.	Consider a more descriptive title such as 'Add MGSC predictor statistics instrumentation and test controls' or 'Instrument MGSC predictor with weight scaling and threshold tracking' to better reflect the substantial changes.

✅ Passed checks (1 passed)

Check name	Status	Explanation
Description Check	✅ Passed	Check skipped - CodeRabbit’s high-level summary is enabled.

_{✏️ Tip: You can configure your own custom pre-merge checks in the settings.}

✨ Finishing touches

📝 Generate docstrings

📜 Recent review details

Configuration used: defaults

Review profile: CHILL

Plan: Pro

📥 Commits

Reviewing files that changed from the base of the PR and between 0edaa15 and 10d3c02.

📒 Files selected for processing (1)

src/cpu/pred/BranchPredictor.py

🧰 Additional context used

🪛 Ruff (0.14.10)

src/cpu/pred/BranchPredictor.py

1150-1150: Param may be undefined, or defined from star imports

(F405)

1151-1151: Param may be undefined, or defined from star imports

(F405)

1152-1152: Param may be undefined, or defined from star imports

(F405)

1153-1153: Param may be undefined, or defined from star imports

(F405)

1154-1154: Param may be undefined, or defined from star imports

(F405)

1155-1155: Param may be undefined, or defined from star imports

(F405)

1156-1156: Param may be undefined, or defined from star imports

(F405)

1157-1157: Param may be undefined, or defined from star imports

(F405)

⏰ Context from checks skipped due to timeout of 90000ms. You can increase the timeout in your CodeRabbit configuration to a maximum of 15 minutes (900000ms). (2)

GitHub Check: Quick Build, Unit Tests & Smoke Test
GitHub Check: perf_test / XS-GEM5 - Run performance test (spec06-0.3c)

🔇 Additional comments (1)

src/cpu/pred/BranchPredictor.py (1)

1149-1158: LGTM! New test switches for SC table components.

The new parameters provide useful granular control for testing and ablation studies on the MGSC predictor. The default values are sensible: individual tables enabled by default (True), while forceUseSC and enablePCThreshold are False to maintain conservative/experimental behavior.

Regarding the Ruff F405 warnings about Param being undefined or from star imports: these are false positives. The from m5.params import * at line 29 is the standard GEM5 configuration pattern used consistently throughout this file (100+ occurrences of Param.*). No changes needed.

Thanks for using CodeRabbit! It's free for OSS, and your support helps us grow. If you like it, consider giving us a shout-out.

❤️ Share

_{Comment @coderabbitai help to get the list of available commands and usage tips.}

coderabbitai

Actionable comments posted: 1

Caution

Some comments are outside the diff and can’t be posted inline due to platform limitations.

⚠️ Outside diff range comments (1)

src/cpu/pred/btb/btb_mgsc.hh (1)
505-505: Fix the incorrect header guard comment.

The comment references TAGE but this is the MGSC header file. This appears to be a copy-paste artifact.
🔎 Proposed fix
-#endif  // __CPU_PRED_BTB_TAGE_HH__
+#endif  // __CPU_PRED_BTB_MGSC_HH__

🧹 Nitpick comments (2)

src/cpu/pred/btb/btb_mgsc.cc (1)
1170-1171: Minor formatting inconsistency.

Line 1170 has a space after the comma in the description string, but line 1171 does not. This is a nitpick but maintaining consistency improves readability.
🔎 Proposed fix
-      ADD_STAT(scPredCorrectTageWrong, statistics::units::Count::get(),"number of sc prediction correct and tage wrong"),
-      ADD_STAT(scPredWrongTageCorrect, statistics::units::Count::get(),"number of sc prediction wrong and tage correct"),
+      ADD_STAT(scPredCorrectTageWrong, statistics::units::Count::get(), "number of sc prediction correct and tage wrong"),
+      ADD_STAT(scPredWrongTageCorrect, statistics::units::Count::get(), "number of sc prediction wrong and tage correct"),
src/cpu/pred/btb/btb_mgsc.hh (1)
97-132: Consider refactoring the large constructor signature.

The MgscPrediction constructor now takes 25+ parameters, which is unwieldy and error-prone. Consider grouping related fields into nested structs (e.g., PercsumData, WeightScaleData, IndexData) or using a builder pattern in a future refactor.

Example structure:
struct PercsumData {
    int bw, l, i, g, p, bias;
};
struct WeightScaleData {
    bool bw, l, i, g, p, bias;
};
// Then: MgscPrediction(..., PercsumData percsums, WeightScaleData weightDiffs);

📜 Review details

Configuration used: defaults

Review profile: CHILL

Plan: Pro

📥 Commits

Reviewing files that changed from the base of the PR and between 4ec9fc8 and 48d6304.

📒 Files selected for processing (4)

configs/example/kmhv3.py
docs/Gem5_Docs/frontend/mgsc_notes.md
src/cpu/pred/btb/btb_mgsc.cc
src/cpu/pred/btb/btb_mgsc.hh

🧰 Additional context used

🧬 Code graph analysis (1)

src/cpu/pred/btb/btb_mgsc.hh (2)

src/cpu/pred/btb/btb_tage.hh (2)

taken (282-282)

pred (383-383)

src/cpu/pred/btb/btb_ittage.hh (1)

taken (179-179)

🪛 LanguageTool

docs/Gem5_Docs/frontend/mgsc_notes.md

[uncategorized] ~14-~14: 您的意思是“少"于"”吗？
Context: ...好。这些表使用折叠历史压缩到较短索引位宽，降低端口和存储压力，同时减少与 TAGE 的直接结构耦合。 - 决策与门控：各表计数器 percsum 做线性...

(YU7_YU8)

[uncategorized] ~46-~46: 动词的修饰一般为‘形容词（副词）+地+动词’。您的意思是否是：丰富"地"可
Context: ...E 置信度（高置信错时更快抬阈，低置信错时慢抬），减少误覆盖。 4) 更丰富的可观测性：按表/置信度输出热分支热点，或导出 per-PC 纠错热度。 ## ...

(wb4)

⏰ Context from checks skipped due to timeout of 90000ms. You can increase the timeout in your CodeRabbit configuration to a maximum of 15 minutes (900000ms). (2)

GitHub Check: Quick Build, Unit Tests & Smoke Test
GitHub Check: perf_test / XS-GEM5 - Run performance test (spec06-0.3c)

🔇 Additional comments (9)

configs/example/kmhv3.py (1)

108-108: LGTM!

Enabling the MGSC predictor aligns with the PR objective for SC testing. The placement within the DecoupledBPUWithBTB configuration block is appropriate.

docs/Gem5_Docs/frontend/mgsc_notes.md (1)

1-54: Good documentation for MGSC integration.

The documentation provides a comprehensive overview of the MGSC predictor design, data structures, statistics, and test cases. The interpretation tips for statistics counters will be particularly helpful for debugging and optimization.
src/cpu/pred/btb/btb_mgsc.cc (4)
191-196: Disabled weight scaling for testing - ensure this is intentional.

The weight scaling logic is completely bypassed by returning percsum directly. This effectively disables a core feature of the MGSC predictor. The comment indicates this is for testing purposes, but this should not be merged to production without re-enabling or adding a configurable flag.

Consider using a parameter to control this behavior instead of hardcoding:
🔎 Suggested approach
// Add a parameter like p.enableWeightScaling in the constructor
// Then use:
int
BTBMGSC::calculateScaledPercsum(int weight, int percsum)
{
    if (!enableWeightScaling)
        return percsum;
    // Original scaling logic here
}
543-628: Good centralization of statistics collection.

The recordPredictionStats function cleanly consolidates all MGSC-related statistics recording. The use of lambdas for recordPercsum and recordConfOutcome reduces code duplication effectively.

652-665: LGTM!

The threshold statistics correctly track when thresholds are being adjusted. The pairing of PC and global threshold stats reflects the actual update behavior at lines 698-701.

332-336: LGTM!

The MgscPrediction constructor call correctly passes the new TAGE confidence fields along with all other required parameters in the proper order.
src/cpu/pred/btb/btb_mgsc.hh (3)

42-44: LGTM!

The new TAGE confidence fields (tage_conf_high, tage_conf_mid, tage_conf_low) are appropriately added to capture the confidence level at prediction time for later use in statistics and update logic.

271-272: LGTM!

The recordPredictionStats declaration is correctly placed in the private section and matches the implementation signature.

412-449: Good statistics instrumentation.

The new statistics fields provide comprehensive observability into MGSC behavior:

Weight scale sensitivity per table type

Raw percsum correctness tracking

Threshold adjustment monitoring

SC usage breakdown by TAGE confidence level

This aligns well with the documentation in mgsc_notes.md.

Change-Id: I69ef16d0b2d973de03e8ccd381ff359bc640d5f1

coderabbitai

Actionable comments posted: 0

🧹 Nitpick comments (3)

src/cpu/pred/btb/btb_mgsc.hh (1)
97-103: Consider using const references for vector parameters.

The constructor takes std::vector parameters by value, which creates copies. For better efficiency, consider passing by const&:
-        MgscPrediction(Addr btb_pc, int total_sum, bool use_mgsc, bool taken, bool taken_before_sc,
-                       bool tage_conf_high, bool tage_conf_mid, bool tage_conf_low, int16_t total_thres,
-                       std::vector<unsigned> bwIndex, std::vector<unsigned> lIndex, std::vector<unsigned> iIndex,
-                       std::vector<unsigned> gIndex, std::vector<unsigned> pIndex, std::vector<unsigned> biasIndex,
+        MgscPrediction(Addr btb_pc, int total_sum, bool use_mgsc, bool taken, bool taken_before_sc,
+                       bool tage_conf_high, bool tage_conf_mid, bool tage_conf_low, int16_t total_thres,
+                       const std::vector<unsigned>& bwIndex, const std::vector<unsigned>& lIndex, const std::vector<unsigned>& iIndex,
+                       const std::vector<unsigned>& gIndex, const std::vector<unsigned>& pIndex, const std::vector<unsigned>& biasIndex,
src/cpu/pred/btb/btb_mgsc.cc (2)
198-203: Weight scaling is disabled for testing - consider adding a configuration flag.

The calculateScaledPercsum function currently ignores the weight parameter and just returns percsum. This disables a significant feature of the SC predictor.

If this is temporary for testing, consider:

Adding a TODO comment with ticket reference

Using a configuration flag (similar to the new table enable flags) to control this behavior
 int
 BTBMGSC::calculateScaledPercsum(int weight, int percsum)
 {
-    return percsum; // disable weight scaling for test
+    // TODO: Re-enable weight scaling after testing
+    // return (weight >= 0) ? percsum * (1 + weight) : percsum / (1 - weight);
+    return percsum;
 }
Would you like me to propose a configuration flag to toggle weight scaling on/off?

278-300: Minor optimization: Skip weight lookup when table is disabled.

When a table is disabled, its weight is still looked up but never used (since scaled_percsum will be 0). Consider:
-    int bw_percsum = enableBwTable ? calculatePercsum(bwTable, bwIndex, bwTableNum, btb_entry.pc) : 0;
-    int bw_weight = findWeight(bwWeightTable, btb_entry.pc);
-    int bw_scaled_percsum = calculateScaledPercsum(bw_weight, bw_percsum);
+    int bw_percsum = 0, bw_weight = 0, bw_scaled_percsum = 0;
+    if (enableBwTable) {
+        bw_percsum = calculatePercsum(bwTable, bwIndex, bwTableNum, btb_entry.pc);
+        bw_weight = findWeight(bwWeightTable, btb_entry.pc);
+        bw_scaled_percsum = calculateScaledPercsum(bw_weight, bw_percsum);
+    }
This would avoid unnecessary memory accesses for disabled tables.

📜 Review details

Configuration used: defaults

Review profile: CHILL

Plan: Pro

📥 Commits

Reviewing files that changed from the base of the PR and between 48d6304 and 8bf3d10.

📒 Files selected for processing (4)

configs/example/kmhv3.py
src/cpu/pred/BranchPredictor.py
src/cpu/pred/btb/btb_mgsc.cc
src/cpu/pred/btb/btb_mgsc.hh

🚧 Files skipped from review as they are similar to previous changes (1)

configs/example/kmhv3.py

🧰 Additional context used

🧬 Code graph analysis (2)

src/cpu/pred/btb/btb_mgsc.hh (3)

src/cpu/pred/btb/btb_tage.hh (2)

taken (282-282)

pred (383-383)

src/cpu/pred/btb/stream_struct.hh (2)

taken (640-668)

taken (704-704)

src/cpu/pred/btb/btb_ittage.hh (1)

taken (179-179)

src/cpu/pred/btb/btb_mgsc.cc (1)

src/cpu/pred/btb/btb_mgsc.hh (7)

entry (169-169)

entry (269-270)

weight (198-198)

total_sum (208-208)

MgscPrediction (66-95)

MgscPrediction (97-132)

pred (271-272)

🪛 Ruff (0.14.10)

src/cpu/pred/BranchPredictor.py

1150-1150: Param may be undefined, or defined from star imports

(F405)

1151-1151: Param may be undefined, or defined from star imports

(F405)

1152-1152: Param may be undefined, or defined from star imports

(F405)

1153-1153: Param may be undefined, or defined from star imports

(F405)

1154-1154: Param may be undefined, or defined from star imports

(F405)

1155-1155: Param may be undefined, or defined from star imports

(F405)

1156-1156: Param may be undefined, or defined from star imports

(F405)

⏰ Context from checks skipped due to timeout of 90000ms. You can increase the timeout in your CodeRabbit configuration to a maximum of 15 minutes (900000ms). (1)

GitHub Check: perf_test / XS-GEM5 - Run performance test (spec06-0.3c)

🔇 Additional comments (11)

src/cpu/pred/btb/btb_mgsc.hh (4)

42-44: LGTM: TAGE confidence fields added to MgscPrediction.

The new tage_conf_high, tage_conf_mid, and tage_conf_low fields are properly initialized in both constructors, enabling confidence-based SC usage decisions.

Also applies to: 72-74, 109-111

329-336: LGTM: Per-table enable flags provide good configurability.

These configuration flags allow fine-grained control over individual SC tables, which is useful for testing and debugging the contribution of each table to prediction accuracy.

421-458: LGTM: Comprehensive statistics for SC predictor analysis.

The new statistics provide excellent visibility into:

Per-table weight scaling decisions

Per-table percsum correctness

Threshold adaptation behavior

SC usage patterns under different TAGE confidence levels

This will enable detailed analysis of the predictor's behavior.

271-272: LGTM: Centralized statistics recording method.

The recordPredictionStats method centralizes statistics recording logic, improving code organization.

src/cpu/pred/btb/btb_mgsc.cc (6)

55-61: LGTM: New configuration flags properly initialized.

The new per-table enable flags and forceUseSC are correctly initialized from the Params object.

552-637: LGTM: Well-structured statistics recording.

The recordPredictionStats method is well-organized with:

Clear separation of concerns using lambdas (recordPercsum, recordConfOutcome)

Comprehensive coverage of all statistics dimensions

Efficient inline recording

Good use of lambdas to avoid code duplication.

661-674: LGTM: Statistics recording integrated into update path.

Statistics are correctly recorded:

recordPredictionStats is called for all predictions

Threshold increment/decrement stats are only recorded when tables are actually updated

1180-1215: LGTM: New statistics properly registered.

All new statistics are correctly initialized with descriptive names and appropriate units. The grouping by category (weight scaling, percsum, threshold, SC usage by confidence) is logical.

159-177: LGTM: Improved documentation for percsum calculation.

The added comments clearly explain:

Counter value range based on width

The transformation to odd numbers (2*counter + 1)

Sign semantics (positive = taken bias, negative = not-taken bias)

This improves code maintainability.

312-327: The confidence level logic is exhaustive and always assigns at least one flag.

The three confidence flags are mutually exclusive and exhaustively cover all cases:

When pred.mainInfo.found is false, tage_pred_conf_low is always true.

When found and the counter is saturated (abs value = 7), tage_pred_conf_high is true.

When found and the counter is mid-range (abs value = 3 or 5), tage_pred_conf_mid is true.

When found and the counter is near zero (abs value = 1), tage_pred_conf_low is true.

The code in btb_mgsc.cc lines 312-327 is correct and will always find at least one true condition in the if-else chain.

src/cpu/pred/BranchPredictor.py (1)

1149-1157: LGTM: New configuration parameters properly defined.

The new parameters match the C++ declarations:

forceUseSC defaults to False (SC usage based on confidence thresholds)

All table enable flags default to True (all tables enabled by default)

The Ruff warnings about Param being undefined are false positives — Param is imported via from m5.params import * on line 29, which is the standard gem5 configuration pattern.

github-actions · 2025-12-26T09:07:15Z

🚀 Coremark Smoke Test Results

Branch	IPC	Change
Base (`xs-dev`)	`2.1772`	-
This PR	`2.2000`	📈 `+0.0227` (`+1.04%`)

✅ Difftest smoke test passed!

github-actions · 2025-12-26T09:19:26Z

🚀 Coremark Smoke Test Results

Branch	IPC	Change
Base (`xs-dev`)	`2.1772`	-
This PR	`2.1916`	📈 `+0.0144` (`+0.66%`)

✅ Difftest smoke test passed!

XiangShanRobot · 2025-12-26T10:32:58Z

[Generated by GEM5 Performance Robot]
commit: 8bf3d10
workflow: gem5 Align BTB Performance Test(0.3c)

Align BTB Performance

Overall Score

	PR	Master	Diff(%)
Score	17.82	17.53	+1.69 🟢

[Generated by GEM5 Performance Robot]
commit: 8bf3d10
workflow: gem5 Align BTB Performance Test(0.3c)

Align BTB Performance

Overall Score

	PR	Previous Commit	Diff(%)
Score	17.82	17.88	-0.30 🔴

Change-Id: Ia279ea56c55b79261163bee4400befe6014e8c12

XiangShanRobot · 2025-12-26T14:13:06Z

[Generated by GEM5 Performance Robot]
commit: 2cfb00c
workflow: gem5 Align BTB Performance Test(0.3c)

Align BTB Performance

Overall Score

	PR	Master	Diff(%)
Score	4.78	17.53	-72.73 🔴

[Generated by GEM5 Performance Robot]
commit: 2cfb00c
workflow: gem5 Align BTB Performance Test(0.3c)

Align BTB Performance

Overall Score

	PR	Previous Commit	Diff(%)
Score	4.78	17.82	-73.18 🔴

This reverts commit 2cfb00c.

Change-Id: I6ecce2060d5a9dc85312ff87188ee8804a694f2a

coderabbitai

Actionable comments posted: 1

🧹 Nitpick comments (1)

src/cpu/pred/btb/btb_mgsc.hh (1)
97-103: Constructor signature is getting large.

The MgscPrediction constructor now accepts 21 parameters, which makes it difficult to maintain and error-prone to call. Consider using a builder pattern or designated initializers (C++20) for better readability and maintainability.
Example builder pattern approach
struct MgscPredictionBuilder {
    MgscPrediction pred;
    
    MgscPredictionBuilder& setBtbPc(Addr pc) { pred.btb_pc = pc; return *this; }
    MgscPredictionBuilder& setTotalSum(int sum) { pred.total_sum = sum; return *this; }
    // ... other setters ...
    
    MgscPrediction build() { return pred; }
};

📜 Review details

Configuration used: defaults

Review profile: CHILL

Plan: Pro

📥 Commits

Reviewing files that changed from the base of the PR and between 2cfb00c and 8e8378d.

📒 Files selected for processing (4)

configs/example/kmhv3.py
src/cpu/pred/BranchPredictor.py
src/cpu/pred/btb/btb_mgsc.cc
src/cpu/pred/btb/btb_mgsc.hh

🧰 Additional context used

🧬 Code graph analysis (1)

src/cpu/pred/btb/btb_mgsc.hh (2)

src/cpu/pred/btb/btb_tage.hh (2)

taken (282-282)

pred (383-383)

src/cpu/pred/btb/btb_ittage.hh (1)

taken (179-179)

🪛 Ruff (0.14.10)

src/cpu/pred/BranchPredictor.py

1150-1150: Param may be undefined, or defined from star imports

(F405)

1151-1151: Param may be undefined, or defined from star imports

(F405)

1152-1152: Param may be undefined, or defined from star imports

(F405)

1153-1153: Param may be undefined, or defined from star imports

(F405)

1154-1154: Param may be undefined, or defined from star imports

(F405)

1155-1155: Param may be undefined, or defined from star imports

(F405)

1156-1156: Param may be undefined, or defined from star imports

(F405)

1157-1157: Param may be undefined, or defined from star imports

(F405)

🔇 Additional comments (13)

configs/example/kmhv3.py (2)

108-111: Verify the performance impact of forcing SC usage.

Enabling MGSC with forceUseSC = True fundamentally changes branch prediction behavior by always using the statistical corrector. The performance robot reports show significant variance across commits, including a -72% regression in commit 2cfb00c that was later reverted.

Confirm that this configuration matches the intended RTL behavior and that the performance characteristics are acceptable for your testing objectives.

113-120: LGTM: RTL alignment configuration matches stated intent.

The selective table enablement (PTable and BiasTable only, with all other tables disabled) aligns with the PR objective to match RTL behavior. The configuration is internally consistent with the inline comment.

src/cpu/pred/btb/btb_mgsc.hh (3)

42-44: LGTM! TAGE confidence fields properly added.

The three TAGE confidence levels (high, mid, low) are consistently added to the MgscPrediction struct across default initialization, constructor parameters, and member initialization. This allows the SC predictor to adapt its behavior based on TAGE confidence.

Also applies to: 72-74, 109-111

422-459: Comprehensive statistics added for MGSC analysis.

The expanded MgscStats provides excellent observability into:

Per-table weight scale sensitivity

Raw percsum correctness for each table

Threshold update patterns

SC usage patterns under different TAGE confidence levels

This will be valuable for understanding predictor behavior and tuning.

329-337: Enable flag defaults and safety checks are properly implemented.

All enable flags default to True in BranchPredictor.py (except forceUseSC which defaults to False), matching the full production MGSC configuration. The implementation safely handles disabled tables using ternary guards throughout btb_mgsc.cc (e.g., enableBwTable ? calculatePercsum(...) : 0), preventing both uninitialized memory access and divide-by-zero issues. Each flag is documented with clear intent descriptions in BranchPredictor.py.

src/cpu/pred/btb/btb_mgsc.cc (7)

55-62: LGTM! Enable flags properly initialized from parameters.

The constructor correctly initializes all enable flags from the provided parameters, allowing runtime configuration of MGSC table usage.

162-169: Good documentation of percsum calculation.

The updated documentation clearly explains the counter range transformation to odd numbers (2*counter + 1), which avoids zero and provides a clear bias signal. The formula correctly maps the counter range [-32, 31] to percsum range [-63, 63] for 6-bit counters.

Also applies to: 178-178

279-279: Conditional table usage properly implemented.

The percsum calculations correctly check the corresponding enable flags before computing values, returning 0 when disabled. This provides clean control over which tables contribute to the final prediction.

Also applies to: 283-283, 287-287, 291-291, 295-295, 299-299

553-638: Well-structured statistics collection method.

The recordPredictionStats method provides centralized, comprehensive statistics tracking:

SC vs TAGE outcome tracking

Per-table percsum correctness using a helper lambda

Weight scale difference tracking

Confidence-based SC usage patterns

The use of lambdas (recordPercsum and recordConfOutcome) is elegant and reduces code duplication.

668-675: Statistics tracking for threshold updates.

The code correctly increments/decrements threshold statistics based on prediction correctness. However, note that these statistics are always incremented regardless of the enablePCThreshold flag value, while the actual PC threshold update (lines 707-710) is conditional. This is correct behavior since global thresholds are always updated, but the asymmetry is worth noting.

707-710: PC threshold updates properly gated by enable flag.

The PC-indexed threshold table is only updated when enablePCThreshold is true, providing correct conditional behavior. The global threshold (line 713) is always updated, which is appropriate.

313-328: Confidence-based threshold logic verified as correct.

The tiered threshold approach is sound: saturated TAGE predictions (high confidence) use the most lenient threshold (total_thres/2), unsaturated predictions use moderate thresholds, and unpredicted/low-confidence cases use stricter thresholds. The three confidence levels are mutually exclusive by design based on counter saturation state. Fallthrough behavior when no confidence level matches (which shouldn't occur) is intentional.

src/cpu/pred/BranchPredictor.py (1)

1149-1157: MGSC control parameters properly defined.

The new parameters provide runtime configurability for MGSC behavior:

forceUseSC: Defaults to False, allowing normal TAGE confidence-based switching

Table enable flags: Default to True, enabling all tables by default

enablePCThreshold: Defaults to True, enabling PC-indexed thresholds

Default values appear reasonable for production use, with all features enabled except forceUseSC.

Note: The static analysis hints about undefined Param are false positives—Param is correctly imported via from m5.params import * on line 29.

github-actions · 2025-12-29T11:54:20Z

🚀 Coremark Smoke Test Results

Branch	IPC	Change
Base (`xs-dev`)	`2.1772`	-
This PR	`2.1660`	📉 `-0.0112` (`-0.52%`)

✅ Difftest smoke test passed!

XiangShanRobot · 2025-12-29T13:18:29Z

[Generated by GEM5 Performance Robot]
commit: 8e8378d
workflow: gem5 Align BTB Performance Test(0.3c)

Align BTB Performance

Overall Score

	PR	Master	Diff(%)
Score	17.62	17.53	+0.52 🟢

[Generated by GEM5 Performance Robot]
commit: 8e8378d
workflow: gem5 Align BTB Performance Test(0.3c)

Align BTB Performance

Overall Score

	PR	Previous Commit	Diff(%)
Score	17.62	4.78	+268.62 🟢

Change-Id: I2b8451ff6ec9abbda07d71a32a942fb116f0abd9

global + bias + path Change-Id: I58b509b2361a76b681a3862bb3d9bcc140d5ac52

IMLI + global + bias + path Change-Id: I5a9c8470901455db75597a3d0aa07204a008a291

backward + IMLI + global + bias + path Change-Id: I7aabaafa68660f82a75a19e002f0b44376751c79

local + backward + IMLI + global + bias + path Change-Id: I6ec878fc09049b5cab9a840ed7e4b568f39168fd

coderabbitai

Actionable comments posted: 0

🧹 Nitpick comments (1)

src/cpu/pred/btb/stream_struct.hh (1)
801-836: LGTM! MgscTrace structure properly extends Record for MGSC/TAGE tracing.

The structure correctly captures MGSC prediction data including TAGE confidence levels, per-table percsums, and final SC decision. The signed int64_t percsum values are intentionally cast to uint64_t for storage (lines 822-830), preserving the bit pattern for later analysis.

The set() method has 16 parameters, which can be difficult to maintain. Consider grouping related fields into nested structures if this interface becomes unwieldy in practice.
💡 Optional refactor: group related parameters

For example, you could group TAGE confidence fields:
struct TageConfidence {
    uint64_t high;
    uint64_t mid;
    uint64_t low;
};

struct PercsumValues {
    int64_t bw, l, i, g, p, bias;
};

void set(uint64_t branchPC, uint64_t tagePred, 
         const TageConfidence& conf,
         const PercsumValues& percsums,
         int64_t totalSum, int64_t totalThres, 
         uint64_t useSc, uint64_t scPred, uint64_t actualTaken)
However, the current flat structure is acceptable and matches the existing pattern in this codebase.

📜 Review details

Configuration used: defaults

Review profile: CHILL

Plan: Pro

📥 Commits

Reviewing files that changed from the base of the PR and between 8e8378d and 2b42aac.

📒 Files selected for processing (3)

configs/example/kmhv3.py
src/cpu/pred/btb/btb_mgsc.cc
src/cpu/pred/btb/stream_struct.hh

🚧 Files skipped from review as they are similar to previous changes (1)

configs/example/kmhv3.py

🧰 Additional context used

🧬 Code graph analysis (1)

src/cpu/pred/btb/stream_struct.hh (1)

src/sim/clocked_object.hh (1)

curTick (222-222)

🔇 Additional comments (8)

src/cpu/pred/btb/btb_mgsc.cc (8)

55-62: LGTM! Proper initialization of SC control flags.

The new configuration parameters (forceUseSC, per-table enable flags, enablePCThreshold) are properly initialized from the Params object, providing fine-grained control over MGSC/SC behavior.

149-172: LGTM! Trace initialization properly defines MGSC trace schema.

The setTrace() method correctly initializes the MGSC trace with 16 fields that align with the MgscTrace::set() method signature. The implementation is properly guarded with #ifndef UNIT_TEST and checks enableDB before initialization.

188-207: LGTM! Percsum calculation correctly transforms counters to odd numbers.

The formula percsum += (2 * entry + 1) (line 204) properly transforms counter values to odd numbers, avoiding zero contributions. The updated documentation clearly explains the counter range and transformation logic.

305-354: LGTM! Conditional table enabling and SC usage logic are correctly implemented.

The per-table percsum calculations properly respect the enable flags (lines 305, 309, 313, 317, 321, 325), setting contributions to 0 when disabled. The SC usage decision logic (lines 339-354) correctly implements confidence-based thresholding:

forceUseSC bypasses all checks

High TAGE confidence requires |total_sum| > total_thres/2

Mid confidence requires |total_sum| > total_thres/4

Low confidence requires |total_sum| > total_thres/8

This ensures SC only overrides TAGE when it has sufficient confidence, with higher bars for overriding high-confidence TAGE predictions.

579-664: LGTM! Centralized statistics recording with good code structure.

The new recordPredictionStats() method consolidates statistics collection, improving maintainability. The implementation correctly:

Tracks SC vs TAGE outcomes (lines 588-601)

Records per-table percsum correctness using a lambda helper (lines 604-639)

Tracks weight scaling criticality for each table

Records SC usage outcomes bucketed by TAGE confidence levels (lines 642-663)

The use of lambda functions reduces code duplication while maintaining clarity.

688-702: LGTM! Statistics recording and trace writing properly integrated.

The updateSinglePredictor method correctly calls recordPredictionStats() (line 688) and writes trace records (lines 690-702). The trace writing is properly guarded with #ifndef UNIT_TEST and enableDB checks, and all required fields are passed to MgscTrace::set().

708-750: LGTM! Threshold update logic correctly adapts SC usage.

The threshold adjustment logic (lines 708-715) is correct:

Increments when sc_pred_taken != actual_taken (raise the bar after SC errors)

Decrements when SC is correct but confidence is low (lower the bar to use SC more often)

Statistics tracking (pcThresholdInc/Dec, globalThresholdInc/Dec) provides visibility into threshold adaptation. The PC-indexed threshold update (lines 747-750) properly respects the enablePCThreshold flag.

1225-1258: LGTM! Comprehensive MGSC statistics for observability.

The new statistics counters provide excellent visibility into MGSC behavior:

Per-table weight scaling criticality (lines 1225-1230)

Per-table percsum correctness (lines 1232-1243)

Threshold adaptation tracking (lines 1245-1248)

SC usage outcomes bucketed by TAGE confidence (lines 1250-1258)

The statistics align with the recordPredictionStats() method and follow gem5 conventions. This comprehensive instrumentation will be valuable for MGSC performance analysis and tuning.

github-actions · 2026-01-07T03:04:54Z

🚀 Coremark Smoke Test Results

Branch	IPC	Change
Base (`xs-dev`)	`2.1689`	-
This PR	`2.1726`	📈 `+0.0038` (`+0.17%`)

✅ Difftest smoke test passed!

github-actions · 2026-01-07T03:05:56Z

🚀 Coremark Smoke Test Results

Branch	IPC	Change
Base (`xs-dev`)	`2.1689`	-
This PR	`2.1726`	📈 `+0.0038` (`+0.17%`)

✅ Difftest smoke test passed!

github-actions · 2026-01-07T03:13:39Z

🚀 Coremark Smoke Test Results

Branch	IPC	Change
Base (`xs-dev`)	`2.1689`	-
This PR	`2.1794`	📈 `+0.0105` (`+0.49%`)

✅ Difftest smoke test passed!

github-actions · 2026-01-07T03:22:02Z

🚀 Coremark Smoke Test Results

Branch	IPC	Change
Base (`xs-dev`)	`2.1689`	-
This PR	`2.1727`	📈 `+0.0038` (`+0.17%`)

✅ Difftest smoke test passed!

XiangShanRobot · 2026-01-07T04:42:46Z

[Generated by GEM5 Performance Robot]
commit: 2b42aac
workflow: gem5 Align BTB Performance Test(0.3c)

Align BTB Performance

Overall Score

	PR	Master	Diff(%)
Score	17.84	17.48	+2.04 🟢

[Generated by GEM5 Performance Robot]
commit: 2b42aac
workflow: gem5 Align BTB Performance Test(0.3c)

Align BTB Performance

Overall Score

	PR	Previous Commit	Diff(%)
Score	17.84	17.78	+0.32 🟢

Change-Id: I07ece3caa9617ff7c1aa094d00ef9710d476b41e

github-actions · 2026-01-09T03:11:15Z

🚀 Coremark Smoke Test Results

Branch	IPC	Change
Base (`xs-dev`)	`2.1662`	-
This PR	`2.1662`	➡️ `0.0000` (`0.00%`)

✅ Difftest smoke test passed!

XiangShanRobot · 2026-01-09T03:56:57Z

[Generated by GEM5 Performance Robot]
commit: 0edaa15
workflow: gem5 Align BTB Performance Test(0.3c)

Align BTB Performance

Overall Score

	PR	Master	Diff(%)
Score	17.53	17.51	+0.13 🟢

[Generated by GEM5 Performance Robot]
commit: 0edaa15
workflow: gem5 Align BTB Performance Test(0.3c)

Align BTB Performance

Overall Score

	PR	Previous Commit	Diff(%)
Score	17.53	17.84	-1.73 🔴

Change-Id: I102752018085baeae8aaba087b361758e6af2ab4

github-actions · 2026-01-12T02:38:04Z

🚀 Coremark Smoke Test Results

Branch	IPC	Change
Base (`xs-dev`)	`2.1691`	-
This PR	`2.1732`	📈 `+0.0041` (`+0.19%`)

✅ Difftest smoke test passed!

jensen-yan added 5 commits December 26, 2025 16:49

pred: add BTBMGSC prediction statistics

f89181e

- Updated calculatePercsum and calculateScaledPercsum documentation for clarity. - Enhanced MgscStats to include new statistics for weight scale sensitivity and prediction correctness.

bpu: test mgsc score

ad38654

Change-Id: I5992adb066d5c5c75bd16141564ac55264aa7e0a

bpu: add mgsc dosc

65040e7

Change-Id: Ieaef1b35cd968cb97679b982eab6db54b0b74b4c

bpu: disable weight table for test

48d6304

Change-Id: Ib26755301387cbf1ce4b12bb26e988d43a49eea8

coderabbitai Bot reviewed Dec 26, 2025

View reviewed changes

Comment thread docs/Gem5_Docs/frontend/mgsc_notes.md

bpu: test force use SC not use tage

8bf3d10

Change-Id: I69ef16d0b2d973de03e8ccd381ff359bc640d5f1

coderabbitai Bot reviewed Dec 26, 2025

View reviewed changes

bpu: test no tage, only use SC

2cfb00c

Change-Id: Ia279ea56c55b79261163bee4400befe6014e8c12

jensen-yan added 2 commits December 29, 2025 18:36

Revert "bpu: test no tage, only use SC"

ef82824

This reverts commit 2cfb00c.

bpu: align RTL: only enable bias and path, disable pcThreshold

8e8378d

Change-Id: I6ecce2060d5a9dc85312ff87188ee8804a694f2a

coderabbitai Bot reviewed Dec 29, 2025

View reviewed changes

Comment thread src/cpu/pred/btb/btb_mgsc.cc

jensen-yan added 5 commits January 7, 2026 10:57

bpu: add mgsc trace

6b78713

Change-Id: I2b8451ff6ec9abbda07d71a32a942fb116f0abd9

bpu: mgsc,enable global

f0b4bdb

global + bias + path Change-Id: I58b509b2361a76b681a3862bb3d9bcc140d5ac52

bpu: mgsc,enable IMLI

8743853

IMLI + global + bias + path Change-Id: I5a9c8470901455db75597a3d0aa07204a008a291

bpu: mgsc,enable backward

8e7df10

backward + IMLI + global + bias + path Change-Id: I7aabaafa68660f82a75a19e002f0b44376751c79

bpu: mgsc,enable local

2b42aac

local + backward + IMLI + global + bias + path Change-Id: I6ec878fc09049b5cab9a840ed7e4b568f39168fd

coderabbitai Bot reviewed Jan 7, 2026

View reviewed changes

jensen-yan requested review from CaoJiaming776 and Yakkhini January 9, 2026 03:02

bpu: disable sc now, waiting for RTL implementation

0edaa15

Change-Id: I07ece3caa9617ff7c1aa094d00ef9710d476b41e

bpu: remove duplicated params in kmhv3.py

10d3c02

Change-Id: I102752018085baeae8aaba087b361758e6af2ab4

CaoJiaming776 approved these changes Jan 12, 2026

View reviewed changes

jensen-yan merged commit 39cd6af into xs-dev Jan 12, 2026
3 checks passed

jensen-yan deleted the mgsc2-align branch January 12, 2026 02:49

This was referenced Jan 13, 2026

bpu: test old: only enable sc imli table #707

Merged

Sc ut #710

Merged

This was referenced Mar 3, 2026

Mgsc5 align #768

Merged

Bpu add percep perf #774

Open

This was referenced May 7, 2026

cpu: Add imHist-assisted MGSC IMLI path #851

Open

Sc test align #852

Open

coderabbitai Bot mentioned this pull request Jun 12, 2026

[codex] Add standalone SC switch #905

Merged

Uh oh!

Conversation

jensen-yan commented Dec 26, 2025 • edited by coderabbitai Bot Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Summary by CodeRabbit

Uh oh!

coderabbitai Bot commented Dec 26, 2025 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Walkthrough

Changes

Estimated code review effort

Possibly related PRs

Suggested labels

Suggested reviewers

Poem

Uh oh!

coderabbitai Bot left a comment

Choose a reason for hiding this comment

Uh oh!

Uh oh!

coderabbitai Bot left a comment

Choose a reason for hiding this comment

Uh oh!

github-actions Bot commented Dec 26, 2025

🚀 Coremark Smoke Test Results

Uh oh!

github-actions Bot commented Dec 26, 2025

🚀 Coremark Smoke Test Results

Uh oh!

XiangShanRobot commented Dec 26, 2025

Align BTB Performance

Overall Score

Align BTB Performance

Overall Score

Uh oh!

XiangShanRobot commented Dec 26, 2025

Align BTB Performance

Overall Score

Align BTB Performance

Overall Score

Uh oh!

coderabbitai Bot left a comment

Choose a reason for hiding this comment

Uh oh!

Uh oh!

github-actions Bot commented Dec 29, 2025

🚀 Coremark Smoke Test Results

Uh oh!

XiangShanRobot commented Dec 29, 2025

Align BTB Performance

Overall Score

Align BTB Performance

Overall Score

Uh oh!

coderabbitai Bot left a comment

Choose a reason for hiding this comment

Uh oh!

github-actions Bot commented Jan 7, 2026

🚀 Coremark Smoke Test Results

Uh oh!

github-actions Bot commented Jan 7, 2026

🚀 Coremark Smoke Test Results

Uh oh!

github-actions Bot commented Jan 7, 2026

🚀 Coremark Smoke Test Results

Uh oh!

github-actions Bot commented Jan 7, 2026

🚀 Coremark Smoke Test Results

Uh oh!

XiangShanRobot commented Jan 7, 2026

Align BTB Performance

Overall Score

Align BTB Performance

Overall Score

Uh oh!

github-actions Bot commented Jan 9, 2026

🚀 Coremark Smoke Test Results

Uh oh!

XiangShanRobot commented Jan 9, 2026

Align BTB Performance

jensen-yan commented Dec 26, 2025 •

edited by coderabbitai Bot

Loading

coderabbitai Bot commented Dec 26, 2025 •

edited

Loading