Developments of the base SC13 model for benchmark and MSE considerations

SPRFMO

South Pacific Regional Fisheries Management Organisation
Jack Mackerel Working Group
SCW16/Paper-01
PDF-rendered version | MS Word version
Developments of the base SC13 model for benchmark and MSE considerations

08 April 2026

Abstract

1. This working paper documents the base SC13 jack mackerel assessment model used for advice under the single-stock and two-stock hypotheses, corresponding to model configurations h1_1.14 and h2_1.14. The paper is intended as a bridge between the full technical annex and a future management strategy evaluation (MSE) framework. Its specific purpose is to describe the Joint Jack Mackerel Model (JJM) as implemented for SC13, identify the principal differences between the two advice models, and define a starting point for evaluating simplified alternatives. The immediate justification for beginning this work is twofold: 1) to cover topics in advance that can be discussed most efficiently at the forthcoming planned benchmark and 2) to help address a number of issues related to the MSE identified during the July 2025 MSE workshop. These include extensive time-varying selectivity, dynamic reference points that depend on terminal-year model conditions, and projection assumptions that differ from the assumptions used during model fitting. The SC13 base model therefore serves here as the reference assessment formulation from which reduced alternatives can be developed and tested.

1 Introduction

2. The Joint Jack Mackerel Model has been the core age-structured assessment framework used by SPRFMO for jack mackerel over a series of annual updates and technical workshops. In general terms, the model belongs to the class of forward-projecting statistical catch-at-age models developed for fisheries assessment, with estimated population dynamics linked to catch, abundance indices, and composition data through a likelihood-based framework (D. Fournier & Archibald, 1982; Hilborn & Walters, 1992). The formal technical reference for the accepted SC13 assessment remains Annex 11 of the SC13 report, the Jack Mackerel Technical Annex (SPRFMO Scientific Committee, 2025).

3. Within the present repository, the SC13 assessment update is represented by model series 1.14. The file assessment/README.md documents the sequence of changes leading to this configuration. In summary, the 2025 assessment update began with a base update to data and model structure (1.00) and then proceeded through a sequence of modifications affecting 2025 selectivity, weighting of 2025 age compositions, downweighting of early years of the northern Chile acoustic index, and historical weight-at-age for the Far North fleet. The final advice models were fitted under both stock hypotheses: h1_1.14 for the single-stock hypothesis and h2_1.14 for the two-stock hypothesis.

4. The motivation for documenting these models in a separate working paper is practical. MSE is most useful when the operating model and the management procedure are both transparent, interpretable, and robust to the major uncertainties that matter for management (Butterworth & Punt, 1999; Punt et al., 2016; Siple et al., 2021). The July 2025 MSE workshop concluded that direct transfer of the full SC13 assessment structure into MSE would likely import avoidable complexity. This paper therefore treats h1_1.14 and h2_1.14 as the base assessment formulations that must first be documented clearly before any simplification is attempted.

2 Materials Used for This Paper

5. This paper is based on the contents of the assessment directory, with the following files providing the main technical basis for the description:

  1. assessment/README.md, which documents the model progression to SC13;
  2. doc/annex/annex_text.qmd, which contains the technical annex narrative;
  3. assessment/R/SC13.rmd, which records the SC13 update process and the rationale for the final model;
  4. assessment/R/run_projections.R, which records the projection logic applied to the base model;
  5. assessment/input/1.14.dat, the common data file used by both stock hypotheses;
  6. assessment/config/h1_1.14.ctl and assessment/config/h2_1.14.ctl, which define the stock-specific control structures;
  7. assessment/results/h1_1.14.rep and assessment/results/h2_1.14.rep, which summarize fitted model outputs and likelihood components.

6. The objective here is not to reproduce every detail already contained in the annex. Rather, the objective is to extract from those materials a concise but technically defensible description of the SC13 base model for later MSE use.

3 General Structure of the JJM Used for SC13

3.1 Overview

7. The annex describes the JJM as an explicitly age-structured forward projection model estimated by maximum likelihood. That places it firmly within the standard family of integrated fisheries assessment models derived from statistical catch-at-age theory (D. Fournier & Archibald, 1982; Schnute & Richards, 1995). In practical implementation, the JJM is also a highly parameterized nonlinear model fitted through AD Model Builder, which is consistent with the long use of ADMB in stock assessment applications (D. A. Fournier et al., 2012).

8. For SC13, the model is run over years 1970-2025 and ages 1-12. The common data file assessment/input/1.14.dat specifies four fishery fleets, seven abundance indices, age-composition observations for multiple fisheries and surveys, length compositions for the Far North fleet, annual catches by fleet, and weight-at-age schedules used in both fishery and survey components. The control files then assign those data to either one or two modeled stocks depending on the hypothesis being fitted.

3.2 Data Structure

9. The SC13 base data set contains four fleets:

  1. N_Chile;
  2. SC_Chile_PS;
  3. FarNorth;
  4. Offshore_Trawl.

10. The composition data structure differs by fleet. Age compositions are used for N_Chile, SC_Chile_PS, and Offshore_Trawl, while the FarNorth fleet is fitted to length compositions through the modelled growth relationship. The file assessment/input/1.14.dat indicates that age-composition observations extend from 1980-2025 for the Chilean fleets and 2015-2025 for the offshore fleet, while length-composition observations are available for the Far North fleet from 1980-2025.

11. The same data file also includes seven abundance indices:

  1. Chile_AcousCS;
  2. Chile_AcousN;
  3. Chile_CPUE;
  4. DEPM;
  5. Peru_Acoustic;
  6. Peru_CPUE;
  7. Offshore_CPUE.

12. This is therefore not a sparse index-only assessment. It is a fully integrated model in which trends in biomass and recruitment are informed simultaneously by catch, multiple abundance indices, and composition data, with the latter carrying substantial influence over selectivity and year-class interpretation. The use of composition data at this level also means that weighting, effective sample size, and selectivity specification become major determinants of fit and inference (Francis, 2011).

3.3 Population Dynamics and Observation Model

13. The annex and control files together indicate a standard age-structured population dynamic framework with stock recruitment, natural mortality, growth, maturity, fleet selectivity, survey selectivity, and catchability. The growth component follows the usual fisheries implementation of the growth (Beverton & Holt, 1957). The Far North fleet is modelled through predicted length compositions derived from the age-structured population and the specified growth schedule.

14. Selectivity is an especially important feature of the SC13 implementation. The annex describes selectivity as non-parametric, fishery-specific, and time-varying. The control files confirm this, showing fleet-specific selectivity structures, annual or block changes by fleet, and selectivity penalties applied to both fisheries and surveys. Time-varying selectivity is a familiar response to changes in fishery behavior and sampling structure, but it is also known to complicate interpretation in catch-at-age assessments (Linton & Bence, 2011).

15. Catchability is index-specific and includes shared settings across the two hypotheses. The annex further notes that standardized CPUE series include explicit assumptions about efficiency creep, consistent with broader literature on fleet evolution and changing fishing power (Rousseau et al., 2019).

16. A compact mathematical summary of the core model structure is provided in the appendix so that the main text can stay focused on the SC13 implementation choices rather than on notation.

3.4 Reference Points and Projections

17. The SC13 annex states that MSY, F_{MSY} and B_{MSY} are calculated internally within the JJM using equilibrium calculations based on terminal-year catch shares, selectivity-at-age, and weights-at-age. Accordingly, these reference points are dynamic rather than fixed. The annex also notes that recent advice has used a ten-year average of model-estimated B_{MSY} as the reference biomass quantity for comparison.

18. The projection scripts in assessment/R/run_projections.R add a second layer of structure. Future projections are conducted using fixed terminal selectivity and, for the .ls alternatives, a shorter and more conservative recruitment history. This means the advice process combines a fitted assessment model with a somewhat different projection model. That approach is defensible in annual advice, but it is exactly the kind of multi-stage complexity that MSE seeks to simplify and evaluate explicitly (Punt et al., 2016; Siple et al., 2021).

4 The SC13 Base Advice Models

4.1 Single-Stock Hypothesis: h1_1.14

19. The control file assessment/config/h1_1.14.ctl defines the single-stock advice model. It specifies one stock, one recruitment regime, one maturity schedule, one population weight-at-age schedule, and a single natural mortality value of 0.28. The stock recruitment component uses steepness 0.65 and recruitment variability 0.6, with recruitment years 2000-2022 entering the stock recruitment specification.

20. From a structural point of view, h1_1.14 is the simpler of the two advice models. All fishery and survey information is interpreted as informing a common population. However, the model still retains considerable complexity because selectivity is allowed to vary over time and because the Far North fleet is represented with annual selectivity changes extending back to 1981. In effect, the single-stock hypothesis avoids stock splitting but compensates with a flexible fleet-history interpretation.

21. The fitted report file indicates that the model is strongly informed by composition and selectivity components. The total objective function reported in assessment/results/h1_1.14.rep is 1323.82, with large contributions from fishery age likelihood, fishery length likelihood, fishery selectivity penalties, index likelihood, and survey age likelihood. This confirms that h1_1.14 should not be viewed as a simplified baseline in itself. It is already a sophisticated assessment model.

4.2 Two-Stock Hypothesis: h2_1.14

22. The control file assessment/config/h2_1.14.ctl defines the two-stock advice model. It specifies two stocks, a more complex recruitment assignment, stock-specific biology for the second stock, and different treatment of the Far North observations. In the terminology used in the annex, the two stocks correspond to a southern stock and a far north stock.

23. The most consequential structural difference lies in recruitment and biological assignment. The two-stock control file specifies nregbyStock = 1 2, RecMatrix = 1 2 3, and RegShift = 1999. In practical terms, this means that the northern component is not represented simply as a second stock with otherwise identical rules. It is represented with a different recruitment structure that includes a regime shift. The recruitment year blocks in the file are 2000-2022, 1970-1996, and 2001-2016 for the three recruitment assignments. The model also assigns a higher natural mortality value to the second stock (0.33), along with a separate maturity schedule and separate population weight-at-age schedule.

24. The treatment of the Far North fleet also changes substantially under h2_1.14. Whereas h1_1.14 allows a long annual history of selectivity variation for the Far North fleet, h2_1.14 uses a much simpler regime treatment centered on 2002. This is an important point for later MSE development because the difference between h1 and h2 is not purely a question of stock structure. It is partly a difference in how northern observations are interpreted and regularized.

25. The fitted report file for h2_1.14 indicates a total objective function of 1194.09. As with the single-stock model, substantial contributions arise from fishery age, fishery length, selectivity penalties, and abundance indices. The lower total objective function relative to h1_1.14 does not by itself establish superiority, but it does confirm that the two-stock hypothesis is a comparably rich fitted model rather than a stripped-down alternative.

4.3 Summary Comparison

26. The main structural differences between the two SC13 advice models are summarized in Table 1.

Table 1: Main structural differences between the SC13 single-stock advice model (h1_1.14) and two-stock advice model (h2_1.14).
Feature h1_1.14 h2_1.14
Stock hypothesis Single shared stock Southern stock plus far north stock
Number of modeled stocks 1 2
Data file assessment/input/1.14.dat assessment/input/1.14.dat
Natural mortality 0.28 0.28 and 0.33
Maturity schedule One schedule Stock-specific schedules
Population weight-at-age One schedule Stock-specific schedules
Recruitment structure One curve, 2000-2022 Three assignments with a 1999 regime shift
Far North selectivity Annual variation from 1981 Simplified regime treatment centered on 2002
Projection horizon 20 years 20 years

27. The commonality between the two models is therefore greater than their difference in terms of data inputs and overall assessment machinery. The key differences lie in stock allocation, biology, recruitment structure, and the degree of complexity assigned to the northern part of the system.

5 Base model considerations

28. The July 2025 MSE workshop concluded that the base assessment should probably be more clearly documented and then perhaps simplified for use as an MSE operating model.

29. The SC13 base model uses extensive time-varying selectivity. This practice continued in the 2025 assessment to improve fits for the south-central Chile and offshore fleets. Time-varying selectivity may be necessary in annual assessments, but it can make MSE results difficult to interpret because changes in management performance may reflect selectivity specification as much as underlying stock dynamics (Linton & Bence, 2011).

30. In the base model the reference points are dynamic. B_{MSY} and F_{MSY} typically depend on terminal-year conditions, especially relative catch shares, selectivity-at-age, and weights-at-age. This makes stock status and advice partly conditional on the most recent fitted model state rather than solely on a stable management benchmark. The appropriateness of this as an assessment choice was raised over several meetings and discussions as it complicates the OM design because the benchmark itself changes with the assessment outcome.

31. Similarly, the projection assumptions differ from the fitting assumptions. The projection scripts show that future calculations rely on fixed selectivity and, in some scenarios, a shortened low-productivity recruitment history. This means that advice is generated from a model in which the fitting phase and the projection phase are intentionally different. In MSE, that layering needs to be explicit and deliberate rather than inherited as an assessment convention (Punt et al., 2016; Siple et al., 2021).

32. Additionally stock-structure uncertainty is mixed in with other differences. The contrast between h1_1.14 and h2_1.14 is more than one stock versus two stocks as an issue. The northern selectivity structure, different mortality, different maturity, different population weight-at-age, and different recruitment regime treatment affect things between the model configurations. For MSE, it will be helpful to separate uncertainty about stock structure from uncertainty about how model complexity has been allocated within each hypothesis.

6 Proposed Simplified Alternatives

33. The SC13 base model should be treated as the reference assessment formulation. Simplified alternatives should therefore be judged by how much of the essential behavior of the base model they retain while removing structural complexity that is not needed for MSE.

34. A practical first working set of alternatives is summarized in Table 2. The table highlights how each variant differs from its relevant 1.14 assessment counterpart in data treatment, selectivity treatment, and intended purpose.

Table 2: Summary of the simplified model variants discussed in Section 6.
Model Reference model Stock structure Data changes Selectivity changes Main purpose
h1_2.00 h1_1.14 Single stock Drops Chile_AcousCS, DEPM, and Peru_Acoustic; trims Chile_AcousN to the reduced-index configuration (2.00.dat). No change from h1_1.14 beyond what is implied by the reduced-index fit. Reduced-index baseline sensitivity.
h1_2.01 h1_2.00 Single stock Same reduced-index data as h1_2.00. Adds fleet-specific fishery selectivity blocks for all fisheries. Tests broad fishery selectivity simplification.
h1_2.01.1 h1_2.01 Single stock Same reduced-index data as h1_2.01. Keeps the h1_2.01 block structure except restores annual selectivity variability for the South-Central Chile fishery. Tests whether restoring S-C Chile selectivity flexibility recovers fit within the block-selectivity framework.
h1_2.01.2 h1_2.01.1 Single stock Same reduced-index data as h1_2.01.1. Keeps the h1_2.01.1 structure except replaces annual S-C Chile variability with five S-C Chile blocks starting in 1985, 1998, 2008, and 2016. Tests whether a five-block S-C Chile treatment can retain recent structure without reverting to full annual flexibility.
h1_2.01.3 h1_2.01 Single stock Same reduced-index data as h1_2.01. Keeps the h1_2.01 selectivity structure but adds estimated random walks in index q for the retained indices and constrains q age scaling to ages 2-6. Tests whether time-varying index catchability can recover fit within the block-selectivity formulation.
h1_2.02 h1_2.00 Single stock Same reduced-index data as h1_2.00. Keeps h1_2.00 except harmonizes Far North fishery selectivity to the single 2002 split used in h2_1.14. Isolates Far North selectivity harmonization within the single-stock hypothesis.
h2_2.02 h2_1.14 Two stock Applies the same reduced-index data configuration as 2.00.dat to the two-stock model. Keeps the h2 Far North selectivity treatment and combines it with the reduced-index data configuration. Provides the two-stock reduced-index harmonized comparator for h1_2.02.
h2_2.03 h2_2.02 Two stock Same reduced-index data and harmonized northern treatment as h2_2.02. No selectivity change relative to h2_2.02; instead collapses stock-2 recruitment to a single fitted regime. Tests whether the two-stock simplified framework can use one stock-2 stock-recruitment relationship instead of two regimes.

35. The h2_2.00 serves as the two-stock reduced-index analogue, h2_2.02 frames the harmonized Far North selectivity treatment, and h2_2.03 extends that harmonized two-stock setup by simplifying the stock-2 recruitment structure to a single fitted regime.

6.1 Reduced number of indices

36. There are three indices in the model that are no longer being conducted and generally fit poorly relative to other data. These included the Central-South acoustic survey, the egg-production index, and the Peruvian acoustic surveys. One sensitivity to omitting these data was considered.

37. The resulting sensitivity run is represented here by h1_2.00, which removes Chile_AcousCS, DEPM, and Peru_Acoustic from the single-stock configuration and also trims the retained Chile_AcousN series to years 2000 and later. Relative to h1_1.14, this drops 56 index-year observations in total: 13 from Chile_AcousCS, 9 from DEPM, 28 from Peru_Acoustic, and 6 pre-2000 observations from Chile_AcousN. It also reduces the fitted parameter vector by 44 parameters, from 1865 in h1_1.14 to 1821 in h1_2.00. This comparison is useful because it isolates a specific simplification of the observation model while leaving the broader single-stock population structure unchanged.

Figure 1: Comparison of spawning biomass trajectories from the base single-stock model (h1_1.14) and the reduced-index sensitivity (h1_2.00).
Figure 2: Comparison of recruitment trajectories from the base single-stock model (h1_1.14) and the reduced-index sensitivity (h1_2.00).

38. Relative to h1_1.14, the reduced-index model leaves the broad historical pattern intact but slightly lowers the terminal objective function, from about 1323.8 in h1_1.14 to about 1234.7 in h1_2.00. Terminal-year spawning biomass uncertainty remains very similar between the two fits, although it is slightly higher in the reduced-index run: the 2025 SSB CV is about 0.125 for h1_1.14 and about 0.129 for h1_2.00. As shown in Figure 1 and Figure 2, the series plots are useful as a first diagnostic of whether the omitted indices were materially changing stock perception or mainly adding noise and additional fitting burden. If the trajectories remain broadly similar, then reducing the index set may be a promising simplification for later MSE work.

NoteNote (I. Mosqueira)

39. The existing cjm.oem function in the MSE codebase already accommodates the legacy indices and accounts for the one-year lag in reporting the offshore CPUE. This means that the computational infrastructure for alternative index configurations is already in place, which substantially reduces the implementation burden of running multiple index configurations within the MSE framework.

40. The influence of individual indices on stock perception warrants specific attention alongside the question of which indices to retain. The Chilean acoustic survey (Chile_AcousN) in particular can strongly influence estimates of current stock status and recent recruitment, depending on the time period and the weight given to that series. A leave-one-out analysis — treating the full index set and each successive alternative configuration as separate robustness operating models — would allow MSE to probe sensitivity to individual indices without committing to a single reduced set as the canonical operating model. The leave-one-out approach should therefore be identified as a robustness axis in the MSE operating model grid, complementing the baseline reduced-index run represented by h1_2.00.

6.2 Simplified Selectivity Alternatives

41. Selectivity specification is the most important source of structural uncertainty in the SC13 base model, i and its treatment in the MSE operating model deserves the most careful consideration of any of the simplification choices discussed in this paper. Two related but distinct dimensions of selectivity need to be addressed.

42. Within-model selectivity structure. The first dimension is the selectivity structure used during model fitting. One option is to replace long annual selectivity histories with a limited set of blocks for each fleet — for example a historical block, a modern block, and a current block. This preserves the broad evidence that fishery selectivity has changed over time while avoiding dozens of annual deviations that are difficult to interpret and project forward. Block-based selectivity is recommended as the primary simplified alternative for the MSE operating models. It provides an interpretable and tractable representation of selectivity change without the parameter proliferation of annual non-parametric estimation.

Selectivity Time-Block Reduction by Fleet

43. Using the age-composition tables (Annex Tables 3-5) and the selectivity heatmaps (Annex Figures 37-38), it is possible to define time blocks where the age pattern, as a proxy for selectivity-at-age, is broadly stable for each fleet. The reduction below follows the main regime shifts visible in the data and is broadly consistent with how the current model already treats selectivity in some components.

Fleet 1 — North Chile

44. The following suggested blocks (based on age comps in Table 3 of the Annex) for this fleet are:

  • 1980-1986: Broad dome, peaking around ages 4-6.
  • 1987-1999: Strong shift toward young ages, with ages 1-2 dominant.
  • 2000-2015: Persistent very young selectivity, again centered on ages 1-2.
  • 2016-2025: Transition toward older ages, with ages 4-8 increasing.

45. The main structural breaks for this fleet are around 1987 and 2016, which is also consistent with the increase in mean age shown in Annex Figure 31.

Fleet 2 — South-Central Chile

46. The following suggested blocks (based on age comps in Table 4 of the Annex) for this fleet are:

  • 1980-1989: Peak ages are around 3-5.
  • 1990-2004: Gradual right-shift, with ages 5-8 becoming more important.
  • 2005-2013: Broad and relatively flat selectivity across ages 4-8.
  • 2014-2025: Clear dominance of older fish, especially ages 5-9.

47. These blocks are consistent with the gradual aging trend noted elsewhere in the report, including the increase in mean age over the last decade.

Fleet 3 — Far North

48. The following suggested blocks (based on the length-composition pattern in Table 6 of the Annex, translated to implied age selectivity) for this fleet are:

  • 1980-1995: Stable mid-length pattern corresponding roughly to ages 4-7.
  • 1996-2004: Modest shift toward smaller fish.
  • 2005-2013: Higher variability with mixed modes.
  • 2014-2025: Shift toward larger and therefore older fish.

49. This is notable because the current h2 treatment already uses a simplified two-regime split around 2002, but the data suggest somewhat finer structure could be supported.

Fleet 4 — Offshore trawl

50. The following suggested blocks (based on age comps in Table 5 of the Annex) for this fleet are:

  • 2015-2017: Mixed ages, including some older fish.
  • 2018-2020: Shift toward younger fish, especially ages 2-4.
  • 2021-2025: Strong young-fish dominance, including a pronounced age-2 spike.

51. These changes are consistent with the recent selectivity changes noted in the SC13 sensitivity runs.

Cross-Fleet Simplification

52. If a more parsimonious common structure is preferred, a shared set of broad periods could be:

  • Pre-1990
  • 1990-2004
  • 2005-2015
  • 2016-present

53. This common structure could then be supplemented with:

  • fleet-specific deviations, for example through random effects or a low-rank representation; and
  • extra flexibility for Fleet 4 after 2018, where the short series suggests sharper recent changes.

Key Insight

54. Across fleets, the main broad pattern is a recent shift toward older fish after about 2015, with the offshore fleet as the main exception because it shows more recent spikes in younger selectivity.

55. The clearest global breakpoints appear to be:

  • 1987-1990: collapse period associated with younger fish;
  • around 2000: a stabilization period; and
  • 2015-2016: an aging or structural shift.

56. These broad breakpoints also align with changes in fishing pressure, the stock rebuilding phase, and the model note that recent selectivity favors older fish.

Illustrative fit with selectivity blocks

57. To illustrate the effect of imposing the proposed fleet-specific selectivity blocks, I fitted a reduced-index sensitivity with those block years hard-coded in the fishery selectivity specification. The resulting run is represented here by h1_2.01, which keeps the reduced-index data configuration of h1_2.00 but replaces the long annual selectivity histories with the fleet-specific blocks proposed above. The side-by-side selectivity panels in Figure 3 show directly how the fishery selectivity surface changes when the annual histories in h1_2.00 are replaced by the simplified block structure in h1_2.01.

(a) Model h1_2.00
(b) Model h1_2.01
Figure 3: Estimates of selectivity by fishery over time for Models h1_2.00 and h1_2.01, shown in the same style as Figure 37 of the SC13 Jack Mackerel Technical Annex (SPRFMO Scientific Committee, 2025).
Figure 4: Comparison of spawning biomass trajectories from the reduced-index sensitivity (h1_2.00) and the reduced-index model with fleet-specific selectivity blocks (h1_2.01).
Figure 5: Comparison of recruitment trajectories from the reduced-index sensitivity (h1_2.00) and the reduced-index model with fleet-specific selectivity blocks (h1_2.01).
Figure 6: Comparison of spawning biomass trajectories from the reduced-index sensitivity (h1_2.00) and an intermediate selectivity variant (h1_2.01.1) in which the South-Central Chile fishery retains the annual selectivity variability allowance from h1_2.00.

58. Relative to h1_2.00, the block-selectivity run fits the retained data less well and produces a higher recent stock trajectory. The terminal objective function increases from about 1234.7 in h1_2.00 to about 1544.1 in h1_2.01, while the estimated 2025 spawning biomass increases from about 15.9 kt to about 19.9 kt. Terminal recruitment also rises, from about 16.8 million in h1_2.00 to about 21.4 million in h1_2.01. As shown in Figure 4 and Figure 5, simplifying selectivity in this way is therefore not just a bookkeeping reduction in model complexity; it materially changes stock perception and should be treated as a substantive structural alternative in the MSE operating-model set.

59. The negative-log-likelihood breakdown for the base single-stock model and the two simplified alternatives is shown in Table 3. Relative to h1_1.14, the reduced-index sensitivity lowers the retained index-related likelihood contributions without materially worsening the overall fit, whereas the block-selectivity model reduces the number of estimated parameters more aggressively but at the cost of substantially higher age- and index-fit components.

Table 3: Negative-log-likelihood components and estimated parameter counts for the single-stock base model (h1_1.14), the reduced-index sensitivity (h1_2.00), and the reduced-index model with fishery selectivity blocks (h1_2.01). For index likelihoods, the aggregate and component rows include only the retained comparable series: Acoustic North, Chile CPUE, Peru CPUE, and Offshore CPUE.
Component h1_1.14 h1_2.00 h1_2.01
Catch likelihood 0.9 0.8 0.8
Fishery age compositions 262.9 262.9 549.8
North Chile fishery age comps 113.7 113.2 202.8
S-C Chile fishery age comps 123.3 124.0 307.3
Offshore trawl fishery age comps 25.9 25.7 39.7
Fishery length compositions 471.5 470.2 581.9
Fishery selectivity penalties 331.3 329.7 114.4
Index likelihood 136.5 130.6 258.0
Chile acoustic North index 74.4 73.2 87.7
Chile CPUE index 17.9 17.0 73.5
Peru CPUE index 32.9 29.5 65.3
Offshore CPUE index 11.3 10.9 31.4
Index age compositions 61.5 38.1 37.8
Index selectivity penalties 10.7 6.1 3.9
Recruitment term −3.0 −3.9 −2.6
Fishing mortality penalty 0.0 0.0 0.0
Index q priors 0.2 0.1 0.1
Residual term 0.1 0.1 0.0
Total negative log likelihood 1,323.8 1,234.7 1,544.1
Estimated parameters (count) 1,865 1,821 480
NA 13.8579

60. Viewed in more detail, the reduced-index configuration suggests that dropping the discontinued or older index series generally relaxes pressure on the rest of the fit. When attention is restricted to the retained comparable indices, the aggregate index likelihood declines modestly from 136.5 in h1_1.14 to 130.6 in h1_2.00, and the retained Chile CPUE, Peru CPUE, and Offshore CPUE contributions all improve modestly. Fishery catch, fishery length compositions, and most fishery age-composition terms also stay similar or improve slightly. The main exception is that the Acoustic North term remains large and only weakly improved, so the reduced-index simplification does not remove the tension associated with that series.

61. By contrast, imposing the block selectivity structure in h1_2.01 degrades the fit across a broad range of components rather than concentrating the penalty in a single place. The South-Central Chile and North Chile fishery age-composition terms increase strongly, the aggregate fishery length-composition likelihood worsens, and the retained index contributions rise for Acoustic North, Chile CPUE, Peru CPUE, and Offshore CPUE. In that sense, the selectivity simplification is not merely trading one data source against another; it reduces flexibility in a way that propagates across several of the main observation components simultaneously.

62. The visual fit to the retained and comparable abundance indices is shown in Figure 7. This first comparison focuses on the base model, the reduced-index sensitivity, and the reduced-index model with block selectivity. It shows that h1_2.00 stays broadly close to h1_1.14 for the South-Central Chile CPUE, Peruvian CPUE, and Offshore CPUE series, whereas the additional selectivity simplification in h1_2.01 produces visibly worse fits for several of the retained indices, especially Acoustic North and Peru CPUE.

Figure 7: Observed and fitted index trajectories for h1_1.14, h1_2.00, and h1_2.01. Panels show the retained comparable index series: South-Central Chile CPUE, Acoustic North, Peruvian CPUE, and Offshore CPUE.

63. Projection-period selectivity uncertainty. The second dimension is the selectivity pattern applied during the projection period of the MSE simulation. In the current assessment, projections use fixed terminal selectivity. In MSE, the future selectivity pattern represents a real source of uncertainty, because the fishing gear and targeting behaviour that produced recent selectivities may not persist over a multi-decade evaluation horizon.

NoteNote (I. Mosqueira)

64. Rather than projecting with a single terminal selectivity, the projection period can be populated with a range of average-selectivity vectors — for example averages computed over the last 1, 5, 10, 15, or 20 years of the modelled period. These alternatives span from recent directed behaviour (short averages, capturing more targeted fishing patterns) to longer integration of historical fishery change (long averages). This range directly addresses the concern raised at the July 2025 workshop that projection assumptions differ from fitting assumptions. Preliminary runs with such averaging periods have already been explored and provide a practical basis for this robustness dimension.

65. The block-based formulation should be regarded as the primary simplified alternative. The 1-, 5-, 10-, 15-, and 20-year rolling-average selectivity vectors should then be used as robustness tests to verify that conclusions about management procedure performance are not overly sensitive to the assumption about future selectivity patterns.

66. This hierarchy — block structure as primary, averaging-period variation as robustness — is consistent with the staged MSE roadmap described by Carruthers (2024) and with the more specific jack mackerel staging discussed in SPRFMO Jack Mackerel Working Group (2025), which separates operating-model specification work from robustness testing and management-procedure refinement. It is also consistent with CCSBT practice in the Bali Procedure review cycle.

67. To isolate whether the deterioration in h1_2.01 is being driven mainly by the reduced flexibility assigned to the South-Central Chile fishery, I fitted an intermediate variant, h1_2.01.1, that keeps the fleet-specific block structure for the other fisheries but restores the annual selectivity-variation treatment for the South-Central Chile fleet used in h1_2.00. This intermediate run converged cleanly (nll = 1364.8, maxgrad = 1.1e-4) and sits between h1_2.00 and h1_2.01 in overall fit. The SSB comparison in Figure 6 shows that restoring that one source of selectivity flexibility pulls the stock trajectory partway back toward h1_2.00, although it does not fully recover the reduced-index baseline.

68. The corresponding selectivity surfaces are shown in Figure 8. This comparison makes the construction of h1_2.01.1 explicit: the North Chile, Far North, and Offshore fleets retain the block-based patterns from h1_2.01, while the South-Central Chile fishery reverts to the more flexible annual treatment. The main visual difference between the two panels is therefore concentrated in that one fishery, which is the intended diagnostic for whether the broader changes in h1_2.01 are being driven primarily by S-C Chile selectivity regularization.

(a) Model h1_2.01
(b) Model h1_2.01.1
Figure 8: Estimates of selectivity by fishery over time for Models h1_2.01 and h1_2.01.1, shown in the same style as Figure 37 of the SC13 Jack Mackerel Technical Annex (SPRFMO Scientific Committee, 2025).

69. The more detailed fit comparison is shown in Table 4. Restoring annual flexibility for the South-Central Chile fishery reduces the total negative log likelihood from 1544.1 in h1_2.01 to 1364.8 in h1_2.01.1, while increasing the parameter count from 480 to 870. The largest direct improvement is in the South-Central Chile fishery age-composition term, which falls from about 307.3 to 139.2, close to the h1_2.00 value of 124.0. Aggregate retained-index fit also improves materially, from about 258.0 to 185.9, although fishery length compositions remain much worse than in h1_2.00. Taken together, the figure and table suggest that a substantial part of the penalty in h1_2.01 comes from over-constraining the S-C Chile selectivity history, but not all of it.

Table 4: Negative-log-likelihood components and estimated parameter counts for the reduced-index sensitivity (h1_2.00), the reduced-index model with fishery selectivity blocks (h1_2.01), and the intermediate selectivity variant (h1_2.01.1) in which the South-Central Chile fishery retains the annual selectivity variability allowance. The aggregate and component index rows include only the retained comparable series: Acoustic North, Chile CPUE, Peru CPUE, and Offshore CPUE.
Component h1_2.00 h1_2.01 h1_2.01.1
Catch likelihood 0.8 0.8 1.1
Fishery age compositions 262.9 549.8 374.8
North Chile fishery age comps 113.2 202.8 199.0
S-C Chile fishery age comps 124.0 307.3 139.2
Offshore trawl fishery age comps 25.7 39.7 36.5
Fishery length compositions 470.2 581.9 567.0
Fishery selectivity penalties 329.7 114.4 196.0
Index likelihood 130.6 258.0 185.9
Chile acoustic North index 73.2 87.7 77.0
Chile CPUE index 17.0 73.5 23.3
Peru CPUE index 29.5 65.3 51.3
Offshore CPUE index 10.9 31.4 34.3
Index age compositions 38.1 37.8 35.8
Index selectivity penalties 6.1 3.9 5.0
Recruitment term −3.9 −2.6 −0.9
Fishing mortality penalty 0.0 0.0 0.0
Index q priors 0.1 0.1 0.1
Residual term 0.1 0.0 0.0
Total negative log likelihood 1,234.7 1,544.1 1,364.8
Estimated parameters (count) 1,821 480 870

70. As a further check, I fitted a second South-Central Chile alternative, h1_2.01.2, which starts from h1_2.01.1 but replaces the annual S-C Chile selectivity history with five blocks separated at 1985, 1998, 2008, and 2016. The intention was to preserve the broad pre-1985 period, the late-1980s to 1990s narrowing of the dome, the late-1990s recruitment-regime shift, the 2008-2015 transitional period, and the distinct post-2016 shape without returning to a fully annual treatment. In practice, this five-block version converged cleanly (nll = 1518.5, maxgrad = 9.6e-5) but performed much more like the original block model than like h1_2.01.1.

71. The resulting spawning biomass trajectories are shown in Figure 12. Relative to h1_2.01.1, the five-block S-C Chile variant moves the stock trajectory sharply upward and is closer to h1_2.01 than to the annual-S-C case. Its terminal spawning biomass reaches about 23.8 kt, compared with about 18.2 kt for h1_2.01.1 and 19.9 kt for h1_2.01.

72. The direct selectivity comparison is shown in Figure 11. As intended, the difference between h1_2.01.1 and h1_2.01.2 is confined to the South-Central Chile fishery, where the annual pattern is replaced by the five broader periods. However, the likelihood comparison in Table 5 shows that this coarser representation gives back much of the fit recovered by h1_2.01.1: the total negative log likelihood rises from 1364.8 to 1518.5, the South-Central Chile fishery age-composition term rises from 139.2 to 285.4, and the retained-index total worsens from 185.9 to 239.6. Parameter count falls from 870 to 490, only slightly above the original h1_2.01 value of 480. This indicates that, for the current data, the proposed five-block S-C Chile structure is still too restrictive to reproduce the main benefits of the annual S-C Chile treatment. A simpler follow-up worth testing later would be the suggested four-block alternative with breaks at 1985, 1998, and 2014.

Figure 9: Observed and fitted index trajectories for h1_2.01, h1_2.01.1, h1_2.01.2, and h1_2.01.3. Panels show the retained comparable index series: South-Central Chile CPUE, Acoustic North, Peruvian CPUE, and Offshore CPUE.

73. The more detailed fit comparison, now including the q-random-walk variant, is shown in Table 5. Relative to h1_2.01, the annual-S-C variant h1_2.01.1 recovers a large part of the lost fit, while the five-block version h1_2.01.2 gives much of that improvement back. The q-random-walk version h1_2.01.3 lowers the retained comparable index contribution further to about 163.9, compared with about 258.0 for h1_2.01, 185.9 for h1_2.01.1, and 239.6 for h1_2.01.2, although it does so by shifting additional flexibility into time-varying catchability rather than selectivity. The same contrast appears in terminal-year spawning biomass uncertainty: the 2026 SSB interval is about 15.1-26.1 kt for h1_2.01, narrows slightly to 14.0-23.6 kt for h1_2.01.1, widens substantially to 16.7-33.7 kt for h1_2.01.2, and shifts downward but remains broad at 11.5-23.0 kt for h1_2.01.3. The resulting retained comparable index fits are shown in Figure 9.

Table 5: Negative-log-likelihood components and estimated parameter counts for the original fishery-block model (h1_2.01), the annual S-C Chile variant (h1_2.01.1), the five-block S-C Chile variant (h1_2.01.2), and the q-random-walk variant (h1_2.01.3). The aggregate and component index rows include only the retained comparable series: Acoustic North, Chile CPUE, Peru CPUE, and Offshore CPUE.
Component h1_2.01 h1_2.01.1 h1_2.01.2 h1_2.01.3
Catch likelihood 0.8 1.1 0.6 0.6
Fishery age compositions 549.8 374.8 528.4 521.9
North Chile fishery age comps 202.8 199.0 202.7 196.4
S-C Chile fishery age comps 307.3 139.2 285.4 289.2
Offshore trawl fishery age comps 39.7 36.5 40.3 36.3
Fishery length compositions 581.9 567.0 586.2 574.0
Fishery selectivity penalties 114.4 196.0 124.5 103.9
Index likelihood 258.0 185.9 239.6 163.9
Chile acoustic North index 87.7 77.0 85.8 69.2
Chile CPUE index 73.5 23.3 67.5 38.5
Peru CPUE index 65.3 51.3 60.4 36.3
Offshore CPUE index 31.4 34.3 26.0 19.9
Index age compositions 37.8 35.8 37.4 36.9
Index selectivity penalties 3.9 5.0 4.7 3.2
Recruitment term −2.6 −0.9 −3.1 −5.1
Index q priors 0.1 0.1 0.1 51.5
Residual term 0.0 0.0 0.0 0.0
Total negative log likelihood 1,544.1 1,364.8 1,518.5 1,450.8
Estimated parameters (count) 480 870 490 575

74. The fitted catchability paths for h1_2.01.3 are shown in Figure 10. To place the four indices on a comparable scale, each q series is divided by its own mean over the common shortest fitted window, 2009-2024, which is the full random-walk span for Offshore CPUE. On that normalized scale, the figure emphasizes the relative temporal pattern rather than the absolute q level and shows that the strongest drift is concentrated in the retained index series where the fit gains were largest.

Figure 10: Time trend in fitted catchability for the retained comparable indices in h1_2.01.3, rescaled so each series has mean 1 over 2009-2024, the shortest fitted random-walk window shared across the four series.
(a) Model h1_2.01.1
(b) Model h1_2.01.2
Figure 11: Estimates of selectivity by fishery over time for Models h1_2.01.1 and h1_2.01.2, shown in the same style as Figure 37 of the SC13 Jack Mackerel Technical Annex (SPRFMO Scientific Committee, 2025).
Figure 12: Comparison of spawning biomass trajectories for the original fishery-block model (h1_2.01), the annual S-C Chile variant (h1_2.01.1), and the five-block S-C Chile variant (h1_2.01.2).

6.3 Harmonized Northern Treatment

75. A fourth option is to harmonize how the Far North observations are treated across the single-stock and two-stock hypotheses. The objective would be to ensure that future comparisons of one-stock and two-stock MSE operating models are driven primarily by stock structure rather than by unrelated differences in selectivity regularization.

76. As documented in Section 4, the current difference between h1 and h2 is not purely about stock allocation. It also reflects a very different degree of flexibility in Far North fleet selectivity: h1_1.14 allows annual variation from 1981, while h2_1.14 uses a much simpler two-regime structure centred on 2002. This confound makes it difficult to attribute differences in MSE outcomes between h1 and h2 to stock structure as opposed to selectivity specification.

77. Additional input from Peru suggests that the 2002 change should be viewed as the start of a transition in Far North selectivity rather than as a single instantaneous break. A plausible mechanism is the regulatory shift toward direct human consumption (DHC), associated with changes in fleet composition, freezing capacity, mesh size, fishing areas, and school-selection practice. Under that interpretation, 2002-2013 is a transition period with more variable size modes, while 2014-2025 shows a clearer shift toward larger sizes, with a stronger increase from about 2018 onward.

NoteNote (I. Mosqueira)

78. The harmonization objective should be to align the selectivity structure of the Far North fleet across h1 and h2, while retaining the biological differences that distinguish the two-stock hypothesis. In practice, this means that differences in natural mortality, growth, maturity, and weight-at-age should be preserved in h2 as currently specified. What should be harmonized is the structural form and degree of flexibility allowed in Far North fleet selectivity, so that a comparison of h1 and h2 MSE outcomes reflects stock structure rather than selectivity regularization.

79. One additional area that has been discussed but not yet implemented is explicit uncertainty in northern stock biology, particularly growth. The two-stock hypothesis implies different population dynamics for the northern component, but the current implementation draws on growth parameters estimated from combined data without formal uncertainty in the stock-specific parameters. Uncertainty in northern growth or natural mortality could be incorporated as candidate robustness scenarios in later MSE phases if data become available to support alternative parameterizations.

80. Taken together, this supports allowing Far North selectivity to vary over time as a more realistic representation than a strict two-regime split centred on 2002 (as in h2_1.14). For MSE implementation, a block-based treatment is a practical compromise between realism and tractability. A candidate structure is pre-2002, 2002-2013, and 2014-2025, with an optional additional split near 2018 if diagnostics support it.

81. To examine that issue more directly, I fitted two harmonized variants using the reduced-index data configuration. Model h1_2.02 is h1_2.00 with the Far North fishery selectivity simplified to the same 2002 split used in h2_1.14, while h2_2.02 is h2_1.14 rebuilt on the reduced-index 2.00.dat data set and with the same Far North selectivity treatment. The resulting comparison is assessed using likelihood components (Table 6), retained-index fits (Figure 13), and system-level spawning biomass and recruitment trajectories (Figure 14; Figure 15).

Table 6: Negative-log-likelihood components and estimated parameter counts for the reduced-index single-stock model (h1_2.00), the harmonized single-stock variant (h1_2.02), and the harmonized two-stock reduced-index variant (h2_2.02). The aggregate and component index rows include only the retained comparable series: Acoustic North, Chile CPUE, Peru CPUE, and Offshore CPUE.
Component h1_2.00 h1_2.02 h2_2.02
Catch likelihood 0.8 1.4 1.0
Fishery age compositions 262.9 279.4 246.2
North Chile fishery age comps 113.2 113.8 110.9
S-C Chile fishery age comps 124.0 139.1 110.1
Offshore trawl fishery age comps 25.7 26.5 25.1
Fishery length compositions 470.2 608.9 459.0
Fishery selectivity penalties 329.7 211.5 201.4
Index likelihood 130.6 155.2 134.1
Chile acoustic North index 73.2 72.1 88.0
Chile CPUE index 17.0 20.4 16.6
Peru CPUE index 29.5 50.8 19.2
Offshore CPUE index 10.9 11.9 10.3
Index age compositions 38.1 35.9 60.0
Index selectivity penalties 6.1 5.3 1.2
Recruitment term −3.9 −4.5 8.3
Fishing mortality penalty 0.0 0.0 0.1
Index q priors 0.1 0.1 0.0
Residual term 0.1 0.0 0.2
Total negative log likelihood 1,234.7 1,293.1 1,111.4
Estimated parameters (count) 1,821 1,513 1,604

82. The retained comparable index fits for these harmonized models are shown in Figure 13. Relative to h1_2.00, the single-stock harmonized variant h1_2.02 changes little in the South-Central Chile CPUE, Peru CPUE, and Offshore CPUE panels, while the two-stock harmonized variant h2_2.02 differs more clearly in the Acoustic North fit and in parts of the Chile CPUE trajectory.

Figure 13: Observed and fitted index trajectories for h1_2.00, h1_2.02, and h2_2.02. Panels show the retained comparable index series: South-Central Chile CPUE, Acoustic North, Peruvian CPUE, and Offshore CPUE.

83. To complement the likelihood and retained-index diagnostics, Figure 14 and Figure 15 compare the fitted system-level spawning biomass and recruitment trajectories across the same three models. For h2_2.02, the plotted values are the sum across the two modeled stocks so that the trajectories are directly comparable with the single-stock runs.

Figure 14: System-level spawning biomass trajectories for h1_2.00, h1_2.02, and h2_2.02. For h2_2.02, values are the sum of stock-1 and stock-2 spawning biomass.
Figure 15: System-level recruitment trajectories for h1_2.00, h1_2.02, and h2_2.02. For h2_2.02, values are the sum of stock-1 and stock-2 recruitment.

6.4 Simplified Recruitment Alternatives

84. Another option is to reduce recruitment complexity by using one recruitment relationship per stock hypothesis, or one per stock under the two-stock model, without additional regime splitting unless that feature is shown to be essential for management performance. The preferred approach is therefore to fit a single Beverton-Holt relationship per stock in the simplified operating model, and then treat regime differences as a robustness test rather than a primary structural alternative.

NoteNote (I. Mosqueira)

85. Under h2_1.14, the northern component already includes a regime shift at 1999, and this structural feature should be preserved in that operating model to reflect the interpretation embedded in the current assessment. For the single-stock hypothesis, no formal regime splitting is assumed. Evaluating MSE performance with and without regime structure — by comparing h1 and h2 operating models and by running a version of h1 with a productivity regime break — would clarify how sensitive management procedures are to assumptions about long-run productivity.

86. An important diagnostic concern is that some recent recruitments already exceed the upper envelope of the fitted stock-recruitment curve. This suggests that the fitted Beverton-Holt relationship may be conservative relative to actual recruitment productivity in recent years. Future work should consider whether alternative recruitment distributions, higher ceiling values for recruitment deviations, or a revisited steepness estimate are warranted. Projecting with a recruitment distribution that truncates the upper tail of realized recruitments would be systematically pessimistic about stock productivity, which would in turn bias MSE performance comparisons toward overly cautious management conclusions.

87. To illustrate this simplification directly, I fitted h2_2.03 as a variant of h2_2.02 in which the second stock uses a single stock-recruitment regime spanning the combined years that were previously split across the two stock-2 regimes. The simplified model converged cleanly, with a slightly worse objective function than h2_2.02 (1117.2 versus 1111.4). The fitted stock-recruitment curves for stock 2 are compared in Figure 16.

Figure 16: Stock-2 Beverton-Holt stock-recruitment curves for h2_2.02 and h2_2.03. The left panel shows the two stock-2 recruitment regimes retained in h2_2.02; the right panel shows the single stock-2 regime used in h2_2.03.

88. As in h2_1.14, model h2_2.02 uses two stock-2 recruitment-regime fitting windows for estimating the stock-recruitment relationships, but it intentionally omits some hatch years from those regime vectors. For h2_2.02, the fitting windows are SR_Curve_years_1 = 1970:1996 and SR_Curve_years_2 = 2001:2016, so hatch years 1997-2000 are excluded from the stock-2 SR fitting windows. In principle a year label can be absent either because it is not assigned to a stock-2 SR regime or because label overlap is suppressed, but Figure 16 now plots all years explicitly. The years omitted from regime fitting are shown with yellow label backgrounds, so labels such as 98 now appear as excluded years rather than seeming to be missing by accident.

89. The corresponding stock-2 spawning biomass trajectories are compared in Figure 17. The single-regime formulation leaves the broad stock-2 biomass history similar to h2_2.02, but with a modest upward shift over much of the series. The corresponding stock-2 recruitment estimates are compared in Figure 18. The single-regime formulation changes the fitted recruitment trajectory modestly, with the largest differences occurring in the early years and around some of the higher-recruitment episodes.

Figure 17: Estimated stock-2 spawning biomass trajectories for h2_2.02 and h2_2.03.
Figure 18: Estimated stock-2 recruitment trajectories for h2_2.02 and h2_2.03.

6.5 Fixed-Reference-Point Alternatives

90. An issue also arose regarding fixed biomass and harvest-rate reference points
rather than dynamic reference points estimated from terminal-year conditions. This would make the interpretation of management performance much clearer while still allowing later comparison back to the SC13 assessment convention.

91. The dynamic reference point system in the JJM — where B_{MSY} and F_{MSY} shift with terminal-year selectivity and weight-at-age — is appropriate in the annual assessment context but problematic for MSE. If reference points change every year as a function of the operating model state, it becomes difficult to distinguish genuine changes in stock productivity from artefacts of the simulation design.

92. As an additional diagnostic of selectivity dynamics, the fishery-combined mean selected age can be tracked through time: \bar{a}_t = \frac{\sum_a F_{a,t} a}{\sum_a F_{a,t}}, where F_{a,t} is total fishing mortality at age in year t. The resulting patterns for the selectivity alternatives h1_2.00, h1_2.01, h1_2.01.1, and h1_2.01.2 are shown in Figure 19.

Figure 19: Time trend in fishery-combined mean selected age for h1_2.00, h1_2.01, h1_2.01.1, and h1_2.01.2, computed as \bar{a}_t = \sum_a F_{a,t} a / \sum_a F_{a,t}.

93. The trajectories indicate that h1_2.00 has the highest and strongest increase in mean selected age, h1_2.01 and h1_2.01.2 remain lower and track closely, and h1_2.01.1 is intermediate, recovering part of the increase relative to h1_2.01.

94. The joint pattern between mean selected age and dynamic F_{MSY} for these same models is shown in Figure 20. These phase-space style trajectories help make clear how time-varying fishery selectivity can directly influence recent-year F_{MSY} estimates, rather than reflecting only changes in underlying stock productivity. For MSE-oriented reference-point workflows, this also supports using period means (for example, recent multi-year averages of selectivity and related schedules) as a practical way to stabilize reference-point calculations.

Figure 20: Relationship between mean selected age (x-axis) and dynamic F_{MSY} (y-axis) for h1_2.00, h1_2.01, h1_2.01.1, and h1_2.01.2. Points are colored by year and labeled with year.
NoteNote (I. Mosqueira)

95. If the simplified selectivity alternative adopts block-based selectivity, then reference points should probably be calculated using the selectivity pattern from the final block period for all future projections. This is internally consistent: the operating model should use the same selectivity for calculating F_{MSY} and B_{MSY} as it uses for projecting future mortality. Using the last block’s selectivity as the reference configuration avoids the problem of reference points shifting mid-simulation as selectivity changes, and it is analogous to the fixed-reference-point convention already applied in several RFMO MSE programmes.

96. The recommended approach is therefore to calculate reference points once, using the terminal selectivity block and the current biological schedules, and use those fixed values as management benchmarks throughout the MSE simulation. Sensitivity to the choice of reference period can be examined as part of robustness testing.

7 Summary

97. The evaluations presented here are intended as discussion items leading into the benchmark workshop, but some clearer conclusions now emerge from the fitted alternatives:

  1. The reduced-index simplification is a defensible starting point. Relative to h1_1.14, model h1_2.00 stays close to the SC13 assessment picture while removing the discontinued or less comparable index series. For the retained comparable indices, this reduction generally improves or at least preserves fit, with Acoustic North remaining the main unresolved tension.
  2. A broad fishery-block simplification is too restrictive if applied everywhere at once. Model h1_2.01 worsens the fit across multiple observation types simultaneously, including retained indices, fishery length compositions, and especially the South-Central Chile and North Chile age compositions. That is a stronger degradation than would be desirable for a primary operating-model simplification.
  3. The South-Central Chile fishery is the main driver of that deterioration. Restoring annual selectivity flexibility only for that fleet in h1_2.01.1 recovers much of the lost fit and pulls the spawning biomass trajectory back toward h1_2.00. By contrast, the five-block alternative h1_2.01.2 remains too restrictive: it gives back much of the gain in fit, behaves more like h1_2.01 than h1_2.01.1, and leaves broader terminal spawning-biomass uncertainty.
  4. Allowing time-varying index catchability can recover retained-index fit, but it changes where the flexibility sits. Model h1_2.01.3 gives the best retained comparable index fit among the h1_2.01-based variants and shows that part of the tension can be absorbed by drifting q. However, it does so by moving flexibility from selectivity into time-varying catchability, with a lower terminal spawning-biomass distribution than the other S-C Chile variants. That makes h1_2.01.3 more persuasive as a robustness case than as a primary operating-model simplification.
  5. Far North selectivity harmonization is worthwhile before comparing h1 and h2, and should be judged on dynamics as well as fit. The harmonized single-stock model h1_2.02 stays broadly close to h1_2.00, while h2_2.02 provides a cleaner two-stock comparator because it uses the same reduced-index data treatment and aligned Far North selectivity structure. The h1_2.02 versus h2_2.02 contrast should therefore be interpreted using likelihoods together with spawning biomass and recruitment trajectories, not likelihoods alone.
  6. Simplifying the stock-2 recruitment structure in h2 appears feasible, but it changes both fit and interpretation. Model h2_2.03 converges cleanly and only modestly worsens the objective function relative to h2_2.02, so a single stock-2 relationship looks workable. At the same time, the stock-recruitment comparison now makes clearer that h2_1.14 and h2_2.02 estimated stock-2 relationships using restricted hatch-year windows and omitted hatch years 1997-2000, whereas h2_2.03 removes that gap by fitting one relationship across the combined period. The choice is therefore not only about the number of regimes; it also changes which years are allowed to inform the fitted stock-2 stock-recruitment relationship.
  7. The strongest current operating-model candidates are now better defined. h1_2.00 remains a sensible reduced-data baseline, h1_2.01.1 is the strongest single-stock selectivity simplification tested so far, and h1_2.02 versus h2_2.02 provide the most interpretable harmonized single-stock versus two-stock comparison. h1_2.01.3 is valuable, but primarily as a robustness test for index-standardization and q-variability assumptions.
  8. The mean-selected-age versus F_{MSY} diagnostic adds a clear selectivity signal. The phase-space comparison across h1_2.00, h1_2.01, h1_2.01.1, and h1_2.01.2 shows that shifts in fishery selectivity can move recent-year dynamic F_{MSY} estimates directly, so apparent reference-point changes should not be interpreted as productivity changes alone.
  9. Reference-point stabilization should be treated as a core design choice for MSE. Projection-period selectivity uncertainty should still be explored using averaging-window alternatives, but reference points are likely to be more stable and interpretable if calculated from period means (multi-year averaged selectivity and related schedules) and then fixed for simulation. Stock-2 recruitment structure should be treated as an explicit robustness axis, and northern biology uncertainty and index-standardization choices remain natural later-phase candidates.

98. Taken together, these results suggest that the next stage should focus less on broad additional simplification and more on selecting a compact set of well-motivated operating models and robustness scenarios. The current evidence points toward carrying forward a reduced-index baseline, a single-stock model that preserves South-Central Chile selectivity flexibility, a harmonized two-stock comparator, and targeted robustness cases for q variability, projection selectivity, and alternative stock-2 recruitment structure.

Appendix: Compact Equation Summary

This appendix gives a compact schematic description of the equations underlying the SC13 Joint Jack Mackerel Model. It is intended to summarize the structure used in this working paper rather than to reproduce every implementation detail of the ADMB code or every option described in the technical annex and user guide.

7.1 Notation

Let s index stock, f fishery fleet, i abundance index, a age, l length bin, and y year. Let A denote the plus group age. The main state variable is numbers at age, N_{s,a,y}, at the start of year y.

The biological and fishery quantities used below are:

  1. M_{s,a}: natural mortality;
  2. S^{F}_{f,s,a,y}: fishery selectivity;
  3. S^{I}_{i,s,a,y}: survey or index selectivity;
  4. w^{\mathrm{pop}}_{s,a,y}, w^{\mathrm{catch}}_{f,s,a,y}, and w^{I}_{i,s,a,y}: population, catch, and index weights at age;
  5. m_{s,a}: maturity at age;
  6. q_i: catchability for abundance index i.

7.2 Population Update

For each stock, total mortality at age is

Z_{s,a,y} = M_{s,a} + \sum_f F_{f,s,a,y}.

Recruitment enters age 1:

N_{s,1,y} = R_{s,y}.

For ages a = 1, \ldots, A-2, cohort survival follows

N_{s,a+1,y+1} = N_{s,a,y}\exp(-Z_{s,a,y}).

The plus group is updated as

\begin{aligned} N_{s,A,y+1} &= N_{s,A-1,y}\exp(-Z_{s,A-1,y}) \\ &\quad + N_{s,A,y}\exp(-Z_{s,A,y}). \end{aligned}

Spawning biomass is calculated from mature biomass at the spawning time within the year:

SSB_{s,y} = \sum_a N_{s,a,y}\exp(-\tau Z_{s,a,y}) \, m_{s,a} \, w^{\mathrm{pop}}_{s,a,y}.

where \tau is the fraction of annual mortality occurring before spawning.

7.3 Stock-Recruitment Relationship

The annex states that the JJM uses a stochastic Beverton-Holt relationship. In compact form,

R_{s,y} = \frac{4h_s R_{0,s} SSB_{s,y-\delta}} {B_{0,s}(1-h_s) + SSB_{s,y-\delta}(5h_s - 1)} \exp\left(\varepsilon_{s,y} - \frac{\sigma_{R,s}^2}{2}\right).

where h_s is steepness, R_{0,s} is unfished recruitment, B_{0,s} is unfished spawning biomass, \delta is the lag from spawning to recruitment, and \varepsilon_{s,y} is the annual recruitment deviation.

Under h1_1.14, this relationship is applied to a single stock with one recruitment regime. Under h2_1.14, recruitment is assigned across two stocks with a more complex regime structure, including the 1999 shift described in the control file.

7.4 Fishing Mortality and Catch

The annex describes fishing mortality as the product of fleet-specific selectivity and an annual fishing-intensity term. A compact representation is

F_{f,s,a,y} = f_{f,y} S^{F}_{f,s,a,y}.

where f_{f,y} is the year-specific fleet intensity after incorporating the effort-to-mortality scaling and annual deviations used in the fitted model.

Predicted catch in numbers at age follows the Baranov catch equation:

C^{N}_{f,s,a,y} = N_{s,a,y} \, \frac{F_{f,s,a,y}}{Z_{s,a,y}} \, \left(1-\exp(-Z_{s,a,y})\right).

Predicted catch in biomass for fleet f is then

C^{B}_{f,y} = \sum_s \sum_a C^{N}_{f,s,a,y} \, w^{\mathrm{catch}}_{f,s,a,y}.

7.5 Abundance Indices

For abundance index i, the predicted index is a catchability-scaled vulnerable biomass or abundance quantity:

\begin{aligned} I^{\mathrm{pred}}_{i,y} &= q_i \sum_s \sum_a N_{s,a,y}\exp(-\lambda_i Z_{s,a,y}) \\ &\quad \times S^{I}_{i,s,a,y} \, w^{I}_{i,s,a,y}. \end{aligned}

where \lambda_i places the survey or CPUE observation within the year. This expression is schematic because some indices are closer to numbers-at-age and others to biomass, but it captures the model logic used across the SC13 index components.

7.6 Age and Length Compositions

Predicted fishery or survey age composition is obtained by normalizing predicted numbers-at-age in the relevant observation:

p^{\mathrm{age}}_{g,a,y} = \frac{n^{\mathrm{pred}}_{g,a,y}} {\sum_a n^{\mathrm{pred}}_{g,a,y}}.

where g denotes the fleet or survey being sampled.

For the Far North fleet, predicted age compositions are converted to predicted length compositions through the growth model:

p^{\mathrm{len}}_{f,l,y} = \sum_a p^{\mathrm{age}}_{f,a,y} \, g_{f,l \mid a}.

where g_{f,l \mid a} is the probability of a fish of age a being observed in length bin l, derived from the von Bertalanffy growth specification used by the model.

7.7 Observation Likelihoods

The annex indicates log-normal likelihoods for catches and abundance indices, and multinomial likelihoods for age and length compositions.

For catches and indices, the negative log-likelihood contribution is written schematically as

\ell_{\mathrm{LN}} = \sum_y \frac{\left[\log(O_y)-\log(P_y)\right]^2}{2\sigma_y^2}.

where O_y and P_y are observed and predicted values.

For age or length compositions, with effective sample size n_{g,y},

\ell_{\mathrm{mult}} = -\sum_y n_{g,y} \sum_k o_{g,k,y}\log\left(p_{g,k,y}\right).

where k indexes age or length bins, o_{g,k,y} is the observed proportion, and p_{g,k,y} is the predicted proportion.

The SC13 model uses Francis weighting for age-composition effective sample sizes, so the composition likelihood is strongly affected by those externally specified weights.

7.8 Objective Function

The full estimation criterion combines data likelihoods, priors, and smoothing penalties:

\mathcal{L} = \ell_{\mathrm{catch}} + \ell_{\mathrm{index}} + \ell_{\mathrm{age}} + \ell_{\mathrm{len}} + \ell_{\mathrm{priors}} + \ell_{\mathrm{sel}}.

The final term, \ell_{\mathrm{sel}}, is especially important in SC13 because the model includes extensive penalties on fishery and survey selectivity trajectories to regularize the time-varying non-parametric selectivity schedules.

7.9 Dynamic Reference Points

The annex states that the JJM calculates equilibrium reference points internally using terminal-year catch shares, selectivity-at-age, and weights-at-age. In compact form,

F_{\mathrm{MSY},y} = \operatorname*{arg\,max}_{F} C^{\mathrm{eq}}_y(F).

MSY_y = C^{\mathrm{eq}}_y(F_{\mathrm{MSY},y}).

B_{\mathrm{MSY},y} = B^{\mathrm{eq}}_y(F_{\mathrm{MSY},y}).

where the equilibrium yield and biomass functions are conditional on the terminal-year fishery mix and biological schedules. This is why F_{MSY} MSY, and F_{MSY} vary through time in the SC13 assessment rather than remaining fixed constants.

References

Beverton, R. J. H., & Holt, S. J. (1957). On the dynamics of exploited fish populations (Vol. 19, p. 533). H. M. Stationery Office.
Butterworth, D. S., & Punt, A. E. (1999). Experiences in the evaluation and implementation of management procedures. ICES Journal of Marine Science, 56(6), 985–998. https://doi.org/10.1006/jmsc.1999.0532
Carruthers, T. R. (2024). An updated roadmap for MSE development. Collective Volume of Scientific Papers ICCAT, 81(2), 1–14. https://www.iccat.int/Documents/CVSP/CV081_2024/n_2/CV08102149.pdf
Fournier, D. A., Skaug, H. J., Ancheta, J., Ianelli, J., Magnusson, A., Maunder, M. N., Nielsen, A., & Sibert, J. (2012). AD model builder: Using automatic differentiation for statistical inference of highly parameterized complex nonlinear models. Optimization Methods and Software, 27(2), 233–249. https://doi.org/10.1080/10556788.2011.597854
Fournier, D., & Archibald, C. P. (1982). A general theory for analyzing catch at age data. Canadian Journal of Fisheries and Aquatic Sciences, 39(9), 1195–1207. https://doi.org/10.1139/f82-157
Francis, R. I. C. C. (2011). Data weighting in statistical fisheries stock assessment models. Canadian Journal of Fisheries and Aquatic Sciences, 68(7), 1124–1138. https://doi.org/10.1139/f2011-025
Hilborn, R., & Walters, C. J. (1992). Quantitative fisheries stock assessment: Choice, dynamics and uncertainty. Springer. https://doi.org/10.1007/978-1-4615-3598-0
Linton, B. C., & Bence, J. R. (2011). Catch-at-age assessment in the face of time-varying selectivity. ICES Journal of Marine Science, 68(3), 611–625. https://doi.org/10.1093/icesjms/fsq173
Punt, A. E., Butterworth, D. S., Moor, C. L. de, Oliveira, J. A. A. de, & Haddon, M. (2016). Management strategy evaluation: Best practices. Fish and Fisheries, 17(2), 303–334. https://doi.org/10.1111/faf.12104
Rousseau, Y., Watson, R. A., Blanchard, J. L., & Fulton, E. A. (2019). Evolution of global marine fishing fleets and the response of fished resources. Proceedings of the National Academy of Sciences of the United States of America, 116(25), 12238–12243. https://doi.org/10.1073/pnas.1820344116
Schnute, J. T., & Richards, L. J. (1995). The influence of error on population estimates from catch-age models. Canadian Journal of Fisheries and Aquatic Sciences, 52(10), 2063–2077. https://doi.org/10.1139/f95-800
Siple, M. C., Koehn, L. E., Johnson, K. F., Punt, A. E., Canales, T. M., Carpi, P., Moor, C. L. de, Oliveira, J. A. A. de, Gao, J., Jacobsen, N. S., Lam, M. E., Licandeo, R., Lindegren, M., Ma, S., Óskarsson, G. J., Sanchez-Maroño, S., Smoliński, S., Surma, S., Tian, Y., … Zimmermann, F. (2021). Considerations for management strategy evaluation for small pelagic fishes. Fish and Fisheries, 22(6), 1167–1186. https://doi.org/10.1111/faf.12579
SPRFMO Jack Mackerel Working Group. (2025). Report of the JMWG management strategy evaluation technical workshop (SCW15) (SC13-JM09). South Pacific Regional Fisheries Management Organisation (SPRFMO). https://www.sprfmo.int/assets/Meetings/02-SC/13th-SC-2025/Jack-Mackerel/SC13-JM09-Report-of-the-JMWG-Management-Strategy-Evaluation-Technical-Workshop.pdf
SPRFMO Scientific Committee. (2025). Annex 11: Jack mackerel technical annex. South Pacific Regional Fisheries Management Organisation (SPRFMO). https://sprfmo.int/assets/Meetings/02-SC/13th-SC-2025/SC13-Report-Annex-11_final.pdf