Clemento et al.: Evaluation of a single nucleotide polymorphism baseline for genetic stock identification of Oncorhynchus tshciwytscha 123 
o 
J3 
-d 
Cl 
O 
CL 
TJ 
0 ) 
TO 
E 
True proportion 
True proportion 
True proportion 
Figure 2 
Estimates of mixing proportions from cross-validation over gene copies (CV-GC) and K-fold sim- 
ulations for the 9 most abundant reporting units of Chinook Salmon (Oncorhynchus tshawyts- 
cha) encountered in California fisheries: Central Valley (A) spring, (B) fall, and (C) winter; (D) 
California Coast; (E) Klamath River; (F) North California/South Oregon Coast; (G) Rogue River; 
and (H) Mid Oregon Coast; and (I) North Oregon Coast. The x-axis gives the true proportion 
of fish from each reporting unit, and the y-axis gives the estimated proportion. The dashed 
line is the y=x line. Gray shaded regions give the range between the 5% and 95% quantiles of 
estimates that would be achieved with perfect assignment of fish to a reporting unit (i.e., they 
represent the uncertainty due to the fact that fishery proportions are estimated with a finite 
sample; in our simulations, a sample of 200 fish). The 5% and 95% quantiles of the estimates 
derived from the CV-GC and the K-fold replicates are shown with vertical line segments and 
open diamonds, respectively. Reporting units for which these bars and diamonds coincide with 
the gray region had estimated proportions as accurate as one would expect given unambiguous 
identification of fish to reporting unit. Filled circles and open triangles indicate the mean over 
20,000 CV-GC and 1000 K-fold replicates, respectively. These points fall along the dotted line 
when the estimator is unbiased. 
