The BROKE-West fish dataset (Australian Antarctic Data Centre 2022) originates from the BROKE-West voyage, a survey conducted in a data-poor area of the Southern Ocean in 2006. This dataset forms part of the baseline biodiversity assessment of the Cosmonaut and Cooperation Seas.<br> The dataset was initially published in 2008 as a single flat Darwin Core Archive (DwC-A) occurrence table without extensions. Although the original dataset contained valuable occurrence data, it omitted much of the sampling context, including survey design, target scope, and material relationships. In 2016, the dataset was reused to test the extended MeasurementOrFact (eMoF) extension developed by Ocean Biodiversity Information System (OBIS) (De Pooter et al. 2017), which enabled the inclusion of environmental and biological measurements (Van de Putte 2016). In 2023, the Humboldt Ecological Inventory extension (TDWG Humboldt Extension Task Group and Biodiversity Information Standards (TDWG) 2024) was applied to describe the survey design and scope, linking each trawl to its survey targets and sampling context (Van de Putte et al. 2025).<br> The presentation from which this extended abstract derives (Gan et al. 2025) describes the limitations of DwC-A for this dataset and how DwC-DP overcomes these.<br> While DwC-A remains useful, several concepts cannot be well represented in this format.<br> <br> <br> <br> Marine datasets often contain both community-level measurements (e.g., abundance or total weight of an occurrence) and individual-level measurements (e.g., the standard length of a single fish from that same occurrence). The subsetting relationship is not explicit in DwC-A.<br> <br> <br> <br> Survey context and survey target are flattened into a single Humboldt Extension table. If each survey target combination is recorded as a separate row, survey context information, such as<br> eco:isTaxonomicScopeFullyReported<br> of a<br> dwc:Event<br> , must be repeated for each target.<br> <br> <br> <br> <br> The use of identifiers can be confusing: because the Occurrence extension requires a unique<br> dwc:occurrenceID<br> per row, the sample identifier representing a dwc:MaterialEntity must be used as dwc:occurrenceID. This leads to properties of the material (e.g., dry weight) being linked via eMoF to an Occurrence instead of to the material itself.<br> <br> <br> <br> <br> Although photographs can be linked using<br> dwc:associatedMedia<br> , this approach omits key media metadata, such as the license.<br> <br> <br> <br> Prey items found in the stomach contents of fish from an occurrence are difficult to model in DwC-A.<br> <br> <br> <br> Marine datasets often contain both community-level measurements (e.g., abundance or total weight of an occurrence) and individual-level measurements (e.g., the standard length of a single fish from that same occurrence). The subsetting relationship is not explicit in DwC-A.<br> <br> Survey context and survey target are flattened into a single Humboldt Extension table. If each survey target combination is recorded as a separate row, survey context information, such as<br> eco:isTaxonomicScopeFullyReported<br> of a<br> dwc:Event<br> , must be repeated for each target.<br> <br> <br> The use of identifiers can be confusing: because the Occurrence extension requires a unique<br> dwc:occurrenceID<br> per row, the sample identifier representing a dwc:MaterialEntity must be used as dwc:occurrenceID. This leads to properties of the material (e.g., dry weight) being linked via eMoF to an Occurrence instead of to the material itself.<br> <br> <br> Although photographs can be linked using<br> dwc:associatedMedia<br> , this approach omits key media metadata, such as the license.<br> <br> Prey items found in the stomach contents of fish from an occurrence are difficult to model in DwC-A.<br> <br> Today, the dataset has been fully restructured (Gan and Van de Putte 2025) using the proposed Darwin Core Data Package (DwC-DP) (Darwin Core Maintenance Group 2025). DwC-DP replaces the single-core architecture with a linked-table model that overcomes the limitations of the star schema. This new structure allows an explicit representation of<br> Occurrence<br> and<br> Material<br> , preserving the meaning of<br> dwc:occurrenceID<br> and<br> dwc:materialEntityID<br> . It also supports explicit relationships such as<br> isPartOf<br> and<br> derivedFrom<br> . For example, a prey item with<br> dwc:materialEntityID<br> “AAV3FF_00337_stomach_002_Euphausiid” is<br> dwc:derivedFromMaterialEntityID<br> “AAV3FF_00337” with<br> dwc:derivationType<br> “stomach content of”.<br> <br> This restructuring has revealed the full analytical potential of legacy datasets. The BROKE-West case demonstrates how DwC-DP can represent data that were previously unpublished or too complex to format in DwC-A, such as material-to-material derivations and media linked to materials.<br> The following three examples highlight DwC-DP’s analytical flexibility.<br> <br> <br> <br> Non-detections for an event can be inferred by comparing occurrences to survey targets. If all occurrences of a survey target are fully reported, an event without that survey target indicates a non-detection.<br> <br> <br> <br> Fish length-frequency distributions can be computed directly by linking<br> Material Assertion<br> to<br> Material<br> and then to<br> Event<br> , aligning with the Essential Ocean Variable (EOV) sub-variable Fish Abundance and Distribution (Global Ocean Observing System 2025).<br> <br> <br> <br> Diet composition can be represented through material relationships between predator stomachs and prey taxa — an analysis previously impossible in DwC-A because stomach-content data were unpublished. An OrganismInteraction table was not used since many prey items were unidentifiable or digested and their ingestion time unknown, making the creation of corresponding prey Occurrences difficult. These examples underline how DwC-DP enhances both data representation and the interpretability and reusability of historical datasets.<br> <br> <br> <br> Non-detections for an event can be inferred by comparing occurrences to survey targets. If all occurrences of a survey target are fully reported, an event without that survey target indicates a non-detection.<br> <br> Fish length-frequency distributions can be computed directly by linking<br> Material Assertion<br> to<br> Material<br> and then to<br> Event<br> , aligning with the Essential Ocean Variable (EOV) sub-variable Fish Abundance and Distribution (Global Ocean Observing System 2025).<br> <br> Diet composition can be represented through material relationships between predator stomachs and prey taxa — an analysis previously impossible in DwC-A because stomach-content data were unpublished. An OrganismInteraction table was not used since many prey items were unidentifiable or digested and their ingestion time unknown, making the creation of corresponding prey Occurrences difficult. These examples underline how DwC-DP enhances both data representation and the interpretability and reusability of historical datasets.<br> Adopting DwC-DP brings major benefits but also challenges. It better preserves data meaning, supports richer links between entities, and suits complex marine survey datasets. However, it requires more identifiers, more relational workflows, and careful curation to keep identifiers stable over time.<br> DwC-DP is a major shift toward a relational biodiversity data model that accommodates complex ecological surveys. The BROKE-West case study shows how DwC-DP enriches flat datasets and more accurately represents survey complexity.