<?xml version='1.0' encoding='utf-8'?>
<?xml-stylesheet type="text/xsl" href="/v2/static/oai2.xsl"?>
<OAI-PMH xmlns="http://www.openarchives.org/OAI/2.0/" xmlns:xsi="http://www.w3.org/2001/XMLSchema-instance" xsi:schemaLocation="http://www.openarchives.org/OAI/2.0/ http://www.openarchives.org/OAI/2.0/OAI-PMH.xsd">
  <responseDate>2026-05-16T04:25:16Z</responseDate>
  <request identifier="oai:figshare.com:article/31833526" metadataPrefix="oai_datacite" verb="GetRecord">https://api.figshare.com/v2/oai</request>
  <GetRecord>
    <record>
      <header>
        <identifier>oai:figshare.com:article/31833526</identifier>
        <datestamp>2026-04-28T17:05:21Z</datestamp>
        <setSpec>category_24190</setSpec>
        <setSpec>category_24238</setSpec>
        <setSpec>category_24274</setSpec>
        <setSpec>portal_549</setSpec>
        <setSpec>item_type_3</setSpec>
        <setSpec>month_year_04_2026</setSpec>
      </header>
      <metadata>
        <resource xmlns="http://datacite.org/schema/kernel-4" xsi:schemaLocation="http://datacite.org/schema/kernel-4 http://schema.datacite.org/meta/kernel-4.3/metadata.xsd">
          <identifier identifierType="DOI">10.5522/04/31833526.v1</identifier>
          <alternateIdentifiers>
            <alternateIdentifier alternateIdentifierType="URL">https://figshare.com/articles/dataset/Functional_group_classification_using_consensus_clustering/31833526</alternateIdentifier>
          </alternateIdentifiers>
          <relatedIdentifiers>
            <relatedIdentifier relatedIdentifierType="URL" relationType="HasPart">https://ndownloader.figshare.com/files/63014614</relatedIdentifier>
            <relatedIdentifier relatedIdentifierType="URL" relationType="HasPart">https://ndownloader.figshare.com/files/63014617</relatedIdentifier>
            <relatedIdentifier relatedIdentifierType="URL" relationType="HasPart">https://ndownloader.figshare.com/files/63014620</relatedIdentifier>
            <relatedIdentifier relatedIdentifierType="URL" relationType="HasPart">https://ndownloader.figshare.com/files/63014623</relatedIdentifier>
            <relatedIdentifier relatedIdentifierType="URL" relationType="HasPart">https://ndownloader.figshare.com/files/63014629</relatedIdentifier>
            <relatedIdentifier relatedIdentifierType="URL" relationType="HasPart">https://ndownloader.figshare.com/files/63014632</relatedIdentifier>
            <relatedIdentifier relatedIdentifierType="URL" relationType="HasPart">https://ndownloader.figshare.com/files/63014641</relatedIdentifier>
          </relatedIdentifiers>
          <creators>
            <creator>
              <creatorName>Maynard, Daniel</creatorName>
              <givenName>Daniel</givenName>
              <familyName>Maynard</familyName>
            </creator>
            <creator>
              <creatorName>Paz Velez, Andrea</creatorName>
              <givenName>Andrea</givenName>
              <familyName>Paz Velez</familyName>
              <nameIdentifier nameIdentifierScheme="ORCID" schemeURI="http://orcid.org">0000-0001-6484-1210</nameIdentifier>
            </creator>
            <creator>
              <creatorName>Ubilla Pavez, Pablo</creatorName>
              <givenName>Pablo</givenName>
              <familyName>Ubilla Pavez</familyName>
            </creator>
          </creators>
          <titles>
            <title><![CDATA[Functional group classification using consensus clustering]]></title>
          </titles>
          <subjects>
            <subject>Computational ecology and phylogenetics</subject>
            <subject>Ecology not elsewhere classified</subject>
            <subject>Plant and fungus systematics and taxonomy</subject>
            <subject>Taxonomy</subject>
            <subject>functional diversity</subject>
            <subject>clustering</subject>
            <subject>functional groups</subject>
            <subject>functional ecology</subject>
            <subject>trees</subject>
          </subjects>
          <dates>
            <date dateType="Created">2026-04-28</date>
            <date dateType="Updated">2026-04-28</date>
          </dates>
          <resourceType resourceTypeGeneral="Dataset">Dataset</resourceType>
          <publicationYear>2026</publicationYear>
          <publisher>University College London</publisher>
          <rightsList>
            <rights rightsURI="https://creativecommons.org/licenses/by/4.0/" rightsIdentifier="CC BY 4.0"/>
            <rights rightsURI="http://purl.org/coar/access_right/c_abf2" rightsIdentifier="open access"/>
          </rightsList>
          <descriptions>
            <description descriptionType="Abstract"><![CDATA[<p dir="ltr"><b><u>Repository description</u></b></p><p dir="ltr">Data and code for "Functional group classification using consensus clustering" (Ubilla Pavez, Paz & Maynard, In Revision, PLOS Computational Biology). This paper presents a consensus clustering method that classifies species into functional groups while accounting for trait uncertainty and trait correlation, using repeated resampling with Gaussian Mixture Models synthesized into a consensus matrix.</p><p dir="ltr">This repository contains the input trait data, taxonomic metadata, species name matching tables, and spatial diversity metrics used in the case study of global tree functional group classification. The consensus clustering pipeline code is available at https://github.com/pabloubilla/tree_clustering/.</p><p dir="ltr"><b><u>File descriptions</u></b></p><p dir="ltr"><b>Estimated_trait_table_with_monos.csv</b></p><p dir="ltr">Species-level trait data for 47,828 tree species across 18 traits (e.g., wood density, leaf area, tree height). Contains both observed and imputed (predicted) values from Maynard et al. (2022, <i>Nature Communications</i>). Each row is a species–trait combination. Columns: <b>accepted_bin</b> (species binomial), <b>fit</b> (fitting method, e.g. "phy" for phylogenetic), <b>LAT</b> and <b>LON</b> (coordinates of observation, NA if imputed), <b>TraitID</b> (numeric trait identifier), <b>trait</b> (full trait name), <b>trait_short</b> (abbreviated trait name), <b>quant</b> (whether predicted using quantile random forest), <b>pred_value</b> (predicted/imputed trait value), <b>obs_value</b> (observed value, NA if unavailable).</p><p dir="ltr"><b>taxonomic_information.csv</b></p><p dir="ltr">Taxonomic classification for each tree species. Columns: <b>genus</b>, <b>family</b>, <b>order</b>, <b>group</b> (Angiosperms or Gymnosperms), <b>accepted_bin</b> (species binomial, used as join key), <b>mono_fern</b> (whether the species is a monocot or fern).</p><p dir="ltr"><b>bgci_v1_3_matched_names.csv</b></p><p dir="ltr">Species name matching table linking names from the Botanic Gardens Conservation International (BGCI) ThreatSearch database (v1.3) to the accepted binomial names used in this study. Columns: <b>TaxonName</b> (original name in BGCI), <b>Author</b> (taxonomic authority), <b>accepted_bin</b> (matched accepted binomial).</p><p dir="ltr"><b>global_tree_search_trees_1_7.csv</b></p><p dir="ltr">Species list from GlobalTreeSearch (v1.7; Beech et al. 2017), a global database of tree species and country distributions maintained by BGCI. Columns: <b>TaxonName</b> (species name), <b>Author</b> (taxonomic authority).</p><p dir="ltr"><b>plot_diversity_metrics/grid_coordinates.csv</b></p><p dir="ltr">Grid cell reference table mapping grid IDs to geographic coordinates. Columns: <b>grid_id</b> (unique identifier), <b>Latitude</b>, <b>Longitude</b>.</p><p dir="ltr"><b>plot_diversity_metrics/Functional_group_results.csv</b></p><p dir="ltr">Functional group diversity metrics calculated per grid cell using the 42 functional groups identified in this study, based on presence–absence data from Paz et al. (2024, <i>Global Ecology and Biogeography</i>). Columns: <b>Latitude</b>, <b>Longitude</b>, <b>nclust</b> (functional group richness, i.e. number of unique groups present), <b>cluster_simpson</b> (functional redundancy, i.e. Simpson's Index applied to functional groups).</p><p dir="ltr"><b>plot_diversity_metrics/Paz_et_al_data.csv</b></p><p dir="ltr">Traditional diversity metrics per grid cell from Paz et al. (2024), used for comparison with the functional group metrics. Columns: <b>Latitude</b>, <b>Longitude</b>, <b>nspec</b> (species richness), <b>raoq</b> (Rao's quadratic entropy, i.e. mean pairwise trait distance), <b>fdr</b> (functional richness, i.e. convex hull volume in trait space).</p><p><br></p>]]></description>
          </descriptions>
        </resource>
      </metadata>
    </record>
  </GetRecord>
</OAI-PMH>
