Title: | Catalogue of the Timber Forest Species of the Peruvian Amazon |
---|---|
Description: | Access the data of the 'Catalogue of the Timber Forest Species of the Peruvian Amazon' Vásquez Martínez, R., & Rojas Gonzáles, R.D.P.(2022)<doi:10.21704/rfp.v37i3.1956>. |
Authors: | Paul E. Santos Andrade [aut, cre] |
Maintainer: | Paul E. Santos Andrade <[email protected]> |
License: | MIT + file LICENSE |
Version: | 0.1.0 |
Built: | 2024-11-26 01:23:18 UTC |
Source: | https://github.com/paulesantos/perutimber |
This function takes a species list and tries to match a name in the Catalogue
of the timber forest species of the Amazon and the Peruvian Yunga, subseting
information for each species. If the name_submitted is a valid name, it will
be the duplicated in accepted_name column, else the accepted_name column will
display the closest name given the maximum distance defined in max_distance
get_perutimber_data(splist, max_distance = 0.2)
get_perutimber_data(splist, max_distance = 0.2)
splist |
A character vector specifying the input taxon, each element
including genus and specific epithet and, potentially, infraspecific rank,
infraspecific name and author name.
Only valid characters are allowed (see |
max_distance |
match when comparing the submitted name with the closest name matches in the species listed in the "Catalogue of the timber forest species of the Amazon and the Peruvian Yunga". The distance used is a generalized Levenshtein distance that indicates the total number of insertions, deletions, and substitutions allowed to match the two names. It can be expressed as an integer or as the fraction of the binomial name. For example, a name with length 10, and a max_distance = 0.1, allow only one change (insertion, deletion, or substitution). A max_distance = 2, allows two changes. |
A table with the accepted name and catalog data of the species.
splist <- c("Euterpe precatoria var. precatorio", "Welfia alfredi", "Hibiscus abelmoschus var. betulifolius") get_perutimber_data(splist)
splist <- c("Euterpe precatoria var. precatorio", "Welfia alfredi", "Hibiscus abelmoschus var. betulifolius") get_perutimber_data(splist)
The 'perutimber_dat' contains information from the Catalogue of the timber forest species of the Amazon and the Peruvian Yunga.
perutimber_data
perutimber_data
A tibble with the following columns:
A character vector. The list of the accepted plant taxa names according to the Taxonomic Name Resolution Service - TNRS.
A character vector. The corresponding author name of the accepted_name, staying empty if the taxonomic_status is "Synonym" or "No opinion".
A character vector. The corresponding family name of the accepted_name.
A character vector. Listed all common names for each species.
A character vector. Information about infraspecific status.
A character vector.
A character vector. The plant height range in meters.
A character vector.
A character vector.
A character vector.
data(perutimber_data) str(perutimber_data)
data(perutimber_data) str(perutimber_data)
The 'perutimber_sps_class' includes all species separeted by genus, epithet, author,
subspecies, variety, and id (position in the
tab_perutimber
).
perutimber_sps_class
perutimber_sps_class
A data.frame.
data(perutimber_sps_class)
data(perutimber_sps_class)
This function takes a species list and tries to match a name in the Catalogue of the timber forest species of the Amazon and the Peruvian Yunga, checking if the name is listed in tha dataset.
search_perutimber(splist, max_distance = 0.2)
search_perutimber(splist, max_distance = 0.2)
splist |
A character vector specifying the input taxon, each element
including genus and specific epithet and, potentially, infraspecific rank,
infraspecific name and author name.
Only valid characters are allowed (see |
max_distance |
match when comparing the submitted name with the closest name matches in the species listed in the "Catalogue of the timber forest species of the Amazon and the Peruvian Yunga". The distance used is a generalized Levenshtein distance that indicates the total number of insertions, deletions, and substitutions allowed to match the two names. It can be expressed as an integer or as the fraction of the binomial name. For example, a name with length 10, and a max_distance = 0.1, allow only one change (insertion, deletion, or substitution). A max_distance = 2, allows two changes. |
A character vector that can have three different output values. The first value, "Present," indicates whether the species name is fully matched with the names listed in the 'Catalogue of the timber forest species of the Amazon and the Peruvian Yunga'. The second value, "P_updated_name," provides a fuzzy matching of species names. If a species name is not listed in the catalogue, the third value returned will be an empty string.
# Search for multiple species vector splist <- c("Euterpe precatoria var. precatorio", "Welfia alfredi", "Hibiscus abelmoschus var. betulifolius") get_perutimber_data(splist) # Search for multiple species data.frame # base df_splist <- data.frame(splist = splist) df_splist$peutimber <- search_perutimber(df_splist$splist)
# Search for multiple species vector splist <- c("Euterpe precatoria var. precatorio", "Welfia alfredi", "Hibiscus abelmoschus var. betulifolius") get_perutimber_data(splist) # Search for multiple species data.frame # base df_splist <- data.frame(splist = splist) df_splist$peutimber <- search_perutimber(df_splist$splist)
The 'tab_perutimber' contains records belonging to all the timber forest species the Amazon and the Peruvian Yunga.
tab_perutimber
tab_perutimber
A tibble with the following columns:
The fixed species id of the input taxon in Catalogue of the timber forest species of the Amazon and the Peruvian Yunga - CTSAPY.
A character vector. The input genus of the corresponding species name listed in CTSAPY.
A character vector. The specific epithet of the corresponding species name listed in CTSAPY.
A character vector. The taxonomic rank: "species","subspecies", "variety", of the corresponding species name listed in CTSAPY.
A character vector. If the indicated rank is below species, the subspecies epithet input of the corresponding species name listed in CTSAPY.
A character vector. description if a taxon is classified as ‘accepted’, ‘synonym’, ‘no opinion’. According to the Taxonomic Name Resolution Service - TNRS.
A character vector. The list of the accepted plant taxa names according to the Taxonomic Name Resolution Service - TNRS.
A character vector. The corresponding family name of the accepted_name.
A character vector. The corresponding author name of the accepted_name, staying empty if the taxonomic_status is "Synonym" or "No opinion".
A character vector. The accepted_name_id according to the Taxonomic Name Resolution Service - TNRS output.
A character vector. The accepted_name information url.
A character vector. The accepted_name information source.
Catalogue of the timber forest species of the Amazon and the Peruvian Yunga. Boyle, B. L., Matasci, N., Mozzherin, D., Rees, T., Barbosa, G. C., Kumar Sajja, R., & Enquist, B. J. (2021). Taxonomic Name Resolution Service, version 5.0. In Botanical Information and Ecology Network. https://tnrs.biendata.org/ , Accessed Mar 21, 2023.
data(tab_perutimber) str(tab_perutimber)
data(tab_perutimber) str(tab_perutimber)
The 'tab_perutimber_position' reports the position (in term of number of rows) of the first three letters (triphthong) for the plant names stored in the variable 'accepted_name' of the table 'tab_perutimber'. This indexing system speeds up of the search on the largest list using the package.
tab_perutimber_position
tab_perutimber_position
A data frame with 399 observations on the following 3 variables.
A character vector. It is the position of the first 3 letters of the species name in the tab_perutimber.
A character vector. First 3 letters of the species name in the tab_perutimber.
A character vector. Corresponding Genus name.
data(tab_perutimber_position) str(tab_perutimber_position)
data(tab_perutimber_position) str(tab_perutimber_position)