Title: | Reviewed Official Classification of Endangered Wild Flora Species in Peru |
---|---|
Description: | Provide users with a convenient way to access and analyze information on endangered plant species in Peru based on `Decreto Supremo N 043-2006-AG - Aprueban categorizacion de especies amenazadas de flora silvestre`<https://sinia.minam.gob.pe/normas/aprueban-categorizacion-especies-amenazadas-flora-silvestre>. |
Authors: | Paul E. Santos Andrade [aut, cre]
|
Maintainer: | Paul E. Santos Andrade <[email protected]> |
License: | MIT + file LICENSE |
Version: | 0.1.1 |
Built: | 2025-03-07 03:18:27 UTC |
Source: | https://github.com/paulesantos/peruflorads43 |
This function takes a species list and tries to match a name in the "DS043-2006-AG:
Aprueban Categorizacion de Especies Amenazadas de Flora Silvestre", subseting
information for each species. If the name_submitted is a valid name, it will
be the duplicated in accepted_name column, else the accepted_name column will
display the closest name given the maximum distance defined in max_distance
category_ds043_2006(splist, max_distance = 0.2)
category_ds043_2006(splist, max_distance = 0.2)
splist |
A character vector specifying the input taxon, each element
including genus and specific epithet and, potentially, infraspecific rank,
infraspecific name and author name.
Only valid characters are allowed (see |
max_distance |
match when comparing the submitted name with the closest name matches in the species listed in the "DS043-2006-AG: Aprueban Categorizacion de Especies Amenazadas de Flora Silvestre". The distance used is a generalized Levenshtein distance that indicates the total number of insertions, deletions, and substitutions allowed to match the two names. It can be expressed as an integer or as the fraction of the binomial name. For example, a name with length 10, and a max_distance = 0.1, allow only one change (insertion, deletion, or substitution). A max_distance = 2, allows two changes. |
A table with the accepted name and catalog data of the species.
splist <- c("Cleistocactus clavispinus", "Welfia alfredi", "Matucana haynei") category_ds043_2006(splist)
splist <- c("Cleistocactus clavispinus", "Welfia alfredi", "Matucana haynei") category_ds043_2006(splist)
Species names list from DS043-2006-AG Aprueban Categorizacion de Especies Amenazadas de Flora Silvestre
ds_043_2006_ag
ds_043_2006_ag
A tibble with the following columns:
A character vector.
A character vector. The list of the accepted plant taxa names according to the Taxonomic Name Resolution Service - TNRS.
A character vector. The corresponding family name of the accepted_name.
DS043-2006-AG: Aprueban Categorizacion de Especies Amenazadas de Flora Silvestre. 13-07-2006
data(ds_043_2006_ag) str(ds_043_2006_ag)
data(ds_043_2006_ag) str(ds_043_2006_ag)
The 'ds43_2006_sps_class' includes all species separeted by genus, epithet, author,
subspecies, variety, and id (position in the
tab_ds43_2006
).
ds43_2006_sps_class
ds43_2006_sps_class
A data.frame.
data(ds43_2006_sps_class)
data(ds43_2006_sps_class)
This function takes a species list and tries to match a name in theCategorizacion de Especies Amenazadas de Flora Silvestre, checking if the name is listed in tha dataset.
search_ds043(splist, max_distance = 0.1)
search_ds043(splist, max_distance = 0.1)
splist |
A character vector specifying the input taxon, each element
including genus and specific epithet and, potentially, infraspecific rank,
infraspecific name and author name.
Only valid characters are allowed (see |
max_distance |
match when comparing the submitted name with the closest name matches in the species listed in the "Categorizacion de Especies Amenazadas de Flora Silvestre". The distance used is a generalized Levenshtein distance that indicates the total number of insertions, deletions, and substitutions allowed to match the two names. It can be expressed as an integer or as the fraction of the binomial name. For example, a name with length 10, and a max_distance = 0.1, allow only one change (insertion, deletion, or substitution). A max_distance = 2, allows two changes. |
A character vector that can have three different output values. The first value, "Present," indicates whether the species name is fully matched with the names listed in the 'Categorizacion de Especies Amenazadas de Flora Silvestre'. The second value, "P_updated_name," provides a fuzzy matching of species names. If a species name is not listed in the catalogue, the third value returned will be an empty string.
# Search for multiple species vector splist <- c("Cleistocactus clavispinus", "Welfia alfredi", "Matucana haynei") search_ds043(splist) # Search for multiple species data.frame # base df_splist <- data.frame(splist = splist) df_splist$peutimber <- search_ds043(df_splist$splist)
# Search for multiple species vector splist <- c("Cleistocactus clavispinus", "Welfia alfredi", "Matucana haynei") search_ds043(splist) # Search for multiple species data.frame # base df_splist <- data.frame(splist = splist) df_splist$peutimber <- search_ds043(df_splist$splist)
The 'tab_ds43_2006' contains records belonging to all the species DS043-200-AG.
tab_ds43_2006
tab_ds43_2006
A tibble with the following columns:
The fixed species id of the input taxon.
A character vector. The input genus of the corresponding species name.
A character vector. The specific epithet of the corresponding species name.
A character vector. The taxonomic rank: "species","subspecies", "variety", of the corresponding species name.
A character vector. If the indicated rank is below species, the subspecies epithet input of the corresponding species name.
A character vector. description if a taxon is classified as ‘accepted’, ‘synonym’, ‘no opinion’. According to the Taxonomic Name Resolution Service - TNRS.
A character vector. The list of the accepted plant taxa names according to the Taxonomic Name Resolution Service - TNRS.
A character vector. The corresponding family name of the accepted_name.
A character vector. The corresponding author name of the accepted_name, staying empty if the taxonomic_status is "Synonym" or "No opinion".
DS043-2006-AG: Aprueban Categorizacion de Especies Amenazadas de Flora Silvestre. 13-07-2006
data(tab_ds43_2006) str(tab_ds43_2006)
data(tab_ds43_2006) str(tab_ds43_2006)
The 'tab_ds43_2006_position' reports the position (in term of number of rows) of the first three letters (triphthong) for the plant names stored in the variable 'accepted_name' of the table 'tab_ds43_2006'. This indexing system speeds up of the search on the largest list using the package.
tab_ds43_2006_position
tab_ds43_2006_position
A data frame with 305 observations on the following 3 variables.
A character vector. It is the position of the first 3 letters of the species name in the tab_ds43_2006.
A character vector. First 3 letters of the species name in the tab_ds43_2006.
A character vector. Corresponding Genus name.
data(tab_ds43_2006_position) str(tab_ds43_2006_position)
data(tab_ds43_2006_position) str(tab_ds43_2006_position)