rsample 1.0.0
CRAN release: 2022-06-24
Fixed how
nested_cv()
handles call objects so variables in the environment can be used when specifying resampling schemes (#81).Updated to testthat 3e (#280) and added better checking for
vfold_cv()
(#293).Finally removed the
gather()
method forrset
objects. Usetidyr::pivot_longer()
instead (#280).Changed
initial_split()
to avoid calling tidyselect twice onstrata
(#296). This fix stopsinitial_split()
from generating messages like:
Note: Using an external vector in selections is ambiguous.
i Use `all_of(strata)` instead of `strata` to silence this message.
i See <https://tidyselect.r-lib.org/reference/faq-external-vector.html>.
- Added better printing methods for initial split objects.
rsample 0.1.1
CRAN release: 2021-11-08
Updated documentation on stratified sampling (#245).
Changed
make_splits()
to an S3 generic, with the original functionality a method forlist
and a new method for dataframes that allows users to create a split from existing analysis & assessment sets (@LiamBlake, #246).Added
validation_time_split()
for a single validation sample taking the first samples for training (@mine-cetinkaya-rundel, #256).Escalated the deprecation of the
gather()
method forrset
objects to a hard deprecation. Usetidyr::pivot_longer()
instead (#257).Changed resample “fingerprint” to hash the indices only rather than the entire resample result (including the data object). This is much faster and will still ensure the same resample for the same original data object (#259).
rsample 0.1.0
CRAN release: 2021-05-08
Fixed how
mc_cv()
,initial_split()
, andvalidation_split()
use theprop
argument to first compute the assessment indices, rather than the analysis indices. This is a minor but breaking change in some situations; the previous implementation could cause an inconsistency in the sizes of the generated analysis and assessment sets when compared to howprop
is documented to function (#217, @issactoast).Fixed problem with creation of
apparent()
(#223) andcaret2rsample()
(#232) resamples.Re-licensed package from GPL-2 to MIT. See consent from copyright holders here.
Attempts to stratify on a
Surv
object now error more informatively (#230).Exposed
pool
argument frommake_strata()
in user-facing resampling functions (#229).Deprecated the
gather()
method forrset
objects in favor oftidyr::pivot_longer()
(#233).Fixed bug in
make_strata()
for numeric variables withNA
values (@brian-j-smith, #236).
rsample 0.0.9
CRAN release: 2021-02-17
New
rset_reconstruct()
, a developer tool to ease creation of new rset subclasses (#210).Added
permutations()
, a function for creating permutation resamples by performing column-wise shuffling (@mattwarkentin, #198).Fixed an issue where empty assessment sets couldn’t be created by
make_splits()
(#188).rset
objects now contain a “fingerprint” attribute that can be used to check to see if the same object uses the same resamples.The
reg_intervals()
function is a convenience function forlm()
,glm()
,survreg()
, andcoxph()
models (#206).A few internal functions were exported so that
rsample
-adjacent packages can use the same underlying code.Changed the inheritance structure for
rsplit
objects from specific to general and simplified the methods for thecomplement()
generic (#216).
rsample 0.0.8
CRAN release: 2020-09-23
New
manual_rset()
for constructing rset objects manually from custom rsplits (tidymodels/tune#273).Three new time based resampling functions have been added:
sliding_window()
,sliding_index()
, andsliding_period()
, which have more flexibility than the pre-existingrolling_origin()
.Correct
alpha
parameter handling for bootstrap CI functions (#179, #184).
rsample 0.0.7
CRAN release: 2020-06-04
Lower threshold for pooling strata to 10% (from 15%) (#149).
The
print()
methods forrsplit
andval_split
objects were adjusted to show"<Analysis/Assess/Total>"
and<Training/Validation/Total>
, respectively.The
drinks
,attrition
, andtwo_class_dat
data sets were removed. They are in themodeldata
package.Compatability with
dplyr
1.0.0.
rsample
0.0.6
CRAN release: 2020-03-31
Added
validation_set()
for making a single resample.Correct the tidy method for bootstraps (#115).
Changes for upcoming `tibble release.
Exported constructors for
rset
andsplit
objects (#40)initial_time_split()
androlling_origin()
now have alag
parameter that ensures that previous data are available so that lagged variables can be calculated. (#135, #136)
rsample
0.0.5
CRAN release: 2019-07-12
- Added three functions to compute different bootstrap confidence intervals.
- A new function (
add_resample_id()
) augments a data frame with columns for the resampling identifier. - Updated
initial_split()
,mc_cv()
,vfold_cv()
,bootstraps()
, andgroup_vfold_cv()
to use tidyselect on the stratification variable. - Updated
initial_split()
,mc_cv()
,vfold_cv()
,bootstraps()
with newbreaks
parameter that specifies the number of bins to stratify by for a numeric stratification variable.
rsample
0.0.4
CRAN release: 2019-01-07
Small maintenence release.
Minor improvements and fixes
-
fill()
was removed per the deprecation warning. - Small changes were made for the new version of
tibble
.
rsample
0.0.3
CRAN release: 2018-11-20
New features
- Added function
initial_time_split()
for ordered initial sampling appropriate for time series data.
Minor improvements and fixes
fill()
has been renamedpopulate()
to avoid a conflict withtidyr::fill()
.Changed the R version requirement to be R >= 3.1 instead of 3.3.3.
The
recipes
-relatedprepper
function was moved to therecipes
package. This makes thersample
install footprint much smaller.rsplit
objects are shown differently inside of a tibble.Moved from the
broom
package to thegenerics
package.
rsample
0.0.2
CRAN release: 2017-11-12
-
initial_split
,training
, andtesting
were added to do training/testing splits prior to resampling. - Another resampling method,
group_vfold_cv
, was added. -
caret2rsample
andrsample2caret
can convertrset
objects to those used bycaret::trainControl
and vice-versa. - A function called
form_pred
can be used to determine the original names of the predictors in a formula orterms
object. - A vignette and a function (
prepper
) were included to facilitate using therecipes
withrsample
. - A
gather
method was added forrset
objects. - A
labels
method was added forrsplit
objects. This can help identify which resample is being used even when the wholerset
object is not available. - A variety of
dplyr
methods were added (e.g.filter
,mutate
, etc) that work without dropping classes or attributes of thersample
objects.