New Version of caret on CRAN
caret
R
feature engineering
genetic algorithms
simulated annealing
A new version of caret is on CRAN.
Some recent features/changes:
- The license was changed to GPL >= 2 to accommodate new code from the
GA
package. - New feature selection functions
gafs
andsafs
were added, along with helper functions and objects, were added. The package HTML was updated to expand more about feature selection. I’ll talk more about these functions in an upcoming blog post. - A reworked version of
nearZerVar
based on code from Michael Benesty was added the old version is now callednzv
that uses less memory and can be used in parallel. sbfControl
now has amultivariate
option where all the predictors are exposed to the scoring function at once.- Several regression simulation functions were added:
SLC14_1
,SLC14_2
,LPH07_1
andLPH07_2
- For the input data
x
totrain
, we now respect the class of the input value to accommodate other data types (such as sparse matrices). - A function
update.rfe
was added.
Recently added models:
- From the
adabag
package, two new models were added:AdaBag
andAdaBoost.M1
. - Weighted subspace random forests from the
wsrf
package was added. - Additional bagged FDA and MARS models were added (model codes
bagFDAGCV
andbagEarthGCV
) were added that use the GCV statistic to prune the model. This leads to memory reductions during training. - Brenton Kenkel added ordered logistic or probit regression to
train
usingmethod = "polr"
fromMASS
- The adaptive mixture discriminant model from the
adaptDA
package - A robust mixture discriminant model from the
robustDA
package was added. - The multi-class discriminant model using binary predictors in the
binda
package was added. - Ensembles of partial least squares models (via the
enpls
package) was added. plsRglm
was added.- From the
kernlab
package, SVM models using string kernels were added:svmBoundrangeString
,svmExpoString
,svmSpectrumString
- The model code for
ada
had a bug fix applied and the code was adapted to use the “sub-model trick” so it should train faster.
(This article was originally posted at http://appliedpredictivemodeling.com
)