Package: VIM 7.1.0

Matthias Templ

VIM: Visualization and Imputation of Missing Values

Provides methods for imputation and visualization of missing values. It includes graphical tools to explore the amount, structure and patterns of missing and/or imputed values, supporting exploratory data analysis and helping to investigate potential missingness mechanisms (details in Alfons, Templ and Filzmoser, <doi:10.1007/s11634-011-0102-y>. The quality of imputations can be assessed visually using a wide range of univariate, bivariate and multivariate plots. The package further provides several imputation methods, including efficient implementations of k-nearest neighbour and hot-deck imputation (Kowarik and Templ 2013, <doi:10.18637/jss.v074.i07>, iterative robust model-based multiple imputation (Templ 2011, <doi:10.1016/j.csda.2011.04.012>; Templ 2023, <doi:10.3390/math11122729>), and machine learning–based approaches such as robust GAM-based multiple imputation (Templ 2024, <doi:10.1007/s11222-024-10429-1>) as well as gradient boosting (XGBoost) and transformer-based methods (Niederhametner et al., <doi:10.1177/18747655251339401>). General background and practical guidance on imputation are provided in the Springer book by Templ (2023) <doi:10.1007/978-3-031-30073-8>.

Authors:Matthias Templ [aut, cre], Alexander Kowarik [aut], Andreas Alfons [aut], Johannes Gussenbauer [aut], Nina Niederhametner [aut], Eileen Vattheuer [aut], Gregor de Cillia [aut], Bernd Prantner [ctb], Wolfgang Rannetbauer [aut]

VIM_7.1.0.tar.gz
VIM_7.1.0.zip(r-4.7)VIM_7.1.0.zip(r-4.6)VIM_7.1.0.zip(r-4.5)
VIM_7.1.0.tgz(r-4.6-x86_64)VIM_7.1.0.tgz(r-4.6-arm64)VIM_7.1.0.tgz(r-4.5-x86_64)VIM_7.1.0.tgz(r-4.5-arm64)
VIM_7.1.0.tar.gz(r-4.7-arm64)VIM_7.1.0.tar.gz(r-4.7-x86_64)VIM_7.1.0.tar.gz(r-4.6-arm64)VIM_7.1.0.tar.gz(r-4.6-x86_64)
VIM_7.1.0.tgz(r-4.6-emscripten)
manual.pdf |manual.html
card.svg |card.png
VIM/json (API)
NEWS

# Install 'VIM' in R:
install.packages('VIM', repos = c('https://statistikat.r-universe.dev', 'https://cloud.r-project.org'))

Bug tracker:https://github.com/statistikat/vim/issues

Uses libs:
  • c++– GNU Standard C++ Library v3
  • openmp– GCC OpenMP (GOMP) support library
Datasets:
  • Animals_na - Animals_na
  • bcancer - Breast cancer Wisconsin data set
  • brittleness - Brittleness index data set
  • chorizonDL - C-horizon of the Kola data with missing values
  • colic - Colic horse data set
  • collisions - Subset of the collision data
  • diabetes - Indian Prime Diabetes Data
  • food - Food consumption
  • kola.background - Background map for the Kola project data
  • pulplignin - Pulp lignin content
  • SBS5242 - Synthetic subset of the Austrian structural business statistics data
  • sleep - Mammal sleep data
  • tao - Tropical Atmosphere Ocean (TAO) project data
  • testdata - Simulated data set for testing purpose
  • toydataMiss - Simulated toy data set for examples
  • wine - Wine tasting and price

On CRAN:

Conda:

hotdeckimputation-methodsmodel-predictionsvisualizationcppopenmp

12.87 score 91 stars 18 packages 3.7k scripts 18k downloads 64 mentions 50 exports 105 dependencies

Last updated from:9d654a3b55. Checks:11 WARNING, 1 ERROR, 1 OK. Indexed: yes.

TargetResultTimeFilesSyslog
linux-devel-arm64WARNING265
linux-devel-x86_64WARNING281
source / vignettesERROR280
linux-release-arm64WARNING251
linux-release-x86_64WARNING262
macos-release-arm64WARNING190
macos-release-x86_64WARNING393
macos-oldrel-arm64WARNING200
macos-oldrel-x86_64WARNING312
windows-develWARNING274
windows-releaseWARNING251
windows-oldrelWARNING254
wasm-releaseOK156

Exports:aggralphablendbarMissbgmapcolormapMisscolormapMissLegendcolSequencecolSequenceHCLcolSequenceRGBcountInfcountNAevaluationgapMissgowerDgrowdotMisshistMisshotdeckimpPCAimputeRobustimputeRobustChaininitialiseirmikNNmapMissmarginmatrixmarginplotmatchImputematrixplotmaxCatmedianSampmosaicMissmsecormsecovnrmsepairsVIMparcoordMisspboxpfcpreparerangerImputeregressionImprugNAsampleCatscattJittscattmatrixMissscattMissspineMisstableMissvimputexgboostImpute

Dependencies:abindbackportsbbotkbootbroomcarcarDatacheckmateclassclicodetoolscolorspacecowplotcpp11data.tableDEoptimRDerivdigestdoBydplyre1071evaluatefarverforecastFormulafracdifffuturefuture.applygenericsggplot2globalsgluegtableisobandjsonlitelabelinglaekenlatticelgrlifecyclelistenvlme4lmtestmagrittrMASSMatrixMatrixModelsmgcvmicrobenchmarkminqamiraimlbenchmlr3mlr3learnersmlr3measuresmlr3miscmlr3pipelinesmlr3tuningmodelrnanonextnlmenloptrnnetnumDerivpalmerpenguinsparadoxparallellypbkrtestpillarpkgconfigproxyPRROCpurrrquantregR6rangerrbibutilsRColorBrewerRcppRcppArmadilloRcppEigenRdpackreformulasrlangrobustbaseS7scalesspSparseMstringistringrsurvivaltibbletidyrtidyselecttimeDateurcautf8uuidvcdvctrsviridisLitewithrxgboostzoo

Readme and manuals

Help Manual

Help pageTopics
Aggregations for missing/imputed valuesaggr plot.aggr print.aggr print.summary.aggr summary.aggr
Alphablending for colorsalphablend
Animals_naAnimals_na
Barplot with information about missing/imputed valuesbarMiss
Breast cancer Wisconsin data setbcancer
Backgound mapbgmap
Brittleness index data setbrittleness
C-horizon of the Kola data with missing valueschorizonDL
Colic horse data setcolic
Subset of the collision datacollisions
Colored map with information about missing/imputed valuescolormapMiss colormapMissLegend
HCL and RGB color sequencescolSequence colSequenceHCL colSequenceRGB
Count number of infinite or missing valuescountInf countNA
Indian Prime Diabetes Datadiabetes
Error performance measuresevaluation msecor msecov nrmse pfc
Food consumptionfood
Missing value gap statisticsgapMiss
Computes the extended Gower distance of two data setsgowerD
Growing dot map with information about missing/imputed valuesbubbleMiss growdotMiss
Histogram with information about missing/imputed valueshistMiss
Hot-Deck Imputationhotdeck
Iterative EM PCA imputationimpPCA
Robust imputationimputeRobust
FUNCTION_TITLEimputeRobustChain
Initialization of missing valuesinitialise
Iterative robust model-based imputation (IRMI)irmi
k-Nearest Neighbour ImputationkNN
Background map for the Kola project datakola.background
Map with information about missing/imputed valuesmapMiss
Marginplot Matrixmarginmatrix
Scatterplot with additional information in the marginsmarginplot
Fast matching/imputation based on categorical variablematchImpute
Matrix plotiimagMiss matrixplot TKRmatrixplot
Aggregation function for a factor variablemaxCat
Aggregation function for a ordinal variablemedianSamp
Mosaic plot with information about missing/imputed valuesmosaicMiss
Scatterplot MatricespairsVIM
Parallel coordinate plot with information about missing/imputed valuesparcoordMiss
Parallel boxplots with information about missing/imputed valuespbox
Transformation and standardizationprepare
Pulp lignin contentpulplignin
Random Forest ImputationrangerImpute
Regression Imputation (via vimpute)regressionImp
Rug representation of missing/imputed valuesrugNA
Random aggregation function for a factor variablesampleCat
Synthetic subset of the Austrian structural business statistics dataSBS5242
Bivariate jitter plotscattJitt
Scatterplot matrix with information about missing/imputed valuesscattmatrixMiss
Scatterplot with information about missing/imputed valuesscattMiss
Mammal sleep datasleep
Spineplot with information about missing/imputed valuesspineMiss
create table with highlighted missings/imputationstableMiss
Tropical Atmosphere Ocean (TAO) project datatao
Simulated data set for testing purposetestdata
Simulated toy data set for examplestoydataMiss
Impute missing values with prefered model, sequentially, with hyperparametertuning and with PMM (if wanted) Need of 'helper_vimpute' scriptvimpute
Wine tasting and pricewine
Xgboost ImputationxgboostImpute