Package: creditmodel 1.3.1

creditmodel: Toolkit for Credit Modeling, Analysis and Visualization

Provides a highly efficient R tool suite for Credit Modeling, Analysis and Visualization.Contains infrastructure functionalities such as data exploration and preparation, missing values treatment, outliers treatment, variable derivation, variable selection, dimensionality reduction, grid search for hyper parameters, data mining and visualization, model evaluation, strategy analysis etc. This package is designed to make the development of binary classification models (machine learning based models as well as credit scorecard) simpler and faster. The references including: 1 Refaat, M. (2011, ISBN: 9781447511199). Credit Risk Scorecard: Development and Implementation Using SAS; 2 Bezdek, James C.FCM: The fuzzy c-means clustering algorithm. Computers & Geosciences (0098-3004),<doi:10.1016/0098-3004(84)90020-7>.

Authors:Dongping Fan [aut, cre]

creditmodel_1.3.1.tar.gz
creditmodel_1.3.1.zip(r-4.5)creditmodel_1.3.1.zip(r-4.4)creditmodel_1.3.1.zip(r-4.3)
creditmodel_1.3.1.tgz(r-4.5-any)creditmodel_1.3.1.tgz(r-4.4-any)creditmodel_1.3.1.tgz(r-4.3-any)
creditmodel_1.3.1.tar.gz(r-4.5-noble)creditmodel_1.3.1.tar.gz(r-4.4-noble)
creditmodel_1.3.1.tgz(r-4.4-emscripten)creditmodel_1.3.1.tgz(r-4.3-emscripten)
creditmodel.pdf |creditmodel.html✨
creditmodel/json (API)

# Install 'creditmodel' in R:

install.packages('creditmodel', repos = c('https://fanhansen.r-universe.dev', 'https://cloud.r-project.org'))

Datasets:

UCICreditCard - UCI Credit Card data
ewm_data - Entropy Weight Method Data
lendingclub - Lending Club data

On CRAN:

This package does not link to any Github/Gitlab/R-forge repository. No issue tracker or development information is available.

3.48 score 4 stars 15 scripts 579 downloads 181 exports 44 dependencies

Last updated 3 years agofrom:a4f0795017. Checks:3 OK, 6 NOTE. Indexed: yes.

Target	Result	Latest binary
Doc / Vignettes	OK	Mar 18 2025
R-4.5-win	NOTE	Mar 18 2025
R-4.5-mac	NOTE	Mar 18 2025
R-4.5-linux	NOTE	Mar 18 2025
R-4.4-win	NOTE	Mar 18 2025
R-4.4-mac	NOTE	Mar 18 2025
R-4.4-linux	NOTE	Mar 18 2025
R-4.3-win	OK	Mar 18 2025
R-4.3-mac	OK	Mar 18 2025

Exports:%alike%%islike%add_variable_process address_varieble analysis_nas analysis_outliers as_percent auc_value avg_x char_cor char_cor_vars char_to_num checking_data city_varieble city_varieble_process cnt_x cohort_plot cohort_table_plot colAllnas colAllzeros colMaxMins color_ramp_palette colSds cor_heat_plot cor_plot cos_sim customer_segmentation cut_equal cv_split data_cleansing data_exploration date_cut de_one_hot_encoding de_percent derived_interval derived_partial_acf derived_pct derived_ts derived_ts_vars digits_num entropy_weight entry_rate_na euclid_dist fast_high_cor_filter feature_selector fuzzy_cluster fuzzy_cluster_means gather_data gbm_filter gbm_params get_auc_ks_lambda get_bins_table get_bins_table_all get_breaks get_breaks_all get_correlation_group get_iv get_iv_all get_logistic_coef get_median get_names get_nas_random get_partial_dependence_plots get_psi get_psi_all get_psi_iv get_psi_iv_all get_psi_plots get_score_card get_shadow_nas get_sim_sign_lambda get_tree_breaks get_x_list high_cor_filter high_cor_selector is_date knn_nas_imp ks_plot ks_psi_plot ks_table ks_table_plot ks_value lasso_filter lift_plot lift_value local_outlier_factor log_trans log_vars loop_function love_color low_variance_filter lr_params lr_params_search lr_vif max_min_norm max_x merge_category min_max_norm min_x model_key_index model_result_plot multi_grid multi_left_join n_char null_blank_na one_hot_encoding outliers_detection outliers_kmeans_lof p_to_score partial_dependence_plot PCA_reduce perf_table plot_colors plot_oot_perf plot_table plot_theme pred_score process_nas process_nas_var process_outliers psi_iv_filter psi_plot quick_as_df ranking_percent_dict ranking_percent_dict_x ranking_percent_proc ranking_percent_proc_x re_code re_name read_data reduce_high_cor_filter remove_duplicated replace_value replace_value_x require_packages rf_params roc_plot rowAll rowAllnas rowAny rowCVs rowMaxMins rowMaxs rowMins rowSds save_data score_distribution_plot score_transfer select_best_breaks select_best_class select_cor_group select_cor_list sim_str split_bins split_bins_all sql_hive_text_parse start_parallel_computing stop_parallel_computing str_match sum_table sum_x term_filter term_idf term_tfidf time_series_proc time_transfer time_variable time_vars_process tnr_value train_lr train_test_split train_xgb training_model var_group_proc variable_process woe_trans woe_trans_all xgb_data xgb_filter xgb_params xgb_params_search

Dependencies:cli codetools colorspace data.table doParallel dplyr fansi farver foreach generics ggplot2 glmnet glue gtable isoband iterators jsonlite labeling lattice lifecycle magrittr MASS Matrix mgcv munsell nlme pillar pkgconfig R6 RColorBrewer Rcpp RcppEigen rlang rpart scales shape survival tibble tidyselect utf8 vctrs viridisLite withr xgboost

Introduction to creditmodel

Rendered fromintroduction.Rmdusingknitr::rmarkdownon Mar 18 2025.

Last update: 2020-11-09
Started: 2019-10-23

Help page	Topics
creditmodel: toolkit for credit modeling and data analysis	creditmodel-package creditmodel
Fuzzy String matching	%alike%
Fuzzy String matching	%islike%
add_variable_process	add_variable_process
address_varieble	address_varieble
missing Analysis	analysis_nas
Outliers Analysis	analysis_outliers
Percent Format	as_percent
auc_value 'auc_value' is for get best lambda required in lasso_filter. This function required in 'lasso_filter'	auc_value
Cramer's V matrix between categorical variables.	char_cor char_cor_vars
character to number	char_to_num
Checking Data	checking_data
city_varieble	city_varieble
Processing of Address Variables	city_varieble_process
cohort_table_plot 'cohort_table_plot' is for ploting cohort(vintage) analysis table.	cohort_plot cohort_table_plot
Correlation Heat Plot	cor_heat_plot
Correlation Plot	cor_plot
cos_sim	cos_sim
Customer Segmentation	customer_segmentation
Generating Initial Equal Size Sample Bins	cut_equal
Stratified Folds	cv_split
Data Cleaning	data_cleansing
Data Exploration	data_exploration
Date Time Cut Point	date_cut
Recovery One-Hot Encoding	de_one_hot_encoding
Recovery Percent Format	de_percent
derived_interval	derived_interval
derived_partial_acf	derived_partial_acf
derived_pct	derived_pct
Derivation of Behavioral Variables	derived_ts derived_ts_vars
Number of digits	digits_num
Entropy Weight Method	entropy_weight
Max Percent of missing Value	entry_rate_na
euclid_dist	euclid_dist
Functions of xgboost feval	eval_auc eval_ks eval_lift eval_tnr
Entropy Weight Method Data	ewm_data
high_cor_filter	fast_high_cor_filter high_cor_filter
Feature Selection Wrapper	feature_selector
Fuzzy Cluster means.	fuzzy_cluster fuzzy_cluster_means
gather or aggregate data	gather_data
Select Features using GBM	gbm_filter
GBM Parameters	gbm_params
get_auc_ks_lambda 'get_auc_ks_lambda' is for get best lambda required in lasso_filter. This function required in 'lasso_filter'	get_auc_ks_lambda
Table of Binning	get_bins_table get_bins_table_all
Generates Best Breaks for Binning	get_breaks get_breaks_all
get_correlation_group	get_correlation_group select_cor_group select_cor_list
Calculate Information Value (IV) 'get_iv' is used to calculate Information Value (IV) of an independent variable. 'get_iv_all' can loop through IV for all specified independent variables.	get_iv get_iv_all
get logistic coef	get_logistic_coef
get central value.	get_median
Get Variable Names	get_names
get_nas_random	get_nas_random
Calculate Population Stability Index (PSI) 'get_psi' is used to calculate Population Stability Index (PSI) of an independent variable. 'get_psi_all' can loop through PSI for all specified independent variables.	get_psi get_psi_all
Calculate IV & PSI	get_psi_iv get_psi_iv_all
Plot PSI(Population Stability Index)	get_psi_plots psi_plot
Score Card	get_score_card
get_shadow_nas	get_shadow_nas
get_sim_sign_lambda 'get_sim_sign_lambda' is for get Best lambda required in lasso_filter. This function required in 'lasso_filter'	get_sim_sign_lambda
Getting the breaks for terminal nodes from decision tree	get_tree_breaks
Get X List.	get_x_list
Compare the two highly correlated variables	high_cor_selector
is_date	is_date
Imputate nas using KNN	knn_nas_imp
ks_table & plot	ks_psi_plot ks_table ks_table_plot model_key_index
ks_value	ks_value
Variable selection by LASSO	lasso_filter
Lending Club data	lendingclub
lift_value	lift_value
local_outlier_factor 'local_outlier_factor' is function for calculating the lof factor for a data set using knn This function is not intended to be used by end user.	local_outlier_factor
Logarithmic transformation	log_trans log_vars
Loop Function. #' 'loop_function' is an iterator to loop through	loop_function
love_color	love_color
Filtering Low Variance Variables	low_variance_filter
Logistic Regression & Scorecard Parameters	lr_params lr_params_search
Variance-Inflation Factors	lr_vif
Max Min Normalization	max_min_norm
Merge Category	merge_category
Min Max Normalization	min_max_norm
model result plots 'model_result_plot' is a wrapper of following: 'perf_table' is for generating a model performance table. 'ks_plot' is for K-S. 'roc_plot' is for ROC. 'lift_plot' is for Lift Chart. 'score_distribution_plot' is for ploting the score distribution.	ks_plot lift_plot model_result_plot perf_table roc_plot score_distribution_plot
Arrange list of plots into a grid	multi_grid
multi_left_join	multi_left_join
The length of a string.	n_char
Encode NAs	null_blank_na
One-Hot Encoding	one_hot_encoding
Outliers Detection 'outliers_detection' is for outliers detecting using Kmeans and Local Outlier Factor (lof)	outliers_detection
Entropy	e_ij p_ij
prob to socre	p_to_score
partial_dependence_plot	get_partial_dependence_plots partial_dependence_plot
PCA Dimension Reduction	PCA_reduce
Plot Colors	color_ramp_palette plot_colors
plot_oot_perf 'plot_oot_perf' is for ploting performance of cross time samples in the future	plot_oot_perf
plot_table	plot_table
plot_theme	plot_theme
pred_score	pred_score
missing Treatment	process_nas process_nas_var
Outliers Treatment	outliers_kmeans_lof process_outliers
Variable reduction based on Information Value & Population Stability Index filter	psi_iv_filter
List as data.frame quickly	quick_as_df
Ranking Percent Process	ranking_percent_dict ranking_percent_dict_x ranking_percent_proc ranking_percent_proc_x
re_code 're_code' search for matches to argument pattern within each element of a character vector:	re_code
Rename	re_name
Read data	check_data_format read_data
Filtering highly correlated variables with reduce method	reduce_high_cor_filter
Remove Duplicated Observations	remove_duplicated
Replace Value	replace_value replace_value_x
Packages required and intallment	require_packages
Random Forest Parameters	rf_params
Functions for vector operation.	avg_x cnt_x colAllnas colAllzeros colMaxMins colSds max_x min_x rowAll rowAllnas rowAny rowCVs rowMaxMins rowMaxs rowMins rowSds sum_x
Save data	save_data
Score Transformation	score_transfer
Generates Best Binning Breaks	select_best_breaks select_best_class
sim_str	sim_str
split_bins	split_bins
Split bins all	split_bins_all
Automatic production of hive SQL	sql_hive_text_parse
Parallel computing and export variables to global Env.	start_parallel_computing
Stop parallel computing	stop_parallel_computing
string match #' 'str_match' search for matches to argument pattern within each element of a character vector:	str_match
Summary table	sum_table
TF-IDF	term_filter term_idf term_tfidf
Process time series data	time_series_proc
Time Format Transfering	time_transfer
time_variable	time_variable
Processing of Time or Date Variables	time_vars_process
tnr_value	tnr_value
Trainig LR model	train_lr
Train-Test-Split	train_test_split
Training XGboost	train_xgb
Training model	training_model
UCI Credit Card data	UCICreditCard
Process group numeric variables	var_group_proc
variable_process	variable_process
WOE Transformation	woe_trans woe_trans_all
XGboost data	xgb_data
Select Features using XGB	xgb_filter
XGboost Parameters	xgb_params xgb_params_search

Package: creditmodel 1.3.1

creditmodel: Toolkit for Credit Modeling, Analysis and Visualization

Introduction to creditmodel

Citation

Readme and manuals

Help Manual

Usage by other packages (reverse dependencies)