BaseEvaluatorWrapper

Bases: ModelExtractor, ABC

Base class for wrappers handling model evaluation processes.

This class serves as a foundational structure for evaluator wrappers, offering methods to initialize, prepare, and evaluate models according to specified parameters. It provides core functionality to streamline evaluation, feature importance analysis, patient inference, and jackknife resampling.

Inherits

BaseModelExtractor: Loads configuration parameters and model extraction.
ABC: Specifies abstract methods that must be implemented by subclasses.

Parameters:

Name	Type	Description	Default
`learners_dict`	`Dict`	Dictionary containing models and their metadata.	required
`criterion`	`str`	Criterion for selecting models (e.g., 'f1', 'brier_score').	required
`aggregate`	`bool`	Whether to aggregate metrics.	required
`verbose`	`bool`	Controls verbose in the evaluation process.	required
`random_state`	`int`	Random state for resampling.	required
`path`	`Path`	Path to the directory containing processed data files.	required

Attributes:

Name	Type	Description
`learners_dict`	`Dict`	Holds learners and metadata.
`criterion`	`str`	Evaluation criterion to select the optimal model.
`aggregate`	`bool`	Indicates if metrics should be aggregated.
`verbose`	`bool`	Flag for controlling logging verbose.
`random_state`	`int`	Random state for resampling.
`model`	`object`	Best-ranked model for the given criterion.
`encoding`	`str`	Encoding type, either 'one_hot' or 'target'.
`learner`	`str`	The learner associated with the best model.
`task`	`str`	Task associated with the model ('pocketclosure', 'improve', etc.).
`factor`	`Optional[float]`	Resampling factor if applicable.
`sampling`	`Optional[str]`	Resampling strategy used (e.g., 'smote').
`classification`	`str`	Classification type ('binary' or 'multiclass').
`dataloader`	`ProcessedDataLoader`	Data loader and transformer.
`resampler`	`Resampler`	Resampling strategy for training and testing.
`df`	`DataFrame`	Loaded dataset.
`df_processed`	`DataFrame`	Processed dataset.
`train_df`	`DataFrame`	Training data after splitting.
`test_df`	`DataFrame`	Test data after splitting.
`X_train`	`DataFrame`	Training features.
`y_train`	`Series`	Training labels.
`X_test`	`DataFrame`	Test features.
`y_test`	`Series`	Test labels.
`base_target`	`Optional[ndarray]`	Baseline target for evaluations.
`baseline`	`Baseline`	Basline class for model analysis.
`evaluator`	`ModelEvaluator`	Evaluator for model metrics and feature importance.
`inference_engine`	`ModelInference`	Model inference manager.
`trainer`	`Trainer`	Trainer for model evaluation and optimization.

Inherited Properties

criterion (str): Retrieves or sets current evaluation criterion for model selection. Supports 'f1', 'brier_score', and 'macro_f1'.
model (object): Retrieves best-ranked model dynamically based on the current criterion. Recalculates when criterion is updated.

Abstract Methods

wrapped_evaluation: Performs model evaluation and generates specified plots.
evaluate_cluster: Performs clustering and calculates Brier scores.
evaluate_feature_importance: Computes feature importance using specified methods.
average_over_splits: Aggregates metrics over multiple splits for model robustness.
wrapped_patient_inference: Runs inference on individual patient data.
wrapped_jackknife: Executes jackknife resampling on patient data for confidence interval estimation.