[FEAT] Preserve Trained Model #90

New Issue

2025-05-29T18:22:14Z

nuluh commented

2025-05-29 18:22:14 +00:00

(Migrated from github.com)

Problem Statement

The current implementation of the train_and_evaluate_model function does not preserve the trained model objects after training. This limits the ability to recall or reuse trained models later in the workflow.

Proposed Solution

Modify the train_and_evaluate_model function and the loop iterating through models to store trained models in a dictionary for later use. Here is the proposed solution:

# Define models for sensor1
models_sensor1 = {
    "SVM": SVC(),
    "XGBoost": XGBClassifier()
}

results_sensor1 = []
trained_models = {}  # Dictionary to store trained models for later use

for name, model in models_sensor1.items():
    res = train_and_evaluate_model(model, name, "sensor1", x_train1, y_train, x_test1, y_test)
    results_sensor1.append(res)
    trained_models[f"{name}_sensor1"] = res["trained_model"]  # Save the trained model
    print(f"{name} on sensor1: Accuracy = {res['accuracy']:.2f}%")

# Later in your code, you can recall the models for prediction:
# Example: making predictions with the trained SVM model
# new_predictions = trained_models["SVM_sensor1"].predict(new_data)

Alternatives Considered

None at the moment but alternatives might include using external serialization tools like joblib or pickle to save and load models.

Component

Python Source Code

Priority

High (significantly improves workflow)

Implementation Ideas

Use Python dictionaries to store trained models with identifiable keys.
Ensure models are serialized if needed for long-term storage.
Extend the train_and_evaluate_model function to optionally include serialization logic.

Expected Benefits

Enables reuse of trained models across multiple workflows.
Enhances modularity and maintainability of the code.
Improves the ability to experiment and test trained models without re-training.

Additional Context

N/A

### Problem Statement The current implementation of the `train_and_evaluate_model` function does not preserve the trained model objects after training. This limits the ability to recall or reuse trained models later in the workflow. ### Proposed Solution Modify the `train_and_evaluate_model` function and the loop iterating through models to store trained models in a dictionary for later use. Here is the proposed solution: ```python # Define models for sensor1 models_sensor1 = { "SVM": SVC(), "XGBoost": XGBClassifier() } results_sensor1 = [] trained_models = {} # Dictionary to store trained models for later use for name, model in models_sensor1.items(): res = train_and_evaluate_model(model, name, "sensor1", x_train1, y_train, x_test1, y_test) results_sensor1.append(res) trained_models[f"{name}_sensor1"] = res["trained_model"] # Save the trained model print(f"{name} on sensor1: Accuracy = {res['accuracy']:.2f}%") # Later in your code, you can recall the models for prediction: # Example: making predictions with the trained SVM model # new_predictions = trained_models["SVM_sensor1"].predict(new_data) ``` ### Alternatives Considered None at the moment but alternatives might include using external serialization tools like `joblib` or `pickle` to save and load models. ### Component Python Source Code ### Priority High (significantly improves workflow) ### Implementation Ideas - Use Python dictionaries to store trained models with identifiable keys. - Ensure models are serialized if needed for long-term storage. - Extend the `train_and_evaluate_model` function to optionally include serialization logic. ### Expected Benefits - Enables reuse of trained models across multiple workflows. - Enhances modularity and maintainability of the code. - Improves the ability to experiment and test trained models without re-training. ### Additional Context N/A

Sign in to join this conversation.

Branches Tags

main

dev

feature/chapter-2-literature-review

feature/chapter-4-results

feature/chapter-3-methodology-steps

exp/74-exp-cross-dataset-validation

exp/74-exp-cross-dataset-validation-b2bf1b0

feat/103-feat-inference-function

feature/101-feat-time-elapsed-for-training-and-inference

feature/99-exp-alternative-undamage-case-data

feat/90-feat-preserve-trained-model

latex/75-enhance-background-research

wuicace-2025

revert-92-latex/91-bug-expose-maketitle

latex/91-bug-expose-maketitle

latex/documentclass

latex/frontmatter

latex/bib

latex/methodology

latex/literature-review

latex/theoritical-foundation

latex/background

latex/68-feat-refactor-chapter-two

68-feat-refactor-chapter-two

latex/initial-template

59-feat-add-acknowledgement-page

57-feat-add-dynamic-page-style-for-chapter-page

latex/fix-table-of-contents-styling

56-bug-endorsementpage-error

latex/54-doc-summary-table-of-past-realted-research

feature/48-feat-refactor-stft-preprocessing-and-training-pipeline-into-importable-modules

40-feat-add-export-to-csv-method-for-dataprocessor-in-convertpy

43-bug-stft-csv-export-has-incorrect-shape-and-column-format

feature/38-feat-redesign-convertpy

feature/37-feat-add-data-processing-script-for-dataset-b-outside-training-data

stft

feature/19-qugs-data

feature/15-normalize-dataset-by-preprocess-relatives-value-between-two-acceloremeter-sensors

feature/automate-csv-file

revert-8-feature/csv-padding-naming

feature/5-create-fft-script

feature/10-add-labels-column-to-time-domain-feature-extraction-dataframe

feature/csv-padding-naming

1 Participants

Notifications

Due Date

No due date set.

Dependencies

No dependencies set.

Reference: nuluh/thesis#90