From feb3c8534069fbca0ac02b87eea08ad43336a239 Mon Sep 17 00:00:00 2001 From: nuluh Date: Mon, 12 Aug 2024 23:04:18 +0700 Subject: [PATCH] feat(test): Add script for processing raw data, building features, and training model This commit adds a new script `start.sh` that automates the process of processing raw data, building features, and training a model. The script uses Python scripts from the `src` directory to perform these tasks. The processed data is saved in the `data/processed` directory, the feature matrix is saved in the `data/features` directory, and the trained model is saved in the `models` directory. The purpose of these changes is to streamline the data processing and model training workflow, making it easier to reproduce and iterate on the results. --- code/start.sh | 10 ++++++++++ 1 file changed, 10 insertions(+) create mode 100644 code/start.sh diff --git a/code/start.sh b/code/start.sh new file mode 100644 index 0000000..5e52603 --- /dev/null +++ b/code/start.sh @@ -0,0 +1,10 @@ +#!/bin/bash + +# Process raw data +python src/data/process_dataset.py data/raw/ data/processed/ + +# Build features +python src/features/build_features.py data/processed/processed_data.csv data/features/feature_matrix.npz + +# Train model +python src/models/train_model.py data/features/feature_matrix.npz models/svm_model.pkl