Commit Graph

15 Commits

Author SHA1 Message Date
nuluh
3860f2cc5b fix(docs): The readme.md should belong to raw data since the script is intended to simulate raw data that coming from accelerometer sensors instead of processed data that should be generated by simulating frequency domain data instead. 2024-08-18 10:34:22 +07:00
nuluh
553140fe3c feat(script): add zero-padding to CSV filenames and change the output generated csv as raw data in raw folder 2024-08-17 19:51:42 +07:00
nuluh
1c23edf098 feat: Add .pyc files to .gitignore 2024-08-17 11:41:18 +07:00
nuluh
d0db65011d style 2024-08-17 11:39:46 +07:00
nuluh
565de5d3a8 refactor(notebooks): Move relative import of FeatureExtraction to "Print Time-domain Feature" section for better context 2024-08-17 11:12:43 +07:00
nuluh
6783cfeb3f docs(readme): Improve data README.md explanation
Update the README.md file in the data/processed directory to provide clearer instructions on how to load the data from the desired Dx_TESTy.csv file. This change enhances the usability of the data files for analysis.
2024-08-15 09:46:50 +07:00
nuluh
153e8cb109 feat(data): Initialize dummy data
- Create a Python script to generate CSV files in a structured folder hierarchy under `data/processed` with specific damage levels and tests.
- Add a `.gitignore` file to exclude CSV files from Git tracking, enhancing data privacy and reducing repository size.
- Include a `README.md` in the `data` directory to clearly document the directory structure, file content, and their intended use for clarity and better usability.

Closes #7
2024-08-14 23:26:06 +07:00
nuluh
feb3c85340 feat(test): Add script for processing raw data, building features, and training model
This commit adds a new script `start.sh` that automates the process of processing raw data, building features, and training a model. The script uses Python scripts from the `src` directory to perform these tasks. The processed data is saved in the `data/processed` directory, the feature matrix is saved in the `data/features` directory, and the trained model is saved in the `models` directory.

The purpose of these changes is to streamline the data processing and model training workflow, making it easier to reproduce and iterate on the results.
2024-08-12 23:04:18 +07:00
nuluh
52b458605f feat: Add time-domain feature extraction functionality
This commit adds code to the `03_feature_extraction.ipynb` notebook to print time-domain features. The features include mean, max, peak, peak-to-peak, RMS, variance, standard deviation, power, crest factor, form factor, pulse indicator, margin, kurtosis, and skewness. The features are calculated using the `FeatureExtractor` class and displayed in a pandas DataFrame.
2024-08-12 20:31:05 +07:00
nuluh
fe457f7427 feat: Add python.analysis.extraPaths setting to VSCode
This commit adds the "python.analysis.extraPaths" setting to the VSCode settings.json file. The setting includes the "./code/src/features" directory as an additional path for Python analysis. This change improves the analysis capabilities within the VSCode environment.

Closes #3
2024-08-12 19:59:49 +07:00
nuluh
72bc0f5f91 feat(test): add script for testing FeatureExtractor with mockup data
Introduce a new testing script that generates mockup data and applies the FeatureExtractor class to calculate and display features. This test script assists in verifying the functionality of the feature extraction methods with controlled input data.
2024-08-12 19:46:42 +07:00
nuluh
a401d620eb feat(features): integrate time-domain feature extraction into data pipeline
- Implement FeatureExtractor class in time_domain_features.py for calculating statistical features from dataset columns.
- Create build_features.py script to automate feature extraction from processed data and save results in a structured format.
- Adjust build_features.py to read processed data, utilize FeatureExtractor, and save feature matrix.

This update supports enhanced analysis capabilities within the thesis-project structure, allowing for more sophisticated data processing and model training stages.

Closes #1
2024-08-12 19:45:19 +07:00
nuluh
7d39176e27 feat: Add initial time domain feature extraction class
The code changes add a new file `time_domain_features.py` that contains a `FeatureExtractor` class. This class calculates various time domain features for a given dataset. The features include mean, max, peak, peak-to-peak, RMS, variance, standard deviation, power, crest factor, form factor, pulse indicator, margin, kurtosis, and skewness.

The class takes a file path as input and reads the data from a CSV file. It assumes the data to analyze is in the first column. The calculated features are stored in a dictionary.

The commit message suggests that the purpose of the changes is to add a new class for time domain feature extraction.
2024-08-12 12:37:55 +07:00
nuluh
208f019d12 initial commit generate directory tree 2024-08-11 20:24:14 +07:00
Panuluh
0014196b1c Initial commit 2024-07-25 20:28:04 +07:00