Need Script to Generate CSV Files and Manage .gitignore #7

New Issue

2024-08-14T11:09:31Z

nuluh commented

2024-08-14 11:09:31 +00:00

(Migrated from github.com)

Description

We need a Python script to automatically generate dummy CSV files with a specific folder structure for testing purposes. Each CSV should include a timestamp and value column, with the CSVs stored in a hierarchical folder structure under the data directory.

Requirements

CSV Generation:
- Each CSV file should have two columns: Time and Value.
- Time should be a timestamp with millisecond precision.
- Value should be a random float.
- Generate ~10 rows per CSV.
- The folder structure should include a main directory (data), with subdirectories for raw and processed. The processed directory should further include directories for different damage levels (DAMAGE_1 to DAMAGE_5), each containing 10 test CSV files (TEST1 to TEST10).
Folder and File Naming:
- Processed files should be saved as Dx_TESTy.csv where x is the damage number and y is the test number.
.gitignore Configuration:
- Ensure that all CSV files are ignored by Git to prevent them from being pushed to the repository.

Example Folder Structure

├───data
│   ├───processed
│   │   └───DAMAGE_1
│   │           D1_TEST1.csv
│   │
│   └───raw

Expected Outcome

A Python script that sets up the described folder structure and populates it with the specified CSV files.
A .gitignore file configured to ignore all CSV files.

This setup will facilitate the generation and management of test data without cluttering our repository with large data files.

Feel free to copy this markdown and use it as needed!

## Description We need a Python script to automatically generate dummy CSV files with a specific folder structure for testing purposes. Each CSV should include a timestamp and value column, with the CSVs stored in a hierarchical folder structure under the `data` directory. ## Requirements 1. **CSV Generation:** - Each CSV file should have two columns: `Time` and `Value`. - `Time` should be a timestamp with millisecond precision. - `Value` should be a random float. - Generate ~10 rows per CSV. - The folder structure should include a main directory (`data`), with subdirectories for `raw` and `processed`. The `processed` directory should further include directories for different damage levels (`DAMAGE_1` to `DAMAGE_5`), each containing 10 test CSV files (`TEST1` to `TEST10`). 2. **Folder and File Naming:** - Processed files should be saved as `Dx_TESTy.csv` where x is the damage number and y is the test number. 3. **.gitignore Configuration:** - Ensure that all CSV files are ignored by Git to prevent them from being pushed to the repository. ## Example Folder Structure ``` ├───data │ ├───processed │ │ └───DAMAGE_1 │ │ D1_TEST1.csv │ │ │ └───raw ``` ## Expected Outcome - A Python script that sets up the described folder structure and populates it with the specified CSV files. - A `.gitignore` file configured to ignore all CSV files. This setup will facilitate the generation and management of test data without cluttering our repository with large data files. --- Feel free to copy this markdown and use it as needed!

Sign in to join this conversation.

Branches Tags

main

dev

feature/chapter-2-literature-review

feature/chapter-4-results

feature/chapter-3-methodology-steps

exp/74-exp-cross-dataset-validation

exp/74-exp-cross-dataset-validation-b2bf1b0

feat/103-feat-inference-function

feature/101-feat-time-elapsed-for-training-and-inference

feature/99-exp-alternative-undamage-case-data

feat/90-feat-preserve-trained-model

latex/75-enhance-background-research

wuicace-2025

revert-92-latex/91-bug-expose-maketitle

latex/91-bug-expose-maketitle

latex/documentclass

latex/frontmatter

latex/bib

latex/methodology

latex/literature-review

latex/theoritical-foundation

latex/background

latex/68-feat-refactor-chapter-two

68-feat-refactor-chapter-two

latex/initial-template

59-feat-add-acknowledgement-page

57-feat-add-dynamic-page-style-for-chapter-page

latex/fix-table-of-contents-styling

56-bug-endorsementpage-error

latex/54-doc-summary-table-of-past-realted-research

feature/48-feat-refactor-stft-preprocessing-and-training-pipeline-into-importable-modules

40-feat-add-export-to-csv-method-for-dataprocessor-in-convertpy

43-bug-stft-csv-export-has-incorrect-shape-and-column-format

feature/38-feat-redesign-convertpy

feature/37-feat-add-data-processing-script-for-dataset-b-outside-training-data

stft

feature/19-qugs-data

feature/15-normalize-dataset-by-preprocess-relatives-value-between-two-acceloremeter-sensors

feature/automate-csv-file

revert-8-feature/csv-padding-naming

feature/5-create-fft-script

feature/10-add-labels-column-to-time-domain-feature-extraction-dataframe

feature/csv-padding-naming

1 Participants

Notifications

Due Date

No due date set.

Dependencies

No dependencies set.

Reference: nuluh/thesis#7