[BUG] STFT CSV export has incorrect shape and column format #43

New Issue

2025-04-19T16:12:36Z

nuluh commented

2025-04-19 16:12:36 +00:00

(Migrated from github.com)

Bug Description

The notebook is exporting STFT data to CSV with incorrect dimensions and column formatting. The exported CSV has 11 rows with 263,169 columns instead of the expected 2,565 rows with 513 columns. This indicates a fundamental problem with how the STFT data is being processed or exported.

Steps to Reproduce

Run the STFT transformation notebook on raw accelerometer data
Export the results to CSV using the current code
Load the CSV and inspect its shape:

import pandas as pd
data = pd.read_csv("stft_data1_1.csv")
data.info()

Expected Behavior

The CSV file should have the correct dimensions matching the STFT output:

Row count: 2,565 entries (time frames)
Column count: 513 columns (frequency bins)
Column names should follow the format "Freq_X.XX" for each frequency bin
Total memory usage around 10.0 MB

As confirmed by previous working exports:

data.info()
# <class 'pandas.core.frame.DataFrame'>
# RangeIndex: 2565 entries, 0 to 2564
# Columns: 513 entries, Freq_0.00 to Freq_512.00
# dtypes: float64(513)
# memory usage: 10.0 MB

Actual Behavior

The CSV file has completely incorrect dimensions:

Row count: Only 11 entries
Column count: 263,169 columns
Column names are floating point numbers instead of the expected "Freq_X.XX" format
Memory usage is much higher at 22.1 MB

data.info()
# <class 'pandas.core.frame.DataFrame'>
# RangeIndex: 11 entries, 0 to 10
# Columns: 263169 entries, 0.0005546800602091881 to 0.0009552282034601476
# dtypes: float64(263169)
# memory usage: 22.1 MB

Error Logs

No error messages are displayed during execution, but the resulting data structure is clearly incorrect.

Component

Jupyter Notebook

Version/Commit

2868101

Environment

No response

Additional Context

No response

### Bug Description The notebook is exporting STFT data to CSV with incorrect dimensions and column formatting. The exported CSV has 11 rows with 263,169 columns instead of the expected 2,565 rows with 513 columns. This indicates a fundamental problem with how the STFT data is being processed or exported. ### Steps to Reproduce 1. Run the STFT transformation notebook on raw accelerometer data 2. Export the results to CSV using the current code 3. Load the CSV and inspect its shape: ```python import pandas as pd data = pd.read_csv("stft_data1_1.csv") data.info() ``` ### Expected Behavior The CSV file should have the correct dimensions matching the STFT output: - Row count: 2,565 entries (time frames) - Column count: 513 columns (frequency bins) - Column names should follow the format "Freq_X.XX" for each frequency bin - Total memory usage around 10.0 MB As confirmed by previous working exports: ```sh data.info() # <class 'pandas.core.frame.DataFrame'> # RangeIndex: 2565 entries, 0 to 2564 # Columns: 513 entries, Freq_0.00 to Freq_512.00 # dtypes: float64(513) # memory usage: 10.0 MB ``` ### Actual Behavior The CSV file has completely incorrect dimensions: - Row count: Only 11 entries - Column count: 263,169 columns - Column names are floating point numbers instead of the expected "Freq_X.XX" format - Memory usage is much higher at 22.1 MB ```sh data.info() # <class 'pandas.core.frame.DataFrame'> # RangeIndex: 11 entries, 0 to 10 # Columns: 263169 entries, 0.0005546800602091881 to 0.0009552282034601476 # dtypes: float64(263169) # memory usage: 22.1 MB ``` ### Error Logs ```shell No error messages are displayed during execution, but the resulting data structure is clearly incorrect. ``` ### Component Jupyter Notebook ### Version/Commit 2868101 ### Environment _No response_ ### Additional Context _No response_

Sign in to join this conversation.

Branches Tags

main

dev

feature/chapter-2-literature-review

feature/chapter-4-results

feature/chapter-3-methodology-steps

exp/74-exp-cross-dataset-validation

exp/74-exp-cross-dataset-validation-b2bf1b0

feat/103-feat-inference-function

feature/101-feat-time-elapsed-for-training-and-inference

feature/99-exp-alternative-undamage-case-data

feat/90-feat-preserve-trained-model

latex/75-enhance-background-research

wuicace-2025

revert-92-latex/91-bug-expose-maketitle

latex/91-bug-expose-maketitle

latex/documentclass

latex/frontmatter

latex/bib

latex/methodology

latex/literature-review

latex/theoritical-foundation

latex/background

latex/68-feat-refactor-chapter-two

68-feat-refactor-chapter-two

latex/initial-template

59-feat-add-acknowledgement-page

57-feat-add-dynamic-page-style-for-chapter-page

latex/fix-table-of-contents-styling

56-bug-endorsementpage-error

latex/54-doc-summary-table-of-past-realted-research

feature/48-feat-refactor-stft-preprocessing-and-training-pipeline-into-importable-modules

40-feat-add-export-to-csv-method-for-dataprocessor-in-convertpy

43-bug-stft-csv-export-has-incorrect-shape-and-column-format

feature/38-feat-redesign-convertpy

feature/37-feat-add-data-processing-script-for-dataset-b-outside-training-data

stft

feature/19-qugs-data

feature/15-normalize-dataset-by-preprocess-relatives-value-between-two-acceloremeter-sensors

feature/automate-csv-file

revert-8-feature/csv-padding-naming

feature/5-create-fft-script

feature/10-add-labels-column-to-time-domain-feature-extraction-dataframe

feature/csv-padding-naming

1 Participants

Notifications

Due Date

No due date set.

Dependencies

No dependencies set.

Reference: nuluh/thesis#43