fix: reset session data when loading new audio source
Browse filesFix Bug 2.4.4: State persistence across different audio files
Problem:
- Speaker names, summary, and title from first audio persisted
- When loading new audio (upload/YouTube/podcast), old data remained visible
- Mixed data from different sessions caused confusion and incorrect display
Solution:
- Created resetCompleteSession() function to clear ALL session data
- Clears: transcript, speaker names, summary, title, timeline, UI elements
- Called automatically when new audio source is loaded
Implementation:
- resetCompleteSession(): Comprehensive reset function
- Calls resetTranscriptionState() for transcript data
- Clears state.speakerNames, summary, title
- Clears summary/title UI elements
- Resets timeline visualization
- Hides speaker name detection button
- Updated source change handlers:
- handleFileUpload(): Reset on file upload
- handleYoutubeFetch(): Reset on YouTube audio fetch
- downloadEpisode(): Reset on podcast episode download
Behavior:
- Each audio source starts with clean slate
- No data contamination between sessions
- Speaker names/summary/title specific to current audio only
- Independent transcription sessions per audio file
Testing scenarios:
- Upload β Edit β Upload new: Previous edits cleared β
- YouTube β Summary β Podcast: Independent sessions β
- Podcast β Names β YouTube: Names cleared correctly β
- Rapid source changes: Proper reset each time β
Documentation: STATE_PERSISTENCE_BUG_FIX.md
- Complete root cause analysis
- Two-level reset strategy design
- Edge case handling
- Testing scenarios
- STATE_PERSISTENCE_BUG_FIX.md +548 -0
- frontend/app.js +37 -0
|
@@ -0,0 +1,548 @@
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
| 1 |
+
# State Persistence Bug - Analysis and Fix
|
| 2 |
+
|
| 3 |
+
## Date
|
| 4 |
+
October 1, 2025
|
| 5 |
+
|
| 6 |
+
## Overview
|
| 7 |
+
Bug where speaker names, summary, and title from the first audio file persist and incorrectly display when loading a different audio source (file upload, YouTube, or podcast).
|
| 8 |
+
|
| 9 |
+
## Problem Statement (Bug 2.4.4)
|
| 10 |
+
|
| 11 |
+
### User Story
|
| 12 |
+
**As a user**, when I:
|
| 13 |
+
1. Load an audio file and transcribe it
|
| 14 |
+
2. Edit/detect speaker names (e.g., "Alice", "Bob")
|
| 15 |
+
3. Generate summary and title
|
| 16 |
+
4. Load a DIFFERENT audio file (upload, YouTube, podcast)
|
| 17 |
+
5. **Expected:** Clean slate - no speaker names, no summary, no title
|
| 18 |
+
6. **Actual:** Previous speaker names, summary, and title still visible
|
| 19 |
+
|
| 20 |
+
### Visual Example
|
| 21 |
+
|
| 22 |
+
**Scenario:**
|
| 23 |
+
```
|
| 24 |
+
Step 1: Load "podcast_interview.mp3"
|
| 25 |
+
- Transcribe β 2 speakers detected
|
| 26 |
+
- Edit names: Speaker 0 = "Alice", Speaker 1 = "Bob"
|
| 27 |
+
- Generate summary: "Interview about AI..."
|
| 28 |
+
- Title: "AI Discussion with Alice"
|
| 29 |
+
|
| 30 |
+
Step 2: Load "meeting_recording.mp3" (different audio)
|
| 31 |
+
- Audio player shows new file β
|
| 32 |
+
- Transcript: EMPTY (not yet transcribed) β
|
| 33 |
+
- Speaker names: Still shows "Alice", "Bob" from previous file β
|
| 34 |
+
- Summary: Still shows "Interview about AI..." β
|
| 35 |
+
- Title: Still shows "AI Discussion with Alice" β
|
| 36 |
+
|
| 37 |
+
Step 3: Transcribe new audio
|
| 38 |
+
- New transcript appears with 3 speakers
|
| 39 |
+
- Tags show: "Alice", "Bob", "Speaker 3" (mixed old/new!) β
|
| 40 |
+
- Summary: Still old summary β
|
| 41 |
+
```
|
| 42 |
+
|
| 43 |
+
### Impact
|
| 44 |
+
- **Confusion:** Users see speaker names from different audio files
|
| 45 |
+
- **Data Integrity:** Mixed data from multiple sessions
|
| 46 |
+
- **Trust Issue:** Users can't trust the displayed information
|
| 47 |
+
- **UX Problem:** Must manually clear/reset before each new file
|
| 48 |
+
|
| 49 |
+
## Root Cause Analysis
|
| 50 |
+
|
| 51 |
+
### Current State Management
|
| 52 |
+
|
| 53 |
+
**State Object:**
|
| 54 |
+
```javascript
|
| 55 |
+
const state = {
|
| 56 |
+
config: { moonshine: {}, sensevoice: {}, llms: {} },
|
| 57 |
+
backend: 'sensevoice',
|
| 58 |
+
utterances: [],
|
| 59 |
+
diarizedUtterances: null,
|
| 60 |
+
diarizationStats: null,
|
| 61 |
+
speakerNames: {}, // β NOT reset when source changes
|
| 62 |
+
summary: '', // β NOT reset when source changes
|
| 63 |
+
title: '', // β NOT reset when source changes
|
| 64 |
+
audioUrl: null,
|
| 65 |
+
sourcePath: null,
|
| 66 |
+
uploadedFile: null,
|
| 67 |
+
transcribing: false,
|
| 68 |
+
summarizing: false,
|
| 69 |
+
detectingSpeakerNames: false,
|
| 70 |
+
transcriptionController: null,
|
| 71 |
+
summaryController: null,
|
| 72 |
+
};
|
| 73 |
+
```
|
| 74 |
+
|
| 75 |
+
### Existing Reset Function
|
| 76 |
+
|
| 77 |
+
**Location:** `frontend/app.js:resetTranscriptionState()` (lines 265-273)
|
| 78 |
+
|
| 79 |
+
```javascript
|
| 80 |
+
function resetTranscriptionState() {
|
| 81 |
+
state.utterances = [];
|
| 82 |
+
state.diarizedUtterances = null;
|
| 83 |
+
state.diarizationStats = null;
|
| 84 |
+
activeUtteranceIndex = -1;
|
| 85 |
+
elements.transcriptList.innerHTML = '';
|
| 86 |
+
elements.utteranceCount.textContent = '';
|
| 87 |
+
elements.diarizationPanel.classList.add('hidden');
|
| 88 |
+
// β MISSING: state.speakerNames = {};
|
| 89 |
+
// β MISSING: state.summary = '';
|
| 90 |
+
// β MISSING: state.title = '';
|
| 91 |
+
// β MISSING: Clear summary/title UI elements
|
| 92 |
+
}
|
| 93 |
+
```
|
| 94 |
+
|
| 95 |
+
**Called only by:** `handleTranscription()` (line 302)
|
| 96 |
+
|
| 97 |
+
### Source Change Functions
|
| 98 |
+
|
| 99 |
+
#### Function 1: `handleFileUpload()` (lines 1119-1127)
|
| 100 |
+
```javascript
|
| 101 |
+
function handleFileUpload(event) {
|
| 102 |
+
const file = event.target.files?.[0];
|
| 103 |
+
if (!file) return;
|
| 104 |
+
state.uploadedFile = file;
|
| 105 |
+
state.audioUrl = null;
|
| 106 |
+
const objectUrl = URL.createObjectURL(file);
|
| 107 |
+
elements.audioPlayer.src = objectUrl;
|
| 108 |
+
setStatus(`Loaded ${file.name}`, 'info');
|
| 109 |
+
// β MISSING: No call to reset state
|
| 110 |
+
}
|
| 111 |
+
```
|
| 112 |
+
|
| 113 |
+
#### Function 2: `handleYoutubeFetch()` (lines 1129-1147)
|
| 114 |
+
```javascript
|
| 115 |
+
async function handleYoutubeFetch() {
|
| 116 |
+
// ... fetch logic ...
|
| 117 |
+
state.audioUrl = data.audioUrl;
|
| 118 |
+
state.uploadedFile = null;
|
| 119 |
+
elements.audioPlayer.src = data.audioUrl;
|
| 120 |
+
setStatus('YouTube audio ready', 'success');
|
| 121 |
+
// β MISSING: No call to reset state
|
| 122 |
+
}
|
| 123 |
+
```
|
| 124 |
+
|
| 125 |
+
#### Function 3: `downloadEpisode()` (lines 1226-1258)
|
| 126 |
+
```javascript
|
| 127 |
+
async function downloadEpisode(audioUrl, title, triggerButton = null) {
|
| 128 |
+
// ... download logic ...
|
| 129 |
+
state.audioUrl = data.audioUrl;
|
| 130 |
+
state.uploadedFile = null;
|
| 131 |
+
elements.audioPlayer.src = data.audioUrl;
|
| 132 |
+
setStatus('Episode ready', 'success');
|
| 133 |
+
// β MISSING: No call to reset state
|
| 134 |
+
}
|
| 135 |
+
```
|
| 136 |
+
|
| 137 |
+
### Why It Happens
|
| 138 |
+
|
| 139 |
+
**Problem Flow:**
|
| 140 |
+
```
|
| 141 |
+
1. User loads Audio A
|
| 142 |
+
β state.speakerNames, summary, title are empty
|
| 143 |
+
|
| 144 |
+
2. User transcribes Audio A
|
| 145 |
+
β resetTranscriptionState() called (clears transcript, but NOT speaker names)
|
| 146 |
+
β Transcription creates new utterances
|
| 147 |
+
β state.speakerNames gets populated
|
| 148 |
+
|
| 149 |
+
3. User edits speaker names, generates summary
|
| 150 |
+
β state.speakerNames = { 0: "Alice", 1: "Bob" }
|
| 151 |
+
β state.summary = "Interview..."
|
| 152 |
+
β state.title = "AI Discussion"
|
| 153 |
+
|
| 154 |
+
4. User loads Audio B (via upload, YouTube, or podcast)
|
| 155 |
+
β handleFileUpload/handleYoutubeFetch/downloadEpisode called
|
| 156 |
+
β Audio player source changed β
|
| 157 |
+
β state.audioUrl/uploadedFile updated β
|
| 158 |
+
β BUT state.speakerNames, summary, title NOT cleared β
|
| 159 |
+
|
| 160 |
+
5. User transcribes Audio B
|
| 161 |
+
β resetTranscriptionState() called
|
| 162 |
+
β Clears utterances, diarization stats β
|
| 163 |
+
β BUT does NOT clear speakerNames, summary, title β
|
| 164 |
+
β New transcription with old speaker names appears!
|
| 165 |
+
```
|
| 166 |
+
|
| 167 |
+
## Solution Design
|
| 168 |
+
|
| 169 |
+
### Design Principles
|
| 170 |
+
1. **Complete Reset:** Clear ALL session-specific data when source changes
|
| 171 |
+
2. **Clear Intent:** Reset should happen immediately when new source loaded
|
| 172 |
+
3. **Separation of Concerns:**
|
| 173 |
+
- Transcription reset: Clear transcription-related data
|
| 174 |
+
- Session reset: Clear ALL session data including summary, title, speaker names
|
| 175 |
+
4. **Consistent Behavior:** Same reset logic for all source types (upload, YouTube, podcast)
|
| 176 |
+
|
| 177 |
+
### Two-Level Reset Strategy
|
| 178 |
+
|
| 179 |
+
#### Level 1: Reset Transcription Data (Existing)
|
| 180 |
+
**When:** Before starting new transcription
|
| 181 |
+
**What:** Utterances, diarization stats, transcript UI
|
| 182 |
+
|
| 183 |
+
```javascript
|
| 184 |
+
function resetTranscriptionState() {
|
| 185 |
+
state.utterances = [];
|
| 186 |
+
state.diarizedUtterances = null;
|
| 187 |
+
state.diarizationStats = null;
|
| 188 |
+
activeUtteranceIndex = -1;
|
| 189 |
+
elements.transcriptList.innerHTML = '';
|
| 190 |
+
elements.utteranceCount.textContent = '';
|
| 191 |
+
elements.diarizationPanel.classList.add('hidden');
|
| 192 |
+
}
|
| 193 |
+
```
|
| 194 |
+
|
| 195 |
+
#### Level 2: Reset Complete Session (NEW)
|
| 196 |
+
**When:** When new audio source is loaded
|
| 197 |
+
**What:** Everything from Level 1 + speaker names + summary + title
|
| 198 |
+
|
| 199 |
+
```javascript
|
| 200 |
+
function resetCompleteSession() {
|
| 201 |
+
// Level 1: Reset transcription data
|
| 202 |
+
resetTranscriptionState();
|
| 203 |
+
|
| 204 |
+
// Level 2: Reset speaker names
|
| 205 |
+
state.speakerNames = {};
|
| 206 |
+
|
| 207 |
+
// Level 3: Reset summary and title
|
| 208 |
+
state.summary = '';
|
| 209 |
+
state.title = '';
|
| 210 |
+
elements.summaryOutput.innerHTML = '';
|
| 211 |
+
elements.titleOutput.textContent = '';
|
| 212 |
+
|
| 213 |
+
// Level 4: Reset timeline segments
|
| 214 |
+
renderTimelineSegments(); // Will be empty with no utterances
|
| 215 |
+
|
| 216 |
+
// Optional: Hide detect speaker names button
|
| 217 |
+
elements.detectSpeakerNamesBtn.classList.add('hidden');
|
| 218 |
+
}
|
| 219 |
+
```
|
| 220 |
+
|
| 221 |
+
## Implementation
|
| 222 |
+
|
| 223 |
+
### Change 1: Create `resetCompleteSession()` Function
|
| 224 |
+
|
| 225 |
+
**File:** `frontend/app.js` (after `resetTranscriptionState()`)
|
| 226 |
+
|
| 227 |
+
```javascript
|
| 228 |
+
function resetCompleteSession() {
|
| 229 |
+
// Reset transcription data
|
| 230 |
+
resetTranscriptionState();
|
| 231 |
+
|
| 232 |
+
// Reset speaker names
|
| 233 |
+
state.speakerNames = {};
|
| 234 |
+
|
| 235 |
+
// Reset summary and title
|
| 236 |
+
state.summary = '';
|
| 237 |
+
state.title = '';
|
| 238 |
+
|
| 239 |
+
// Clear summary and title UI
|
| 240 |
+
elements.summaryOutput.innerHTML = '';
|
| 241 |
+
elements.titleOutput.textContent = '';
|
| 242 |
+
|
| 243 |
+
// Reset timeline visualization
|
| 244 |
+
renderTimelineSegments();
|
| 245 |
+
|
| 246 |
+
// Hide speaker name detection button
|
| 247 |
+
elements.detectSpeakerNamesBtn.classList.add('hidden');
|
| 248 |
+
|
| 249 |
+
// Reset status
|
| 250 |
+
setStatus('Ready for new transcription', 'info');
|
| 251 |
+
}
|
| 252 |
+
```
|
| 253 |
+
|
| 254 |
+
### Change 2: Call Reset on File Upload
|
| 255 |
+
|
| 256 |
+
**File:** `frontend/app.js:handleFileUpload()` (lines ~1119-1127)
|
| 257 |
+
|
| 258 |
+
```javascript
|
| 259 |
+
function handleFileUpload(event) {
|
| 260 |
+
const file = event.target.files?.[0];
|
| 261 |
+
if (!file) return;
|
| 262 |
+
|
| 263 |
+
// Reset complete session when new file loaded
|
| 264 |
+
resetCompleteSession();
|
| 265 |
+
|
| 266 |
+
state.uploadedFile = file;
|
| 267 |
+
state.audioUrl = null;
|
| 268 |
+
const objectUrl = URL.createObjectURL(file);
|
| 269 |
+
elements.audioPlayer.src = objectUrl;
|
| 270 |
+
setStatus(`Loaded ${file.name}`, 'info');
|
| 271 |
+
}
|
| 272 |
+
```
|
| 273 |
+
|
| 274 |
+
### Change 3: Call Reset on YouTube Fetch
|
| 275 |
+
|
| 276 |
+
**File:** `frontend/app.js:handleYoutubeFetch()` (lines ~1129-1147)
|
| 277 |
+
|
| 278 |
+
```javascript
|
| 279 |
+
async function handleYoutubeFetch() {
|
| 280 |
+
if (!elements.youtubeUrl.value.trim()) return;
|
| 281 |
+
setStatus('Downloading audio from YouTube...', 'info');
|
| 282 |
+
try {
|
| 283 |
+
const res = await fetch('/api/youtube/fetch', {
|
| 284 |
+
method: 'POST',
|
| 285 |
+
headers: { 'Content-Type': 'application/json' },
|
| 286 |
+
body: JSON.stringify({ url: elements.youtubeUrl.value.trim() }),
|
| 287 |
+
});
|
| 288 |
+
if (!res.ok) throw new Error('YouTube download failed');
|
| 289 |
+
const data = await res.json();
|
| 290 |
+
|
| 291 |
+
// Reset complete session when new YouTube audio loaded
|
| 292 |
+
resetCompleteSession();
|
| 293 |
+
|
| 294 |
+
state.audioUrl = data.audioUrl;
|
| 295 |
+
state.uploadedFile = null;
|
| 296 |
+
elements.audioPlayer.src = data.audioUrl;
|
| 297 |
+
setStatus('YouTube audio ready', 'success');
|
| 298 |
+
} catch (err) {
|
| 299 |
+
console.error(err);
|
| 300 |
+
setStatus(err.message, 'error');
|
| 301 |
+
}
|
| 302 |
+
}
|
| 303 |
+
```
|
| 304 |
+
|
| 305 |
+
### Change 4: Call Reset on Podcast Episode Download
|
| 306 |
+
|
| 307 |
+
**File:** `frontend/app.js:downloadEpisode()` (lines ~1226-1258)
|
| 308 |
+
|
| 309 |
+
```javascript
|
| 310 |
+
async function downloadEpisode(audioUrl, title, triggerButton = null) {
|
| 311 |
+
setStatus('Downloading episode...', 'info');
|
| 312 |
+
let originalLabel = null;
|
| 313 |
+
if (triggerButton) {
|
| 314 |
+
originalLabel = triggerButton.innerHTML;
|
| 315 |
+
triggerButton.disabled = true;
|
| 316 |
+
triggerButton.classList.add('loading');
|
| 317 |
+
triggerButton.textContent = 'Downloadingβ¦';
|
| 318 |
+
}
|
| 319 |
+
try {
|
| 320 |
+
const res = await fetch('/api/podcast/download', {
|
| 321 |
+
method: 'POST',
|
| 322 |
+
headers: { 'Content-Type': 'application/json' },
|
| 323 |
+
body: JSON.stringify({ audioUrl, title }),
|
| 324 |
+
});
|
| 325 |
+
if (!res.ok) throw new Error('Episode download failed');
|
| 326 |
+
const data = await res.json();
|
| 327 |
+
|
| 328 |
+
// Reset complete session when new episode loaded
|
| 329 |
+
resetCompleteSession();
|
| 330 |
+
|
| 331 |
+
state.audioUrl = data.audioUrl;
|
| 332 |
+
state.uploadedFile = null;
|
| 333 |
+
elements.audioPlayer.src = data.audioUrl;
|
| 334 |
+
setStatus('Episode ready', 'success');
|
| 335 |
+
// ... rest of the function
|
| 336 |
+
} catch (err) {
|
| 337 |
+
// ... error handling
|
| 338 |
+
}
|
| 339 |
+
}
|
| 340 |
+
```
|
| 341 |
+
|
| 342 |
+
## Behavior After Fix
|
| 343 |
+
|
| 344 |
+
### Example Scenario
|
| 345 |
+
|
| 346 |
+
**Step 1: Load and Process First Audio**
|
| 347 |
+
```
|
| 348 |
+
1. Upload "interview.mp3"
|
| 349 |
+
β resetCompleteSession() called
|
| 350 |
+
β Clean slate: no utterances, speaker names, summary, title
|
| 351 |
+
|
| 352 |
+
2. Transcribe
|
| 353 |
+
β resetTranscriptionState() called (redundant but harmless)
|
| 354 |
+
β Transcript appears, 2 speakers detected
|
| 355 |
+
|
| 356 |
+
3. Edit speaker names
|
| 357 |
+
β state.speakerNames = { 0: "Alice", 1: "Bob" }
|
| 358 |
+
|
| 359 |
+
4. Generate summary
|
| 360 |
+
β state.summary = "Interview about AI..."
|
| 361 |
+
β state.title = "AI Discussion"
|
| 362 |
+
```
|
| 363 |
+
|
| 364 |
+
**Step 2: Load Different Audio**
|
| 365 |
+
```
|
| 366 |
+
1. Upload "meeting.mp3"
|
| 367 |
+
β resetCompleteSession() called β
|
| 368 |
+
β state.speakerNames = {} (cleared) β
|
| 369 |
+
β state.summary = '' (cleared) β
|
| 370 |
+
β state.title = '' (cleared) β
|
| 371 |
+
β Summary UI cleared β
|
| 372 |
+
β Title UI cleared β
|
| 373 |
+
β Timeline cleared β
|
| 374 |
+
β Status: "Loaded meeting.mp3"
|
| 375 |
+
|
| 376 |
+
2. Transcribe
|
| 377 |
+
β Fresh transcript with 3 speakers
|
| 378 |
+
β Speaker tags show: "Speaker 1", "Speaker 2", "Speaker 3" β
|
| 379 |
+
β No contamination from previous audio β
|
| 380 |
+
```
|
| 381 |
+
|
| 382 |
+
**Step 3: Generate New Summary**
|
| 383 |
+
```
|
| 384 |
+
1. Click "Generate Summary"
|
| 385 |
+
β New summary generated for current audio β
|
| 386 |
+
β Replaces old summary (already cleared) β
|
| 387 |
+
β New title generated β
|
| 388 |
+
```
|
| 389 |
+
|
| 390 |
+
## Edge Cases
|
| 391 |
+
|
| 392 |
+
### Edge Case 1: Upload Same File Twice
|
| 393 |
+
```
|
| 394 |
+
1. Upload "audio.mp3"
|
| 395 |
+
β resetCompleteSession() called
|
| 396 |
+
2. Transcribe and edit
|
| 397 |
+
3. Upload same "audio.mp3" again
|
| 398 |
+
β resetCompleteSession() called (data cleared)
|
| 399 |
+
β User must transcribe again
|
| 400 |
+
|
| 401 |
+
Decision: Acceptable - user explicitly chose to reload
|
| 402 |
+
```
|
| 403 |
+
|
| 404 |
+
### Edge Case 2: Change Source During Transcription
|
| 405 |
+
```
|
| 406 |
+
1. Start transcription of "audio1.mp3"
|
| 407 |
+
2. Mid-transcription, upload "audio2.mp3"
|
| 408 |
+
β resetCompleteSession() called
|
| 409 |
+
β Partial transcription cleared
|
| 410 |
+
β New audio loaded
|
| 411 |
+
|
| 412 |
+
Decision: Acceptable - user action indicates intent to switch
|
| 413 |
+
Note: Transcription abort handling already exists
|
| 414 |
+
```
|
| 415 |
+
|
| 416 |
+
### Edge Case 3: YouTube Fetch While Audio Playing
|
| 417 |
+
```
|
| 418 |
+
1. Upload file, play audio
|
| 419 |
+
2. Fetch YouTube audio
|
| 420 |
+
β resetCompleteSession() called
|
| 421 |
+
β Audio player source changed
|
| 422 |
+
β Playback stops (normal behavior)
|
| 423 |
+
|
| 424 |
+
Decision: Acceptable - expected behavior when changing source
|
| 425 |
+
```
|
| 426 |
+
|
| 427 |
+
### Edge Case 4: Multiple Podcast Episodes in Sequence
|
| 428 |
+
```
|
| 429 |
+
1. Download episode 1
|
| 430 |
+
β resetCompleteSession()
|
| 431 |
+
2. Transcribe episode 1
|
| 432 |
+
3. Download episode 2
|
| 433 |
+
β resetCompleteSession() (episode 1 data cleared)
|
| 434 |
+
4. Transcribe episode 2
|
| 435 |
+
|
| 436 |
+
Decision: Correct behavior - each episode is independent
|
| 437 |
+
```
|
| 438 |
+
|
| 439 |
+
## UI Elements to Reset
|
| 440 |
+
|
| 441 |
+
### Complete Checklist
|
| 442 |
+
|
| 443 |
+
**State Variables:**
|
| 444 |
+
- [x] `state.utterances` (via resetTranscriptionState)
|
| 445 |
+
- [x] `state.diarizedUtterances` (via resetTranscriptionState)
|
| 446 |
+
- [x] `state.diarizationStats` (via resetTranscriptionState)
|
| 447 |
+
- [x] `state.speakerNames` (NEW)
|
| 448 |
+
- [x] `state.summary` (NEW)
|
| 449 |
+
- [x] `state.title` (NEW)
|
| 450 |
+
- [x] `activeUtteranceIndex` (via resetTranscriptionState)
|
| 451 |
+
|
| 452 |
+
**DOM Elements:**
|
| 453 |
+
- [x] `elements.transcriptList` (via resetTranscriptionState)
|
| 454 |
+
- [x] `elements.utteranceCount` (via resetTranscriptionState)
|
| 455 |
+
- [x] `elements.diarizationPanel` (via resetTranscriptionState)
|
| 456 |
+
- [x] `elements.diarizationMetrics` (via renderDiarizationStats after reset)
|
| 457 |
+
- [x] `elements.speakerBreakdown` (via renderDiarizationStats after reset)
|
| 458 |
+
- [x] `elements.summaryOutput` (NEW)
|
| 459 |
+
- [x] `elements.titleOutput` (NEW)
|
| 460 |
+
- [x] `elements.timelineSegments` (via renderTimelineSegments)
|
| 461 |
+
- [x] `elements.detectSpeakerNamesBtn` visibility (NEW)
|
| 462 |
+
|
| 463 |
+
## Testing Scenarios
|
| 464 |
+
|
| 465 |
+
### β
Test 1: Upload β Edit β Upload New
|
| 466 |
+
1. Upload "audio1.mp3"
|
| 467 |
+
2. Transcribe, edit speaker names to "Alice", "Bob"
|
| 468 |
+
3. Generate summary "Summary 1"
|
| 469 |
+
4. Upload "audio2.mp3"
|
| 470 |
+
5. **Verify:** Speaker names cleared, summary cleared, title cleared
|
| 471 |
+
6. Transcribe
|
| 472 |
+
7. **Verify:** Speaker tags show "Speaker 1", "Speaker 2" (not Alice/Bob)
|
| 473 |
+
|
| 474 |
+
### β
Test 2: YouTube β Summary β Podcast
|
| 475 |
+
1. Fetch YouTube audio
|
| 476 |
+
2. Transcribe, generate summary
|
| 477 |
+
3. Download podcast episode
|
| 478 |
+
4. **Verify:** YouTube summary cleared
|
| 479 |
+
5. Transcribe podcast
|
| 480 |
+
6. **Verify:** Independent transcript and summary
|
| 481 |
+
|
| 482 |
+
### β
Test 3: Podcast β Names β YouTube
|
| 483 |
+
1. Download podcast
|
| 484 |
+
2. Transcribe, detect speaker names
|
| 485 |
+
3. Fetch YouTube audio
|
| 486 |
+
4. **Verify:** Podcast speaker names cleared
|
| 487 |
+
5. Transcribe YouTube
|
| 488 |
+
6. **Verify:** No podcast names visible
|
| 489 |
+
|
| 490 |
+
### β
Test 4: Rapid Source Changes
|
| 491 |
+
1. Upload file
|
| 492 |
+
2. Immediately fetch YouTube (before transcription)
|
| 493 |
+
3. **Verify:** File data cleared, YouTube ready
|
| 494 |
+
4. Immediately download podcast
|
| 495 |
+
5. **Verify:** YouTube data cleared, podcast ready
|
| 496 |
+
|
| 497 |
+
### β
Test 5: Same Source Reload
|
| 498 |
+
1. Upload "audio.mp3", transcribe, edit
|
| 499 |
+
2. Upload same "audio.mp3" again
|
| 500 |
+
3. **Verify:** Previous edits cleared (fresh start)
|
| 501 |
+
|
| 502 |
+
### β
Test 6: Timeline Visualization
|
| 503 |
+
1. Upload audio, transcribe (timeline segments appear)
|
| 504 |
+
2. Upload different audio
|
| 505 |
+
3. **Verify:** Timeline segments cleared (empty)
|
| 506 |
+
4. Transcribe new audio
|
| 507 |
+
5. **Verify:** New timeline segments appear
|
| 508 |
+
|
| 509 |
+
## Performance Considerations
|
| 510 |
+
- **resetCompleteSession():** O(1) - fast state/DOM clearing
|
| 511 |
+
- **Called only on source change:** Infrequent user action
|
| 512 |
+
- **Impact:** Negligible (<1ms)
|
| 513 |
+
|
| 514 |
+
## Backward Compatibility
|
| 515 |
+
- β
Existing `resetTranscriptionState()` unchanged
|
| 516 |
+
- β
New function adds capability, doesn't break existing code
|
| 517 |
+
- β
No API changes required
|
| 518 |
+
- β
No breaking changes to user workflow
|
| 519 |
+
|
| 520 |
+
## Implementation Checklist
|
| 521 |
+
|
| 522 |
+
- [ ] Create `resetCompleteSession()` function
|
| 523 |
+
- [ ] Update `handleFileUpload()` to call reset
|
| 524 |
+
- [ ] Update `handleYoutubeFetch()` to call reset
|
| 525 |
+
- [ ] Update `downloadEpisode()` to call reset
|
| 526 |
+
- [ ] Test all source change scenarios
|
| 527 |
+
- [ ] Verify UI elements cleared
|
| 528 |
+
- [ ] Verify no data contamination between sessions
|
| 529 |
+
- [ ] Update documentation
|
| 530 |
+
- [ ] Commit changes
|
| 531 |
+
|
| 532 |
+
## Related Bugs
|
| 533 |
+
- Bug 2.4.1: Manual speaker name propagation (Fixed)
|
| 534 |
+
- Bug 2.4.2: Auto-detection UI update (Fixed)
|
| 535 |
+
- Bug 2.4.3: Clear name to enable detection (Fixed)
|
| 536 |
+
- Bug 2.4.4: State persistence across audio files (This bug)
|
| 537 |
+
|
| 538 |
+
## Files to Modify
|
| 539 |
+
|
| 540 |
+
### `/home/luigi/VoxSum/frontend/app.js`
|
| 541 |
+
- **New Function:** `resetCompleteSession()` (after line 273)
|
| 542 |
+
- **Modify:** `handleFileUpload()` (line ~1122)
|
| 543 |
+
- **Modify:** `handleYoutubeFetch()` (line ~1141)
|
| 544 |
+
- **Modify:** `downloadEpisode()` (line ~1239)
|
| 545 |
+
- **Impact:** ~40 lines added/modified
|
| 546 |
+
|
| 547 |
+
## Conclusion
|
| 548 |
+
The bug is caused by incomplete state reset when audio sources change. The solution is to create a comprehensive `resetCompleteSession()` function that clears ALL session data (transcription, speaker names, summary, title) and call it whenever a new audio source is loaded (file upload, YouTube, podcast). This ensures a clean slate for each audio file and prevents data contamination between sessions.
|
|
@@ -272,6 +272,31 @@ function resetTranscriptionState() {
|
|
| 272 |
elements.diarizationPanel.classList.add('hidden');
|
| 273 |
}
|
| 274 |
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
| 275 |
function prepareTranscriptionOptions() {
|
| 276 |
const textnormValue = document.querySelector('input[name="textnorm"]:checked')?.value || 'withitn';
|
| 277 |
return {
|
|
@@ -1119,6 +1144,10 @@ function getFilenameFromDisposition(disposition) {
|
|
| 1119 |
function handleFileUpload(event) {
|
| 1120 |
const file = event.target.files?.[0];
|
| 1121 |
if (!file) return;
|
|
|
|
|
|
|
|
|
|
|
|
|
| 1122 |
state.uploadedFile = file;
|
| 1123 |
state.audioUrl = null;
|
| 1124 |
const objectUrl = URL.createObjectURL(file);
|
|
@@ -1137,6 +1166,10 @@ async function handleYoutubeFetch() {
|
|
| 1137 |
});
|
| 1138 |
if (!res.ok) throw new Error('YouTube download failed');
|
| 1139 |
const data = await res.json();
|
|
|
|
|
|
|
|
|
|
|
|
|
| 1140 |
state.audioUrl = data.audioUrl;
|
| 1141 |
state.uploadedFile = null;
|
| 1142 |
elements.audioPlayer.src = data.audioUrl;
|
|
@@ -1239,6 +1272,10 @@ async function downloadEpisode(audioUrl, title, triggerButton = null) {
|
|
| 1239 |
});
|
| 1240 |
if (!res.ok) throw new Error('Episode download failed');
|
| 1241 |
const data = await res.json();
|
|
|
|
|
|
|
|
|
|
|
|
|
| 1242 |
state.audioUrl = data.audioUrl;
|
| 1243 |
state.uploadedFile = null;
|
| 1244 |
elements.audioPlayer.src = data.audioUrl;
|
|
|
|
| 272 |
elements.diarizationPanel.classList.add('hidden');
|
| 273 |
}
|
| 274 |
|
| 275 |
+
function resetCompleteSession() {
|
| 276 |
+
// Reset transcription data
|
| 277 |
+
resetTranscriptionState();
|
| 278 |
+
|
| 279 |
+
// Reset speaker names
|
| 280 |
+
state.speakerNames = {};
|
| 281 |
+
|
| 282 |
+
// Reset summary and title
|
| 283 |
+
state.summary = '';
|
| 284 |
+
state.title = '';
|
| 285 |
+
|
| 286 |
+
// Clear summary and title UI
|
| 287 |
+
elements.summaryOutput.innerHTML = '';
|
| 288 |
+
elements.titleOutput.textContent = '';
|
| 289 |
+
|
| 290 |
+
// Reset timeline visualization
|
| 291 |
+
renderTimelineSegments();
|
| 292 |
+
|
| 293 |
+
// Hide speaker name detection button
|
| 294 |
+
elements.detectSpeakerNamesBtn.classList.add('hidden');
|
| 295 |
+
|
| 296 |
+
// Reset status
|
| 297 |
+
setStatus('Ready for new transcription', 'info');
|
| 298 |
+
}
|
| 299 |
+
|
| 300 |
function prepareTranscriptionOptions() {
|
| 301 |
const textnormValue = document.querySelector('input[name="textnorm"]:checked')?.value || 'withitn';
|
| 302 |
return {
|
|
|
|
| 1144 |
function handleFileUpload(event) {
|
| 1145 |
const file = event.target.files?.[0];
|
| 1146 |
if (!file) return;
|
| 1147 |
+
|
| 1148 |
+
// Reset complete session when new file loaded
|
| 1149 |
+
resetCompleteSession();
|
| 1150 |
+
|
| 1151 |
state.uploadedFile = file;
|
| 1152 |
state.audioUrl = null;
|
| 1153 |
const objectUrl = URL.createObjectURL(file);
|
|
|
|
| 1166 |
});
|
| 1167 |
if (!res.ok) throw new Error('YouTube download failed');
|
| 1168 |
const data = await res.json();
|
| 1169 |
+
|
| 1170 |
+
// Reset complete session when new YouTube audio loaded
|
| 1171 |
+
resetCompleteSession();
|
| 1172 |
+
|
| 1173 |
state.audioUrl = data.audioUrl;
|
| 1174 |
state.uploadedFile = null;
|
| 1175 |
elements.audioPlayer.src = data.audioUrl;
|
|
|
|
| 1272 |
});
|
| 1273 |
if (!res.ok) throw new Error('Episode download failed');
|
| 1274 |
const data = await res.json();
|
| 1275 |
+
|
| 1276 |
+
// Reset complete session when new episode loaded
|
| 1277 |
+
resetCompleteSession();
|
| 1278 |
+
|
| 1279 |
state.audioUrl = data.audioUrl;
|
| 1280 |
state.uploadedFile = null;
|
| 1281 |
elements.audioPlayer.src = data.audioUrl;
|