File size: 13,404 Bytes
e4e4574
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
1
2
3
4
5
6
7
8
9
10
11
12
13
14
15
16
17
18
19
20
21
22
23
24
25
26
27
28
29
30
31
32
33
34
35
36
37
38
39
40
41
42
43
44
45
46
47
48
49
50
51
52
53
54
55
56
57
58
59
60
61
62
63
64
65
66
67
68
69
70
71
72
73
74
75
76
77
78
79
80
81
82
83
84
85
86
87
88
89
90
91
92
93
94
95
96
97
98
99
100
101
102
103
104
105
106
107
108
109
110
111
112
113
114
115
116
117
118
119
120
121
122
123
124
125
126
127
128
129
130
131
132
133
134
135
136
137
138
139
140
141
142
143
144
145
146
147
148
149
150
151
152
153
154
155
156
157
158
159
160
161
162
163
164
165
166
167
168
169
170
171
172
173
174
175
176
177
178
179
180
181
182
183
184
185
186
187
188
189
190
191
192
193
194
195
196
197
198
199
200
201
202
203
204
205
206
207
208
209
210
211
212
213
214
215
216
217
218
219
220
221
222
223
224
225
226
227
228
229
230
231
232
233
234
235
236
237
238
239
240
241
242
243
244
245
246
247
248
249
250
251
252
253
254
255
256
257
258
259
260
261
262
263
264
265
266
267
268
269
270
271
272
273
274
275
276
277
278
279
280
281
282
283
284
285
286
287
288
289
290
291
292
293
294
295
296
297
298
299
300
301
302
303
304
305
306
307
308
309
310
311
312
313
314
315
316
317
318
319
320
321
322
323
324
325
326
327
328
329
330
331
332
333
334
335
336
337
338
339
340
341
342
343
344
345
346
347
348
349
350
351
352
353
354
355
356
357
358
359
360
361
362
363
364
365
366
367
368
369
370
371
372
373
374
375
376
377
378
379
380
381
382
383
384
385
386
387
388
389
390
391
392
393
394
395
396
397
398
399
400
401
402
403
404
405
406
407
408
409
410
411
412
413
414
415
416
417
418
419
420
421
422
423
424
425
426
427
428
429
430
431
432
433
434
435
436
437
438
439
440
441
442
443
444
445
446
447
448
449
450
451
452
453
454
455
456
457
458
459
460
461
462
463
464
465
466
467
468
469
470
471
472
473
474
475
476
477
478
479
480
481
482
483
484
485
486
487
# Enhanced Crypto Data Tracker - New Features

## πŸš€ Overview

This document describes the major enhancements added to the crypto data tracking system, including unified configuration management, advanced scheduling, real-time updates via WebSockets, and comprehensive data persistence.

## ✨ New Features

### 1. Unified Configuration Loader

**File:** `backend/services/unified_config_loader.py`

The unified configuration loader automatically imports and manages all API sources from JSON configuration files at the project root.

**Features:**
- Loads from multiple JSON config files:
  - `crypto_resources_unified_2025-11-11.json` (200+ APIs)
  - `all_apis_merged_2025.json`
  - `ultimate_crypto_pipeline_2025_NZasinich.json`
- Automatic API key extraction
- Category-based organization
- Update type classification (realtime, periodic, scheduled)
- Schedule management for each API
- Import/Export functionality

**Usage:**
```python
from backend.services.unified_config_loader import UnifiedConfigLoader

loader = UnifiedConfigLoader()

# Get all APIs
all_apis = loader.get_all_apis()

# Get APIs by category
market_data_apis = loader.get_apis_by_category('market_data')

# Get APIs by update type
realtime_apis = loader.get_realtime_apis()
periodic_apis = loader.get_periodic_apis()

# Add custom API
loader.add_custom_api({
    'id': 'custom_api',
    'name': 'Custom API',
    'category': 'custom',
    'base_url': 'https://api.example.com',
    'update_type': 'periodic',
    'enabled': True
})
```

### 2. Enhanced Scheduling System

**File:** `backend/services/scheduler_service.py`

Advanced scheduler that manages periodic and real-time data updates with automatic error handling and retry logic.

**Features:**
- **Periodic Updates:** Schedule APIs to update at specific intervals
- **Real-time Updates:** WebSocket connections for instant data
- **Scheduled Updates:** Less frequent updates for HuggingFace and other resources
- **Smart Retry:** Automatic interval adjustment on failures
- **Callbacks:** Register callbacks for data updates
- **Force Updates:** Manually trigger immediate updates

**Update Types:**
- `realtime` (0s interval): WebSocket - always connected
- `periodic` (60s interval): Regular polling for market data
- `scheduled` (3600s interval): Hourly updates for HF models/datasets
- `daily` (86400s interval): Once per day

**Usage:**
```python
from backend.services.scheduler_service import SchedulerService

scheduler = SchedulerService(config_loader, db_manager)

# Start scheduler
await scheduler.start()

# Update schedule
scheduler.update_task_schedule('coingecko', interval=120, enabled=True)

# Force update
success = await scheduler.force_update('coingecko')

# Register callback
def on_data_update(api_id, data):
    print(f"Data updated for {api_id}")

scheduler.register_callback('coingecko', on_data_update)

# Get task status
status = scheduler.get_task_status('coingecko')

# Export schedules
scheduler.export_schedules('schedules_backup.json')
```

### 3. Data Persistence Service

**File:** `backend/services/persistence_service.py`

Comprehensive data persistence with multiple export formats and automatic backups.

**Features:**
- In-memory caching for quick access
- Historical data tracking (configurable limit)
- Export to JSON, CSV formats
- Automatic backups
- Database integration (SQLAlchemy)
- Data cleanup utilities

**Usage:**
```python
from backend.services.persistence_service import PersistenceService

persistence = PersistenceService(db_manager)

# Save data
await persistence.save_api_data(
    'coingecko',
    {'price': 50000},
    metadata={'category': 'market_data'}
)

# Get cached data
data = persistence.get_cached_data('coingecko')

# Get history
history = persistence.get_history('coingecko', limit=100)

# Export to JSON
await persistence.export_to_json('export.json', include_history=True)

# Export to CSV
await persistence.export_to_csv('export.csv', flatten=True)

# Create backup
backup_file = await persistence.backup_all_data()

# Restore from backup
await persistence.restore_from_backup(backup_file)

# Cleanup old data (7 days)
removed = await persistence.cleanup_old_data(days=7)
```

### 4. Real-time WebSocket Service

**File:** `backend/services/websocket_service.py`

WebSocket service for real-time bidirectional communication between backend and frontend.

**Features:**
- Connection management with client tracking
- Subscription-based updates (specific APIs or all)
- Real-time notifications for:
  - API data updates
  - System status changes
  - Schedule modifications
- Request-response patterns for data queries
- Heartbeat/ping-pong for connection health

**WebSocket Message Types:**

**Client β†’ Server:**
- `subscribe`: Subscribe to specific API updates
- `subscribe_all`: Subscribe to all updates
- `unsubscribe`: Unsubscribe from API
- `get_data`: Request cached data
- `get_all_data`: Request all cached data
- `get_schedule`: Request schedule information
- `update_schedule`: Update schedule configuration
- `force_update`: Force immediate API update
- `ping`: Heartbeat

**Server β†’ Client:**
- `connected`: Welcome message with client ID
- `api_update`: API data updated
- `status_update`: System status changed
- `schedule_update`: Schedule modified
- `subscribed`: Subscription confirmed
- `data_response`: Data query response
- `schedule_response`: Schedule query response
- `pong`: Heartbeat response
- `error`: Error occurred

**Usage:**

**Frontend JavaScript:**
```javascript
// Connect
const ws = new WebSocket('ws://localhost:8000/api/v2/ws');

// Subscribe to all updates
ws.send(JSON.stringify({ type: 'subscribe_all' }));

// Subscribe to specific API
ws.send(JSON.stringify({
    type: 'subscribe',
    api_id: 'coingecko'
}));

// Request data
ws.send(JSON.stringify({
    type: 'get_data',
    api_id: 'coingecko'
}));

// Update schedule
ws.send(JSON.stringify({
    type: 'update_schedule',
    api_id: 'coingecko',
    interval: 120,
    enabled: true
}));

// Force update
ws.send(JSON.stringify({
    type: 'force_update',
    api_id: 'coingecko'
}));

// Handle messages
ws.onmessage = (event) => {
    const message = JSON.parse(event.data);

    switch (message.type) {
        case 'api_update':
            console.log(`${message.api_id} updated:`, message.data);
            break;
        case 'status_update':
            console.log('Status:', message.status);
            break;
    }
};
```

### 5. Integrated Backend API

**File:** `backend/routers/integrated_api.py`

Comprehensive REST API that combines all services.

**Endpoints:**

**Configuration:**
- `GET /api/v2/config/apis` - Get all configured APIs
- `GET /api/v2/config/apis/{api_id}` - Get specific API
- `GET /api/v2/config/categories` - Get all categories
- `GET /api/v2/config/apis/category/{category}` - Get APIs by category
- `POST /api/v2/config/apis` - Add custom API
- `DELETE /api/v2/config/apis/{api_id}` - Remove API
- `GET /api/v2/config/export` - Export configuration

**Scheduling:**
- `GET /api/v2/schedule/tasks` - Get all scheduled tasks
- `GET /api/v2/schedule/tasks/{api_id}` - Get specific task
- `PUT /api/v2/schedule/tasks/{api_id}` - Update schedule
- `POST /api/v2/schedule/tasks/{api_id}/force-update` - Force update
- `GET /api/v2/schedule/export` - Export schedules

**Data:**
- `GET /api/v2/data/cached` - Get all cached data
- `GET /api/v2/data/cached/{api_id}` - Get cached data for API
- `GET /api/v2/data/history/{api_id}` - Get historical data
- `GET /api/v2/data/statistics` - Get storage statistics

**Export/Import:**
- `POST /api/v2/export/json` - Export to JSON
- `POST /api/v2/export/csv` - Export to CSV
- `POST /api/v2/export/history/{api_id}` - Export API history
- `GET /api/v2/download?file={path}` - Download exported file
- `POST /api/v2/backup` - Create backup
- `POST /api/v2/restore` - Restore from backup

**Status:**
- `GET /api/v2/status` - System status
- `GET /api/v2/health` - Health check

**Cleanup:**
- `POST /api/v2/cleanup/cache` - Clear cache
- `POST /api/v2/cleanup/history` - Clear history
- `POST /api/v2/cleanup/old-data` - Remove old data

### 6. Enhanced Server

**File:** `enhanced_server.py`

Production-ready server with all services integrated.

**Features:**
- Automatic service initialization on startup
- Graceful shutdown with final backup
- Comprehensive logging
- CORS support
- Static file serving
- Multiple dashboard routes

**Run the server:**
```bash
python enhanced_server.py
```

**Access points:**
- Main Dashboard: http://localhost:8000/
- Enhanced Dashboard: http://localhost:8000/enhanced_dashboard.html
- API Documentation: http://localhost:8000/docs
- WebSocket: ws://localhost:8000/api/v2/ws

### 7. Enhanced Dashboard UI

**File:** `enhanced_dashboard.html`

Modern, interactive dashboard with real-time updates and full control over the system.

**Features:**
- **Real-time Updates:** WebSocket connection with live data
- **Export Controls:** One-click export to JSON/CSV
- **Backup Management:** Create/restore backups
- **Schedule Configuration:** Adjust update intervals per API
- **Force Updates:** Trigger immediate updates
- **System Statistics:** Live monitoring of system metrics
- **Activity Log:** Real-time activity feed
- **API Management:** View and control all API sources

## πŸ”§ Installation & Setup

### Prerequisites
```bash
pip install fastapi uvicorn websockets pandas httpx sqlalchemy
```

### Directory Structure
```
crypto-dt-source/
β”œβ”€β”€ backend/
β”‚   β”œβ”€β”€ routers/
β”‚   β”‚   └── integrated_api.py
β”‚   └── services/
β”‚       β”œβ”€β”€ unified_config_loader.py
β”‚       β”œβ”€β”€ scheduler_service.py
β”‚       β”œβ”€β”€ persistence_service.py
β”‚       └── websocket_service.py
β”œβ”€β”€ database/
β”‚   β”œβ”€β”€ models.py
β”‚   └── db_manager.py
β”œβ”€β”€ data/
β”‚   β”œβ”€β”€ exports/
β”‚   └── backups/
β”œβ”€β”€ crypto_resources_unified_2025-11-11.json
β”œβ”€β”€ all_apis_merged_2025.json
β”œβ”€β”€ ultimate_crypto_pipeline_2025_NZasinich.json
β”œβ”€β”€ enhanced_server.py
└── enhanced_dashboard.html
```

### Running the Enhanced Server

1. **Start the server:**
```bash
python enhanced_server.py
```

2. **Access the dashboard:**
   - Open browser to http://localhost:8000/enhanced_dashboard.html

3. **Monitor logs:**
   - Server logs show all activities
   - WebSocket connections
   - Data updates
   - Errors and warnings

## πŸ“Š Configuration

### Scheduling Configuration

Edit schedules via:
1. **Web UI:** Click "Configure Schedule" in enhanced dashboard
2. **API:** Use PUT /api/v2/schedule/tasks/{api_id}
3. **Code:** Call `scheduler.update_task_schedule()`

### Update Types

Configure `update_type` in API configuration:
- `realtime`: WebSocket connection (instant updates)
- `periodic`: Regular polling (default: 60s)
- `scheduled`: Less frequent updates (default: 3600s)
- `daily`: Once per day (default: 86400s)

### Data Retention

Configure in `persistence_service.py`:
```python
max_history_per_api = 1000  # Keep last 1000 records per API
```

Cleanup old data:
```bash
curl -X POST http://localhost:8000/api/v2/cleanup/old-data?days=7
```

## πŸ” Security Notes

- API keys are stored securely in config files
- Keys are masked in exports (shown as ***)
- Database uses SQLite with proper permissions
- CORS configured for security
- WebSocket connections tracked and managed

## πŸš€ Performance

- **In-memory caching:** Fast data access
- **Async operations:** Non-blocking I/O
- **Concurrent updates:** Parallel API calls
- **Connection pooling:** Efficient database access
- **Smart retry logic:** Automatic error recovery

## πŸ“ Examples

### Example 1: Setup and Start
```python
from backend.services.unified_config_loader import UnifiedConfigLoader
from backend.services.scheduler_service import SchedulerService
from backend.services.persistence_service import PersistenceService

# Initialize
config = UnifiedConfigLoader()
persistence = PersistenceService()
scheduler = SchedulerService(config)

# Start scheduler
await scheduler.start()
```

### Example 2: Export Data
```python
# Export all data to JSON
await persistence.export_to_json('all_data.json', include_history=True)

# Export specific APIs to CSV
await persistence.export_to_csv('market_data.csv', api_ids=['coingecko', 'binance'])
```

### Example 3: Custom API
```python
# Add custom API
config.add_custom_api({
    'id': 'my_custom_api',
    'name': 'My Custom API',
    'category': 'custom',
    'base_url': 'https://api.myservice.com/data',
    'auth': {'type': 'apiKey', 'key': 'YOUR_KEY'},
    'update_type': 'periodic',
    'interval': 300
})
```

## πŸ› Troubleshooting

### WebSocket Not Connecting
- Check server is running
- Verify URL: `ws://localhost:8000/api/v2/ws`
- Check browser console for errors
- Ensure no firewall blocking WebSocket

### Data Not Updating
- Check scheduler is running: GET /api/v2/status
- Verify API is enabled in schedule
- Check logs for errors
- Force update: POST /api/v2/schedule/tasks/{api_id}/force-update

### Export Fails
- Ensure `data/exports/` directory exists
- Check disk space
- Verify pandas is installed

## πŸ“š API Documentation

Full API documentation available at: http://localhost:8000/docs

## πŸ™ Credits

Enhanced features developed for comprehensive crypto data tracking with real-time updates, advanced scheduling, and data persistence.