File size: 3,488 Bytes
52b3331
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
1
2
3
4
5
6
7
8
9
10
11
12
13
14
15
16
17
18
19
20
21
22
23
24
25
26
27
28
29
30
31
32
33
34
35
36
37
38
39
40
41
42
43
44
45
46
47
48
49
50
51
52
53
54
55
56
2025-07-04 17:13:41,696 - trainer_MyTestExp - INFO - info:66 - πŸ“„ Loading config from experiments/MyTestExp/config.json
2025-07-04 17:13:41,697 - trainer_MyTestExp - INFO - info:66 - βœ… Custom configuration loaded and merged
2025-07-04 17:13:41,697 - trainer_MyTestExp - INFO - info:66 - πŸ”§ Configuration: {
  "data": {
    "tickers": [
      "AAPL",
      "MSFT",
      "GOOGL",
      "AMZN",
      "TSLA"
    ],
    "period": "5y",
    "interval": "1d",
    "use_sp500": false
  },
  "environment": {
    "initial_balance": 10000,
    "transaction_cost": 0.001,
    "max_position_size": 1.0,
    "lookback_window": 60,
    "reward_type": "return"
  },
  "training": {
    "algorithm": "PPO",
    "total_timesteps": 500000,
    "learning_rate": 0.0003,
    "batch_size": 64,
    "n_epochs": 10,
    "gamma": 0.99,
    "eval_freq": 1000,
    "n_eval_episodes": 5,
    "save_freq": 10000,
    "seed": 42
  }
}
2025-07-04 17:13:41,699 - trainer_MyTestExp - INFO - info:66 - πŸ’Ύ Configuration saved to experiments/MyTestExp/config.json
2025-07-04 17:13:41,699 - trainer_MyTestExp - INFO - info:66 - 🎯 Trainer initialized for experiment: MyTestExp
2025-07-04 17:13:41,700 - trainer_MyTestExp - INFO - info:66 - πŸ“ Experiment directory: experiments/MyTestExp
2025-07-04 17:13:41,895 - trainer_MyTestExp - INFO - info:66 - ⏱️  Starting Environment Creation...
2025-07-04 17:13:41,895 - trainer_MyTestExp - INFO - info:66 - πŸͺ Environment configuration:
2025-07-04 17:13:41,896 - trainer_MyTestExp - INFO - info:66 -   initial_balance: 10000
2025-07-04 17:13:41,897 - trainer_MyTestExp - INFO - info:66 -   transaction_cost: 0.001
2025-07-04 17:13:41,898 - trainer_MyTestExp - INFO - info:66 -   max_position_size: 1.0
2025-07-04 17:13:41,899 - trainer_MyTestExp - INFO - info:66 -   lookback_window: 60
2025-07-04 17:13:41,899 - trainer_MyTestExp - INFO - info:66 -   reward_type: return
2025-07-04 17:13:41,900 - trainer_MyTestExp - INFO - info:66 - Creating environments: |β–ˆβ–ˆβ–ˆβ–ˆβ–ˆβ–ˆβ–‘β–‘β–‘β–‘β–‘β–‘β–‘β–‘β–‘β–‘β–‘β–‘β–‘β–‘β–‘β–‘β–‘β–‘β–‘β–‘β–‘β–‘β–‘β–‘| 20.0% (1/5)
2025-07-04 17:13:41,903 - trainer_MyTestExp - INFO - info:66 - Creating environments: |β–ˆβ–ˆβ–ˆβ–ˆβ–ˆβ–ˆβ–ˆβ–ˆβ–ˆβ–ˆβ–ˆβ–ˆβ–‘β–‘β–‘β–‘β–‘β–‘β–‘β–‘β–‘β–‘β–‘β–‘β–‘β–‘β–‘β–‘β–‘β–‘| 40.0% (2/5)
2025-07-04 17:13:41,905 - trainer_MyTestExp - INFO - info:66 - Creating environments: |β–ˆβ–ˆβ–ˆβ–ˆβ–ˆβ–ˆβ–ˆβ–ˆβ–ˆβ–ˆβ–ˆβ–ˆβ–ˆβ–ˆβ–ˆβ–ˆβ–ˆβ–ˆβ–‘β–‘β–‘β–‘β–‘β–‘β–‘β–‘β–‘β–‘β–‘β–‘| 60.0% (3/5)
2025-07-04 17:13:41,907 - trainer_MyTestExp - INFO - info:66 - Creating environments: |β–ˆβ–ˆβ–ˆβ–ˆβ–ˆβ–ˆβ–ˆβ–ˆβ–ˆβ–ˆβ–ˆβ–ˆβ–ˆβ–ˆβ–ˆβ–ˆβ–ˆβ–ˆβ–ˆβ–ˆβ–ˆβ–ˆβ–ˆβ–ˆβ–‘β–‘β–‘β–‘β–‘β–‘| 80.0% (4/5)
2025-07-04 17:13:41,910 - trainer_MyTestExp - INFO - info:66 - Creating environments: |β–ˆβ–ˆβ–ˆβ–ˆβ–ˆβ–ˆβ–ˆβ–ˆβ–ˆβ–ˆβ–ˆβ–ˆβ–ˆβ–ˆβ–ˆβ–ˆβ–ˆβ–ˆβ–ˆβ–ˆβ–ˆβ–ˆβ–ˆβ–ˆβ–ˆβ–ˆβ–ˆβ–ˆβ–ˆβ–ˆ| 100.0% (5/5)
2025-07-04 17:13:41,912 - trainer_MyTestExp - INFO - info:66 - βœ… Created environments for 5 stocks
2025-07-04 17:13:41,912 - trainer_MyTestExp - INFO - info:66 - ⏱️  Environment Creation completed in 0:00:00.017660
2025-07-04 17:13:41,914 - trainer_MyTestExp - INFO - info:66 - ⏱️  Starting Model Evaluation...
2025-07-04 17:13:41,914 - trainer_MyTestExp - INFO - info:66 - Loading model from experiments/MyTestExp/models/final_model.zip
2025-07-04 17:13:41,959 - trainer_MyTestExp - INFO - info:66 - Evaluating AMZN: |β–ˆβ–ˆβ–ˆβ–ˆβ–ˆβ–ˆβ–‘β–‘β–‘β–‘β–‘β–‘β–‘β–‘β–‘β–‘β–‘β–‘β–‘β–‘β–‘β–‘β–‘β–‘β–‘β–‘β–‘β–‘β–‘β–‘| 20.0% (1/5)