463465810cz commited on Jul 17, 2023

Commit

8cb8316

1 Parent(s): 3cda643

ICCV 2023

Browse files

Former-commit-id: 1d9f9df3885a24e94f638ac328082683d0ceb8b8

Files changed (26) hide show

README.md +29 -5
basicsr/archs/dat_arch.py +2 -2
basicsr/train.py +215 -0
basicsr/version.py +2 -2
datasets/README.md +45 -1
experiments/README.md +1 -2
figs/Figure-2.png +3 -0
figs/Figure-3.png +3 -0
figs/Figure-4.png +3 -0
figs/Figure-5.png +3 -0
figs/Table-2.png +3 -0
options/README.md +0 -2
options/Test/test_DAT_2_x2.yml +93 -0
options/Test/test_DAT_2_x3.yml +92 -0
options/Test/test_DAT_2_x4.yml +93 -0
options/Test/test_DAT_S_x2.yml +2 -2
options/Test/{test_DAT_S_x3.yml.yml → test_DAT_S_x3.yml} +2 -2
options/Test/test_DAT_S_x4.yml +2 -2
options/Test/test_DAT_x2.yml +1 -1
options/Test/test_DAT_x3.yml +1 -1
options/Test/test_DAT_x4.yml +1 -1
options/Train/train_DAT_2_x2.yml +106 -0
options/Train/train_DAT_2_x3.yml +109 -0
options/Train/train_DAT_2_x4.yml +110 -0
options/Train/{train_DAT_S_x3.yml.yml → train_DAT_S_x3.yml} +0 -0
options/Train/train_DAT_x4.yml +2 -2

README.md CHANGED Viewed

@@ -58,10 +58,11 @@ Download training and testing datasets and put them into the corresponding folde
 | Method | Params (M) | FLOPs (G) | Dataset  | PSNR (dB) |  SSIM  |                          Model Zoo                           |                        Visual Results                        |
 | :----- | :--------: | :-------: | :------: | :-------: | :----: | :----------------------------------------------------------: | :----------------------------------------------------------: |
-| DAT-S  |   11.21    |   203.3   | Urban100 |   27.68   | 0.8300 | [Google Drive](https://drive.google.com/drive/folders/1hb77nOTpCo9iU_jmg_izHOPRvPJujRiL?usp=drive_link) | [Google Drive](https://drive.google.com/file/d/1W-CeN2Z0e1r0rOdc3t-GcGrRV-qTGdub/view?usp=drive_link) |
-| DAT    |   14.80    |   275.8   | Urban100 |   27.87   | 0.8343 | [Google Drive](https://drive.google.com/drive/folders/1eZqgQEBQ69Vzf8afrPkvL27JHubW6o0t?usp=drive_link) | [Google Drive](https://drive.google.com/file/d/1B4zJsZaiVsu009ilTh81BV7-8Hr98BI2/view?usp=drive_link) |
-The performance is reported on Urban100 (x4, SR). The test input size of FLOPs is 128 x 128.
 ## Training
@@ -79,6 +80,11 @@ The performance is reported on Urban100 (x4, SR). The test input size of FLOPs i
   python -m torch.distributed.launch --nproc_per_node=4 --master_port=4321 basicsr/train.py -opt options/Train/train_DAT_x2.yml --launcher pytorch
   python -m torch.distributed.launch --nproc_per_node=4 --master_port=4321 basicsr/train.py -opt options/Train/train_DAT_x3.yml --launcher pytorch
   python -m torch.distributed.launch --nproc_per_node=4 --master_port=4321 basicsr/train.py -opt options/Train/train_DAT_x4.yml --launcher pytorch
   ```
 - The training experiment is in `experiments/`.
@@ -87,9 +93,9 @@ The performance is reported on Urban100 (x4, SR). The test input size of FLOPs i
 - Download the pre-trained [models](https://drive.google.com/drive/folders/1iBdf_-LVZuz_PAbFtuxSKd_11RL1YKxM?usp=drive_link) and place them in `experiments/pretrained_models/`.
-  We provide pre-trained models for image SR: DAT-S and DAT (x2, x3, x4).
-- Download [testing](https://ufile.io/6ek67nf8) (Set5, Set14, BSD100, Urban100, Manga109) datasets, place them in `datasets/`.
 - Run the following scripts. The testing configuration is in `options/test/`.
@@ -104,6 +110,11 @@ The performance is reported on Urban100 (x4, SR). The test input size of FLOPs i
   python basicsr/test.py -opt options/Test/test_DAT_x2.yml
   python basicsr/test.py -opt options/Test/test_DAT_x3.yml
   python basicsr/test.py -opt options/Test/test_DAT_x4.yml
   ```
 - The output is in `results/`.
@@ -120,13 +131,26 @@ We achieved state-of-the-art performance. Detailed results can be found in the p
 <p align="center">
   <img width="900" src="figs/Table-1.png">
 </p>
 - visual comparison (x4) in the main paper
 <p align="center">
   <img width="900" src="figs/Figure-1.png">
 </p>
 - </details>

 | Method | Params (M) | FLOPs (G) | Dataset  | PSNR (dB) |  SSIM  |                          Model Zoo                           |                        Visual Results                        |
 | :----- | :--------: | :-------: | :------: | :-------: | :----: | :----------------------------------------------------------: | :----------------------------------------------------------: |
+| DAT-S  |   11.21    |   203.3   | Urban100 |   27.68   | 0.8300 | [Google Drive](https://drive.google.com/drive/folders/1hM0v3fUg5u6GjkI7dduxShyGgGfEwQXO?usp=drive_link) | [Google Drive](https://drive.google.com/file/d/1x1ixMswxw5w-zeZ_Rap5Nk4Tr46MIjAw/view?usp=drive_link) |
+| DAT    |   14.80    |   275.8   | Urban100 |   27.87   | 0.8343 | [Google Drive](https://drive.google.com/drive/folders/14VG5mw5ie8RrR4jjypeHynXDZYWL8w-r?usp=drive_link) | [Google Drive](https://drive.google.com/file/d/1K43CTsXpoX5St5fed4kEW9gu2KMR6hLu/view?usp=drive_link) |
+| DAT-2  |   11.21    |  216.93   | Urban100 |   27.86   | 0.8341 | [Google Drive](https://drive.google.com/drive/folders/1yV9LMhr2tYM_eHEIVY4Jw9X3bWGgorbD?usp=drive_link) | [Google Drive](https://drive.google.com/file/d/1TQRZIg8at5HX87OCu3GYytZhYGperkuN/view?usp=drive_link) |
+The performance is reported on Urban100 (x4). The test input size of FLOPs is 128 x 128.
 ## Training
   python -m torch.distributed.launch --nproc_per_node=4 --master_port=4321 basicsr/train.py -opt options/Train/train_DAT_x2.yml --launcher pytorch
   python -m torch.distributed.launch --nproc_per_node=4 --master_port=4321 basicsr/train.py -opt options/Train/train_DAT_x3.yml --launcher pytorch
   python -m torch.distributed.launch --nproc_per_node=4 --master_port=4321 basicsr/train.py -opt options/Train/train_DAT_x4.yml --launcher pytorch
+  # DAT-2, input=64x64, 4 GPUs
+  python -m torch.distributed.launch --nproc_per_node=4 --master_port=4321 basicsr/train.py -opt options/Train/train_DAT_2_x2.yml --launcher pytorch
+  python -m torch.distributed.launch --nproc_per_node=4 --master_port=4321 basicsr/train.py -opt options/Train/train_DAT_2_x3.yml --launcher pytorch
+  python -m torch.distributed.launch --nproc_per_node=4 --master_port=4321 basicsr/train.py -opt options/Train/train_DAT_2_x4.yml --launcher pytorch
   ```
 - The training experiment is in `experiments/`.
 - Download the pre-trained [models](https://drive.google.com/drive/folders/1iBdf_-LVZuz_PAbFtuxSKd_11RL1YKxM?usp=drive_link) and place them in `experiments/pretrained_models/`.
+  We provide pre-trained models for image SR: DAT-S, DAT, and DAT-2 (x2, x3, x4).
+- Download [testing](https://drive.google.com/file/d/1yMbItvFKVaCT93yPWmlP3883XtJ-wSee/view?usp=sharing) (Set5, Set14, BSD100, Urban100, Manga109) datasets, place them in `datasets/`.
 - Run the following scripts. The testing configuration is in `options/test/`.
   python basicsr/test.py -opt options/Test/test_DAT_x2.yml
   python basicsr/test.py -opt options/Test/test_DAT_x3.yml
   python basicsr/test.py -opt options/Test/test_DAT_x4.yml
+  # DAT-2, reproduces results in Table 1 of the supplementary material
+  python basicsr/test.py -opt options/Test/test_DAT_2_x2.yml
+  python basicsr/test.py -opt options/Test/test_DAT_2_x3.yml
+  python basicsr/test.py -opt options/Test/test_DAT_2_x4.yml
   ```
 - The output is in `results/`.
 <p align="center">
   <img width="900" src="figs/Table-1.png">
 </p>
+- results in Table 1 of the supplementary material
+<p align="center">
+  <img width="900" src="figs/Table-2.png">
+</p>
 - visual comparison (x4) in the main paper
 <p align="center">
   <img width="900" src="figs/Figure-1.png">
 </p>
+- visual comparison (x4) in the supplementary material
+<p align="center">
+  <img width="900" src="figs/Figure-2.png">
+  <img width="900" src="figs/Figure-3.png">
+  <img width="900" src="figs/Figure-4.png">
+  <img width="900" src="figs/Figure-5.png">
+</p>
 - </details>

basicsr/archs/dat_arch.py CHANGED Viewed

@@ -297,7 +297,6 @@ class Axial_Spatial_Attention(nn.Module):
             self.register_buffer("attn_mask_0", None)
             self.register_buffer("attn_mask_1", None)
-        # Adaptive Interaction Module
         self.dwconv = nn.Sequential(
             nn.Conv2d(dim, dim, kernel_size=3, stride=1, padding=1,groups=dim),
             nn.BatchNorm2d(dim),
@@ -419,6 +418,7 @@ class Axial_Spatial_Attention(nn.Module):
         # convolution output
         conv_x = self.dwconv(v)
         # C-Map (before sigmoid)
         channel_map = self.channel_interaction(conv_x).permute(0, 2, 3, 1).contiguous().view(B, 1, C)
         # S-Map (before sigmoid)
@@ -460,7 +460,6 @@ class Axial_Channel_Attention(nn.Module):
         self.proj = nn.Linear(dim, dim)
         self.proj_drop = nn.Dropout(proj_drop)
-        # Adaptive Interaction Module
         self.dwconv = nn.Sequential(
             nn.Conv2d(dim, dim, kernel_size=3, stride=1, padding=1,groups=dim),
             nn.BatchNorm2d(dim),
@@ -509,6 +508,7 @@ class Axial_Channel_Attention(nn.Module):
         # convolution output
         conv_x = self.dwconv(v_)
         # C-Map (before sigmoid)
         attention_reshape = attened_x.transpose(-2,-1).contiguous().view(B, C, H, W)
         channel_map = self.channel_interaction(attention_reshape)

             self.register_buffer("attn_mask_0", None)
             self.register_buffer("attn_mask_1", None)
         self.dwconv = nn.Sequential(
             nn.Conv2d(dim, dim, kernel_size=3, stride=1, padding=1,groups=dim),
             nn.BatchNorm2d(dim),
         # convolution output
         conv_x = self.dwconv(v)
+        # Adaptive Interaction Module (AIM)
         # C-Map (before sigmoid)
         channel_map = self.channel_interaction(conv_x).permute(0, 2, 3, 1).contiguous().view(B, 1, C)
         # S-Map (before sigmoid)
         self.proj = nn.Linear(dim, dim)
         self.proj_drop = nn.Dropout(proj_drop)
         self.dwconv = nn.Sequential(
             nn.Conv2d(dim, dim, kernel_size=3, stride=1, padding=1,groups=dim),
             nn.BatchNorm2d(dim),
         # convolution output
         conv_x = self.dwconv(v_)
+        # Adaptive Interaction Module (AIM)
         # C-Map (before sigmoid)
         attention_reshape = attened_x.transpose(-2,-1).contiguous().view(B, C, H, W)
         channel_map = self.channel_interaction(attention_reshape)

basicsr/train.py ADDED Viewed

	@@ -0,0 +1,215 @@

+import datetime
+import logging
+import math
+import time
+import torch
+from os import path as osp
+from basicsr.data import build_dataloader, build_dataset
+from basicsr.data.data_sampler import EnlargedSampler
+from basicsr.data.prefetch_dataloader import CPUPrefetcher, CUDAPrefetcher
+from basicsr.models import build_model
+from basicsr.utils import (AvgTimer, MessageLogger, check_resume, get_env_info, get_root_logger, get_time_str,
+                           init_tb_logger, init_wandb_logger, make_exp_dirs, mkdir_and_rename, scandir)
+from basicsr.utils.options import copy_opt_file, dict2str, parse_options
+def init_tb_loggers(opt):
+    # initialize wandb logger before tensorboard logger to allow proper sync
+    if (opt['logger'].get('wandb') is not None) and (opt['logger']['wandb'].get('project')
+                                                     is not None) and ('debug' not in opt['name']):
+        assert opt['logger'].get('use_tb_logger') is True, ('should turn on tensorboard when using wandb')
+        init_wandb_logger(opt)
+    tb_logger = None
+    if opt['logger'].get('use_tb_logger') and 'debug' not in opt['name']:
+        tb_logger = init_tb_logger(log_dir=osp.join(opt['root_path'], 'tb_logger', opt['name']))
+    return tb_logger
+def create_train_val_dataloader(opt, logger):
+    # create train and val dataloaders
+    train_loader, val_loaders = None, []
+    for phase, dataset_opt in opt['datasets'].items():
+        if phase == 'train':
+            dataset_enlarge_ratio = dataset_opt.get('dataset_enlarge_ratio', 1)
+            train_set = build_dataset(dataset_opt)
+            train_sampler = EnlargedSampler(train_set, opt['world_size'], opt['rank'], dataset_enlarge_ratio)
+            train_loader = build_dataloader(
+                train_set,
+                dataset_opt,
+                num_gpu=opt['num_gpu'],
+                dist=opt['dist'],
+                sampler=train_sampler,
+                seed=opt['manual_seed'])
+            num_iter_per_epoch = math.ceil(
+                len(train_set) * dataset_enlarge_ratio / (dataset_opt['batch_size_per_gpu'] * opt['world_size']))
+            total_iters = int(opt['train']['total_iter'])
+            total_epochs = math.ceil(total_iters / (num_iter_per_epoch))
+            logger.info('Training statistics:'
+                        f'\n\tNumber of train images: {len(train_set)}'
+                        f'\n\tDataset enlarge ratio: {dataset_enlarge_ratio}'
+                        f'\n\tBatch size per gpu: {dataset_opt["batch_size_per_gpu"]}'
+                        f'\n\tWorld size (gpu number): {opt["world_size"]}'
+                        f'\n\tRequire iter number per epoch: {num_iter_per_epoch}'
+                        f'\n\tTotal epochs: {total_epochs}; iters: {total_iters}.')
+        elif phase.split('_')[0] == 'val':
+            val_set = build_dataset(dataset_opt)
+            val_loader = build_dataloader(
+                val_set, dataset_opt, num_gpu=opt['num_gpu'], dist=opt['dist'], sampler=None, seed=opt['manual_seed'])
+            logger.info(f'Number of val images/folders in {dataset_opt["name"]}: {len(val_set)}')
+            val_loaders.append(val_loader)
+        else:
+            raise ValueError(f'Dataset phase {phase} is not recognized.')
+    return train_loader, train_sampler, val_loaders, total_epochs, total_iters
+def load_resume_state(opt):
+    resume_state_path = None
+    if opt['auto_resume']:
+        state_path = osp.join('experiments', opt['name'], 'training_states')
+        if osp.isdir(state_path):
+            states = list(scandir(state_path, suffix='state', recursive=False, full_path=False))
+            if len(states) != 0:
+                states = [float(v.split('.state')[0]) for v in states]
+                resume_state_path = osp.join(state_path, f'{max(states):.0f}.state')
+                opt['path']['resume_state'] = resume_state_path
+    else:
+        if opt['path'].get('resume_state'):
+            resume_state_path = opt['path']['resume_state']
+    if resume_state_path is None:
+        resume_state = None
+    else:
+        device_id = torch.cuda.current_device()
+        resume_state = torch.load(resume_state_path, map_location=lambda storage, loc: storage.cuda(device_id))
+        check_resume(opt, resume_state['iter'])
+    return resume_state
+def train_pipeline(root_path):
+    # parse options, set distributed setting, set ramdom seed
+    opt, args = parse_options(root_path, is_train=True)
+    opt['root_path'] = root_path
+    torch.backends.cudnn.benchmark = True
+    # torch.backends.cudnn.deterministic = True
+    # load resume states if necessary
+    resume_state = load_resume_state(opt)
+    # mkdir for experiments and logger
+    if resume_state is None:
+        make_exp_dirs(opt)
+        if opt['logger'].get('use_tb_logger') and 'debug' not in opt['name'] and opt['rank'] == 0:
+            mkdir_and_rename(osp.join(opt['root_path'], 'tb_logger', opt['name']))
+    # copy the yml file to the experiment root
+    copy_opt_file(args.opt, opt['path']['experiments_root'])
+    # WARNING: should not use get_root_logger in the above codes, including the called functions
+    # Otherwise the logger will not be properly initialized
+    log_file = osp.join(opt['path']['log'], f"train_{opt['name']}_{get_time_str()}.log")
+    logger = get_root_logger(logger_name='basicsr', log_level=logging.INFO, log_file=log_file)
+    logger.info(get_env_info())
+    logger.info(dict2str(opt))
+    # initialize wandb and tb loggers
+    tb_logger = init_tb_loggers(opt)
+    # create train and validation dataloaders
+    result = create_train_val_dataloader(opt, logger)
+    train_loader, train_sampler, val_loaders, total_epochs, total_iters = result
+    # create model
+    model = build_model(opt)
+    if resume_state:  # resume training
+        model.resume_training(resume_state)  # handle optimizers and schedulers
+        logger.info(f"Resuming training from epoch: {resume_state['epoch']}, iter: {resume_state['iter']}.")
+        start_epoch = resume_state['epoch']
+        current_iter = resume_state['iter']
+    else:
+        start_epoch = 0
+        current_iter = 0
+    # create message logger (formatted outputs)
+    msg_logger = MessageLogger(opt, current_iter, tb_logger)
+    # dataloader prefetcher
+    prefetch_mode = opt['datasets']['train'].get('prefetch_mode')
+    if prefetch_mode is None or prefetch_mode == 'cpu':
+        prefetcher = CPUPrefetcher(train_loader)
+    elif prefetch_mode == 'cuda':
+        prefetcher = CUDAPrefetcher(train_loader, opt)
+        logger.info(f'Use {prefetch_mode} prefetch dataloader')
+        if opt['datasets']['train'].get('pin_memory') is not True:
+            raise ValueError('Please set pin_memory=True for CUDAPrefetcher.')
+    else:
+        raise ValueError(f"Wrong prefetch_mode {prefetch_mode}. Supported ones are: None, 'cuda', 'cpu'.")
+    # training
+    logger.info(f'Start training from epoch: {start_epoch}, iter: {current_iter}')
+    data_timer, iter_timer = AvgTimer(), AvgTimer()
+    start_time = time.time()
+    for epoch in range(start_epoch, total_epochs + 1):
+        train_sampler.set_epoch(epoch)
+        prefetcher.reset()
+        train_data = prefetcher.next()
+        while train_data is not None:
+            data_timer.record()
+            current_iter += 1
+            if current_iter > total_iters:
+                break
+            # update learning rate
+            model.update_learning_rate(current_iter, warmup_iter=opt['train'].get('warmup_iter', -1))
+            # training
+            model.feed_data(train_data)
+            model.optimize_parameters(current_iter)
+            iter_timer.record()
+            if current_iter == 1:
+                # reset start time in msg_logger for more accurate eta_time
+                # not work in resume mode
+                msg_logger.reset_start_time()
+            # log
+            if current_iter % opt['logger']['print_freq'] == 0:
+                log_vars = {'epoch': epoch, 'iter': current_iter}
+                log_vars.update({'lrs': model.get_current_learning_rate()})
+                log_vars.update({'time': iter_timer.get_avg_time(), 'data_time': data_timer.get_avg_time()})
+                log_vars.update(model.get_current_log())
+                msg_logger(log_vars)
+            # save models and training states
+            if current_iter % opt['logger']['save_checkpoint_freq'] == 0:
+                logger.info('Saving models and training states.')
+                model.save(epoch, current_iter)
+            # validation
+            if opt.get('val') is not None and (current_iter % opt['val']['val_freq'] == 0):
+                if len(val_loaders) > 1:
+                    logger.warning('Multiple validation datasets are *only* supported by SRModel.')
+                for val_loader in val_loaders:
+                    model.validation(val_loader, current_iter, tb_logger, opt['val']['save_img'])
+            data_timer.start()
+            iter_timer.start()
+            train_data = prefetcher.next()
+        # end of iter
+    # end of epoch
+    consumed_time = str(datetime.timedelta(seconds=int(time.time() - start_time)))
+    logger.info(f'End of training. Time consumed: {consumed_time}')
+    logger.info('Save the latest model.')
+    model.save(epoch=-1, current_iter=-1)  # -1 stands for the latest
+    if opt.get('val') is not None:
+        for val_loader in val_loaders:
+            model.validation(val_loader, current_iter, tb_logger, opt['val']['save_img'])
+    if tb_logger:
+        tb_logger.close()
+if __name__ == '__main__':
+    root_path = osp.abspath(osp.join(__file__, osp.pardir, osp.pardir))
+    train_pipeline(root_path)

basicsr/version.py CHANGED Viewed

@@ -1,5 +1,5 @@
 # GENERATED VERSION FILE
-# TIME: Thu Sep 22 07:20:35 2022
 __version__ = '1.3.5'
-__gitsha__ = 'cbc9a18'
 version_info = (1, 3, 5)

 # GENERATED VERSION FILE
+# TIME: Mon Jul 17 01:59:53 2023
 __version__ = '1.3.5'
+__gitsha__ = '29e57e3'
 version_info = (1, 3, 5)

datasets/README.md CHANGED Viewed

	@@ -1,2 +1,46 @@
1	- ~~Dwonload~~ ~~the~~ [testing~~](https://ufile.io/6ek67nf8)~~ ~~datasets~~ ~~and~~ ~~place~~ ~~them~~ ~~here.~~
2

+For training and testing, the directory structure is as follows:
+```shell
+|-- datasets
+    # train
+    |-- DF2K
+        |-- HR
+        |-- LR_bicubic
+            |-- X2
+            |-- X3
+            |-- X4
+    # test
+    |-- benchmark
+        |-- Set5
+            |-- HR
+          	|-- LR_bicubic
+                |-- X2
+                |-- X3
+                |-- X4
+        |-- Set14
+            |-- HR
+            |-- LR_bicubic
+                |-- X2
+                |-- X3
+                |-- X4
+        |-- B100
+            |-- HR
+            |-- LR_bicubic
+                |-- X2
+                |-- X3
+                |-- X4
+        |-- Urban100
+            |-- HR
+            |-- LR_bicubic
+                |-- X2
+                |-- X3
+                |-- X4
+        |-- Manga109
+            |-- HR
+            |-- LR_bicubic
+                |-- X2
+                |-- X3
+                |-- X4
+```
+You can download the complete datasets we have collected.

experiments/README.md CHANGED Viewed

	@@ -1,2 +1 @@
1	- ~~Dwonload~~ ~~the~~ ~~pre-trained [~~models~~](https://ufile.io/rf58x0s9)~~ ~~and place them~~ in `pretrained_models`.
2	-


1	+ Place pretrained models in `pretrained_models`.

figs/Figure-2.png ADDED Viewed

Git LFS Details

SHA256: bca8431490641478e71d106c83458e00ee1c1cf6315ca03318d24c6ebbd48246
Pointer size: 132 Bytes
Size of remote file: 2.65 MB

figs/Figure-3.png ADDED Viewed

Git LFS Details

SHA256: 3e08d373a4a965a1da58b0b8f67273afa2e90c231f56ad2e875febb5e1c1b9d0
Pointer size: 132 Bytes
Size of remote file: 3.27 MB

figs/Figure-4.png ADDED Viewed

Git LFS Details

SHA256: 1c86421092316c09c7ff5e21a4f2d59775c501cf61b7cdd70c66546b39959971
Pointer size: 132 Bytes
Size of remote file: 3.05 MB

figs/Figure-5.png ADDED Viewed

Git LFS Details

SHA256: 18e604aca7929f426aca7ab53df0ff38b488b04f19c54c90a1c6d730c0c605ef
Pointer size: 132 Bytes
Size of remote file: 3.12 MB

figs/Table-2.png ADDED Viewed

Git LFS Details

SHA256: be734be1fe8df4ae76804df09621b1859f1f93134fb9d458b279ac2ceda8d811
Pointer size: 131 Bytes
Size of remote file: 124 kB

options/README.md DELETED Viewed

	@@ -1,2 +0,0 @@
1	- For more information about testing configuration, please refer to [Configuration](https://github.com/XPixelGroup/BasicSR/blob/master/docs/Config.md).
2	-

options/Test/test_DAT_2_x2.yml ADDED Viewed

	@@ -0,0 +1,93 @@

+# general settings
+name: test_DAT_2_x2
+model_type: SRModel
+scale: 2
+num_gpu: 1
+manual_seed: 10
+datasets:
+  test_1:  # the 1st test dataset
+    task: SR
+    name: Set5
+    type: PairedImageDataset
+    dataroot_gt: datasets/benchmark/Set5/HR
+    dataroot_lq: datasets/benchmark/Set5/LR_bicubic/X2
+    filename_tmpl: '{}x2'
+    io_backend:
+      type: disk
+  test_2:  # the 2st test dataset
+    task: SR
+    name: Set14
+    type: PairedImageDataset
+    dataroot_gt: datasets/benchmark/Set14/HR
+    dataroot_lq: datasets/benchmark/Set14/LR_bicubic/X2
+    filename_tmpl: '{}x2'
+    io_backend:
+      type: disk
+  test_3:  # the 3st test dataset
+    task: SR
+    name: B100
+    type: PairedImageDataset
+    dataroot_gt: datasets/benchmark/B100/HR
+    dataroot_lq: datasets/benchmark/B100/LR_bicubic/X2
+    filename_tmpl: '{}x2'
+    io_backend:
+      type: disk
+  test_4:  # the 4st test dataset
+    task: SR
+    name: Urban100
+    type: PairedImageDataset
+    dataroot_gt: datasets/benchmark/Urban100/HR
+    dataroot_lq: datasets/benchmark/Urban100/LR_bicubic/X2
+    filename_tmpl: '{}x2'
+    io_backend:
+      type: disk
+  test_5:  # the 5st test dataset
+    task: SR
+    name: Manga109
+    type: PairedImageDataset
+    dataroot_gt: datasets/benchmark/Manga109/HR
+    dataroot_lq: datasets/benchmark/Manga109/LR_bicubic/X2
+    filename_tmpl: '{}_LRBI_x2'
+    io_backend:
+      type: disk
+# network structures
+network_g:
+  type: DAT
+  upscale: 2
+  in_chans: 3
+  img_size: 64
+  img_range: 1.
+  split_size: [8,32]
+  depth: [6,6,6,6,6,6]
+  embed_dim: 180
+  num_heads: [6,6,6,6,6,6]
+  expansion_factor: 2
+  resi_connection: '1conv'
+# path
+path:
+  pretrain_network_g: experiments/pretrained_models/DAT-2/DAT_2_x2.pth
+  strict_load_g: True
+# validation settings
+val:
+  save_img: True
+  suffix: ~  # add suffix to saved images, if None, use exp name
+  use_chop: False
+  metrics:
+    psnr: # metric name, can be arbitrary
+      type: calculate_psnr
+      crop_border: 2
+      test_y_channel: True
+    ssim:
+      type: calculate_ssim
+      crop_border: 2
+      test_y_channel: True

options/Test/test_DAT_2_x3.yml ADDED Viewed

	@@ -0,0 +1,92 @@

+# general settings
+name: test_DAT_2_x3
+model_type: SRModel
+scale: 3
+num_gpu: 1
+manual_seed: 10
+datasets:
+  test_1:  # the 1st test dataset
+    task: SR
+    name: Set5
+    type: PairedImageDataset
+    dataroot_gt: datasets/benchmark/Set5/HR
+    dataroot_lq: datasets/benchmark/Set5/LR_bicubic/X3
+    filename_tmpl: '{}x3'
+    io_backend:
+      type: disk
+  test_2:  # the 2st test dataset
+    task: SR
+    name: Set14
+    type: PairedImageDataset
+    dataroot_gt: datasets/benchmark/Set14/HR
+    dataroot_lq: datasets/benchmark/Set14/LR_bicubic/X3
+    filename_tmpl: '{}x3'
+    io_backend:
+      type: disk
+  test_3:  # the 3st test dataset
+    task: SR
+    name: B100
+    type: PairedImageDataset
+    dataroot_gt: datasets/benchmark/B100/HR
+    dataroot_lq: datasets/benchmark/B100/LR_bicubic/X3
+    filename_tmpl: '{}x3'
+    io_backend:
+      type: disk
+  test_4:  # the 4st test dataset
+    task: SR
+    name: Urban100
+    type: PairedImageDataset
+    dataroot_gt: datasets/benchmark/Urban100/HR
+    dataroot_lq: datasets/benchmark/Urban100/LR_bicubic/X3
+    filename_tmpl: '{}x3'
+    io_backend:
+      type: disk
+  test_5:  # the 5st test dataset
+    task: SR
+    name: Manga109
+    type: PairedImageDataset
+    dataroot_gt: datasets/benchmark/Manga109/HR
+    dataroot_lq: datasets/benchmark/Manga109/LR_bicubic/X3
+    filename_tmpl: '{}_LRBI_x3'
+    io_backend:
+      type: disk
+# network structures
+network_g:
+  type: DAT
+  upscale: 3
+  in_chans: 3
+  img_size: 64
+  img_range: 1.
+  split_size: [8,32]
+  depth: [6,6,6,6,6,6]
+  embed_dim: 180
+  num_heads: [6,6,6,6,6,6]
+  expansion_factor: 2
+  resi_connection: '1conv'
+# path
+path:
+  pretrain_network_g: experiments/pretrained_models/DAT-2/DAT_2_x3.pth
+  strict_load_g: True
+# validation settings
+val:
+  save_img: True
+  suffix: ~  # add suffix to saved images, if None, use exp name
+  use_chop: False
+  metrics:
+    psnr: # metric name, can be arbitrary
+      type: calculate_psnr
+      crop_border: 3
+      test_y_channel: True
+    ssim:
+      type: calculate_ssim
+      crop_border: 3
+      test_y_channel: True

options/Test/test_DAT_2_x4.yml ADDED Viewed

	@@ -0,0 +1,93 @@

+# general settings
+name: test_DAT_2_x4
+model_type: SRModel
+scale: 4
+num_gpu: 1
+manual_seed: 10
+datasets:
+  test_1:  # the 1st test dataset
+    task: SR
+    name: Set5
+    type: PairedImageDataset
+    dataroot_gt: datasets/benchmark/Set5/HR
+    dataroot_lq: datasets/benchmark/Set5/LR_bicubic/X4
+    filename_tmpl: '{}x4'
+    io_backend:
+      type: disk
+  test_2:  # the 2st test dataset
+    task: SR
+    name: Set14
+    type: PairedImageDataset
+    dataroot_gt: datasets/benchmark/Set14/HR
+    dataroot_lq: datasets/benchmark/Set14/LR_bicubic/X4
+    filename_tmpl: '{}x4'
+    io_backend:
+      type: disk
+  test_3:  # the 3st test dataset
+    task: SR
+    name: B100
+    type: PairedImageDataset
+    dataroot_gt: datasets/benchmark/B100/HR
+    dataroot_lq: datasets/benchmark/B100/LR_bicubic/X4
+    filename_tmpl: '{}x4'
+    io_backend:
+      type: disk
+  test_4:  # the 4st test dataset
+    task: SR
+    name: Urban100
+    type: PairedImageDataset
+    dataroot_gt: datasets/benchmark/Urban100/HR
+    dataroot_lq: datasets/benchmark/Urban100/LR_bicubic/X4
+    filename_tmpl: '{}x4'
+    io_backend:
+      type: disk
+  test_5:  # the 5st test dataset
+    task: SR
+    name: Manga109
+    type: PairedImageDataset
+    dataroot_gt: datasets/benchmark/Manga109/HR
+    dataroot_lq: datasets/benchmark/Manga109/LR_bicubic/X4
+    filename_tmpl: '{}_LRBI_x4'
+    io_backend:
+      type: disk
+# network structures
+network_g:
+  type: DAT
+  upscale: 4
+  in_chans: 3
+  img_size: 64
+  img_range: 1.
+  split_size: [8,32]
+  depth: [6,6,6,6,6,6]
+  embed_dim: 180
+  num_heads: [6,6,6,6,6,6]
+  expansion_factor: 2
+  resi_connection: '1conv'
+# path
+path:
+  pretrain_network_g: experiments/pretrained_models/DAT-2/DAT_2_x4.pth
+  strict_load_g: True
+# validation settings
+val:
+  save_img: True
+  suffix: ~  # add suffix to saved images, if None, use exp name
+  use_chop: False
+  metrics:
+    psnr: # metric name, can be arbitrary
+      type: calculate_psnr
+      crop_border: 4
+      test_y_channel: True
+    ssim:
+      type: calculate_ssim
+      crop_border: 4
+      test_y_channel: True

options/Test/test_DAT_S_x2.yml CHANGED Viewed

@@ -73,12 +73,12 @@ network_g:
 # path
 path:
-  pretrain_network_g: experiments/pretrained_models/DAT/DAT_S_x2.pth
   strict_load_g: True
 # validation settings
 val:
-  save_img: False
   suffix: ~  # add suffix to saved images, if None, use exp name
   use_chop: False

 # path
 path:
+  pretrain_network_g: experiments/pretrained_models/DAT-S/DAT_S_x2.pth
   strict_load_g: True
 # validation settings
 val:
+  save_img: True
   suffix: ~  # add suffix to saved images, if None, use exp name
   use_chop: False

options/Test/{test_DAT_S_x3.yml.yml → test_DAT_S_x3.yml} RENAMED Viewed

@@ -72,12 +72,12 @@ network_g:
 # path
 path:
-  pretrain_network_g: experiments/pretrained_models/DAT/DAT_S_x3.pth
   strict_load_g: True
 # validation settings
 val:
-  save_img: False
   suffix: ~  # add suffix to saved images, if None, use exp name
   use_chop: False

 # path
 path:
+  pretrain_network_g: experiments/pretrained_models/DAT-S/DAT_S_x3.pth
   strict_load_g: True
 # validation settings
 val:
+  save_img: True
   suffix: ~  # add suffix to saved images, if None, use exp name
   use_chop: False

options/Test/test_DAT_S_x4.yml CHANGED Viewed

@@ -73,12 +73,12 @@ network_g:
 # path
 path:
-  pretrain_network_g: experiments/pretrained_models/DAT/DAT_S_x4.pth
   strict_load_g: True
 # validation settings
 val:
-  save_img: False
   suffix: ~  # add suffix to saved images, if None, use exp name
   use_chop: False

 # path
 path:
+  pretrain_network_g: experiments/pretrained_models/DAT-S/DAT_S_x4.pth
   strict_load_g: True
 # validation settings
 val:
+  save_img: True
   suffix: ~  # add suffix to saved images, if None, use exp name
   use_chop: False

options/Test/test_DAT_x2.yml CHANGED Viewed

@@ -78,7 +78,7 @@ path:
 # validation settings
 val:
-  save_img: False
   suffix: ~  # add suffix to saved images, if None, use exp name
   use_chop: False

 # validation settings
 val:
+  save_img: True
   suffix: ~  # add suffix to saved images, if None, use exp name
   use_chop: False

options/Test/test_DAT_x3.yml CHANGED Viewed

@@ -77,7 +77,7 @@ path:
 # validation settings
 val:
-  save_img: False
   suffix: ~  # add suffix to saved images, if None, use exp name
   use_chop: False

 # validation settings
 val:
+  save_img: True
   suffix: ~  # add suffix to saved images, if None, use exp name
   use_chop: False

options/Test/test_DAT_x4.yml CHANGED Viewed

@@ -78,7 +78,7 @@ path:
 # validation settings
 val:
-  save_img: False
   suffix: ~  # add suffix to saved images, if None, use exp name
   use_chop: False

 # validation settings
 val:
+  save_img: True
   suffix: ~  # add suffix to saved images, if None, use exp name
   use_chop: False

options/Train/train_DAT_2_x2.yml ADDED Viewed

	@@ -0,0 +1,106 @@

+# general settings
+name: train_DAT_2_x2
+model_type: SRModel
+scale: 2
+num_gpu: auto
+manual_seed: 10
+# dataset and data loader settings
+datasets:
+  train:
+    task: SR
+    name: DF2K
+    type: PairedImageDataset
+    dataroot_gt: datasets/DF2K/HR
+    dataroot_lq: datasets/DF2K/LR_bicubic/X2
+    filename_tmpl: '{}x2'
+    io_backend:
+      type: disk
+    gt_size: 128
+    use_hflip: True
+    use_rot: True
+    # data loader
+    use_shuffle: True
+    num_worker_per_gpu: 12
+    batch_size_per_gpu: 8
+    dataset_enlarge_ratio: 100
+    prefetch_mode: ~
+  val:
+    task: SR
+    name: Set5
+    type: PairedImageDataset
+    dataroot_gt: datasets/benchmark/Set5/HR
+    dataroot_lq: datasets/benchmark/Set5/LR_bicubic/X2
+    filename_tmpl: '{}x2'
+    io_backend:
+      type: disk
+# network structures
+network_g:
+  type: DAT
+  upscale: 2
+  in_chans: 3
+  img_size: 64
+  img_range: 1.
+  split_size: [8,32]
+  depth: [6,6,6,6,6,6]
+  embed_dim: 180
+  num_heads: [6,6,6,6,6,6]
+  expansion_factor: 2
+  resi_connection: '1conv'
+# path
+path:
+  pretrain_network_g: ~
+  strict_load_g: True
+  resume_state: ~
+# training settings
+train:
+  optim_g:
+    type: Adam
+    lr: !!float 2e-4
+    weight_decay: 0
+    betas: [0.9, 0.99]
+  scheduler:
+    type: MultiStepLR
+    milestones: [250000, 400000, 450000, 475000]
+    gamma: 0.5
+  total_iter: 500000
+  warmup_iter: -1  # no warm up
+  # losses
+  pixel_opt:
+    type: L1Loss
+    loss_weight: 1.0
+    reduction: mean
+# validation settings
+val:
+  val_freq: !!float 5e3
+  save_img: False
+  metrics:
+    psnr: # metric name, can be arbitrary
+      type: calculate_psnr
+      crop_border: 2
+      test_y_channel: True
+# logging settings
+logger:
+  print_freq: 200
+  save_checkpoint_freq: !!float 5e3
+  use_tb_logger: True
+  wandb:
+    project: ~
+    resume_id: ~
+# dist training settings
+dist_params:
+  backend: nccl
+  port: 29500

options/Train/train_DAT_2_x3.yml ADDED Viewed

	@@ -0,0 +1,109 @@

+# general settings
+name: train_DAT_2_x3
+model_type: SRModel
+scale: 3
+num_gpu: auto
+manual_seed: 10
+# dataset and data loader settings
+datasets:
+  train:
+    task: SR
+    name: DF2K
+    type: PairedImageDataset
+    dataroot_gt: datasets/DF2K/HR
+    dataroot_lq: datasets/DF2K/LR_bicubic/X3
+    filename_tmpl: '{}x3'
+    io_backend:
+      type: disk
+    gt_size: 192
+    use_hflip: True
+    use_rot: True
+    # data loader
+    use_shuffle: True
+    num_worker_per_gpu: 12
+    batch_size_per_gpu: 8
+    dataset_enlarge_ratio: 100
+    prefetch_mode: ~
+  val:
+    task: SR
+    name: Set5
+    type: PairedImageDataset
+    dataroot_gt: datasets/benchmark/Set5/HR
+    dataroot_lq: datasets/benchmark/Set5/LR_bicubic/X3
+    filename_tmpl: '{}x3'
+    io_backend:
+      type: disk
+# network structures
+network_g:
+  type: DAT
+  upscale: 2
+  in_chans: 3
+  img_size: 64
+  img_range: 1.
+  split_size: [8,32]
+  depth: [6,6,6,6,6,6]
+  embed_dim: 180
+  num_heads: [6,6,6,6,6,6]
+  expansion_factor: 2
+  resi_connection: '1conv'
+# path
+path:
+  pretrain_network_g: experiments/pretrained_models/DAT-2/DAT_2_x2.pth # save half of training time if we finetune from x2 and halve initial lr.
+  strict_load_g: False
+  resume_state: ~
+# training settings
+train:
+  optim_g:
+    type: Adam
+    # lr: !!float 2e-4
+    lr: !!float 1e-4
+    weight_decay: 0
+    betas: [0.9, 0.99]
+  scheduler:
+    type: MultiStepLR
+    # milestones: [ 250000, 400000, 450000, 475000 ]
+    milestones: [ 125000, 200000, 225000, 237500 ]
+    gamma: 0.5
+  # total_iter: 500000
+  total_iter: 250000
+  warmup_iter: -1  # no warm up
+  # losses
+  pixel_opt:
+    type: L1Loss
+    loss_weight: 1.0
+    reduction: mean
+# validation settings
+val:
+  val_freq: !!float 5e3
+  save_img: False
+  metrics:
+    psnr: # metric name, can be arbitrary
+      type: calculate_psnr
+      crop_border: 4
+      test_y_channel: True
+# logging settings
+logger:
+  print_freq: 200
+  save_checkpoint_freq: !!float 5e3
+  use_tb_logger: True
+  wandb:
+    project: ~
+    resume_id: ~
+# dist training settings
+dist_params:
+  backend: nccl
+  port: 29500

options/Train/train_DAT_2_x4.yml ADDED Viewed

	@@ -0,0 +1,110 @@

+# general settings
+name: test_DAT_2_x4
+model_type: SRModel
+scale: 4
+num_gpu: auto
+manual_seed: 10
+# dataset and data loader settings
+datasets:
+  train:
+    task: SR
+    name: DF2K
+    type: PairedImageDataset
+    dataroot_gt: datasets/DF2K/HR
+    dataroot_lq: datasets/DF2K/LR_bicubic/X4
+    filename_tmpl: '{}x4'
+    io_backend:
+      type: disk
+    gt_size: 256
+    use_hflip: true
+    use_rot: true
+    # data loader
+    use_shuffle: True
+    num_worker_per_gpu: 12
+    batch_size_per_gpu: 8
+    dataset_enlarge_ratio: 100
+    prefetch_mode: ~
+  val:
+    task: SR
+    name: Set5
+    type: PairedImageDataset
+    dataroot_gt: datasets/benchmark/Set5/HR
+    dataroot_lq: datasets/benchmark/Set5/LR_bicubic/X4
+    filename_tmpl: '{}x4'
+    io_backend:
+      type: disk
+# network structures
+network_g:
+  type: DAT
+  upscale: 4
+  in_chans: 3
+  img_size: 64
+  img_range: 1.
+  split_size: [8,32]
+  depth: [6,6,6,6,6,6]
+  embed_dim: 180
+  num_heads: [6,6,6,6,6,6]
+  expansion_factor: 2
+  resi_connection: '1conv'
+# path
+path:
+  pretrain_network_g: experiments/pretrained_models/DAT-2/DAT_2_x2.pth # save half of training time if we finetune from x2 and halve initial lr.
+  strict_load_g: False
+  resume_state: ~
+# training settings
+train:
+  optim_g:
+    type: Adam
+    # lr: !!float 2e-4
+    lr: !!float 1e-4
+    weight_decay: 0
+    betas: [0.9, 0.99]
+  scheduler:
+    type: MultiStepLR
+    # milestones: [ 250000, 400000, 450000, 475000 ]
+    milestones: [ 125000, 200000, 225000, 237500 ]
+    gamma: 0.5
+  # total_iter: 500000
+  total_iter: 250000
+  warmup_iter: -1  # no warm up
+  # losses
+  pixel_opt:
+    type: L1Loss
+    loss_weight: 1.0
+    reduction: mean
+# validation settings
+val:
+  val_freq: !!float 5e3
+  save_img: False
+  metrics:
+    psnr: # metric name, can be arbitrary
+      type: calculate_psnr
+      crop_border: 4
+      test_y_channel: True
+# logging settings
+logger:
+  print_freq: 200
+  save_checkpoint_freq: !!float 5e3
+  use_tb_logger: True
+  wandb:
+    project: ~
+    resume_id: ~
+# dist training settings
+dist_params:
+  backend: nccl
+  port: 29500

options/Train/{train_DAT_S_x3.yml.yml → train_DAT_S_x3.yml} RENAMED Viewed

File without changes

options/Train/train_DAT_x4.yml CHANGED Viewed

@@ -1,5 +1,5 @@
 # general settings
-name: test_DAT_S_x4
 model_type: SRModel
 scale: 4
 num_gpu: auto
@@ -55,7 +55,7 @@ network_g:
 # path
 path:
-  pretrain_network_g: experiments/pretrained_models/DAT-S/DAT_S_x2.pth # save half of training time if we finetune from x2 and halve initial lr.
   strict_load_g: False
   resume_state: ~

 # general settings
+name: test_DAT_x4
 model_type: SRModel
 scale: 4
 num_gpu: auto
 # path
 path:
+  pretrain_network_g: experiments/pretrained_models/DAT/DAT_x2.pth # save half of training time if we finetune from x2 and halve initial lr.
   strict_load_g: False
   resume_state: ~