For each video in ultra-wide (UW), wide-angle (W), and telephoto (T) video triplets concurrently recorded from iPhone 12 Pro Max, we extract video frames and save them in PNG format.
We divide the video frames into training, test, and validation sets, and for each of the divided sets, we provide video frames in original (HR), 2x (LRx2), and 4x (LRx4) downsampled resolutions.
Note that for downsampling operation, we used imresize
function (bicubic) provided in MATLAB.
The following diagram shows the detailed file structure of the RealMCVSR dataset.
RealMCVSR
├── train # a training set
│ ├── HR # videos in original resolution
│ │ ├── T # telephoto videos
│ │ │ ├── 0002 # a video clip
│ │ │ │ ├── 0000.png # a video frame
│ │ │ │ └── ...
│ │ │ └── ...
│ │ ├── UW # ultra-wide-angle videos
│ │ └── W # wide-angle videos
│ ├── LRx2 # 2x downsampled videos
│ └── LRx4 # 4x downsampled videos
├── test # a testing set
└── valid # a validation set