October 2019 Layout Flipping Full - page 766

View and GeoEye satellites were taken to construct simulated

and real data experiments, respectively. Besides, several dif-

ferent types of terrains were selected to test the performance

of this method in different terrain conditions. Experiments

verified the feasibility and practicality of this kind of appli-

cation, and a subpixel level

DSM

with higher-fidelity can be

obtained compared to directly

DSM

interpolation.

The rest of this paper is organized as follows: Section “Re-

lated Work” gives a brief overview on

CNN

-based

methods

and the applications of

in remote sensing fields. Section

“Application to Subpixel Level

DSM

Generation” describes

the main application details and tells the evaluation metrics

of the proposed approach. Section “Experiments” confirms

the effectiveness and practicability of this kind of applica-

tion with simulated experiments and real data experiments.

Finally, the “Conclusion” section concludes this paper.

Related Work

In this section, we will introduce and analyze related methods

on image super resolution and give a brief overview on some

relevant applications of

technology in remote sensing fields.

Image SR

Image

aims to restore a high-resolution (

) image from

one or more low-resolution (

) images. Numerous methods

have been proposed to deal with the image

problems in

recent years, which can be mainly classified into reconstruc-

tion-based

and learning-based

(Zeng

et al.

2017).

The reconstruction-based

methods assume that the

images are from the observations of their related

images

after transformation, deformation, and noise disturbance. As

is an ill-posed task, reconstruction-based methods attempt

to optimize the problem by imposing some extra assumptions,

such as smoothness and limited bands

(

Sun

et al.

2008

;

Tai

al.

2010; Zhang

et al.

2010). However, the added assumptions

will heavily limit the performance of this kind of method

since they are always unsuitable in most image situations.

Different from the reconstruction-based methods, the

learning-based

methods learn the high frequency infor-

mation difference through constantly training high/low-

resolution image pairs, and then build a robust model. After

that, given a low-resolution image into the learned model, its

corresponding

image can be restored. S

pecifically, accord-

ing to the different strategy of learning, th

e learning-based

method can be further divided into sparse

-coding-based

methods(Yang

et al.

2008;Yang

et al.

2010)

, manifold-learn-

ing-based methods(Chang

et al.

2004; Lu

et al.

2013) and

CNN

-based methods(Dong

et al.

2016; Kim

et al.

2016). Since

more high-frequency details can be learnt from the numerous

image samples,

CNN

-based methods visibly outperform other

methods. A pioneer work of

CNN

-based

methods is called

super-resolution convolution neural network (

SRCNN

) (Dong

et al.

2016), which learns an end-to-end nonlinear mapping

between

and

image pairs via a three-hidden-layers

CNN

architecture. However, this method cannot extract satisfactory

image feature for

image reconstruction due to the shallow

architecture. Considering this, Kim

et al.

(2016) developed a

much deeper

model

VDSR

, of which the number of convolu-

tion layers is increased to 20. As the difference of

image

pairs are mainly located in high frequency part, Kim

et al.

(2015) proposed another network Deeply-Recursive Convo-

lutional Network, which utilized residual learning (He

et al.

2016) as optimization strategy to accelerate the convergence

speed of their proposed network. Bicubic down-sampling is

commonly used in these

CNN

-based methods to generate

images from their

counterparts, which results in the un-

satisfactory performance on real data. Therefore, Zhang

et al.

(2018) proposed an effective model super-resolution network

for multiple degradations (

SRMD

) to address this problem

to some extent, which established a degradation formula

between the

HR/LR

image pairs and took the

images and the

degradation maps as input to reconstruct the corresponding

results.

SR in Remote Sensing Fields

As an effective technology for improving image spatial resolu-

tion, image super resolution is highly related to remote sens-

ing fields. And some classical

methods in computer vision

fields were transferred into remote sensing fields to better

solve those ill-posed problems. Luo

et al.

(2017) refined

VDSR

to deal with image compression problem that occurred during

video satellite image transmission process, Song et al (2018)

applied

SRCNN

to image fusion tasks as a transitional image

generation tool, and He

et al.

(2018) adopted cascaded deep

network and multiple receptive fields for infrared image super

resolution. Besides, considering the special characteristic of

remote sensing images, Huang et al (2017) developed remote

sensing deep residual-learning specifically for remote sensing

images super-resolution, while Lei

et al.

(2017) used local-

global combined network (

LGCNet

) to learn multilevel repre-

sentations of remote sensing images. Moreover, Jeon

et al.

(2018) utilized a parallax prior in stereo images to reconstruct

images for subpixel registration accuracy, and Song

et al.

(2016) extended the success of deep convolutional neural

network to depth image super resolution.

To the best of our knowledge, even though

methods are

widely applicated into remote sensing fields and Dai

et al.

(2016) have already verified the effectiveness and usefulness

technology in several vision applications, there is not a

work to discuss whether it is feasible to bring the

method

into traditional dense matching procedures for obtaining

subpixel level

DSM

Since the main purpose of this paper is to explore the

feasibility and practicality of this kind of application and to show

the effect of bringing different

models into the framework, for

fair comparison and better explaining the experimental results,

we refined several

models to show the performance difference

and analyze the practicality of the proposed method in this

paper, rather than designed a new

model.

Application to Subpixel Level DSM Generation

As illustrated in Figure 1, the workflow of this application

can be mainly divided i

nto two stages. Separately, in step I,

the image quality and sp

atial resolution is improved by

CNN

based

module. The

module is adopted to enhance the

object texture and edge detection, which helps extract more

reliable matching points for subsequent

DSM

generation pro-

cess. Then, with the obtained

images from step I as input,

a subpixel level

DSM

with high fidelity can be obtained from a

standard

DSM

generation procedure in step II. In this section,

the details of this application will be explained in the follow-

ing aspects: refinement details of

models,

DSM

generation

process, and quality assessment metrics.

Refinement of SR Models

In this subsection, we will introduce how to reconstruct a

image from its counterpart

image. For the ease of repre-

sentation, the

image is denoted as

, its corresponding

image is denoted as

, and

′

is the reconstructed result of

image through

CNN

architecture. Specifically,

is obtained

from

according to down-sampling or a special degradation

formula. Therefore, as illustrated in Figure 2, a robust archi-

tecture can be obtained by repeatedly minimizing the differ-

ence between

image

and

′

. To measure the difference

between real

image

and predicted

image

′

, mean

squared error (

MSE

), is usually used as Equation 1,

MSE

width height

−

y y

′

(1)

766

October 2019

PHOTOGRAMMETRIC ENGINEERING & REMOTE SENSING

SEO Version

Warning.

You are currently viewing the SEO version of !text.
It has a number of design and functionality limitations.

We recommend viewing the Flash version or the basic HTML version of this publication.

699...,756,757,758,759,760,761,762,763,764,765 767,768,769,770,771,772,773,774,775,776,...778