Markov Random Field - 3D Manikin Face Modelling And Super-Resolution From Range Images

The MRF assumes that an individual pixel in an image is only affected by a subset or clique of the pixels in the image. If these pixels are directional organized, the relationship can be expressed by aN dimensional Markov chain, whereN is the number of orientations used in the subset.

P(f_i|fj6=i) =P(f_i|fk, k∈ Ni) (3.12) Here, Ni is the subset, normally chosen as the adjacent spatial neighborhood of the pixel f_i and usually invariable homogenous meaning that all the pixels have the same neighborhood structure regardless of the positions of these pixels.

For simplicity, 4 or 8 adjacent neighbor structure (Figure3.1) is utilized in 2D image analysis.

Figure 3.1: A point with four and eight neighbors.

3.3.1 Gibbs Random Fields

AGibbs distribution takes the form P(f) = 1

where Z is a normalize factor called partition function, T is a constant called the temperature which is normally assumed to be 1, U(f) = P

∀C∈CV_C(f) is theenergy function,Cis one of the pixel clique,Cis all the cliques in the image.

V_C is called potential function which defines how the pixels are related. The potential function is often chosen as pair-pixel related, meaning thatV_C(f_i) = V(f_i, f_j).

If all the f obey Gibbs distribution then they are called Gibbs Random Field (GRF).

3.3 Markov Random Field 27

Hammersley-Clifford theorem states thatF is an MRF onS with respect toN if and only if F is a GRF on S with respect to N. One of the proof can be referred in [16].

The MRF describes the local property while the GRF focuses on the global.

Both on them can be used to provide an easy interpretation of the prior.

3.3.2 Some Common Priors

The best prior can contain as much accurate information of the desired result as possible. So designing a prior which is suitable to the current problem is the best choice. However, sometimes designing a specific prior is not that easy.

Some of the priors are proved to be useful for most of the circumstances and being widely accepted. Some of them are:

Gaussian Model

When the Gibbs distribution is a multivariate Gaussian function, the Prior is calledGaussian MRF (GMRF). In this model

V(C_x) =γd²_x, V(C_y) =γd²_y V(Cxy) =γd²_xy, V(Cyx) =γd²_yx

whereγis a constant controlling the smoothness. For the first derivative dx=fx+1,y−fx,y, dy=fx,y+1−fx,y

dxy= ^√¹₂(fx+1,y+1−fx,y), dyx= 1

√2(fx+1,y−1−fx,y).

The second derivative of this model could be

d²_x=f_x−1,y−fx,y+fx+1,y, d²_y =f_x,y−1−fx,y+fx,y+1

d²_xy= ¹₂f_x−1,y−1−f_x,y+¹₂f_x+1,y+1, d²_yx= 1

2f_x−1,y+1−f_x,y+1

2f_x+1,y−1. The GMRF encourages a smoother result.

Generalized Gaussian Model

TheGeneralized Gaussian MRF (GGMRF) introduces a factor pin the GMRF to make it more flexible. The Gibbs distribution of GGMRF is

P(x) = 1

Z exp (−x^p

pσ^p) (3.14)

GGMRF has heavier tail than GMRF when 1< p <2.

Huber Model

The Huber MRF (HMRF) uses Huber function so that the Gibbs distri-bution has the form: HMRF encourages Gaussian smoothness when the pixels have the dif-ference within |α|, whereas less penalty for the large difference so as to preserve the edges. So that, the HMRF is also called Edge-preserving MRF.

Auto-Model

TheAuto-MRF(AMRF) provides more flexibilities to control the smooth-ness along any specific direction.

P(fi|fN_i) = 1

Z exp (fiGi(fi) + X

i⁰∈N_i

βi⁰fifi⁰) (3.16) WhereGi is an arbitrary function and the orientation smoothness is con-trolled by parameter βi⁰. If fi ∈ {0,1} or fi ∈ {−1,1} the auto model is said to be an auto-logistic model. Furthermore, if four-neighborhood structure in Figure3.1is selected, the model is reduced to theIsing Model.

3.3.3 Model Optimization Method

The optimal pixel value is found by maximizing the posterior function 3.11.

If the function has the quadratic form, it would be efficient to use Conjugate Gradient Ascentmethod[26]. Otherwise, a general gradient ascent method may be used. It could be also possible to useIterative Constrained Modes (ICM)[3]

orSimulated Annealing (SA) method.

Chapter 4

Range Image Restoration via MRF

Due to the limitation of the hardware, the depth image is relatively low resolu-tion and noisy. In another hand, the intensity image is with high contrast and contains some information of depth (See Figure 4.1). Furthermore, the

Swiss-Figure 4.1: The depth (left) and intensity (right) images. The values have been scaled.

Ranger has very fast frame rate. It is possible to acquire multiple images at a very short time interval. Both of the situations can be utilized to increase the resolution of the single depth image.

There are very few researches on multiple view super-resolution of depth image because although we know the 3D objects are related by 3D rigid transform and they are projected onto the 2D plane, it is very difficult to find a transform between two 2D depth images. That will prevent us to write a formula like3.4.

In reference [9], the authors proposed a method that utilized a normal high resolution photo as prior to make a single super-resolution depth image. The method is based on designing a new MRF. In this chapter, I will show the structure of this MRF and how to use this structure to restore our depth image.

4.1 Forming MRF

The idea in reference [9] is to exploit the information in a high resolution image to restore low resolution depth image. The assumption of this high resolution image is that the depth discontinuities will also be reflected in this image. The authors use the conventional 2D camera photos because the depth difference may bring the brightness to change. The log likelihood function called depth measurement potential is:

Ψ =X

i∈L

k(yi−zi)² (4.1)

wherey is the restored image that we want to estimate,z is the original depth measurement,kis a positive constant weight,Lis all the depth measurements.

The logdepth smoothness prior is of the form Φ =X

j∈N(i)

w_ij(y_i−y_j)² (4.2) hereN is the neighbor clique andwis the weight connecting the high resolution information:

w_ij = exp(−cu_ij) (4.3)

cis a positive constant and

uij =kxi−xjk²₂ (4.4)

here xis the high resolution image point, so that the small difference ofxwill result in large w and smooth estimation. Whereas small w will decrease the functionality of the prior.

Then the normalized posterior probability is p(y|x, z) = 1

Z exp(−1

2(Ψ + Φ)) (4.5)

In document 3D Manikin Face Modelling And Super-Resolution From Range Images (Sider 40-45)