Abstract
Face recognition is one of the challenging problems which suffer from practical issues like pose, expression, and illumination changes, and/or aging. Plastic surgery is one among the issues that poses great difficulty in recognizing the faces. The literature has been reported with traditional features and classifiers for recognizing the faces after plastic surgery. In order to reduce the computational complexity of high-dimensional feature descriptor and improve the accuracy of recognition algorithm, the paper proposes Volume based SIFT (V-SIFT) for accurate face recognition after the plastic surgery. The corresponding feature extracts the key points and volume of the scale-space structure for which the information rate is determined. This provides least effect on uncertain variations in the face since the volume is the higher order statistical feature. The corresponding V-SIFT features are applied to the Support vector machine for classification. The normal SIFT feature extracts the key points based on the contrast of the image and the
V- SIFT feature extracts the key points based on the volume of the structure. Thus V-SIFT provide better performance when compared with PCA and normal based feature extraction. The effectiveness of the algorithm is verified by experiments on the ORL face image database, which demonstrates good stability and robustness especially under the conditions of some confounding factors such as different facial expressions, postures and so on.
Save your time!
We can take care of your essay
- Proper editing and formatting
- Free revision, title page, and bibliography
- Flexible prices and money-back guarantee
Place an order
INTRODUCTION
In plastic surgery, rather than changing the global features, local features are often changed since they are more prominent for plastic face image. SIFT descriptors are invariant to uniform orientation, scaling and illumination changes, and hence it is even more robust to identify the objects. Moreover, SIFT descriptors are more effective over other contemporary local descriptors on structured as well as textured image.
In the proposed methodology, the testing image from the plastic surgery face database is initially processed under pre-processing for image enhancement, where it crops and resized. Next to this, the features are extracted from the pre-processed image with the aid of V-SIFT. Then the classification process is done using SVM classifier and results the recognized image.
SIFT DESCRIPTOR
The comprised computations of the SIFT descriptor is as follows:
(i) Detection of Scale-space extrema: Here, a scale space is constructed for the detection of the image’s blob structures, in which the interest points named key points are detected. Moreover, the true scale invariance is acquired from the along with factor. Hence, the selection of scale is done automatically by convolution with Laplacian function.
if there is any closer relation between scale of the image and value of Laplacian, the result is formulated by convolving the corresponding image with , and it will be considered as extremum. Hence for the blob detection and optimal scale representation, the points those are extrema in scale as well as spatial spaces are chosen.
(ii) Removal of unreliable key points: This is the second step; here the formulation of at all the key points is done. Once the formulation is completed, it is checked for whether the value is below the threshold. If so, then it means that the attained structure has less contrast (which is more sensitive to noise), then the corresponding key point will be detached. Further, for weak peaks in the normalized Laplacian of Gaussian, the formulation of the primary curvature ratio of every candidate is done and checks whether the ratio is below the threshold, if so, then the key point remains.
(iii) Assigning Orientation: Here, the key point is assigned in one or more orientations with the basis of gradient directions of local image.
(iv) Key point descriptor: Basically, around the key point location, all the gradient magnitude and orientations of the image are sampled and utilizes the key point’s scale for choosing the Gaussian blur’s level. Then the formulation of the feature descriptor is carried out as a set of histogram of size 16ˣ16 pixels are round the key points. 8 bins are present in every histogram and 4ˣ4 array of histograms are compiled in each descriptor, and that are around the key point. Hence the feature vector of each key point will be 4 × 4 × 8 = 128 dimension.
3.3 V-SIFT DESCRIPTOR
Generally, the Difference of Gaussian (DOG) in SIFT features has varied gains. When the noise is supplemented to the image or if the image is blurred, then the stability of the features that are extracted is not so effective. Thus to get the perfect matching, it is essential to extract the stable feature points. DOG is utilized to capture the scale invariant features and the establishment of DOG scale space is done by scale space function.
In this DOG scale space, the entire sample points are compared with 26 neighbours for the identification of 2D image space along with intense points. For any image, the target point must be compared to 8 and 18 neighbours in both the below and above scale. The formulation of true scale is carried out from with . Subsequently, the convolution is attained with the normalized Laplacian function and the scale selection is accomplished. In this algorithm, the operator named scale normalized Laplacian operator is utilized for the selection of scale. Let us consider a model configuration: A Gaussian function is a 2D blob model that characteristics length in any coordinates direction.
Further, the scale space demonstration of is followed from the Gaussian property, which is termed as semi group property, and the representation.
After certain algebraic influences, it could be revealed that for , there is a distinctive over scale, this supreme over scales is defined. Therefore, the scale that lies in the scale space supreme shows the characteristic size of the particular blob, which is the rationale behindhand, the detection of the scale space. It is well known that a second order Laplacian operator is highly sensitive to noise and hence the unreliable key points are eliminated by the next step, ‘unreliable key point removal’.
Here, initially, the formulation of is done for all the location of every candidate and if the obtained value is below the threshold, then the key point is detached. For instance, let the Gaussian blob structures be and with varied scales. Once the convolution is done with and results in 1D outputs.
Further, the key points that represent the blobs are distinguished at and extrema. If both and extrema are same, then in the algorithm, both and are similarly treated. Both of them will be either kept or removed.
However, if both and are varied structures with diverse scales, then they have not be considered as useful or useless equally. If is greater than , then it is determined that is fewer sensitive to noise but is more sensitive to noise. The key points that represents must be kept and at the same time the key points which denotes must be removed. Hence VSIFT is the contrast approach that has the module of eliminating irrelevant key points, which are based on volume of the structure.
The value of is formulated in the locality of every candidate key point, and if the value goes below the threshold given, then the key point must be removed or else it should be remained.
SVM RECOGNITION SYSTEM
The basic concept of SVM shows the hyperplane that separates the classes and has the largest margin between support vectors. SVM is a machine learning structure from data or information. It is important to learn the mapping strategy: , where is some object and is denoting the class label. For instance, consider the 2-class classification. Therefore, , , where denotes the radius and defines the number of pixels in the image.
- Training sets with prediction modalities: let us consider the input set and output set . The training set for the input and output sets be . By giving , the suitable must be found, which means the classifier must be learned: , where denote the parameter of the function .
For instance, if the modality is selected from the set of hyperlanes in: then the function of SVM classifier is defined, where denotes the bias that given to the classifier and represents the weight vector.
- Empirical and True risk: is learned by selecting the appropriate function, which outperforms on the training data, the empirical risk is defined where represents the zero-one loss function, , if and 0 otherwise. By learning this, the overall risk is reduced, where refers the indefinite joint distribution function of and .
- Hyperlane’s capacity: The hyperlane is considered, where is normalized in terms of set of points , such that , and is minimized to have low capacity, the reduction is correspondent to the gaining of large margin classifier.
- Linear Support Vector Machine: The function must be find to minimize the objective such as Training error and complexity, it can be defined. Consider a set of hyperplane to minimize the objective.
In our work, the obtained V-SIFT feature descriptor of the plastic surgery is given to SVM classifier, which aims to find the decision exterior that has the supreme distance to the point of varied classes.
Statistical Analysis
The statistical analysis of the plastic surgery face recognition describes the comparison of features such as Principle Component Analysis (PCA), SIFT and the proposed method Volume SIFT. The performance measures are analysed for all these features. The ranking of each measure is mentioned in bracket. The definitions all the measures are based on the Wikipedia source.
In the source, accuracy determines the degree of correctly classified face. Sensitivity is the measure of the method to correctly identify the positive samples while the specificity is the measure of the method to correctly identify the negative samples. Precision can give the ratio of positive against all the
positive results. FPR, FNR, NPV and FDR can correctly predict the incorrect identification and correct identification. The correctness of the classification algorithm and the efficacy of binary class classification can be determined by F1_Score and MCC.
But here all the measures are better for the V-SIFT feature. The ranking of all the measures is calculated and the final rank is best for EV SIFT feature when compared to the other feature extraction methods in linear SVM and quadratic SVM. So by examining the overall analysis, it is clear that the V-SIFT feature extraction is better for the plastic surgery face recognition purpose.
CONCLUSION
This paper has presented a face recognition technique that uses derived features based on V-SIFT approach. The corresponding system was evaluated using plastic surgery image database of 15 subjects where it contains each image of pre-surgery and post surgery faces. The proposed V-SIFT approach has obtained the volume of the structure and the contrast of the image and also it has removed all the unwanted key points effectively. The extracted features were applied to the SVM classifier for the recognition purpose. The performance measures were analyzed in different kernel of SVM with different existing features. Here V-SIFT feature was effective for producing the best performance. The parameters of SVM classifier such as radius and enlarge factor was varied. From the analysis it was clear that, the performance was better for varied values of radius and EF and it was not fixed. So here a proper tuning was needed for obtaining the fixed value. In future work, the analysis based on the tuning process will be done to have the accurate recognition of plastic surgery face.