[go: up one dir, main page]
More Web Proxy on the site http://driver.im/

CN103310450B - A kind of image partition method merging direct-connected commensurability bundle - Google Patents

A kind of image partition method merging direct-connected commensurability bundle Download PDF

Info

Publication number
CN103310450B
CN103310450B CN201310237708.4A CN201310237708A CN103310450B CN 103310450 B CN103310450 B CN 103310450B CN 201310237708 A CN201310237708 A CN 201310237708A CN 103310450 B CN103310450 B CN 103310450B
Authority
CN
China
Prior art keywords
background
prospect
pixel
direct
region
Prior art date
Legal status (The legal status is an assumption and is not a legal conclusion. Google has not performed a legal analysis and makes no representation as to the accuracy of the status listed.)
Expired - Fee Related
Application number
CN201310237708.4A
Other languages
Chinese (zh)
Other versions
CN103310450A (en
Inventor
马伟
刘倞
段立娟
Current Assignee (The listed assignees may be inaccurate. Google has not performed a legal analysis and makes no representation or warranty as to the accuracy of the list.)
Beijing University of Technology
Original Assignee
Beijing University of Technology
Priority date (The priority date is an assumption and is not a legal conclusion. Google has not performed a legal analysis and makes no representation as to the accuracy of the date listed.)
Filing date
Publication date
Application filed by Beijing University of Technology filed Critical Beijing University of Technology
Priority to CN201310237708.4A priority Critical patent/CN103310450B/en
Publication of CN103310450A publication Critical patent/CN103310450A/en
Application granted granted Critical
Publication of CN103310450B publication Critical patent/CN103310450B/en
Expired - Fee Related legal-status Critical Current
Anticipated expiration legal-status Critical

Links

Landscapes

  • Image Analysis (AREA)
  • Image Generation (AREA)

Abstract

The invention discloses a kind of image partition method merging direct-connected commensurability bundle, belong to the crossing domains such as computer vision, computer graphics, image procossing.First pass through the user interface of application program, before interactive specified portions, background.Before then setting up specified portions, the color model of background.It is configured to figure G and the energy function of segmentation.Energy function comprises color constraint and gradient constraint, and direct-connected commensurability bundle proposed by the invention.Finally, employing figure cuts Algorithm for Solving function minimum, obtains segmentation result.If user can add front background clue again to segmentation result is dissatisfied, flow process repetitive cycling performs, until obtaining satisfied segmentation effect.Present invention firstly provides the dividing method merging direct-connected commensurability bundle, compare only color and the traditional method of gradient constraint, on the premise of identical interactive quantity, the present invention is when processing from UNICOM's Object Segmentation, and segmentation effect is more preferable.

Description

A kind of image partition method merging direct-connected commensurability bundle
Technical field
The invention belongs to the crossing domains such as computer vision, computer graphics, image procossing, relate to a kind of fusion direct-connected The interactive image segmentation method of commensurability bundle.
Background technology
Interactive image segmentation be by before the mutual specified portions of a small amount of user, background, then utilize algorithm realize will sense The process that the object of interest splits from image.Image segmentation can be applicable to image synthesis, production of film and TV, and medical science The fields such as graphical analysis.Used by interactive image segmentation, optimized algorithm type is various.Wherein, energy can be obtained owing to figure cuts algorithm The optimal solution of flow function and obtain extensive concern.
It is currently based on figure and cuts the interactive image segmentation method of algorithm, need user's interactive mode to specify a small amount of fore/background picture Element or super-pixel, in order to set up the distribution of color of fore/background.Then, set up energy function, comprise: 1) color constraint, 2) gradient Constraint.Mostly be from UNICOM in view of object to be split, and color and gradient constraint not from terms of UNICOM in addition about Bundle.Retrain from UNICOM if can add, it will improve effect and the efficiency of partitioning algorithm.Work on hand also has by connectedness about Bundle incorporates in energy function.But, these connection constraints are excessively complicated, had a strong impact on calculating speed.
Summary of the invention
It is an object of the invention to explore simply but effectively connect constraint, and merge the dividing method of connection constraint, carry The efficiency of high cutting procedure.
For achieving the above object, the present invention adopts the following technical scheme that: user's user interface by application program, alternately Before formula specified portions, background.Then, before setting up specified portions, the color model of background.It is configured to figure G and the energy of segmentation Flow function.Energy function comprises color constraint and gradient constraint, and direct-connected commensurability bundle proposed by the invention.Finally, use Figure cuts Algorithm for Solving function minimum, obtains segmentation result.If user can add front background again to segmentation result is dissatisfied Clue, flow process repetitive cycling performs, until obtaining satisfied segmentation effect.
Compared with prior art, the innovation of the present invention is: propose to merge the dividing method of direct-connected commensurability bundle first. Comparing only color and the traditional method of gradient constraint, on the premise of identical interactive quantity, the present invention is processing from UNICOM's object During segmentation, segmentation effect is more preferable.
Accompanying drawing explanation
Fig. 1 is method flow diagram involved in the present invention;
Fig. 2 is that straight connected region involved in the present invention calculates schematic diagram;
Fig. 3 is multizone direct-connected commensurability bundle schematic diagram;
Fig. 4 is application example experimental result of the present invention: (a) is classified as the image of input and the inventive method and control methods Before Suo Yong, background clue, dark color is prospect, and light color is background;B () is classified as the segmentation result of the present invention;C () row are to analogy The segmentation result of method.
Detailed description of the invention
The present invention will be further described with detailed description of the invention below in conjunction with the accompanying drawings.
The flow process of the present invention is as it is shown in figure 1, specifically include following steps:
Step one, before interpolation, background clue.
After reading in piece image, user is by background pixel before the specified portions of interface.The present invention uses Yin Li et al. The side used in the paper " lazy snapping " delivered on " ACM Transactions on Graphics " for 2004 Formula, i.e. by input equipments such as mouse, touch screen or writing pencils, delineates the lines of different colours on image, carrys out specifying part Before Fen, background pixel.As shown in Fig. 4 (a) arranges, the pixel that dark strokes covers belongs to prospect, and the pixel that light color lines cover belongs to In background.
Step 2, structural map and energy function.
Image can be expressed as a non-directed graph G=<ν, ε>.ν is the node set in figure G, and ε is the set on limit.In figure G Each node p (p ∈ v), the super-pixel after a pixel of correspondence image or over-segmentation.Assuming that before the interactive part demarcated Scene element belongs to set F, and background pixel belongs to set B, and rest of pixels belongs to set U, then image segmentation can be considered a binary Labelling problem, each node one the unique labelling l of distribution being in set Up, lp{ 1,0} represents prospect and the back of the body to ∈ respectively Scape.Above-mentioned labelling assignment problem can solve by minimizing following energy function:
E ( L ) = &Sigma; p &Element; v E 1 ( l p ) + &lambda; &Sigma; ( p , q ) &Element; &epsiv; , l p &NotEqual; l q E 2 ( l p , l q ) + &Sigma; p &Element; &alpha; E 3 ( l p )
In formula, E1(lp) and E3(lp) it is unit item, represent and node p (p ∈ ν) is labeled as lp∈ { cost when 1,0}, E2 (lp,lq) it is binary item, for representing cost when neighbor takes not isolabeling respectively;λ is weight.
Functionally, E1(lp)、E2(lp,lq) and E3(lp) it is called again color bound term, gradient constraint item and straight Connection geometrical constraint item.According to E1(lp) definition, the color of node p is the biggest with the color similarity of the foreground part specified, then lpThe probability taking prospect labelling is the biggest, i.e. lp=1;Vice versa.According to E2(lp,lq) definition, the gradient between adjacent node Being worth the biggest, the probability as partitioning boundary is the biggest.According to E3(lp) definition, the node running counter to direct-connected commensurability bundle will directly be marked It is designated as background.Due to E3(lp) it is to obligate, the size of weight λ does not produce impact to it.Therefore, λ is only used for regulating E2(lp, lq) relative to E1(lp) importance.The present invention tests setting constant λ=1000.By E1(lp) and E2(lp,lq) it is referred to as non-geometric Bound term, E3(lp) it is referred to as geometrical constraint item.
(1) definition non-geometric bound term
For E1(lp), the present invention uses Yin Li et al. in 2004 at " ACM Transactions on Graphics " on definition mode in the paper " lazy snapping " delivered: first, by the pixel in F and B is carried out K-means clusters, and obtains 64 prospect classes bunchWith 64 background classes bunchThen, by E1(lp) fixed Justice is:
E 1 ( l p = 1 ) = 0 E 1 ( l p = 0 ) = &infin; &ForAll; p &Element; F E 1 ( l p = 1 ) = &infin; E 1 ( l p = 0 ) = 0 &ForAll; p &Element; B E 1 ( l p = 1 ) = f 1 ( D p F , D p B ) E 1 ( l p = 0 ) = f 0 ( D p F , D p B ) &ForAll; p &Element; U
f 1 ( D p F , D p B ) = D p F D p F + D p B
f 0 ( D p F , D p B ) = D p B D p F + D p B
In formula,Representing the distance that pixel p is distributed to front, background color respectively, its expression formula is respectively as follows:
D p F = min k = 1 , . . . , 64 | | C p - C k F | |
D p B = min k = 1 , . . . , 64 | | C p - C k B | |
Gradient constraint item E2(lp,lq) it is defined as:
E 2 ( l p , l q ) = 1 1 + D p , q 2
In formula, Dp,q=| | Cp-Cq| | represent the colour-difference between node p and q.
(2) direct-connected logical geometrical constraint item is defined
The definition of direct-connected logical geometrical constraint item is also based on the lines of user's input, and its ultimate principle is: if pixel p quilt Background line covers, and the central point for any prospect line is invisible, then it is assumed that p runs counter to direct-connected commensurability bundle, and p is directly labeled as the back of the body Scape, i.e. lp=0.
E3(lp) it is defined as:
E3(lp=1)=∞, E3(lp=0)=0,
This bound term specifies that the pixel being positioned at α region is judged to background pixel by rigid.Region alpha is defined as the district that is blocked Territory, i.e. for any prospect lines central point, the node within region alpha is the most invisible.
As in figure 2 it is shown, in the case of only background line and a prospect line, by the prospect line of user's labelling Heart point is expressed as O, the beginning and end of background line is designated as X and Y, ray OX and OY respectively and defines a piece of fan expanded outwardly Shape region, sector region has been further separated into two parts α and β by straight line XY.For any one pixel p, it is judged that whether it The method belonging to α region is as follows:
1) if as in figure 2 it is shown, ∠ XOY > ∠ pOX and ∠ XOY > ∠ pOY, then p ∈ α ∪ β, i.e. ray OX enclose with ray OY The sector region become.
2) if p is ∈ α ∪ β, and ∠ pYO > ∠ XYO, then p ∈ α, i.e. pixel p belong to α region (such as the light gray in Fig. 2 Region).
On same piece image, often have a plurality of prospect lines and background lines, for M bar prospect line and N bar background Line (as it is shown on figure 3, M=1, N=2), the α region after superposition is represented by:
α=(α11∪α12∪...∪α1n∪...∪α1N)∪(α21∪α22∪...∪α2n∪...∪α2N)
∪...∪(αM1∪αM2∪...∪αmn∪...∪αMN)
In formula, αmnRepresent the region that is blocked that the m article prospect lines and nth bar background lines are formed.
Step 3, employing figure cuts algorithm and seeks the optimal solution of energy function.
The present invention uses Yuri Boykov et al. to deliver on " IEEE Transaction on PAMI " in 2004 Paper " An Experimental Comparison of Min-Cut/Max-Flow Algorithms for Energy Minimization in Vision " proposed in figure cut algorithm, seek the optimal solution of energy function, obtain segmentation result.With Family, as being unsatisfied with segmentation effect, can return to step one, before continuing to add, background clue.Often add lines, be either Specified portions prospect or background, all will trigger the cutting procedure of Single cell fusion direct-connected commensurability bundle.
An application example of the present invention is given below.
This experiment with the dividing method without direct-connected commensurability bundle, i.e. Yin Li et al. in 2004 at " ACM Transactions on Graphics " on the method that proposes in the paper " lazy snapping " delivered be contrast object.Note Meaning, figure is all defined in pixel scale by this experiment.Fig. 4 is segmentation effect contrast.Fig. 4 (a) is the image and this inputted Before used by inventive method and control methods, background clue, dark color is prospect, and light color is background;Fig. 4 (b) is dividing of the present invention Cut result;Fig. 4 (c) is the segmentation result of control methods.It is seen that on the premise of identical interactive quantity, the present invention's Method can obtain complete cutting object.And a part of error flag of background area is often prospect by control methods, A part such as the background Folium Alangii in 4 (c) is labeled as prospect by mistake.Large stretch of background area in little STOWAGE PLAN is also by mistake terrestrial reference It is designated as prospect.

Claims (1)

1. the image partition method merging direct-connected commensurability bundle, it is characterised in that comprise the following steps:
Step one, before interpolation, background clue;
After reading in piece image, use the input equipments such as mouse, touch screen or writing pencil, by delineating different face on image Before the lines specified portions of color, background pixel;
Step 2, structural map and energy function;
Image can be expressed as a non-directed graph G=<ν, ε>, ν is the node set in figure G, and ε is the set on limit;Figure G in every Super-pixel after one pixel of individual node p (p ∈ v) correspondence image or over-segmentation;Assuming that the interactive part prospect picture demarcated Element belongs to set F, and background pixel belongs to set B, and rest of pixels belongs to set U, then image segmentation can be considered a binary flag Problem, each node one the unique labelling l of distribution being in set Up∈ { 1,0}, lp=1 represents prospect, lp=0 represents Background;Above-mentioned labelling assignment problem can solve by minimizing following energy function:
E ( L ) = &Sigma; p &Element; v E 1 ( l p ) + &lambda; &Sigma; ( p , q ) &Element; &epsiv; , l p &NotEqual; l q E 2 ( l p , l q ) + &Sigma; p &Element; &alpha; E 3 ( l p ) - - - ( 1 )
In formula, E1(lp) and E3(lp) it is unit item, represent and node p (p ∈ ν) is labeled as lp∈ { cost when 1,0}, E2(lp, lq) it is binary item, for representing cost when neighbor takes not isolabeling respectively;λ is weight;
E1(lp)、E2(lp,lq) and E3(lp) it is called again color bound term, gradient constraint item and direct-connected logical geometrical constraint item;Press According to E1(lp) definition, the color of node p is the biggest with the color similarity of the foreground part specified, lpTake the possibility of prospect labelling Property is the biggest, i.e. lp=1;Vice versa;According to E2(lp,lq) definition, the Grad between adjacent node is the biggest, as segmentation side The probability on boundary is the biggest;According to E3(lp) definition, the node running counter to direct-connected commensurability bundle will be directly labeled as background;E3(lp) be Obligating, the size of weight λ does not produce impact to it, and λ is only used for regulating E2(lp,lq) relative to E1(lp) importance;
(1) definition color bound term, gradient constraint item E1(lp) and E2(lp,lq);
For E1(lp), first, by the pixel in F and B is carried out K-means cluster, obtain 64 prospect classes bunchWith 64 background classes bunchThen, by E1(lp) it is defined as:
E 1 ( l p = 1 ) = 0 E 1 ( l p = 0 ) = &infin; &ForAll; p &Element; F E 1 ( l p = 1 ) = &infin; E 1 ( l p = 0 ) = 0 &ForAll; p &Element; B E 1 ( l p = 1 ) = f 1 ( D p F , D p B ) E 1 ( l p = 0 ) = f 0 ( D p F , D p B ) &ForAll; p &Element; U
f 1 ( D p F , D p B ) = D p F D p F + D p B
f 0 ( D p F , D p B ) = D p B D p F + D p B
In formula,Representing the distance that pixel p is distributed to front, background color respectively, its expression formula is respectively as follows:
D p F = m i n k = 1 , .. , 64 | | C p - C k F | |
D p B = m i n k = 1 , .. , 64 | | C p - C k B | |
E2(lp,lq) it is defined as:
E 2 ( l p , l q ) = 1 1 + D p , q 2
In formula, Dp,q=| | Cp-Cq| | represent the colour-difference between node p and q;Cp、CqRepresent the color value of node p and q respectively;Represent the color value of kth class bunch in prospect class bunch and background classes bunch respectively;
(2) direct-connected logical geometrical constraint item E is defined3(lp)
The definition of direct-connected logical geometrical constraint item is also based on the lines of user's input, and its ultimate principle is: if pixel p is by background Line covers, and the central point for any prospect line is invisible, then it is assumed that p runs counter to direct-connected commensurability bundle, and p is directly labeled as background, I.e. lp=0;
E3(lp) it is defined as:
E 3 ( l p = 1 ) = &infin; , E 3 ( l p = 0 ) = 0 , &ForAll; p &Element; &alpha;
This bound term specifies that the pixel being positioned at α region is judged to background pixel by rigid;Region alpha is defined as the region that is blocked, I.e. for any prospect lines central point, the node within region alpha is the most invisible;
In the case of only background line and a prospect line, the central point of the prospect line of labelling is expressed as O, by background The beginning and end of line is designated as X and Y, ray OX and OY respectively and defines a piece of sector region expanded outwardly, and straight line XY will fan Shape region has been further separated into two parts α and β;For any one pixel p, it is judged that whether it belongs to the method in α region such as Under:
1) if ∠ XOY > ∠ pOX and ∠ XOY > ∠ pOY, then p ∈ α ∪ β, the sector region that i.e. ray OX surrounds with ray OY;
2) if p is ∈ α ∪ β, and ∠ pYO > ∠ XYO, then p ∈ α, i.e. pixel p belong to α region;
On same piece image, for M bar prospect line and N bar background line, the α region after superposition is represented by:
α=(α11∪α12∪...∪α1n∪...∪α1N)∪(α21∪α22∪...∪α2n∪...∪α2N)
∪...∪(αM1∪αM2∪...∪αmn∪...∪αMN)
In formula, αmnRepresent the region that is blocked that the m article prospect lines and nth bar background lines are formed;
Step 3, employing figure cuts algorithm and seeks the optimal solution of energy function;
Use Y figure to cut algorithm and seek the optimal solution of energy function (1), obtain segmentation result;As segmentation effect is unsatisfied with, can return to Step one, continue add before, background clue;Often add lines, either for specified portions prospect or background, all incite somebody to action Trigger the cutting procedure of Single cell fusion direct-connected commensurability bundle.
CN201310237708.4A 2013-06-17 2013-06-17 A kind of image partition method merging direct-connected commensurability bundle Expired - Fee Related CN103310450B (en)

Priority Applications (1)

Application Number Priority Date Filing Date Title
CN201310237708.4A CN103310450B (en) 2013-06-17 2013-06-17 A kind of image partition method merging direct-connected commensurability bundle

Applications Claiming Priority (1)

Application Number Priority Date Filing Date Title
CN201310237708.4A CN103310450B (en) 2013-06-17 2013-06-17 A kind of image partition method merging direct-connected commensurability bundle

Publications (2)

Publication Number Publication Date
CN103310450A CN103310450A (en) 2013-09-18
CN103310450B true CN103310450B (en) 2016-12-28

Family

ID=49135627

Family Applications (1)

Application Number Title Priority Date Filing Date
CN201310237708.4A Expired - Fee Related CN103310450B (en) 2013-06-17 2013-06-17 A kind of image partition method merging direct-connected commensurability bundle

Country Status (1)

Country Link
CN (1) CN103310450B (en)

Families Citing this family (1)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
CN103942785A (en) * 2014-04-09 2014-07-23 苏州大学 Lung tumor segmentation method based on PET and CT images of image segmentation

Citations (3)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
EP1612733A2 (en) * 2004-06-28 2006-01-04 Microsoft Corporation Color segmentation-based stereo 3D reconstruction system and process
CN102332097A (en) * 2011-10-21 2012-01-25 中国科学院自动化研究所 Method for segmenting complex background text images based on image segmentation
CN102360494A (en) * 2011-10-18 2012-02-22 中国科学院自动化研究所 Interactive image segmentation method for multiple foreground targets

Family Cites Families (1)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
US7400757B2 (en) * 2001-10-04 2008-07-15 Siemens Medical Solutions Usa, Inc. System and method for segmenting the left ventricle in a cardiac image

Patent Citations (3)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
EP1612733A2 (en) * 2004-06-28 2006-01-04 Microsoft Corporation Color segmentation-based stereo 3D reconstruction system and process
CN102360494A (en) * 2011-10-18 2012-02-22 中国科学院自动化研究所 Interactive image segmentation method for multiple foreground targets
CN102332097A (en) * 2011-10-21 2012-01-25 中国科学院自动化研究所 Method for segmenting complex background text images based on image segmentation

Also Published As

Publication number Publication date
CN103310450A (en) 2013-09-18

Similar Documents

Publication Publication Date Title
Wang et al. Associatively segmenting instances and semantics in point clouds
Gilani et al. Deep, dense and accurate 3D face correspondence for generating population specific deformable models
CN107168527B (en) The first visual angle gesture identification and exchange method based on region convolutional neural networks
Jänicke et al. Brushing of attribute clouds for the visualization of multivariate data
CN109740686A (en) A kind of deep learning image multiple labeling classification method based on pool area and Fusion Features
CN103246891B (en) A kind of Chinese Sign Language recognition methods based on Kinect
CN105976378A (en) Graph model based saliency target detection method
CN103337072B (en) A kind of room objects analytic method based on texture and geometric attribute conjunctive model
CN103530633B (en) Semantic mapping method of local invariant feature of image and semantic mapping system
CN103400386A (en) Interactive image processing method used for video
CN109272467A (en) A kind of stratification image partition method based on multi-scale edge clue
CN105354593B (en) A kind of threedimensional model sorting technique based on NMF
CN105046689B (en) A kind of interactive stereo-picture fast partition method based on multi-level graph structure
Chen et al. 3-d instance segmentation of mvs buildings
CN104166988B (en) A kind of stereo sync dividing method for incorporating sparse match information
CN104167013A (en) Volume rendering method for highlighting target area in volume data
CN110222712B (en) Multi-special-item target detection algorithm based on deep learning
Ma et al. Location-aware box reasoning for anchor-based single-shot object detection
CN107330907B (en) A kind of MRF image partition methods of combination deep learning shape prior
CN103310450B (en) A kind of image partition method merging direct-connected commensurability bundle
Wang et al. Semantic annotation for complex video street views based on 2D–3D multi-feature fusion and aggregated boosting decision forests
Shyr et al. Supervised hierarchical Pitman-Yor process for natural scene segmentation
Zhong et al. Robust image segmentation against complex color distribution
CN109919057B (en) Multi-mode fusion gesture recognition method based on efficient convolutional neural network
Li et al. Exploring 3D human action recognition: From offline to online

Legal Events

Date Code Title Description
C06 Publication
PB01 Publication
C10 Entry into substantive examination
SE01 Entry into force of request for substantive examination
C14 Grant of patent or utility model
GR01 Patent grant
CF01 Termination of patent right due to non-payment of annual fee
CF01 Termination of patent right due to non-payment of annual fee

Granted publication date: 20161228

Termination date: 20190617