CN103310450B - A kind of image partition method merging direct-connected commensurability bundle - Google Patents
A kind of image partition method merging direct-connected commensurability bundle Download PDFInfo
- Publication number
- CN103310450B CN103310450B CN201310237708.4A CN201310237708A CN103310450B CN 103310450 B CN103310450 B CN 103310450B CN 201310237708 A CN201310237708 A CN 201310237708A CN 103310450 B CN103310450 B CN 103310450B
- Authority
- CN
- China
- Prior art keywords
- background
- prospect
- pixel
- direct
- region
- Prior art date
- Legal status (The legal status is an assumption and is not a legal conclusion. Google has not performed a legal analysis and makes no representation as to the accuracy of the status listed.)
- Expired - Fee Related
Links
Landscapes
- Image Analysis (AREA)
- Image Generation (AREA)
Abstract
The invention discloses a kind of image partition method merging direct-connected commensurability bundle, belong to the crossing domains such as computer vision, computer graphics, image procossing.First pass through the user interface of application program, before interactive specified portions, background.Before then setting up specified portions, the color model of background.It is configured to figure G and the energy function of segmentation.Energy function comprises color constraint and gradient constraint, and direct-connected commensurability bundle proposed by the invention.Finally, employing figure cuts Algorithm for Solving function minimum, obtains segmentation result.If user can add front background clue again to segmentation result is dissatisfied, flow process repetitive cycling performs, until obtaining satisfied segmentation effect.Present invention firstly provides the dividing method merging direct-connected commensurability bundle, compare only color and the traditional method of gradient constraint, on the premise of identical interactive quantity, the present invention is when processing from UNICOM's Object Segmentation, and segmentation effect is more preferable.
Description
Technical field
The invention belongs to the crossing domains such as computer vision, computer graphics, image procossing, relate to a kind of fusion direct-connected
The interactive image segmentation method of commensurability bundle.
Background technology
Interactive image segmentation be by before the mutual specified portions of a small amount of user, background, then utilize algorithm realize will sense
The process that the object of interest splits from image.Image segmentation can be applicable to image synthesis, production of film and TV, and medical science
The fields such as graphical analysis.Used by interactive image segmentation, optimized algorithm type is various.Wherein, energy can be obtained owing to figure cuts algorithm
The optimal solution of flow function and obtain extensive concern.
It is currently based on figure and cuts the interactive image segmentation method of algorithm, need user's interactive mode to specify a small amount of fore/background picture
Element or super-pixel, in order to set up the distribution of color of fore/background.Then, set up energy function, comprise: 1) color constraint, 2) gradient
Constraint.Mostly be from UNICOM in view of object to be split, and color and gradient constraint not from terms of UNICOM in addition about
Bundle.Retrain from UNICOM if can add, it will improve effect and the efficiency of partitioning algorithm.Work on hand also has by connectedness about
Bundle incorporates in energy function.But, these connection constraints are excessively complicated, had a strong impact on calculating speed.
Summary of the invention
It is an object of the invention to explore simply but effectively connect constraint, and merge the dividing method of connection constraint, carry
The efficiency of high cutting procedure.
For achieving the above object, the present invention adopts the following technical scheme that: user's user interface by application program, alternately
Before formula specified portions, background.Then, before setting up specified portions, the color model of background.It is configured to figure G and the energy of segmentation
Flow function.Energy function comprises color constraint and gradient constraint, and direct-connected commensurability bundle proposed by the invention.Finally, use
Figure cuts Algorithm for Solving function minimum, obtains segmentation result.If user can add front background again to segmentation result is dissatisfied
Clue, flow process repetitive cycling performs, until obtaining satisfied segmentation effect.
Compared with prior art, the innovation of the present invention is: propose to merge the dividing method of direct-connected commensurability bundle first.
Comparing only color and the traditional method of gradient constraint, on the premise of identical interactive quantity, the present invention is processing from UNICOM's object
During segmentation, segmentation effect is more preferable.
Accompanying drawing explanation
Fig. 1 is method flow diagram involved in the present invention;
Fig. 2 is that straight connected region involved in the present invention calculates schematic diagram;
Fig. 3 is multizone direct-connected commensurability bundle schematic diagram;
Fig. 4 is application example experimental result of the present invention: (a) is classified as the image of input and the inventive method and control methods
Before Suo Yong, background clue, dark color is prospect, and light color is background;B () is classified as the segmentation result of the present invention;C () row are to analogy
The segmentation result of method.
Detailed description of the invention
The present invention will be further described with detailed description of the invention below in conjunction with the accompanying drawings.
The flow process of the present invention is as it is shown in figure 1, specifically include following steps:
Step one, before interpolation, background clue.
After reading in piece image, user is by background pixel before the specified portions of interface.The present invention uses Yin Li et al.
The side used in the paper " lazy snapping " delivered on " ACM Transactions on Graphics " for 2004
Formula, i.e. by input equipments such as mouse, touch screen or writing pencils, delineates the lines of different colours on image, carrys out specifying part
Before Fen, background pixel.As shown in Fig. 4 (a) arranges, the pixel that dark strokes covers belongs to prospect, and the pixel that light color lines cover belongs to
In background.
Step 2, structural map and energy function.
Image can be expressed as a non-directed graph G=<ν, ε>.ν is the node set in figure G, and ε is the set on limit.In figure G
Each node p (p ∈ v), the super-pixel after a pixel of correspondence image or over-segmentation.Assuming that before the interactive part demarcated
Scene element belongs to set F, and background pixel belongs to set B, and rest of pixels belongs to set U, then image segmentation can be considered a binary
Labelling problem, each node one the unique labelling l of distribution being in set Up, lp{ 1,0} represents prospect and the back of the body to ∈ respectively
Scape.Above-mentioned labelling assignment problem can solve by minimizing following energy function:
In formula, E1(lp) and E3(lp) it is unit item, represent and node p (p ∈ ν) is labeled as lp∈ { cost when 1,0}, E2
(lp,lq) it is binary item, for representing cost when neighbor takes not isolabeling respectively;λ is weight.
Functionally, E1(lp)、E2(lp,lq) and E3(lp) it is called again color bound term, gradient constraint item and straight
Connection geometrical constraint item.According to E1(lp) definition, the color of node p is the biggest with the color similarity of the foreground part specified, then
lpThe probability taking prospect labelling is the biggest, i.e. lp=1;Vice versa.According to E2(lp,lq) definition, the gradient between adjacent node
Being worth the biggest, the probability as partitioning boundary is the biggest.According to E3(lp) definition, the node running counter to direct-connected commensurability bundle will directly be marked
It is designated as background.Due to E3(lp) it is to obligate, the size of weight λ does not produce impact to it.Therefore, λ is only used for regulating E2(lp,
lq) relative to E1(lp) importance.The present invention tests setting constant λ=1000.By E1(lp) and E2(lp,lq) it is referred to as non-geometric
Bound term, E3(lp) it is referred to as geometrical constraint item.
(1) definition non-geometric bound term
For E1(lp), the present invention uses Yin Li et al. in 2004 at " ACM Transactions on
Graphics " on definition mode in the paper " lazy snapping " delivered: first, by the pixel in F and B is carried out
K-means clusters, and obtains 64 prospect classes bunchWith 64 background classes bunchThen, by E1(lp) fixed
Justice is:
In formula,Representing the distance that pixel p is distributed to front, background color respectively, its expression formula is respectively as follows:
Gradient constraint item E2(lp,lq) it is defined as:
In formula, Dp,q=| | Cp-Cq| | represent the colour-difference between node p and q.
(2) direct-connected logical geometrical constraint item is defined
The definition of direct-connected logical geometrical constraint item is also based on the lines of user's input, and its ultimate principle is: if pixel p quilt
Background line covers, and the central point for any prospect line is invisible, then it is assumed that p runs counter to direct-connected commensurability bundle, and p is directly labeled as the back of the body
Scape, i.e. lp=0.
E3(lp) it is defined as:
E3(lp=1)=∞, E3(lp=0)=0,
This bound term specifies that the pixel being positioned at α region is judged to background pixel by rigid.Region alpha is defined as the district that is blocked
Territory, i.e. for any prospect lines central point, the node within region alpha is the most invisible.
As in figure 2 it is shown, in the case of only background line and a prospect line, by the prospect line of user's labelling
Heart point is expressed as O, the beginning and end of background line is designated as X and Y, ray OX and OY respectively and defines a piece of fan expanded outwardly
Shape region, sector region has been further separated into two parts α and β by straight line XY.For any one pixel p, it is judged that whether it
The method belonging to α region is as follows:
1) if as in figure 2 it is shown, ∠ XOY > ∠ pOX and ∠ XOY > ∠ pOY, then p ∈ α ∪ β, i.e. ray OX enclose with ray OY
The sector region become.
2) if p is ∈ α ∪ β, and ∠ pYO > ∠ XYO, then p ∈ α, i.e. pixel p belong to α region (such as the light gray in Fig. 2
Region).
On same piece image, often have a plurality of prospect lines and background lines, for M bar prospect line and N bar background
Line (as it is shown on figure 3, M=1, N=2), the α region after superposition is represented by:
α=(α11∪α12∪...∪α1n∪...∪α1N)∪(α21∪α22∪...∪α2n∪...∪α2N)
∪...∪(αM1∪αM2∪...∪αmn∪...∪αMN)
In formula, αmnRepresent the region that is blocked that the m article prospect lines and nth bar background lines are formed.
Step 3, employing figure cuts algorithm and seeks the optimal solution of energy function.
The present invention uses Yuri Boykov et al. to deliver on " IEEE Transaction on PAMI " in 2004
Paper " An Experimental Comparison of Min-Cut/Max-Flow Algorithms for Energy
Minimization in Vision " proposed in figure cut algorithm, seek the optimal solution of energy function, obtain segmentation result.With
Family, as being unsatisfied with segmentation effect, can return to step one, before continuing to add, background clue.Often add lines, be either
Specified portions prospect or background, all will trigger the cutting procedure of Single cell fusion direct-connected commensurability bundle.
An application example of the present invention is given below.
This experiment with the dividing method without direct-connected commensurability bundle, i.e. Yin Li et al. in 2004 at " ACM
Transactions on Graphics " on the method that proposes in the paper " lazy snapping " delivered be contrast object.Note
Meaning, figure is all defined in pixel scale by this experiment.Fig. 4 is segmentation effect contrast.Fig. 4 (a) is the image and this inputted
Before used by inventive method and control methods, background clue, dark color is prospect, and light color is background;Fig. 4 (b) is dividing of the present invention
Cut result;Fig. 4 (c) is the segmentation result of control methods.It is seen that on the premise of identical interactive quantity, the present invention's
Method can obtain complete cutting object.And a part of error flag of background area is often prospect by control methods,
A part such as the background Folium Alangii in 4 (c) is labeled as prospect by mistake.Large stretch of background area in little STOWAGE PLAN is also by mistake terrestrial reference
It is designated as prospect.
Claims (1)
1. the image partition method merging direct-connected commensurability bundle, it is characterised in that comprise the following steps:
Step one, before interpolation, background clue;
After reading in piece image, use the input equipments such as mouse, touch screen or writing pencil, by delineating different face on image
Before the lines specified portions of color, background pixel;
Step 2, structural map and energy function;
Image can be expressed as a non-directed graph G=<ν, ε>, ν is the node set in figure G, and ε is the set on limit;Figure G in every
Super-pixel after one pixel of individual node p (p ∈ v) correspondence image or over-segmentation;Assuming that the interactive part prospect picture demarcated
Element belongs to set F, and background pixel belongs to set B, and rest of pixels belongs to set U, then image segmentation can be considered a binary flag
Problem, each node one the unique labelling l of distribution being in set Up∈ { 1,0}, lp=1 represents prospect, lp=0 represents
Background;Above-mentioned labelling assignment problem can solve by minimizing following energy function:
In formula, E1(lp) and E3(lp) it is unit item, represent and node p (p ∈ ν) is labeled as lp∈ { cost when 1,0}, E2(lp,
lq) it is binary item, for representing cost when neighbor takes not isolabeling respectively;λ is weight;
E1(lp)、E2(lp,lq) and E3(lp) it is called again color bound term, gradient constraint item and direct-connected logical geometrical constraint item;Press
According to E1(lp) definition, the color of node p is the biggest with the color similarity of the foreground part specified, lpTake the possibility of prospect labelling
Property is the biggest, i.e. lp=1;Vice versa;According to E2(lp,lq) definition, the Grad between adjacent node is the biggest, as segmentation side
The probability on boundary is the biggest;According to E3(lp) definition, the node running counter to direct-connected commensurability bundle will be directly labeled as background;E3(lp) be
Obligating, the size of weight λ does not produce impact to it, and λ is only used for regulating E2(lp,lq) relative to E1(lp) importance;
(1) definition color bound term, gradient constraint item E1(lp) and E2(lp,lq);
For E1(lp), first, by the pixel in F and B is carried out K-means cluster, obtain 64 prospect classes bunchWith 64 background classes bunchThen, by E1(lp) it is defined as:
In formula,Representing the distance that pixel p is distributed to front, background color respectively, its expression formula is respectively as follows:
E2(lp,lq) it is defined as:
In formula, Dp,q=| | Cp-Cq| | represent the colour-difference between node p and q;Cp、CqRepresent the color value of node p and q respectively;Represent the color value of kth class bunch in prospect class bunch and background classes bunch respectively;
(2) direct-connected logical geometrical constraint item E is defined3(lp)
The definition of direct-connected logical geometrical constraint item is also based on the lines of user's input, and its ultimate principle is: if pixel p is by background
Line covers, and the central point for any prospect line is invisible, then it is assumed that p runs counter to direct-connected commensurability bundle, and p is directly labeled as background,
I.e. lp=0;
E3(lp) it is defined as:
This bound term specifies that the pixel being positioned at α region is judged to background pixel by rigid;Region alpha is defined as the region that is blocked,
I.e. for any prospect lines central point, the node within region alpha is the most invisible;
In the case of only background line and a prospect line, the central point of the prospect line of labelling is expressed as O, by background
The beginning and end of line is designated as X and Y, ray OX and OY respectively and defines a piece of sector region expanded outwardly, and straight line XY will fan
Shape region has been further separated into two parts α and β;For any one pixel p, it is judged that whether it belongs to the method in α region such as
Under:
1) if ∠ XOY > ∠ pOX and ∠ XOY > ∠ pOY, then p ∈ α ∪ β, the sector region that i.e. ray OX surrounds with ray OY;
2) if p is ∈ α ∪ β, and ∠ pYO > ∠ XYO, then p ∈ α, i.e. pixel p belong to α region;
On same piece image, for M bar prospect line and N bar background line, the α region after superposition is represented by:
α=(α11∪α12∪...∪α1n∪...∪α1N)∪(α21∪α22∪...∪α2n∪...∪α2N)
∪...∪(αM1∪αM2∪...∪αmn∪...∪αMN)
In formula, αmnRepresent the region that is blocked that the m article prospect lines and nth bar background lines are formed;
Step 3, employing figure cuts algorithm and seeks the optimal solution of energy function;
Use Y figure to cut algorithm and seek the optimal solution of energy function (1), obtain segmentation result;As segmentation effect is unsatisfied with, can return to
Step one, continue add before, background clue;Often add lines, either for specified portions prospect or background, all incite somebody to action
Trigger the cutting procedure of Single cell fusion direct-connected commensurability bundle.
Priority Applications (1)
Application Number | Priority Date | Filing Date | Title |
---|---|---|---|
CN201310237708.4A CN103310450B (en) | 2013-06-17 | 2013-06-17 | A kind of image partition method merging direct-connected commensurability bundle |
Applications Claiming Priority (1)
Application Number | Priority Date | Filing Date | Title |
---|---|---|---|
CN201310237708.4A CN103310450B (en) | 2013-06-17 | 2013-06-17 | A kind of image partition method merging direct-connected commensurability bundle |
Publications (2)
Publication Number | Publication Date |
---|---|
CN103310450A CN103310450A (en) | 2013-09-18 |
CN103310450B true CN103310450B (en) | 2016-12-28 |
Family
ID=49135627
Family Applications (1)
Application Number | Title | Priority Date | Filing Date |
---|---|---|---|
CN201310237708.4A Expired - Fee Related CN103310450B (en) | 2013-06-17 | 2013-06-17 | A kind of image partition method merging direct-connected commensurability bundle |
Country Status (1)
Country | Link |
---|---|
CN (1) | CN103310450B (en) |
Families Citing this family (1)
Publication number | Priority date | Publication date | Assignee | Title |
---|---|---|---|---|
CN103942785A (en) * | 2014-04-09 | 2014-07-23 | 苏州大学 | Lung tumor segmentation method based on PET and CT images of image segmentation |
Citations (3)
Publication number | Priority date | Publication date | Assignee | Title |
---|---|---|---|---|
EP1612733A2 (en) * | 2004-06-28 | 2006-01-04 | Microsoft Corporation | Color segmentation-based stereo 3D reconstruction system and process |
CN102332097A (en) * | 2011-10-21 | 2012-01-25 | 中国科学院自动化研究所 | Method for segmenting complex background text images based on image segmentation |
CN102360494A (en) * | 2011-10-18 | 2012-02-22 | 中国科学院自动化研究所 | Interactive image segmentation method for multiple foreground targets |
Family Cites Families (1)
Publication number | Priority date | Publication date | Assignee | Title |
---|---|---|---|---|
US7400757B2 (en) * | 2001-10-04 | 2008-07-15 | Siemens Medical Solutions Usa, Inc. | System and method for segmenting the left ventricle in a cardiac image |
-
2013
- 2013-06-17 CN CN201310237708.4A patent/CN103310450B/en not_active Expired - Fee Related
Patent Citations (3)
Publication number | Priority date | Publication date | Assignee | Title |
---|---|---|---|---|
EP1612733A2 (en) * | 2004-06-28 | 2006-01-04 | Microsoft Corporation | Color segmentation-based stereo 3D reconstruction system and process |
CN102360494A (en) * | 2011-10-18 | 2012-02-22 | 中国科学院自动化研究所 | Interactive image segmentation method for multiple foreground targets |
CN102332097A (en) * | 2011-10-21 | 2012-01-25 | 中国科学院自动化研究所 | Method for segmenting complex background text images based on image segmentation |
Also Published As
Publication number | Publication date |
---|---|
CN103310450A (en) | 2013-09-18 |
Similar Documents
Publication | Publication Date | Title |
---|---|---|
Wang et al. | Associatively segmenting instances and semantics in point clouds | |
Gilani et al. | Deep, dense and accurate 3D face correspondence for generating population specific deformable models | |
CN107168527B (en) | The first visual angle gesture identification and exchange method based on region convolutional neural networks | |
Jänicke et al. | Brushing of attribute clouds for the visualization of multivariate data | |
CN109740686A (en) | A kind of deep learning image multiple labeling classification method based on pool area and Fusion Features | |
CN103246891B (en) | A kind of Chinese Sign Language recognition methods based on Kinect | |
CN105976378A (en) | Graph model based saliency target detection method | |
CN103337072B (en) | A kind of room objects analytic method based on texture and geometric attribute conjunctive model | |
CN103530633B (en) | Semantic mapping method of local invariant feature of image and semantic mapping system | |
CN103400386A (en) | Interactive image processing method used for video | |
CN109272467A (en) | A kind of stratification image partition method based on multi-scale edge clue | |
CN105354593B (en) | A kind of threedimensional model sorting technique based on NMF | |
CN105046689B (en) | A kind of interactive stereo-picture fast partition method based on multi-level graph structure | |
Chen et al. | 3-d instance segmentation of mvs buildings | |
CN104166988B (en) | A kind of stereo sync dividing method for incorporating sparse match information | |
CN104167013A (en) | Volume rendering method for highlighting target area in volume data | |
CN110222712B (en) | Multi-special-item target detection algorithm based on deep learning | |
Ma et al. | Location-aware box reasoning for anchor-based single-shot object detection | |
CN107330907B (en) | A kind of MRF image partition methods of combination deep learning shape prior | |
CN103310450B (en) | A kind of image partition method merging direct-connected commensurability bundle | |
Wang et al. | Semantic annotation for complex video street views based on 2D–3D multi-feature fusion and aggregated boosting decision forests | |
Shyr et al. | Supervised hierarchical Pitman-Yor process for natural scene segmentation | |
Zhong et al. | Robust image segmentation against complex color distribution | |
CN109919057B (en) | Multi-mode fusion gesture recognition method based on efficient convolutional neural network | |
Li et al. | Exploring 3D human action recognition: From offline to online |
Legal Events
Date | Code | Title | Description |
---|---|---|---|
C06 | Publication | ||
PB01 | Publication | ||
C10 | Entry into substantive examination | ||
SE01 | Entry into force of request for substantive examination | ||
C14 | Grant of patent or utility model | ||
GR01 | Patent grant | ||
CF01 | Termination of patent right due to non-payment of annual fee | ||
CF01 | Termination of patent right due to non-payment of annual fee |
Granted publication date: 20161228 Termination date: 20190617 |