Vanishing point detection based on an artificial bee colony algorithm

Lei Han; Tanghuai Fan; Shengnan Zheng; Chenrong Huang

doi:10.1117/1.JEI.24.3.033024

23 June 2015 Vanishing point detection based on an artificial bee colony algorithm

Lei Han, Tanghuai Fan, Shengnan Zheng, Chenrong Huang

Author Affiliations +

Journal of Electronic Imaging, Vol. 24, Issue 3, 033024 (June 2015). https://doi.org/10.1117/1.JEI.24.3.033024

Abstract

Vanishing points (VPs) are crucial for inferring the three-dimensional structure of a scene and can be exploited in various computer vision applications. Previous VP detection algorithms have been proven effective but generally cannot guarantee a strong performance in both accuracy and computational time. We propose an artificial bee colony algorithm called dynamic clustering artificial bee colony (DCABC) that accurately and efficiently detects VPs in the image plane. The task is regarded as a dynamic line-clustering problem, and the line clusters are initialized by their orientation information. Inspired by the foraging behavior of bees, DCABC selects the clustering center and reclassifies the line segments based on a distance criterion until the terminating condition is met. The optimal line clusters determine the estimated VP. The dissimilarity among solutions is measured by the Hamming distance between two binary vectors, which simplifies the new solution construction. The performances of the proposed and existing algorithms are evaluated on the York Urban database. The results verify the efficiency and accuracy of our proposed algorithm.

1. Introduction

Vanishing point (VP) is defined as the convergence point of projected parallel lines in an image plane.¹ When a set of parallel lines in three-dimensional (3-D) space is projected onto a set of parallel or nonparallel lines in the two-dimensional (2-D) plane, the VP lies at infinity or at some finite distance, respectively. The VP is an invariant feature that provides important cues for inferring the 3-D structure of a real scene. Therefore, it is widely relevant to robotic navigation, visual measurement, camera calibration,² 3-D reconstruction, augmented reality, image understanding, and similar fields. Naturally, VP detection has been an important topic in the computer vision research community.

In this work, we propose a bioinspired scheme for detecting VPs. Our solution directly detects all possible nonorthogonal VPs in the image plane, without reliance on camera calibration parameters. Our algorithm offers several advantages over previous algorithms:

• The number of VPs need not be preconfigured. VP detection is considered as a dynamic clustering of line segments. The set of input segments is divided into initial clusters or marked as outliers, depending on prior information of their inclination angles. Specifically, segments in an image corresponding to a set of parallel lines in real space are clustered into groups of similarly inclined angles.
• The algorithm is both accurate and efficient. The line-clustering algorithm is solved by a dynamic clustering artificial bee colony (DCABC) algorithm. The searching scheme of the scout bee in the swarm can avoid local optima trapping and guarantees accurate solutions. To simplify the construction of new solutions and accelerate convergence of the DCABC algorithm, we apply the Hamming distance and introduce some heuristic rules.

This paper is organized as follows. Section 2 discusses related works in VP detection and artificial bee colony (ABC) algorithms. Section 3 states the VP detection problem from a clustering theory perspective. Section 4 details our proposed algorithm, and Sec. 5 presents and discusses the experimental results. The paper concludes with Sec. 6.

2. Related Works

2.1.

Vanishing Point Detection

Finding the VPs in 2-D perspective projections has received much attention since Barnard’s methodology was proposed in 1983.³ Each line group is often processed by the three elements of line extraction, line classification and VP estimation.

Many present studies focus on the second element. With regard to the number of detected VPs, these algorithms aim at estimating three estimate three orthogonal VPs,⁴^,⁵ or any present nonorthogonal VP.³^,⁶^,⁷ Based on their estimation strategy, line classification algorithms are roughly divided into two categories; algorithms that specify accumulator spaces and those that perform the clustering directly on the image plane.

In the former category, an open image plane is mapped into a bounded accumulator space. These algorithms aim for simple operation and do not discriminate between finite and infinite VPs.³^,⁴^,⁸ Each cell in the space accumulates the lines that pass through its corresponding image point. Cells that have accumulated the most lines will produce the candidate VPs. The performances of these algorithms are mainly determined by the selected accumulator space.

In the latter category, the workspace is the actual image plane.⁶^,⁹ Line clustering is generally decided by computations, such as the distances among points and lines. Typical computational methods are random sample consensus (RANSAC) and its variants (such as multi-RANSAC and J-Linkage).⁷^,¹⁰^–¹² These methods iteratively select the minimal sampling set of image features for computing a candidate VP, and retrieve the feature set consistent with that minimal sampling set. Another traditional approach is the EM approach,⁵^,¹³ which alternates between expectation and maximization steps (denoted as E and M steps, respectively). The E step estimates the line clustering from the given current or hypothesized VPs; the M step computes the VPs using the data clusters estimated in the E step.

Although accumulator space–based algorithms require the intrinsic camera parameters, clustering in the image plane operates in an uncalibrated setting. However, RANSAC and its variants do not guarantee an optimal solution, whereas the success of the EM approach depends on the initially estimated VPs. Our present study attempts to overcome the inherent limitations of conventional image plane clustering algorithms, and to detect possible nonorthogonal VPs.

2.2.

Artificial Bee Colony

An ABC algorithm, proposed by Karaboga,¹⁴ simulates the intelligent foraging behavior of honey bee swarms. Originally designed for solving multidimensional and multimodal optimization problems, the ABC algorithm has since been extended to other problems. For example, Akay and Karaboga¹⁵ concluded that ABC efficiently solves integer programming problems. The ABC variant DisABC, introduced by Kashan et al.,¹⁶ was designed for binary optimization.

In the ABC algorithm, the possible solutions to the optimization problem are represented as food sources; the nectar amount in each food source indicates the quality (fitness) of the potential solution. A swarm of bees investigates the optimum solution.¹⁷^–¹⁹ Each bee assumes one of the following three roles:

• Scouts, who explore the search space to find new candidate solutions in case the swarm fails to further exploit the current candidate solutions;
• Worker or employed bees, who exploit current candidate solutions;
• Onlooker bees, recruited by employed bees to further exploit candidate solutions.

The ABC algorithm randomly creates a set of candidate solutions, and each solution is assigned to one employed bee. It then iteratively executes three important steps to find new solutions.

In the first step, each employed bee searches for a new solution within the neighborhood of its current solution. Let $X_{i} = (x_{i 1}, \dots, x_{i d}, \dots, x_{i D})$ be the position of the $i$ ’th food source (the $i$ ’th solution to the problem), $D$ be the dimension of the problem, and $f (X_{i})$ be the amount of nectar (quality of the solution). The position of a new food source ( $X_{i}^{'} = (x_{i 1}^{'}, \dots, x_{i d}^{'}, \dots, x_{i D}^{'})$ ) is calculated as follows:

Eq. (1)

x_{i d}^{'} = x_{i d} + φ_{i d} (x_{i d} - x_{k d}),

where

φ_{i d}

is a randomly generated number in the interval

[- 1,1]

and

k

is a randomly generated index with

i, k \in {1,2, \dots, SN}

and

i \neq k

.

If the nectar amount $f (X_{i}^{'})$ is greater than $f (X_{i})$ , the artificial bee memorizes $X_{i}^{'}$ and shares her information with onlooker bees. The position of the $i$ ’th food source becomes $X_{i}^{'}$ . If $f (X_{i}^{'})$ does not exceed $f (X_{i})$ , the food source remains at $X_{i}$ .

In the second step, based on the information provided by the employed bees, each onlooker bee probabilistically chooses a candidate solution to further update. The probability is decided by the roulette wheel rule

Eq. (2)

p_{i} = \frac{{fit}_{i}}{\sum_{j = 1}^{SN} {fit}_{j}},

where

{fit}_{i}

is the fitness value of the

i

’th solution, obtained as follows:

Eq. (3)

{fit}_{i} = {\begin{cases} \frac{1}{1 + f (X_{i})} & f (X_{i}) \geq 0 \\ 1 + abs (f (X_{i})) & f (X_{i}) < 0 . \end{cases}

In the third step, employed bees whose solutions never improve after a predetermined number of trials (called the limit) become scouts and their solutions are abandoned. The scouts embark on random searches for new solutions. The new random position chosen by the scout is calculated as follows:

Eq. (4)

x_{i d}^{'} = x_{\min}^{d} + rand (0,1) (x_{\max}^{d} - x_{\min}^{d}) .

In Eq. (4), $x_{\min}^{d}$ and $x_{\max}^{d}$ are the lower and upper bounds of the food source position, respectively, in dimension $d$ .

3. Problem Statement

3.1.

Modeling the Problem of Vanishing Point Detection Using Artificial Bee Colony

VP detection is a typical chicken-and-egg problem: if the line clustering is known, then the VPs can be computed reciprocally; if the VPs are known, the line clustering can be retrieved. Therefore, given a set of features describing the linear structure in an image, VP estimation strategies typically proceed by clustering and estimation. The lines are first classified into groups with common VPs. The approximate VP is then located from the line cluster. Like other algorithms, our proposed algorithm alternately iterates the clustering and estimation steps. Especially, the line classification is modeled as an unsupervised clustering problem solved by a novel ABC, and its optimal result is processed by the estimation step.

One of the key issues in designing a successful algorithm for VP detection is to suitably assign the line classification solutions to the food sources in the ABC algorithm. In this study, a food source is defined as a binary vector representing a partition of the line set. Binary vector encoding proceeds as follows. Based on some heuristic information, the set of line segments in an image is initially partitioned into line clusters. VPs computed by those line clusters are then collected into a set. A food source is finally encoded as a binary vector whose dimensions are equal to the size of the VP set. Each vector component, either 1 or 0, indicates whether the VP at the corresponding set position is selected to repartition line clusters through the latter optimal steps. As shown in Fig. 1, 200 line segments ${ξ_{1}, ξ_{2}, \dots, ξ_{200}}$ extracted from an image are initially divided into seven subsets. A VP set ${v_{1}, v_{2}, \dots, v_{7}}$ is built from each subset producing one VP. A binary vector ${0,1, 1,0, 1,1, 1}$ is randomly generated as a food source (solution) to indicate the selected VP set ${v_{2}, v_{3}, v_{5}, v_{6}, v_{7}}$ . The set ${ξ_{1}, ξ_{2}, \dots, ξ_{200}}$ is then repartitioned so that each line segment is associated with one VP of the set ${v_{2}, v_{3}, v_{5}, v_{6}, v_{7}}$ or is marked as an outlier.

Fig. 1

Example showing the encoding procedures of a solution vector.

Another problem that arises when modeling VP detection using the ABC algorithm is how to evaluate and update food sources (solutions).¹⁷^,¹⁸ In the present study, the fitness of a solution is evaluated by validating the line cluster, thus determining whether the cluster is cohesive with low coupling. In particular, a distance function is introduced to measure the deviation between each VP and the corresponding line segment. Moreover, the solutions are updated by the three types of bees (employed bees, onlooker bees, and scouts), as described in Sec. 4.

3.2.

A Priori Information of Line Segments

The angle of two line segments in an image plane, which corresponds to the perspective view of two parallel lines in an object plane, is related to the shooting angle and the distances between the parallel lines and between the image and the object plane. The shooting angle is usually presented by the angle between the image and the object plane. More specifically, the angle of two segments widens as the shooting angle becomes larger or as the parallel lines become more separated, but decreases with an increasing distance of the two planes. In practice, the distances between parallel lines in a man-made environment are distributed within a certain range, whereas the scene images have often been shot from a general shooting angle at a medium-to-long distance. Therefore, the angle between two arbitrary line segments, corresponding to a family of parallel lines in 3-D space, is constrained within a small interval. Furthermore, we know a priori that parallel lines are often similarly inclined in medium-to-long range scenes. For example, consider the building image in Fig. 2(a). We first compute the inclination angle of each line segment in the image; next, we construct a histogram of the inclined angles [see Fig. 2(b)]. The inclined angles of the line segments clearly concentrate within several subintervals of the histogram.

Fig. 2

Example showing the features of line segments. (a) Image with detected line segments. (b) Histogram of inclined angles of segments in the image.

3.3.

Preliminary Processing of the Initial Cluster

The input to our algorithm is a set of line segments $E$ , which are obtained from the image ( $I$ ) using a line segment detector (LSD). Each line segment $ξ_{k} \in E$ is represented by its two endpoints expressed as homogenous coordinates $[\begin{matrix} p_{1 k} & p_{2 k} \end{matrix}]$ . From the input matrix, we can derive other attributes of $ξ_{k}$ such as the length ( ${len}_{k}$ ) and inclined angle ( $θ_{k}$ ). Among the many line segments typically contained in $E$ , very short segments contribute little information for the VP detection. Therefore, line segments shorter than 5% of the image height are filtered from set $E$ . Sequentially, outliers with anomalous inclined angles are also removed. To initialize the proposed algorithm, the set of remaining line segments is then partitioned into several groups. The preprocessing of set $E$ is detailed in the Secs. 3.3.1 and 3.3.2.

3.3.1.

Initialize the line clusters

Based on the priori information, the line segments are segregated by their inclined angles.

The interval of the inclined angle of line segments, $[0, π]$ , is first divided into several subintervals. The number and scale of a subinterval, denoted as $N_{A}$ and $α$ , respectively, are determined from the real situation. Empirically, we set $N_{A}$ to 36 and $α$ to $π / 36$ . Then based on their inclined angles, the line segments are split into $N_{A}$ subsets called angular domains, denoted as $A_{j}$ ( $j = 1,2, \dots, N_{A}$ ). In this way, line segments whose inclined angles occupy the same subinterval are collected into the same angular domain.

3.3.2.

Initializing the solution vectors

The number of line segments in each angular domain $A_{j}$ is denoted ${NA}_{j}$ ( $j = 1,2, \dots, N_{A}$ ). $A_{j}$ is marked as an invalid angular domain if ${NA}_{j}$ equals 0 or 1. If ${NA}_{j}$ is 1, we check whether the left and right neighborhoods of $A_{j}$ are both invalid angle domains. If true, the line segment is removed from $A_{j}$ ; otherwise, it is removed from $A_{j}$ and placed in the neighborhood containing more line segments than the other neighborhood.

We denote the number of valid angular domains as $D$ , and renumber each valid angular domain as ${AL}_{d}$ , where $d = 1,2, \dots, D$ . In each valid angular domain ${AL}_{d} \forall d = 1,2, \dots, D$ , an initial VP ( $v_{d}$ ) is determined as the cross-product between the two lines ( $l_{m}, l_{n}$ ) coincident with two segments ( $ξ_{m}, ξ_{n}$ ), which are randomly selected from ${AL}_{d}$ . The VPs of all valid angular domains are collected into a set $V$ . Next, SN binary vectors with $D$ dimensions corresponding to the set $V$ are randomly generated for defining the initial positions of food sources in the ABC algorithm. Each component of each binary vector indicates whether the corresponding VP in $V$ is selected as a reference point in the line segment classification.

Note that the new positions of the scout bees are also randomly generated by the above initialization procedure.

4. Artificial Bee Colony Algorithm for Vanishing Point Detection

4.1.

Notation

A novel ABC algorithm called DCABC is presented in this section. For this purpose, we first define some essential symbols:

$l_{k}$ is the line passing along segment $ξ_{k}$ , represented in homogeneous coordinate form.

$D$ is the maximum number of line classifications, determined by the number of valid angular domains.

${AL}_{d}$ is a valid angular domain indexed by $d$ with $d = 1,2, \dots, D$ .

$v_{d}$ is a VP initialized by the dth valid angular domain ( ${AL}_{d}$ ).

$V = {v_{d} \in R | d = 1, \dots, D}$ is the set of $D$ VPs, where $v_{d}$ is a VP corresponding to ${AL}_{d}$ .

$C_{d}$ is a line cluster of segments closer to $v_{d}$ than to any other VP in $V$ .

$X_{i} = (x_{i 1}, \dots, x_{i d}, \dots, x_{i D})$ is a binary vector of food source positions in the DCABC. If $x_{i d} = 1$ , the corresponding $v_{d}$ in $V$ has been selected as part of the solution proposed by $X_{i}$ . If $x_{i d} = 0$ , $v_{d}$ is excluded from the solution.

$X_{β}$ is the position of the best current food source.

$V_{i}$ is a subset of VPs, represented by a position vector $X_{i}$ , such that $V_{i} = (v_{d}) \forall d : x_{i d} = 1$ with $V_{i} \subseteq V$ .

$V_{r}$ is the set of VPs in $V$ that have not been selected by $X_{β}$ , such that $V_{r} = (v_{d}) \forall d : x_{β d} = 0$ with $V_{r} \subseteq V$ .

$d (ξ_{k}, v_{d})$ denotes the distance between the VP $v_{d}$ and the line segment $ξ_{k}$ .

4.2.

Partition Strategy and Validity Index

Essentially, this paper regards the line classification step in the VP detection as a clustering procedure. The clustering process, which separates the objects into groups, is realized by unsupervised or supervised learning. In unsupervised clustering (also known as dynamic clustering), the number of classes need not be specified in the training data. In supervised clustering, the number of classes must be predetermined. Our algorithm belongs to the former category.

4.2.1.

Distance function

In the DCABC, the position vector ( $X_{i}$ ) of a food source chooses some VPs as clustering centroids from the set ( $V$ ). The algorithm then assigns each segment $ξ_{k}$ to the cluster $C_{d}$ with the closest pseudocentroid $v_{d}$ , based on the distance function $d (ξ_{k}, v_{d})$ . We also specify a distance threshold (TH) that assigns a segment to the outliers if the segment is separated by more than TH from all selected VPs. Here, the deviation between a line segment and a VP is measured by the distance function. In a previous measurement method,²⁰ the orientation error was considered as the workspace. Thus, we define the distance [ $d (ξ_{k}, v_{d})$ ] between a VP ( $v_{d}$ ) and a segment ( $ξ_{k}$ ) as the absolute value of the sine of the angle between a line ( $l_{k}$ ) coincident with the segment and another line ( ${\hat{l}}_{k}$ ) connecting ( $v_{d}$ ) to the center point of the segment ( $ξ_{k}$ ). Both lines are defined in homogeneous coordinates as $l_{k} = {(e_{1}, e_{2}, e_{3})}^{T}$ and ${\hat{l}}_{k} = {({\hat{e}}_{1}, {\hat{e}}_{2}, {\hat{e}}_{3})}^{T}$ . The homogeneous coordinates of the two endpoints of $ξ_{k}$ are recorded in $p_{1 k}$ and $p_{2 k}$ , respectively, and $l_{k}$ , ${\hat{l}}_{k}$ and $d (ξ_{k}, {vp}_{d})$ are computed as follows:

Eq. (5)

l_{k} = p_{1 k} \times p_{2 k},

Eq. (6)

{\hat{l}}_{k} = {vp}_{d} \times \frac{1}{2} (p_{1 k} + p_{2 k}),

Eq. (7)

d (ξ_{k}, {vp}_{d}) = | \frac{- e_{2} {\hat{e}}_{1} + e_{1} {\hat{e}}_{2}}{\sqrt{e_{1}^{2} + e_{2}^{2}} \sqrt{{\hat{e}}_{1}^{2} + {\hat{e}}_{2}^{2}}} | .

In the above definition of the distance function, we adopt the absolute value because we are interested in the relative deviation between the orientations of the two lines, not the sign of this deviation. Moreover, the sine function is a suitable choice because $\sin θ$ approximately equals $θ$ when the deviation angle $θ$ is small.

4.2.2.

Line classification and evaluation

Based on the distance criterion, the set of line segments can be partitioned into several clusters. The quality of the partitioning can be sufficiently assessed by the compactness and separation measures. High compactness indicates that line segments in the same cluster share a high degree of similarity, while high separation means that line segments occupying different clusters are very dissimilar. The validity index of the clustering is given by

Eq. (8)

f_{i} = \frac{1}{D} \sum_{d = 1}^{D} \sum_{ξ_{k} \in C_{d}} d (ξ_{k}, v p_{d}) .

Since our model seeks the optimal minimum, the solution ( $X_{i}$ ) is evaluated by the following equation:

Eq. (9)

f (X_{i}) = \frac{1}{1 + f_{i}} .

4.3.

Generating a New Partition

The ABC algorithm was initially designed for solving numerical optimization problems, which are continuous problems, and has proven successful at this task. However, the present line classification task is treated as a dynamic clustering problem with binary optimization. To adapt the basic ABC algorithm to dynamic clustering problems, we introduce two changes. First, as mentioned previously, the position ( $X_{i}$ ) of a food source is represented as a binary vector indicating the selected clustering centroids. Second, when generating a new solution by Eq. (1), we substitute the “−” operator with a dissimilarity measure of the binary vectors. This substitution is inspired from the DisABC algorithm,¹⁶ but here we modify both the dissimilarity measure and solution reconstruction to ensure low complexity and fast convergence.

Before progressing further, we introduce some new notations. Suppose that $X_{i} = (x_{i 1}, x_{i 2}, \dots, x_{i D})$ and $X_{j} = (x_{j 1}, x_{j 2}, \dots, x_{j D})$ are binary vectors with $D$ dimensions. Let

Eq. (10)

H (X_{i}, X_{j}) = \sum_{d = 1}^{D} | x_{i d} - x_{j d} |,

be the Hamming distance between

X_{i}

and

X_{j}

, and specify the number of positions at which the corresponding bits of the two vectors differ. The indices of these positions are recorded in

Eq. (11)

S_{Index} (X_{i}, X_{j}) = {d | x_{i d} \neq x_{j d}} .

Reshaping Eq. (1) as $X_{i}^{'} {- X}_{i} = φ_{i} (X_{i} {- X}_{k})$ and replacing the “−” operator by the Hamming distance between the two binary vectors, the new position is expressed as

Eq. (12)

H (X_{i}^{'}, X_{i}) = round [φ_{i} \cdot H (X_{i}, X_{k})],

where the

round ()

function rounds the input value to an integer, and

φ_{i}

is a random number distributed within the interval [0,1]. We also define

T = round [φ_{i} \cdot H (X_{i}, X_{k})]

, and assume that

T

has been determined by Eq. (10).

In this work, we construct a new solution vector $X_{i}^{'}$ that equalizes $H (X_{i}^{'}, X_{i})$ and $T$ . In other words, the new vector $X_{i}^{'}$ contains $T$ bits whose values differ from those of the corresponding bits in $X_{i}$ . Therefore, $X_{i}^{'}$ is obtained by flipping $T$ bits of $X_{i}$ from 1 to 0, or from 0 to 1, employing problem-dependent heuristics to instruct the bit selection. Let us recap the problem of VPs detection.

As previously mentioned, a position vector $X_{i}$ indicates the VPs selected from the set $V$ , which partition the line segments into line clusters. Obviously, the more line segments in a line cluster, the better the VP estimation. Therefore, the number of line segments in each line cluster can be utilized as a heuristic for generating new solutions.

A component $x_{i d}$ of a solution vector $X_{i} = (x_{i 1}, x_{i 2}, \dots, x_{i D})$ corresponds to a line cluster $C_{i d}$ , and the number of line segments in $C_{i d}$ is denoted ${NL}_{i d}$ . Therefore, if component $x_{i d}$ is 0, $C_{i d}$ will be an empty set and ${NL}_{i d}$ will also be 0. We collect the current solution $X_{i}$ and its neighbor $X_{k}$ into an index set $SI = S_{Index} (X_{i}, X_{k})$ , and create two additional sets ${NLS}_{i}$ and ${NLS}_{k}$ that record the number of line segments in the line clusters indexed by the elements in SI. Specifically, for each index $d$ in SI, if $x_{i d}$ equals 1, ${NL}_{i d}$ is added to the set ${NLS}_{i}$ ; otherwise, ${NL}_{k d}$ is added to the set ${NLS}_{k}$ .

The new solution vector $X_{i}^{'}$ can be simply constructed from the obtained $T$ , ${NLS}_{i}$ , and ${NLS}_{k}$ . The candidate solution $X_{i}^{'}$ is initialized with a copy of $X_{i}$ and is computed in three steps: (1) select the bit within $X_{i}^{'}$ for which the number of line segments of the corresponding cluster is minimized in ${NLS}_{i}$ and switch it from 1 to 0; (2) select the bit within $X_{i}^{'}$ for which the number of line segments of the corresponding cluster is minimized in ${NLS}_{k}$ and switch it from 0 to 1; (3) delete the minimal value of ${NLS}_{i}$ and the maximal value of ${NLS}_{k}$ . These three steps are repeated until $T$ bits are flipped.

4.4.

Estimating a Vanishing Point for Segment Clustering

Once the line segments have been grouped into line clusters based on their estimated VPs, we refine the related VP locations for each line cluster $C_{d}$ . To this end, we compute the point in the image plane that minimizes the sum of the distances to the lines sharing a common VP. Using the definition of the distance between a VP and a line segment [Eq. (7)], we estimate the VP ${\hat{v}}_{d}$ of line cluster $C_{d}$ as

Eq. (13)

{\hat{v}}_{d} = \arg \min_{p} \sum_{ξ_{k} \in C_{d}} d (ξ_{k}, p) .

In practice, Eq. (13) is solved by a least-squares algorithm.

4.5.

Description of the Dynamic Clustering Artificial Bee Colony Algorithm

Based on the above analysis, the DCABC algorithm flow is presented in Fig. 3. The main steps of the algorithm include the following stages.

1. The initial solutions are determined and the parameters of the DCABC algorithm are initialized (Initiation phase, Fig. 4). The line clusters are initially constructed based on the similarity of the line segments’ inclined angles. The valid angular domains are then chosen from the initial clusters. For each valid angular domain, a VP can be computed as the cross-product between the two lines selected randomly from the angular domain. The next step is to randomly produce binary vectors which represent the initial solutions (see Sec. 3). These initial solutions are evaluated using Eq. (9). Finally, other DCABC parameters are empirically set, such as the colony size, the value of “limit,” the line clustering termination condition (Criterion 1), and the VP detection termination condition (Criterion 2).
2. Employed bees search new solutions within the adjacent space (Fig. 5). Eq. (1) is transformed to Eq. (12) in order to adapt to the binary searching problem, where the Hamming distance is introduced in Eq. (10) to construct new solutions. The fitness value of the new solution is computed from Eq. (9). If the fitness of the new solution is improved, then an employed bee would replace the old solution in its memory with the new solution.
3. The probability of $p_{i}$ is calculated using Eqs. (2) and (3).
4. Onlooker bees select food sources depending on $p_{i}$ values. Then they produce new solutions with Eq. (12), and they update the solutions using the greedy selection approach.
5. Abandoned solutions are identified and replaced with new randomly generated solutions by the scout bee.
6. Stages 2 to 6 are repeated until the maximum number of cycles is reached (Criterion 1).
7. The VP set is reconstructed according to the optimal solution. VPs that are not selected are reinitialized using the method described in stage 1 (Fig. 6). Then all current solutions are reevaluated by Eq. (9).
8. Stages 2 to 8 are repeated until the maximum number of cycles is reached (Criterion 2).
9. VPs selected by the optimal solution are re-estimated by Eq. (13) based on the corresponding line clusters.

Fig. 3

Dynamic clustering artificial bee colony (DCABC) algorithm flowchart.

Fig. 4

Subflowchart of initialization for the DCABC algorithm.

Fig. 5

Subflowchart of the searching procedure for an employed (or onlooker) bee.

Fig. 6

Subflowchart for reconstructing vanishing point (VP) set.

In Stage 1, certain parameters have to be preset for the DCABC algorithm. Generally, onlooker bees form 50% of the colony, employed bees form 50%, and the number of scout bees is, at most, one for each cycle. Here, the number of onlooker bees is selected to be equal to the number of employed bees so that the algorithm has fewer controlling parameters. According to previous research,²¹ as the colony size increases, the algorithm produces better results. However, after a sufficient colony size is achieved, any further size increase does not improve the algorithm performance significantly. Here, the colony size is empirically set to 30 at Stage 1. The control parameter “limit” determines the “scout bee” production (see Sec. 2.2). The “limit” value is inversely proportional to the scout production frequency; an increase in the number of scouts encourages exploration, whereas increasing the onlookers on one food source encourages its exploitation. A “limit” value of 60 is considered appropriate in the present study, representing a quarter of the product of the colony size with the dimensions of the problem. Criteria 1 and 2 determine the termination conditions for the two cycles, where both are based on the maximum number of iterations, and two variables ( $t_{1}$ , $t_{2}$ ) serve as the counters. Here, the upper limits of $t_{1}$ and $t_{2}$ are empirically set to 3 and 35, respectively.

5. Experiments

Experiments were conducted on the York Urban database provided by Denis et al.²² This database holds 102 indoor or outdoor images in man-made environments and also provides the camera intrinsic parameters and the VPs computed with hand-detected segments. For each image in the York Urban database, we compiled two distinct line sets. The first line set (Lineset1) was compiled from the database holding the labeled ground-truth lines with few outliers. The second line set (Lineset2) was extracted from the image by a LSD,²³ and includes many lines corresponding to non-Manhattan directions. For example, Fig. 7(a) shows an image stored in the York Urban database, and Figs. 7(b) and 7(c) show the Lineset1 and Lineset2 extracted from this image, respectively. Both line sets of each image were processed by the following algorithms. All experiments were implemented in MATLAB®, running on a Core i7 920 Intel CPU.

Fig. 7

Example of a line set: (a) image from the York Urban database, (b) Lineset1 of the database image, and (c) Lineset2 of the image detected by the line segment detector algorithm.

DCABC: DCABC is used for VP detection.

GS: GS, proposed by Barnard (1983),³ assumes a Gaussian sphere as the accumulator space.

RANSAC: the algorithm proposed by Pflugfelder (2008),¹¹ using RANSAC algorithm to cluster lines.

JL: The JL algorithm, proposed by Tardif (2009),⁷ clusters lines by a J-Linkage algorithm.

EM: The EM algorithm is a more recent line clustering algorithm proposed by Nieto (2011).¹³

The algorithms were evaluated by two performance indicators. The first indicator is the accuracy of the estimated focal length using the VPs. As is well known, two orthogonal VPs $v$ and $v_{⊥}$ satisfy

Eq. (14)

v^{T} ω v_{⊥} = 0,

where

ω

is the image of the absolute conic given by

K^{- T} K^{- 1}

, and

K

is the matrix of intrinsic camera parameters provided by the York Urban database.

From the estimated set of VPs, we selected a triplet of VPs ( $v_{o 1}$ , $v_{o 2}$ , $v_{o 3}$ ) that minimized the sum of squares of the constraint: ${(v_{o 1}^{T} ω v_{o 2})}^{2} + {(v_{o 1}^{T} ω v_{o 3})}^{2} + {(v_{o 2}^{T} ω v_{o 3})}^{2}$ . We then estimated the focal length and compared it with that provided in the York Urban database. Specially, each term of the above constraint was set to 0, and three focal lengths were separately solved by Eq. (14). The estimated focal lengths, calculated as the mean of the three focal lengths, are presented in Fig. 8. Figures 8(a) and 8(b) show the cumulative histograms of the focal length errors in Lineset1 and Lineset2, respectively, for each image in the database.

Fig. 8

Cumulative histograms of the focal length errors in the York Urban database images. A point $(x, y)$ represents the fraction $y$ of the images in the database with focal length error below $x$ . (a) Focal length error in Lineset1 and (b) focal length error in Lineset2.

In this test, the five evaluated algorithms demonstrated comparable accuracy performance, although DCABC was slightly more accurate than its four competitors [Fig. 8(a)]. The focal length error of DCABC was less than 78 pixels for 90 out of 102 images in the database, and below 150 pixels for all images.

The accuracies of all algorithms, but especially that of EM, decreased when processing Lineset2. Overall, the results were similar to those of the ground-truth lines (Lineset1). Again, DCABC performed slightly better than the other algorithms [see Fig. 8(b)].

The second performance indicator is the computational time of detecting the VP. In this evaluation, we measured the time spent on line clustering and VP estimation, and excluded the line segment detection and selection of the three orthogonal VPs.

Figure 9 presents the minimal, average, and maximum computational times required to process both line sets. We observe that EM and DCABC are the fastest and second-fastest of the tested algorithms, respectively. The JL and GS algorithms calculate $O (n^{2})$ intersections, which is time-intensive when $n$ is large. However, in DCABC and RANSAC, only a small fraction of the intersections are calculated from a few randomly chosen lines. In the EM algorithm, lines are directly clustered with no intersection calculations. Consequently, the EM and DCABC algorithms are time-economical. The mean computation time of DCABC was only 0.023 s for Lineset1 [Fig. 9(a)] and 0.058 s for Lineset2 [Fig. 9(b)].

Fig. 9

Computational time of detecting the VPs in the York Urban database images. Algorithms were programmed in MATLAB®. (a) Time of processing Lineset1 and (b) time of processing Lineset2.

In summary, the five tested algorithms detected the VP with similar accuracy in the lower-noise case. The accuracy of all algorithms declined when some outliers were added. The decline was especially noticed in the EM algorithm. However, EM is the most efficient of the five algorithms, with DCABC a close second. Clearly, our proposed algorithm achieves a good balance between accuracy and efficiency.

6. Conclusions

Given a set of lines extracted from uncalibrated images, we proposed a means of directly detecting all possible nonorthogonal VPs in the image plane. Treating the task as a dynamic problem of clustering line segments, we partitioned the line clusters by our proposed DCABC algorithm. Initially, the line segments were clustered based on the similarity of their orientations. The clustering was then refined by DCABC using optimization technology in binary space. Finally, a VP was estimated in each of the optimized line clusters. In DCABC, the set of candidate VPs was encoded as a binary vector whose elements indicated the selection status of the corresponding VP. To simplify the new solution construction and accelerate the algorithm convergence, we computed the Hamming distance between two binary vectors and adopted the number of segments in each cluster. Our approach provides a promising bioinspired solution for practical VP detection.

Acknowledgments

This work was supported partly by the National Natural Science Foundation of China (Nos. 61401195 and 61263029), the Natural Science Foundation of the Jiangsu Higher Education Institutions of China (No. 13KJB520009), and the Key Project of the Young Foundation of Nanjing Institute of Technology, China (Nos. QKJA201204 and QKJA201305).

References

1.

A. Minagawa et al., “Line clustering with vanishing point and vanishing line,” in Proc. Int. Conf. on 1999 Image Analysis and Processing, (1999). Google Scholar

2.

X. Ding et al., “Stereo depth estimation under different camera calibration and alignment errors,” Appl. Opt., 50 (10), 1289 –1301 (2011). http://dx.doi.org/10.1364/AO.50.001289 APOPAI 0003-6935 Google Scholar

3.

S. T. Barnard, “Interpreting perspective images,” Artif. Intell., 21 (4), 435 –462 (1983). http://dx.doi.org/10.1016/S0004-3702(83)80021-6 AINTBB 0004-3702 Google Scholar

4.

K. S. Seo, J. H. Lee and H. M. Choi, “An efficient detection of vanishing points using inverted coordinates image space,” Pattern Recognit. Lett., 27 (2), 102 –108 (2006). http://dx.doi.org/10.1016/j.patrec.2005.07.011 PRLEDG 0167-8655 Google Scholar

5.

J. Košecká and W. Zhang, “Video compass,” in Computer Vision—ECCV 2002, 476 –490 (2002). Google Scholar

6.

F. A. Andaló, G. Taubin and S. Goldenstein, “Vanishing point detection by segment clustering on the projective space,” Trends and Topics in Computer Vision, 6554 324 –337 Springer, Berlin Heidelberg (2012). Google Scholar

7.

J. P. Tardif, “Non-iterative approach for fast and accurate vanishing point detection,” in IEEE 12th Int. Conf. on 2009 Computer Vision, (2009). Google Scholar

8.

B. Li et al., “Vanishing point detection using cascaded 1D Hough transform from single images,” Pattern Recognit. Lett., 33 (1), 1 –8 (2012). http://dx.doi.org/10.1016/j.patrec.2011.09.027 PRLEDG 0167-8655 Google Scholar

9.

A. Almansa, A. Desolneux and S. Vamech, “Vanishing point detection without any a priori information,” IEEE Trans. Pattern Anal. Mach. Intell., 25 (4), 502 –507 (2003). http://dx.doi.org/10.1109/TPAMI.2003.1190575 Google Scholar

10.

C. Rother, “A new approach to vanishing point detection in architectural environments,” Image Vision Comput., 20 (9), 647 –655 (2002). http://dx.doi.org/10.1016/S0262-8856(02)00054-9 IVCODK 0262-8856 Google Scholar

11.

R. Pflugfelder, “Self-calibrating cameras in video surveillance,” University of Reading, UK, (2008). Google Scholar

12.

M. Kalantari, F. Jung and J. Guedon, “Precise, automatic and fast method for vanishing point detection,” Photogramm. Rec., 24 (127), 246 –263 (2009). http://dx.doi.org/10.1111/j.1477-9730.2009.00542.x Google Scholar

13.

M. Nieto and L. Salgado, “Simultaneous estimation of vanishing points and their converging lines using the EM algorithm,” Pattern Recognit. Lett., 32 (14), 1691 –1700 (2011). http://dx.doi.org/10.1016/j.patrec.2011.07.018 PRLEDG 0167-8655 Google Scholar

14.

D. Karaboga, “An idea based on honey bee swarm for numerical optimization,” (2005). Google Scholar

15.

B. Akay and D. Karaboga, “Solving integer programming problems by using artificial bee colony algorithm,” AI(ASTERISK)IA 2009: Emergent Perspectives in Artificial Intelligence, 355 –364 Springer, Berlin, Heidelberg (2009). Google Scholar

16.

M. H. Kashan, N. Nahavandi and A. H. Kashan, “DisABC: a new artificial bee colony algorithm for binary optimization,” Appl. Soft Comput., 12 (1), 342 –352 (2012). http://dx.doi.org/10.1016/j.asoc.2011.08.038 Google Scholar

17.

D. Karaboga and C. Ozturk, “A novel clustering approach: Artificial Bee Colony (ABC) algorithm,” Appl. Soft Comput., 11 (1), 652 –657 (2011). http://dx.doi.org/10.1016/j.asoc.2009.12.025 Google Scholar

18.

L. Li et al., “A discrete artificial bee colony algorithm for TSP problem,” Bio-Inspired Computing and Applications, 6840 566 –573 Springer, Berlin, Heidelberg (2012). Google Scholar

19.

N. Karaboga and M. B. Cetinkaya, “A novel and efficient algorithm for adaptive filtering: artificial bee colony algorithm,” Turk. J. Electr. Eng. Comput. Sci., 19 (1), 175 –190 (2011). http://dx.doi.org/10.3906/elk-0912-344 Google Scholar

20.

M. Nieto and L. Salgado, “Non-linear optimization for robust estimation of vanishing points,” in 17th IEEE Int. Conf. on 2010 Image Processing (ICIP), (2010). Google Scholar

21.

D. Karaboga and B. Basturk, “On the performance of artificial bee colony (ABC) algorithm,” Appl. Soft Comput., 8 (1), 687 –697 (2008). Google Scholar

22.

P. Denis, J. H. Elder and F. J. Estrada, “Efficient edge-based methods for estimating Manhattan frames in urban imagery,” in European Conf. on Computer Vision, 197 –210 (2008). Google Scholar

23.

R. G. Von Gioi et al., “LSD: a fast line segment detector with a false detection control,” IEEE Trans. Pattern Anal. Mach. Intell., 32 (4), 722 –732 (2010). http://dx.doi.org/10.1109/TPAMI.2008.300 ITPIDJ 0162-8828 Google Scholar

Biography

Lei Han is a lecturer at the School of Computer Engineering, Nanjing Institute of Technology. He received his BE and ME degrees in computer science and technology from China University of Mining and Technology in 2004 and 2007, respectively. Currently, he is pursuing his PhD at Hohai University, Nanjing, China. His current research interests include image processing, computer vision, and visual measurement.

Chenrong Huang is a professor at the School of Computer Engineering, Nanjing Institute of Technology. She received her PhD from Nanjing University of Science and Technology in 2005. Her research interests cover image processing, virtual reality, and computer vision.

Tanghuai Fan is a professor at the School of Information Engineering, Nanchang Institute of Technology. He received his PhD from Hohai University in 2010. His research interests cover image processing, telemetry and telecontrol system.

Shengnan Zheng received her BS degree in Electronic Information Science and Technology from Nanjing Institute of Technology, in 2006, and MS degree in digital image processing from HoHai University, Nanjing, China, in 2010. From 2010, she held a laboratory technician position in Nanjing Institute of Technology. She is currently pursuing the PhD in hydro informatics from HoHai University. Her current research interests include image processing, image segmentation and analysis, object detection and recognition.

CC BY: © The Authors. Published by SPIE under a Creative Commons Attribution 4.0 Unported License. Distribution or reproduction of this work in whole or in part requires full attribution of the original publication, including its DOI.

Citation Download Citation

Lei Han, Tanghuai Fan, Shengnan Zheng, and Chenrong Huang "Vanishing point detection based on an artificial bee colony algorithm," Journal of Electronic Imaging 24(3), 033024 (23 June 2015). https://doi.org/10.1117/1.JEI.24.3.033024

Published: 23 June 2015

Access the abstract

JOURNAL ARTICLE
9 PAGES

DOWNLOAD PAPER SAVE TO MY LIBRARY

GET CITATION

CITATIONS

Cited by 2 scholarly publications.

Explore citations on Lens.org

KEYWORDS

Image segmentation

Virtual point source

Detection and tracking algorithms

Binary data

Expectation maximization algorithms

Databases

Image processing algorithms and systems

1.

Introduction

2.

Related Works

2.1.

Vanishing Point Detection

2.2.

Artificial Bee Colony

Eq. (1)

Eq. (2)

Eq. (3)

Eq. (4)

3.

Problem Statement

3.1.

Modeling the Problem of Vanishing Point Detection Using Artificial Bee Colony

Fig. 1

3.2.

A Priori Information of Line Segments

Fig. 2

3.3.

Preliminary Processing of the Initial Cluster

3.3.1.

Initialize the line clusters

3.3.2.

Initializing the solution vectors

4.

Artificial Bee Colony Algorithm for Vanishing Point Detection

4.1.

Notation

4.2.

Partition Strategy and Validity Index

4.2.1.

Distance function

Eq. (5)

Eq. (6)

Eq. (7)

4.2.2.

Line classification and evaluation

Eq. (8)

Eq. (9)

4.3.

Generating a New Partition

Eq. (10)

Eq. (11)

Eq. (12)

4.4.

Estimating a Vanishing Point for Segment Clustering

Eq. (13)

4.5.

Description of the Dynamic Clustering Artificial Bee Colony Algorithm

Fig. 3

Fig. 4

Fig. 5

Fig. 6

5.

Experiments

Fig. 7

Eq. (14)

Fig. 8

Fig. 9

6.

Conclusions

Acknowledgments

References

Biography

Show All Keywords

Keywords/Phrases

Search In:

Publication Years