Research Statement
Overview
I am mostly interested in solving optimization problems using population-based
stochastic techniques such as Evolutionary Algorithms (EAs). The term of
“optimization” refers to the process of searching for the optimal solution to
the problem of interest from a potentially huge set of candidates, which has
found broad applications in manufacturing, engineering, finance, bioinformatics
and logistics.
Although optimization problems may come in a variety of forms, they can all be formulated as an objective function controlled by a set of parameters. The goal of optimization is to find the best valid parameter settings in order to achieve optimal function values (finding the global maxima/minima). In practice, it may be to accomplish a task in the most efficient way or in the highest quality or to produce maximum yields given limited resources.
My research experience in the above area covers a wide range
of topics at difference levels, including theoretical modeling of algorithm
dynamics, empirical studies of algorithms, experimental methodology and
real-world applications. I also have strong interest in combining the field of
global optimization with advanced techniques from other areas such as Machine
Learning and Statistics.
Postdoctoral Work
My Postdoctoral work was focused on the optimal robot routing problem in
wireless sensor networks. Generally speaking, given a set of sparsely
distributed sensors where it may be difficult for them to communicate with each
other directly, a mobile robot is employed to travel to each sensor to download
its data. The effective range of each sensor is typically specified by a disk
(2D) or a sphere (3D) and the robot must at least reach its boundary to
initiate the communication.
The primary objective of this routing problem is to design a path for the robot so that it can collect all the data and return to its base while the overall traveling distance is minimized. In its generic form, this problem can be regarded as a special case of a class of problems referred to as Traveling Salesman Problem with Neighborhoods (TSPN), which is an extension of the classical TSP. Since both TSP and TSPN are NP-hard, it is usually impractical to apply exact algorithms that can guarantee optimality, especially on large scale problems.
A new heuristic algorithm framework was proposed that can handle problems with hundreds of sensors and produce significantly improved results compared to current approximation algorithms. The first step is to design a heuristic that can effectively decompose each TSPN instance into a combinatorial optimization problem (the sequence of sensors to be visited) to be solved by an external TSP routine and a continuous optimization problem (the locations of hitting points). Next, the search space of the continuous problem is further reduced based on the spatial relationship among sensors. Finally, a competent optimization algorithm is applied to find the accurate solution to the continuous optimization problem, which is also the solution to the original TSPN instance.
Another project was on the data mining problem in the smart shopping centre, which is built on advanced techniques such as wireless networks and RFID. In this scenario, a smart trolley is equipped with wireless devices that can communicate with the smart shelves to record the path information of customers (represented by the sequence of sectors visited). Given a collection of such path records, a set of algorithms were proposed to conduct various interesting queries such as finding the most frequent common path with a fixed length and finding the longest common path with a given support.
PhD Work
My PhD work was concentrated on a key research question: How to conduct better
experimental research on EAs? Due to the challenges in the theoretical analysis
of EAs, empirical studies are still the primary methods to understand and
compare their performance. Although empirical studies of EAs are often seemingly
simple, the results can be significantly influenced by a variety of factors such
as the test problems and the parameter settings. In fact, designing experiments
with high scientific values is actually a much more complicated task than many
practitioners may have realized.
Landscape Generator
(web)
A continuous landscape (test problem) generator based on multivariate Gaussian components (LG-MVG) was proposed, which is capable of generating a large number of random problem instances with predefined structure in accordance with the specific objective of experimental studies. The major advantage of LG-MVG is that it can generate a variety of problems in terms of landscape structure and is parameterized by a flexible set of parameters, which have an intuitive influence on the structure of the problems generated. It can also overcome a number of well-known issues of many existing benchmark problems.
Case studies on a set of continuous Estimation of
Distribution Algorithms (EDAs) have shown that a much wider range of interesting
experiments can be conducted with LG-MVG than using traditional benchmark
problems. In the meantime, some of the important properties of these EDAs have
also been revealed that would otherwise be difficult to observe.
Statistical Racing Techniques
A major practical difficulty in conducting large scale experimental studies of EAs lies in the high computational cost incurred correspondingly. In order to ease this challenge, a statistical Racing technique originally proposed for solving model selection problems in Machine Learning was extended and adapted to the comparative studies of EAs for the purpose of reducing the computational cost while still maintaining high reliability of the results.
The typical task where Racing is mostly applicable is to find out the best performing algorithm from a set of candidate algorithms, which may include the same type of algorithm with different parameter settings or completely different algorithms. Instead of sequentially testing each algorithm on the test problem for a fixed number of trials, all algorithms are tested in parallel for a single trial at each step. For each algorithm, the results from all trials that have been conducted are maintained as the samples from its true performance distribution and used in some statistical test.
If there is enough evidence, as indicated by the result of the test, that one algorithm is significantly worse than another one, it will be removed from the experiment and undergo no further testing because it is very unlikely to be the best algorithm. By doing so, algorithms are forced to race against each other to compete for computational resources and only promising algorithms survive through to the next round of test. As a result, the overall cost can be significantly reduced by avoiding running unnecessary trials on inferior algorithms.
Mathematical Modeling of EDAs
A principled modeling technique was proposed to investigate
the detailed dynamics of a simple EDA based on Gaussian distributions, which is
one of the first attempts in this area. Unlike some existing work on the global
convergence behavior of EDAs, this modeling technique is aimed at describing the
specific behavior of the EDA at each generation under the influence of different
algorithm and problem factors. Case studies were conducted with different
algorithm parameter values on both unimodal and multimodal problems. It has been
shown that the proposed technique can not only accurately predict the dynamics
of the EDA from various aspects but also demonstrate some of its very
interesting behaviors that were not known from previous experimental studies.
MRI Magnet Design (Poster)
Applications of continuous EDAs on large scale real-world
problems have been relatively rare. In this part of work, two EDAs were compared
on the magnet design task in MRI systems, to highlight the advantage of
capturing dependences in solving practical problems. Another key contribution is
an efficient statistical technique for estimating the number of optima in the
search space, which could be potentially very useful for choosing the right
optimization algorithm. Other important issues such as parallel computation and
incorporating problem-specific knowledge were also discussed.
Other Work
In the past, I have conducted some research on evolving neural networks using
EAs. One example is to design feedforward neural networks whose structure and
weights were to be evolved simultaneously. A number of unusual neural networks
were found in the experiments, which were dramatically different from the
classical three-layer structure. Another example is to evolve a recurrent neural
network with complex dynamics to learn a simple context-free language where the
neural network was trained to count the occurrences of each language character.
Furthermore, I have participated various projects on image processing, fractal
image compression, pattern recognition and medical X-ray machine calibration.