|
|
|
Truyen Tran (PhD)
News:
Research interests Collaborators Prof. Svetha Venkatesh
(Deakin) A.Prof. Dinh Q. Phung
(Deakin) Dr. Hung H. Bui (Nuance NLU Lab) Dr. Son D. Pham
(Curtin)
|
||||||||||||||||
|
|
Research Projects
Modern hospitals and medical centres have
collected huge amount of clinical data for hundreds of millions of
patients over the past decades. However, how to make the best out of
the data for improving clinincal services still remains the major
question. This research aims at characterising the data using
probabilistic models and applying the state-of-the-art machine learning
techniques for representation, clustering and prediction both at the
patient and the cohort levels. Contributions: Patient
profiling: [Published in PAKDD'13]. Suicide
risk prediction. [Under review]. Representation
learning: Modelling complex and mixed-data types Raw
data may lie on hidden manifolds and contain noise and thus it may not
be appropriate for tasks at hand. The goal of representation learning
is to discover latent factors in the data which are invariant to small
changes and insentitive of noise. These factors then can be fed into
standard machine learning techniques. The hope is that the learning
curve will be much easier (e.g., better linearity and
pre-conditioning) and the final performance will be improved
(e.g., due to noise reduction and invariance promotion). Real-world data are heterogeneous: They come from
multiple domains, sources, and are represented in different ways. Put
in other ways, they are full of data types: real-valued, binary, ordinal, multiple label, label
ranking, preference graph and matrix-variates. Fusion
of these types to make informed decision is inherently
important. In machine learning, data types are often treated
separately. In statistics, traditional mixed type handling is often
inadequate and does not scale well. Here we explore scalable
methods to learn from and infer with multiple data types. Contributions: Modelling
ordinal matrices: these are popular
in multiuser ratings of common items, such as those in collaborative
filtering. [Published in: UAI'09, AAAI'12,
ACML'12]
Probabilistic
models of ordered partitions: we address
ranking of subsets instead of individuals, where the subsets are
themselves unknown in advance. This leads to an explosion
in state-space. Here we introduce high-order Markov chain over
partitions and MCMC methods for learning from ordered
partitionings. [Published in SDM'11,
ACML'12] Modelling
mixed-data types using RBM: intergrating
multiple data types can be a non-trivial task for
many real-world problems. Here we offer an unified way for
converting mixed-data types into
the numerical vectorial spaces. [Publised in
ACML'11] Thurstonian
Boltzmann Machines: A multivariate utility model for mixed-data types.
For better intepretation, here we model the process of data generation
for multiple types using random utilities, and at the same time
enable information fusion and discovery of latent data
aspects. [Published in FUSION'12,
ICML'13]. Learning probabilistic metrics on
learned representation. [Published in ICME'12, ICME'13] Statistical
Relational Learning Much
of real-world data is relational. While this offers stronger
statistical properties through compactness, it also proves to be very
challenging due to the complextiy and scales. Problems of interest
include: feature aggregation, concept/predicate invention, clustering
and latent structures discovery, collective classification/regression,
exploiting symmetries, large-scale learning and
inference. Contributions: The
goal is to deliver right services to right users. Most existing work is
rather ad-hoc and ignores complex nature of the
data. Research topics include discovering
hidden patterns, incorporating contexts and side-information, social
networks, multiple-domains, product hierarchies, as well as
correlations between actors and items. Contributions: Preference
networks: integrating user profile, item
content and ratings into a single probabilistic database using
Relational Markov Network. Supports rating prediction,
item-ranking (given an user) and user-ranking (given an item).
[Published in AusDM'07]. Ordinal
Boltzmann machines: treating rating matrix as
a whole unit and discovering hidden aspects of the data. Also includes
treatment of ordinal nature of the data, and Preference networks as a
subcomponent. [Published in UAI'09, ACML'12]. Sequential
ordinal matrix factorisation: investigating
the generative mechanism of the rating matrix. We adopt the notion that
an ordinal output is a result of a sequential process: Given that an
item has a utility with respect to an user, we start from the lowest
level, and stop at the optimal level where the utility falls below its
threshold. The utility is a combination of different aspects: the
general value of the item, the general easiness of the user, the
compatibility between the user and the item, and the relations between
the item and other items. [Published in AAAI'12]. Collaborative
ranking: addressing the mismatch bewteen the
data representation (ordinal ratings) and the recommendation goal (rank
list of items or list of subsets), we embrace the notion of
learning to rank collaboratively, borrowing ideas from learning to rank
in information retrieval and mixed-effect models in statistical fields.
[Published in SDM'11,
AMCL'12]. Learning preferences
over sets. moving beyond the current paradigm
on preference over items, here we explore the space of sets directly.
[Published in SDM'11,
AMCL'12]. Discriminative sequential models This is indeed
very rich type of data which we encounter everyday. Issues include
feature discovery, segmentation, permutation, collocation and n-grams.
One the the extreme is the problem of statistical machine translation,
where the output space is theoretically infinite (the target language
space). Fast
learning in intractable graphical models Learning in
generic graphical models is needed for both network structure and
parameter. However, structure learning is extremely difficult because
the structure space is exponentially large. Even when the structure is
known, parameter learning is also very challenging since quantities needed for learning are often
intractable to estimate. For parameter learning, fast methods like
pseudo-likelihood are sub-optimal with finite
data and often over-estimate the interaction between
variables. Message-passing methods can be used for the inner inference
loops but their poor convergence guarantee may stop learning progress
too early. The goal of this
project is to investigate options for fast and effective learning. Contributions: AdaBoost.MRF
- learning to select and weight trees from a tree ensemble:
not only this supports parameter learning, it also
discovers strong interactions among variables, and thus
enables structure learning. [CVPR'06] Activity
recognition is important in assisting tasks or
surveillance. Several problems need to be addressed:
First, sensory information is often unreliable and there must
be a way to fuse many sensor readings to make informed decision.
Second, activity can be quite complex, consisting of many
sub-activities. And third, training labels are often sparse since it
would be incovenient for users to provide labels frequently. Contributions: Selection
of sensory inputs for structured activities under missing
labels. we offer a fast beam search to select
very small number of sensor inputs even when the labels are partially
missing. [Published in ISSNIP'05]. Conditional
models for daily routine labelling under sparse labels:
we investigate conditional models (Maximum Entropy Markov Models and
Conditional Random Fields) when labels are sparse. [Published
in PRICAI'08]. Joint
modelling of multilevel activity abstraction:
often activities are performed to achieve some
more abstract goals. Here we propose a
general framework to model multiple levels
of activity abstraction. [Published in CVPR'06]. Hierarchical
nested activities: when activities are nested
in time (higher level of activity is declared completed only
if their child activities have been
terminated), learning and inference can be performed in
polynomial time. [Published in NIPS'08]. Contributions:
2013
2012
2011
2009
2005-2008
Working papers
PhD thesis
|
|||||||||||||||||