portrait

Experimental Lab   

Projects   

Publication   

Software/Demo   

Let data blog   

CV   

Scholar profile   

 

 

 


 

Truyen Tran (PhD)

 


PraDa, School of Information Technology
Deakin University
Locked Bag 20000
Geelong VIC 3220

truyen.tran {_at_} deakin.edu.au
IMPCA, Department of Computing
Curtin University of Technology
Kent St, Bentley, WA 6102, Australia

News:


Research interests

  • Statistical Machine Learning
  • Probabilistic Graphical Models
  • Medical Data Analysis
  • Others: Recommender Systems, Multimedia, Natural Language Processing, Computer Vision,  Bioinformatics, Social Media 
               

Collaborators

 

Research projects

Software/Demo

  • CRF-SL: a generic implementation of CRFs for sequential labelling. It supports any data types but requires the raw features extracted from data as the input.
  • viAccent: A Perl-based Accent restoration service for Vietnamese. Use CRFs.
  • viCat: A simple Vietnamese text classifier.

 

 

 


Research Projects

Medical Data Analysis

Modern hospitals and medical centres have collected huge amount of clinical data for hundreds of millions of patients over the past decades. However, how to make the best out of the data for improving clinincal services still remains the major question. This research aims at characterising the data using probabilistic models and applying the state-of-the-art machine learning techniques for representation, clustering and prediction both at the patient and the cohort levels.

Contributions:

  • Patient profiling: [Published in PAKDD'13].

  • Suicide risk prediction. [Under review].

Representation learning: Modelling complex and mixed-data types

Raw data may lie on hidden manifolds and contain noise and thus it may not be appropriate for tasks at hand. The goal of representation learning is to discover latent factors in the data which are invariant to small changes and insentitive of noise. These factors then can be fed into standard machine learning techniques. The hope is that the learning curve will be much easier (e.g., better linearity and pre-conditioning) and the final performance will be improved (e.g., due to noise reduction and invariance promotion).

Real-world data are heterogeneous: They come from multiple domains, sources, and are represented in different ways. Put in other ways, they are full of data types: real-valued, binary, ordinal, multiple label, label ranking, preference graph and matrix-variates. Fusion of these types to make informed decision is inherently important. 

In machine learning, data types are often treated separately. In statistics, traditional mixed type handling is often inadequate and does not scale well. Here we explore scalable methods to learn from and infer with multiple data types.

Contributions:

  • Modelling ordinal matrices: these are popular in multiuser ratings of common items, such as those in collaborative filtering. [Published in: UAI'09, AAAI'12, ACML'12]

  • Probabilistic models of ordered partitions: we address ranking of subsets instead of individuals, where the subsets are themselves unknown in advance. This leads to an explosion in state-space. Here we introduce high-order Markov chain over partitions and MCMC methods for learning from ordered partitionings. [Published in SDM'11, ACML'12]

  • Modelling mixed-data types using RBM: intergrating multiple data types can be a non-trivial task for many real-world problems. Here we offer an unified way for converting mixed-data types into the numerical vectorial spaces. [Publised in ACML'11]

  • Thurstonian Boltzmann Machines: A multivariate utility model for mixed-data types. For better intepretation, here we model the process of data generation for multiple types using random utilities, and at the same time enable information fusion and discovery of latent data aspects. [Published in FUSION'12, ICML'13].

  • Learning probabilistic metrics on learned representation. [Published in ICME'12, ICME'13]

Statistical Relational Learning

Much of real-world data is relational. While this offers stronger statistical properties through compactness, it also proves to be very challenging due to the complextiy and scales. Problems of interest include: feature aggregation, concept/predicate invention, clustering and latent structures discovery, collective classification/regression, exploiting symmetries, large-scale learning and inference.

Contributions:

  • Entities modelling from dyadic relations: [Published in AusDM'07].

  • Discovering latent profiles from dyadic relations. [Published in UAI'09, ACML'12].

Recommender systems

The goal is to deliver right services to right users. Most existing work is rather ad-hoc and ignores complex nature of the data. Research topics include discovering hidden patterns, incorporating contexts and side-information, social networks, multiple-domains, product hierarchies, as well as correlations between actors and items.

Contributions:

  • Preference networks: integrating user profile, item content and ratings into a single probabilistic database using Relational Markov Network. Supports rating prediction, item-ranking (given an user) and user-ranking (given an item). [Published in AusDM'07].

  • Ordinal Boltzmann machines: treating rating matrix as a whole unit and discovering hidden aspects of the data. Also includes treatment of ordinal nature of the data, and Preference networks as a subcomponent. [Published in UAI'09, ACML'12].

  • Sequential ordinal matrix factorisation: investigating the generative mechanism of the rating matrix. We adopt the notion that an ordinal output is a result of a sequential process: Given that an item has a utility with respect to an user, we start from the lowest level, and stop at the optimal level where the utility falls below its threshold. The utility is a combination of different aspects: the general value of the item, the general easiness of the user, the compatibility between the user and the item, and the relations between the item and other items. [Published in AAAI'12].

  • Collaborative ranking: addressing the mismatch bewteen the data representation (ordinal ratings) and the recommendation goal (rank list of items or list of subsets), we embrace the notion of learning to rank collaboratively, borrowing ideas from learning to rank in information retrieval and mixed-effect models in statistical fields. [Published in SDM'11, AMCL'12].

  • Learning preferences over sets. moving beyond the current paradigm on preference over items, here we explore the space of sets directly. [Published in SDM'11, AMCL'12].

Discriminative sequential models

This is indeed very rich type of data which we encounter everyday. Issues include feature discovery, segmentation, permutation, collocation and n-grams. One the the extreme is the problem of statistical machine translation, where the output space is theoretically infinite (the target language space).

  • Learning local and global models with partial labelling schemes [PRICAI'08].

  • Feature selection with exponential loss under partial labelling scheme [ISSNIP'05].

  • Learning a highly constrained sequential model.  [PRICAI'08]

  • Nested sequential models. [Publication: NIPS'08, DLSRRA'09]

 

Fast learning in intractable graphical models

Learning in generic graphical models is needed for both network structure and parameter. However, structure learning is extremely difficult because the structure space is exponentially large. Even when the structure is known, parameter learning is also very challenging since quantities needed for learning are often intractable to estimate. For parameter learning, fast methods like pseudo-likelihood are sub-optimal with finite data and often over-estimate the interaction between variables. Message-passing methods can be used for the inner inference loops but their poor convergence guarantee may stop learning progress too early.

The goal of this project is to investigate options for fast and effective learning.

Contributions:

  • AdaBoost.MRF - learning to select and weight trees from a tree ensemble: not only this supports parameter learning, it also discovers strong interactions among variables, and thus enables structure learning. [CVPR'06]

Activity recognition

Activity recognition is important in assisting tasks or surveillance. Several problems need to be addressed: First,  sensory information is often unreliable and there must be a way to fuse many sensor readings to make informed decision. Second, activity can be quite complex, consisting of many sub-activities. And third, training labels are often sparse since it would be incovenient for users to provide labels frequently.

Contributions:

  • Selection of sensory inputs for structured activities under missing labels. we offer a fast beam search to select very small number of sensor inputs even when the labels are partially missing.  [Published in ISSNIP'05].

  • Conditional models for daily routine labelling under sparse labels: we investigate conditional models (Maximum Entropy Markov Models and Conditional Random Fields) when labels are sparse. [Published in PRICAI'08].

  • Joint modelling of multilevel activity abstraction: often activities are performed to achieve some more abstract goals. Here we propose a general framework to model multiple levels of activity abstraction. [Published in CVPR'06].

  • Hierarchical nested activities: when activities are nested in time (higher level of activity is declared completed only if their child activities have been terminated), learning and inference can be performed in polynomial time. [Published in NIPS'08].

Lexical disambiguation

Contributions:

  • Accent restoration in Vietnamese: Given an accentless sentence, we need to restore the lost accents (or tonal marks) [Publication; PRICAI'08] [Demo].


Publications

2013

  • Thurstonian Boltzmann machines: Learning from multiple inequalities, Truyen Tran, D. Phung, and S. Venkatesh, In Proc. of 30th International Conference in Machine Learning (ICML’13), Atlanta, USA, June, 2013.
  • Latent patient profile modelling and applications with Mixed-Variate Restricted Boltzmann Machine, Tu D. Nguyen, Truyen Tran, D. Phung, and S. Venkatesh,  In Proc. of 17th Pacific-Asia Conference on Knowledge Discovery and Data Mining (PAKDD’13), Gold Coast, Australia, April 2013.
  • Learning sparse latent representation and distance metric for image retrieval, Tu D. Nguyen, Truyen Tran, D. Phung, and S. Venkatesh, In Proc. of IEEE International Conference on Multimedia and Expo (ICME), San Jose, California, USA, July 2013.

2012

2011

2010

2009

2005-2008

  • Boosted Markov networks for activity recognition, Truyen Tran, Hung Hai Bui and Svetha Venkatesh, In Proc. of International Conference on Intelligent Sensors, Sensor Networks and Information Processing (ISSNIP2005), 5-8 Dec, Melbourne, Australia.

Working papers

Tutorials and Notes

PhD thesis