Big Microbiome Data Analysis
發布時間 🏓:2017-05-03  閱讀次數 :3988

報告題目:Big Microbiome Data Analysis

報  告 人:Prof. Xiaohua Tony Hu, College of Computing and Informatics, Drexel University

報告時間:5月15日 上午10:00

報告地點:閔行校區生物藥學樓2-116

聯  系 人:魏冬青 34204573, dqwei@sjtu.edu.cn
徐   沁 34204573, xuqin523@sjtu.edu.cn
 

摘要:We know little about the microbial world. Microbiome sequencing (i.e. metagenome, 16s rRNA) extracts DNA directly from a microbial environment without culturing any species. Recently, huge amount of data are generated from many micorbiome projects such as Human Microbiome Project (HMP), Metagenomics of the Human Intestinal Tract (MetaHIT), et al. Analyzing these data will help us to better understand the function and structure of microbial community of human body, earth and other environmental eco-systems. However, the huge data volume, the complexity of microbial community and the intricate data properties have created a lot of opportunities and challenges for data analysis and mining.  For example, it is estimate that in the microbial eco-system of human gut, there are about 1000 kinds of bacteria with 10 billion bacteria and more than 4 million genes in more than 6000 orthologous gene family. The challenges are due to the complex properties of microbiome: large-scale, complicated, diversity, correlation, composition, hierarchy, incompleteness etc. Current microbiomes data analysis methods seldom consider these data properties and often make some assumptions such as linear, Euclidean space, metric-space, continue data type, which conflict with the true data properties. For example, some similarities are non-metric because the prevalent existence of some species; and the interactions among species and environment are complex in high order. Thus it is urgent to develop novel computational methods to overcome these assumptions and consider the microbiome data properties in the analysis procedure. In this talk, we will discuss some computational methods to analyze and visualize microbiome big data. Our studies are focusing on 1) novel machine learning and computational technologies for dimension reduction and visualization of microbiome data based on non-Euclidean spaces (manifold learning) to discover nonlinear intrinsic features and patterns in these data to overcome the linear assumptions, 2) novel statistical methods for variable selection in microbiome data by integrating group information among variables.

EON体育4平台专业提供:EON体育4平台EON体育4EON体育4登录等服务,提供最新官网平台、地址、注册、登陆、登录、入口、全站、网站、网页、网址、娱乐、手机版、app、下载、欧洲杯、欧冠、nba、世界杯、英超等,界面美观优质完美,安全稳定,服务一流🖖,EON体育4平台欢迎您。 EON体育4平台官網xml地圖