关键信息
关于内容
How do we infer which genes orchestrate various processes in the cell? How did humans migrate out of Africa and spread around the world? In this class, we will see that these two seemingly different questions can be addressed using similar algorithmic and machine learning techniques arising from the general problem of dividing data points into distinct clusters. In the first half of the course, we will introduce algorithms for clustering a group of objects into a collection of clusters based on their similarity, a classic problem in data science, and see how these algorithms can be applied to gene expression data. In the second half of the course, we will introduce another classic tool in data science called principal components analysis that can be used to preprocess multidimensional data before clustering in an effort to greatly reduce the number dimensions without losing much of the "signal" in the data. Finally, you will learn how to apply popular bioinformatics software tools to solve a real problem in clustering.
课程大纲
How Did Yeast Become a Wine Maker? (Clustering Algorithms)
- An Evolutionary History of Wine Making
- Identifying Genes Responsible for the Diauxic Shift
- Introduction to Clustering
- k-Means Clustering
- The Lloyd Algorithm
- Clustering Genes Implicated in the Diauxic Shift
- Limitations of k-Means Clustering
- From Coin Flipping to k-Means Clustering
- Making Soft Decisions in Coin Flipping
- Soft k-Means Clustering
- Hierarchical Clustering
- Epilogue: Clustering Tumor Samples
What Genetic Characteristics Do Human Populations Share? (Principal Components Analysis)
- Specific Content TBA
Bioinformatics Application Challenge: Clustering Biological Big Data (RNA-seq)
教师
Pavel Pevzner
Professor
Department of Computer Science and Engineering
Phillip Compeau
Visiting Researcher
Department of Computer Science & Engineering
内容设计师

加州大学圣地亚哥分校是位于加利福尼亚州圣地亚哥的一所公立赠地研究型大学。加州大学圣地亚哥分校成立于 1960 年,位于斯克里普斯海洋学研究所附近,是加州大学十个校区中最南端的一个,提供 200 多个本科和研究生学位课程,在校本科生 33,096 人,研究生 9,872 人。
加州大学圣地亚哥分校被认为是世界上最好的大学之一。多份出版物将加州大学圣地亚哥分校的生物科学系和计算机科学系评为世界前十名。
平台

Coursera是一家数字公司,提供由位于加利福尼亚州山景城的计算机教师Andrew Ng和达芙妮科勒斯坦福大学创建的大型开放式在线课程。
Coursera与顶尖大学和组织合作,在线提供一些课程,并提供许多科目的课程,包括:物理,工程,人文,医学,生物学,社会科学,数学,商业,计算机科学,数字营销,数据科学 和其他科目。