Note of Week 1
本项目是徐英泽创建的 生物信息学(2026春) 第一周课程笔记。
返回引导页
Bioinfo Introduction
1. 4 steps of Bioinformatics
- Question
- Information: Biological or Medical Data
- Analysis: Data Clean and Feature Extraction
- Modeling: Probabilistic Model & Computational Algorithm
2. Type of NGS Next Generation Sequencing)
- DNA-seq
- RNA-seq
- Epigenetics
- DNAase
- Methylation
- Histone modifications: ChIP-seq
- Interaction
- Protein-DNA: ChIP-seq
- Protein-RNA: CLIP-seq
- DNA-RNA: Grid-seq
3. Question/Hypothesis-driven Science & Big Data-driven Science
- Question/Hypothesis-driven Science: Question - Information - Analysis - Modeling
- Big Data-driven Science: Information - Analysis - Modeling - Question
- * Now Big Data-driven Science is the Fourth Paradigm*.
4. Model & Algorithm
||Algorithm|Modelm| |:—:|:—:|:—:| |定义|算法是一组明确的步骤或规则,用来解决问题或完成某个特定任务。|模型是算法在特定数据上学习得到的表示。在机器学习中,模型是通过算法从数据中学习得到的,它能够对新的数据进行预测或分类。模型通常包括参数和结构两部分,参数是模型在学习过程中调整的变量,结构则是模型的框架,定义了参数如何组合和相互作用。| |举例|动态规划算法|线性回归模型、神经网络模型|
总之算法是训练模型的方法和规则,模型是算法加上数据训练后的产物。
Getting Started
1. Document my work
- Github and Markdown
2. Backup my work
- Cloud storage
- Time Machine (Mac)
- Github and Github Desktop (what I use)
3. Text editor
- Vim
- Visual Studio Code
- Jupyter Notebook
- RStudio
4. Docker
- 获得Linux运行环境
- 封装软件,提高软件运行环境的可移植性