【佳学基因人工智能】RNA测序数据的信息分析——基因解码信息源的准备
【佳學(xué)基因人工智能】RNA測序數(shù)據(jù)的信息分析——基因解碼信息源的準(zhǔn)備
人的基因信息解碼策略
人的基因信息解碼有兩種策略,一是數(shù)據(jù)庫比對策略,二是基因解碼策略。數(shù)據(jù)庫比對策略只能用數(shù)據(jù)庫中記錄過的案例。由于人的特殊性,常規(guī)的基因檢測無法發(fā)現(xiàn)臨床中遇到的病人,所以查找基因病的致病原因,基因解碼策略要優(yōu)與數(shù)據(jù)庫比對的基因檢測策略。RNAseq是基因解碼策略中獲取原始信息并進(jìn)行后續(xù)的重要一步。
分析工具的選擇
佳學(xué)基因出與培訓(xùn)基因檢測公司的目的,將使用小鼠參考基因組的一小部分(染色體1)來演示如何使用R進(jìn)行高通量測序數(shù)據(jù)的比對和計數(shù)。將測序數(shù)據(jù)映射到基因組是一項非常重要的任務(wù),并且有許多不同的比對工具可用,例如bowtie,topHat、STAR和Rsubread。根據(jù)佳學(xué)基因基因信息分析中心的實際測定,Rsubread是唯一可以在R中運(yùn)行的基因信息比對分析工具。大多數(shù)對齊工具都是在linux環(huán)境中運(yùn)行的,并且計算量非常大。大多數(shù)比對任務(wù)需要比普通筆記本電腦更大的計算機(jī),因此通常在類似linux的環(huán)境中的服務(wù)器上完成原始數(shù)據(jù)的讀取和比對。在這里,佳學(xué)基因的生物信息培訓(xùn)學(xué)員只將從智能分析老師準(zhǔn)備的小鼠泌乳數(shù)據(jù)集中的每個樣本抽取1000個數(shù)據(jù),學(xué)習(xí)過程將比對1號染色體。因為佳學(xué)基因的主要目的是為了讓基因檢測機(jī)構(gòu)的技術(shù)人員可以嘗試使用RStudio筆記本電腦方便的進(jìn)行數(shù)據(jù)分析。
軟件包的安裝:
不能從R中直接安裝:會出現(xiàn)下面的結(jié)果:
install.packages(“Rsubread”)
WARNING: Rtools is required to build R packages but is not currently installed. Please download and install the appropriate version of Rtools before proceeding:
https://cran.rstudio.com/bin/windows/Rtools/
將程序包安裝入‘C:/Users/yunli/Documents/R/win-library/4.1’
(因為‘lib’沒有被指定)
Warning in install.packages :
package ‘Rsubread’ is not available for this version of R
A version of this package for your version of R might be available elsewhere,
see the ideas at
https://cran.r-project.org/doc/manuals/r-patched/R-admin.html#Installing-packages
我們對Markdown
而應(yīng)當(dāng)運(yùn)行如下代碼:
if (!requireNamespace("BiocManager", quietly = TRUE))install.packages("BiocManager")BiocManager::install("Rsubread")出現(xiàn)如下信息:
https://cran.rstudio.com/bin/windows/Rtools/
將程序包安裝入‘C:/Users/yunli/Documents/R/win-library/4.1’
(因為‘lib’沒有被指定)
trying URL ‘https://cran.rstudio.com/bin/windows/contrib/4.1/BiocManager_1.30.16.zip’
Content type ‘a(chǎn)pplication/zip’ length 328795 bytes (321 KB)
downloaded 321 KB
package ‘BiocManager’ successfully unpacked and MD5 sums checked
The downloaded binary packages are in
C:\Users\yunli\AppData\Local\Temp\Rtmpuaqk8c\downloaded_packages
要求更新部分功能
The downloaded binary packages are in
C:\Users\yunli\AppData\Local\Temp\Rtmpuaqk8c\downloaded_packages
Installation paths not writeable, unable to update packages
path: C:/Program Files/R/R-4.1.2/library
packages:
class, foreign, MASS, Matrix, nlme, nnet, spatial
Old packages: ‘broom’, ‘DBI’, ‘fansi’, ‘openssl’
Update all/some/none? [a/s/n]:
更新:鍵盤上敲入:a
有二進(jìn)制版本的,但源代碼版本是后來的:
binary source needs_compilation
fansi 0.5.0 1.0.0 TRUE
Binaries will be installed
trying URL ‘https://cran.rstudio.com/bin/windows/contrib/4.1/broom_0.7.11.zip’
Content type ‘a(chǎn)pplication/zip’ length 1814717 bytes (1.7 MB)
downloaded 1.7 MB
trying URL ‘https://cran.rstudio.com/bin/windows/contrib/4.1/DBI_1.1.2.zip’
Content type ‘a(chǎn)pplication/zip’ length 741837 bytes (724 KB)
downloaded 724 KB
trying URL ‘https://cran.rstudio.com/bin/windows/contrib/4.1/fansi_0.5.0.zip’
Content type ‘a(chǎn)pplication/zip’ length 248710 bytes (242 KB)
downloaded 242 KB
trying URL ‘https://cran.rstudio.com/bin/windows/contrib/4.1/openssl_1.4.6.zip’
Content type ‘a(chǎn)pplication/zip’ length 3987697 bytes (3.8 MB)
downloaded 3.8 MB
package ‘broom’ successfully unpacked and MD5 sums checked
package ‘DBI’ successfully unpacked and MD5 sums checked
package ‘fansi’ successfully unpacked and MD5 sums checked
package ‘openssl’ successfully unpacked and MD5 sums checked
The downloaded binary packages are in
C:\Users\yunli\AppData\Local\Temp\Rtmpuaqk8c\downloaded_packages
軟件包安裝成功
檢查是否可以正常調(diào)用Rsubread
library(Rsubread)總結(jié)
以上是生活随笔為你收集整理的【佳学基因人工智能】RNA测序数据的信息分析——基因解码信息源的准备的全部內(nèi)容,希望文章能夠幫你解決所遇到的問題。
- 上一篇: perp系列之七:perp手册
- 下一篇: python点阵显示数字_Python实