Data File Formats-UCSC-GFF,PSL
2009-10-10 14:34
351 查看
GFF3&GFF&GFF2PS
GFF3: http://song.sourceforge.net/gff3.shtml
GFF: http://www.sanger.ac.uk/Software/formats/GFF/
GFF2PS: http://genome.imim.es/software/gfftools/GFF2PS.html
DATA FORMAT
PSL format | ![]() |
PSL lines represent alignments, and are typically taken from files generated by BLAT or psLayout. See the BLAT documentation for more details. All of the following fields are required on each data line within a PSL file: matches - Number of bases that match that aren't repeats misMatches - Number of bases that don't match repMatches - Number of bases that match but are part of repeats nCount - Number of 'N' bases qNumInsert - Number of inserts in query qBaseInsert - Number of bases inserted in query tNumInsert - Number of inserts in target tBaseInsert - Number of bases inserted in target strand - '+' or '-' for query strand. For translated alignments, second '+'or '-' is for genomic strand qName - Query sequence name qSize - Query sequence size qStart - Alignment start position in query qEnd - Alignment end position in query tName - Target sequence name tSize - Target sequence size tStart - Alignment start position in target tEnd - Alignment end position in target blockCount - Number of blocks in the alignment (a block contains no gaps) blockSizes - Comma-separated list of sizes of each block qStarts - Comma-separated list of starting positions of each block in query tStarts - Comma-separated list of starting positions of each block in target Example: Here is an example of an annotation track in PSL format. Note that line breaks have been inserted into the PSL lines in this example for documentation display purposes. Click here for a copy of this example that can be pasted into the browser without editing. track name=fishBlats description="Fish BLAT" useScore=1 59 9 0 0 1 823 1 96 +- FS_CONTIG_48080_1 1955 171 1062 chr22 47748585 13073589 13073753 2 48,20, 171,1042, 34674832,34674976, 59 7 0 0 1 55 1 55 +- FS_CONTIG_26780_1 2825 2456 2577 chr22 47748585 13073626 13073747 2 21,45, 2456,2532, 34674838,34674914, 59 7 0 0 1 55 1 55 -+ FS_CONTIG_26780_1 2825 2455 2676 chr22 47748585 13073727 13073848 2 45,21, 249,349, 13073727,13073827, Be aware that the coordinates for a negative strand in a PSL line are handled in a special way. In the qStart and qEnd fields, the coordinates indicate the position where the query matches from the point of view of the forward strand, even when the match is on the reverse strand. However, in the qStarts list, the coordinates are reversed. Example: Here is a 30-mer containing 2 blocks that align on the minus strand and 2 blocks that align on the plus strand (this sometimes can happen in response to assembly errors): 0 1 2 3 tens position in query 0123456789012345678901234567890 ones position in query ++++ +++++ plus strand alignment on query -------- ---------- minus strand alignment on query Plus strand: qStart=12 qEnd=31 blockSizes=4,5 qStarts=12,26 Minus strand: qStart=4 qEnd=26 blockSizes=10,8 qStarts=5,19 Essentially, the minus strand blockSizes and qStarts are what you would get if you reverse-complemented the query. However, the qStart and qEnd are not reversed. To convert one to the other: qStart = qSize - revQEnd qEnd = qSize - revQStart |
|
|
相关文章推荐
- Hive 读书笔记2:Data Types and File Formats
- NutchFileFormats
- RMAN-06100: no channel to restore a backup or copy of datafile 681
- Starting MySQL.. ERROR! The server quit without updating PID file (/usr/local/mysql/data/localhost.l
- RMAN-06023 no backup or copy of datafile 1 found to restore
- 解决: ORA-02199: missing DATAFILE/TEMPFILE clause [Oracle OMF 功能详解]
- GoldenGate Data Pump 进程 report 报 WARNING OGG-01223 Cannot find executable file './server' 无法启动
- file not found: /Users/smile/Library/Developer/Xcode/DerivedData/
- oracle sqlldr数据导入错误Field in data file exceeds maximum length解决
- Python Save Data To File
- 三种上传文件不刷新页面的方法讨论 iframe/FormData/FileReader
- Unable to parse request org.apache.commons.fileupload.FileUploadBase$IOFileUploadException: Processing of multipart/form-data request failed. null
- VC通过WIN32_FIND_DATA和FindFirstFile获取文件夹大小代码
- OGG "Loading data from file to Replicat"table数据的静态同步配置过程
- jQuery-File-Upload兼容IE8的问题:data.submit()没有发送请求
- file结构中的private_data
- 安装好hadoop集群后,报错如下n org.apache.hadoop.ipc.RemoteException(java.io.IOException): File /data/hadoop-roo
- produce a gradient file using surface elevation data, and plot it using grdimage with topographic "shade"
- datastage sequential file 控件的使用
- 关于 OGG "Loading data from file to Replicat"同步含有lob字段表的部分记录的关键参数