Human Gene Functions

Human Gene Functions

Time Limit: 1000MS Memory Limit: 10000K

Total Submissions: 18053 Accepted: 10046

Description

It is well known that a human gene can be considered as a sequence, consisting of four nucleotides, which are simply denoted by four letters, A, C, G, and T. Biologists have been interested in identifying human genes and determining their functions, because these can be used to diagnose human diseases and to design new drugs for them.

A human gene can be identified through a series of time-consuming biological experiments, often with the help of computer programs. Once a sequence of a gene is obtained, the next job is to determine its function.

One of the methods for biologists to use in determining the function of a new gene sequence that they have just identified is to search a database with the new gene as a query. The database to be searched stores many gene sequences and their functions – many researchers have been submitting their genes and functions to the database and the database is freely accessible through the Internet.

A database search will return a list of gene sequences from the database that are similar to the query gene.

Biologists assume that sequence similarity often implies functional similarity. So, the function of the new gene might be one of the functions that the genes from the list have. To exactly determine which one is the right one another series of biological experiments will be needed.

Your job is to make a program that compares two genes and determines their similarity as explained below. Your program may be used as a part of the database search if you can provide an efficient one.

Given two genes AGTGATG and GTTAG, how similar are they? One of the methods to measure the similarity

of two genes is called alignment. In an alignment, spaces are inserted, if necessary, in appropriate positions of

the genes to make them equally long and score the resulting genes according to a scoring matrix.

For example, one space is inserted into AGTGATG to result in AGTGAT-G, and three spaces are inserted into GTTAG to result in –GT–TAG. A space is denoted by a minus sign (-). The two genes are now of equal

length. These two strings are aligned:

AGTGAT-G

-GT–TAG

In this alignment, there are four matches, namely, G in the second position, T in the third, T in the sixth, and G in the eighth. Each pair of aligned characters is assigned a score according to the following scoring matrix.

denotes that a space-space match is not allowed. The score of the alignment above is (-3)+5+5+(-2)+(-3)+5+(-3)+5=9.

Of course, many other alignments are possible. One is shown below (a different number of spaces are inserted into different positions):

AGTGATG

-GTTA-G

This alignment gives a score of (-3)+5+5+(-2)+5+(-1) +5=14. So, this one is better than the previous one. As a matter of fact, this one is optimal since no other alignment can have a higher score. So, it is said that the

similarity of the two genes is 14.

Input

The input consists of T test cases. The number of test cases ) (T is given in the first line of the input file. Each test case consists of two lines: each line contains an integer, the length of a gene, followed by a gene sequence. The length of each gene sequence is at least one and does not exceed 100.

Output

The output should print the similarity of each test case, one per line.

Sample Input

2

7 AGTGATG

5 GTTAG

7 AGCTATT

9 AGCTTTAAA

Sample Output

14

21

Source

Taejon 2001

求DNA匹配度,类似最长公共子序列

#include <iostream>

#include <cstdio>

#include <cstring>

#include <cmath>

#include <queue>

#include <map>

#include <algorithm>

using namespace std;

typedef long long LL;

typedef pair<int,int>p;

const int INF = 0x3f3f3f3f;

int value[][5]={{5,-1,-2,-1,-3},

                {-1,5,-3,-2,-4},

                {-2,-3,5,-2,-2},

                {-1,-2,-2,5,-1},

                {-3,-4,-2,-1,0}};

map<char ,int >Dir;

int lens,lenc;

int Dp[110][110];

int main()

{

    int n;

    char s[110];

    char c[110];

    Dir['A']=0;

    Dir['C']=1;

    Dir['G']=2;

    Dir['T']=3;

    Dir['-']=4;

    while(~scanf("%d",&n))

    {

        while(n--)

        {

            scanf("%d %s",&lens,s+1);

            scanf("%d %s",&lenc,c+1);

            Dp[0][0]=0;

            for(int i=1;i<=lens;i++)

            {

                Dp[i][0]=Dp[i-1][0]+value[Dir[s[i]]][Dir['-']];//如果都不匹配的情况

            }

            for(int i=1;i<=lenc;i++)

            {

                Dp[0][i]=Dp[0][i-1]+value[Dir[c[i]]][Dir['-']];

            }

            for(int i=1;i<=lens;i++)

            {

                for(int j=1;j<=lenc;j++)

                {

                    Dp[i][j]=Dp[i-1][j-1]+value[Dir[s[i]]][Dir[c[j]]];//如果两个字符要匹配,则Dp[i][j]由Dp[i-1][j-1]推出.

                    Dp[i][j]=max(Dp[i][j],Dp[i-1][j]+value[Dir[s[i]]][Dir['-']]);//前面的字符与这个已经匹配(不管用什么方式匹配的),他只能与'-'匹配

                    Dp[i][j]=max(Dp[i][j],Dp[i][j-1]+value[Dir['-']][Dir[c[j]]]);//如果这个字符已经与前面匹配,则c[j]与'-'匹配

                }

            }

            printf("%d\n",Dp[lens][lenc]);

        }

    }

    return 0;

}

Human Gene Functions的更多相关文章

hdu1080 Human Gene Functions() 2016-05-24 14:43 65人阅读评论(0) 收藏
Human Gene Functions Time Limit: 2000/1000 MS (Java/Others) Memory Limit: 65536/32768 K (Java/Oth ...
poj 1080 ——Human Gene Functions——————【最长公共子序列变型题】
Human Gene Functions Time Limit: 1000MS Memory Limit: 10000K Total Submissions: 17805 Accepted: ...
【POJ 1080】 Human Gene Functions
[POJ 1080] Human Gene Functions 相似于最长公共子序列的做法 dp[i][j]表示 str1[i]相应str2[j]时的最大得分转移方程为 dp[i][j]=max(d ...
poj 1080 Human Gene Functions（lcs，较难）
Human Gene Functions Time Limit: 1000MS Memory Limit: 10000K Total Submissions: 19573 Accepted: ...
POJ 1080：Human Gene Functions LCS经典DP
Human Gene Functions Time Limit: 1000MS Memory Limit: 10000K Total Submissions: 18007 Accepted: ...
POJ 1080 Human Gene Functions -- 动态规划(最长公共子序列)
题目地址:http://poj.org/problem?id=1080 Description It is well known that a human gene can be considered ...
杭电20题 Human Gene Functions
Problem Description It is well known that a human gene can be considered as a sequence, consisting o ...
刷题总结——Human Gene Functions（hdu1080）
题目: Problem Description It is well known that a human gene can be considered as a sequence, consisti ...
Human Gene Functions POJ 1080 最长公共子序列变形
Description It is well known that a human gene can be considered as a sequence, consisting of four n ...

随机推荐

php基础语法学习汇总
常量学习: <?php # function demo function sum($x,$y){ $z=$x+$y; return $z; } echo sum(1,2); #define de ...
通用窗口类 Inventory Pro 2.1.2 Demo1（中）
本篇想总结的是Inventory Pro中通用窗口的具体实现,但还是要强调下该插件的重点还是装备系统而不是通用窗口系统,所以这里提到的通用窗口类其实是通用装备窗口类(其实该插件中也有非装备窗口比如No ...
Swift游戏实战-跑酷熊猫 07 平台的移动
这节内容我们来实现平台是怎么产生移动动画的. 要点 1 利用数组存放平台 var platforms=[Platform]() 2 有新的平台产生存放进数组 platforms.append(plat ...
ADO.net 扩展属性
扩展属性处理:有外键关系时将代号化信息处理成原始文字,让用户可看懂的(粗略解释) 利用扩展属性如:users表中的民族列显示的是民族代号处理成Nation表中的民族名称需要在users类里面扩展 ...
Codeforce Round #217 Div2
e,妈蛋,第二题被hack了没理解清题意,- -居然也把pretest过了,- -# A: 呵呵! B:包含任意一个子集的输出NO!,其他输出YES! C:贪心额,类似上次的Topcoder的500 ...
zoj The 12th Zhejiang Provincial Collegiate Programming Contest May Day Holiday
http://acm.zju.edu.cn/onlinejudge/showContestProblem.do?problemId=5500 The 12th Zhejiang Provincial ...
[原创]java WEB学习笔记90：Hibernate学习之路-- -HQL检索方式，分页查询，命名查询语句，投影查询，报表查询
本博客的目的:①总结自己的学习过程,相当于学习笔记 ②将自己的经验分享给大家,相互学习,互相交流,不可商用内容难免出现问题,欢迎指正,交流,探讨,可以留言,也可以通过以下方式联系. 本人互联网技术爱 ...
ArrayList和LinkList区别
ArrayList和LinkList区别前者是数组的数据结构,后者是链表的数据结构前者应用于排序和查找,后者应用于插入删除
list和map的区别
list和map的区别 list-->list是对象集合,允许对象重复 map-->map是键值对的集合,不允许key重复
Android中使用ShareSDK实现分享
1,在http://www.mob.com官网上去创建一个后台应用(如果没有账号的同学要去先注册一下),主要后去的是你新创建的应用的appKey值 2,这里我们分享新浪微博为例,在新浪微博的sdk中创 ...

Human Gene Functions

Human Gene Functions的更多相关文章

随机推荐

热门专题