UVA11107 Life Forms SA模板

Life Forms

Time Limit: 5000MS		Memory Limit: 65536K
Total Submissions: 16827		Accepted: 4943

Description

You may have wondered why most extraterrestrial life forms resemble humans, differing by superficial traits such as height, colour, wrinkles, ears, eyebrows and the like. A few bear no human resemblance; these typically have geometric or amorphous shapes like cubes, oil slicks or clouds of dust.

The answer is given in the 146th episode of Star Trek - The Next Generation, titled The Chase. It turns out that in the vast majority of the quadrant's life forms ended up with a large fragment of common DNA.

Given the DNA sequences of several life forms represented as strings of letters, you are to find the longest substring that is shared by more than half of them.

Input

Standard input contains several test cases. Each test case begins with 1 ≤ n ≤ 100, the number of life forms. n lines follow; each contains a string of lower case letters representing the DNA sequence of a life form. Each DNA sequence contains at least one and not more than 1000 letters. A line containing 0 follows the last test case.

Output

For each test case, output the longest string or strings shared by more than half of the life forms. If there are many, output all of them in alphabetical order. If there is no solution with at least one letter, output "?". Leave an empty line between test cases.

Sample Input

3

abcdefg

bcdefgh

cdefghi

3

xxx

yyy

zzz

0

Sample Output

bcdefg

cdefgh

?

Source

【题解】

UVA炸了，在POJ上交的

把所有的串连起来，串尾加分隔符，一来识别串尾，二来防止跟后面的串拼起来一块与其他串组成LCP，分组时会出现错误

二分答案分组，看每个组里是否有大于n/2个串即可

细节很多，关键是要输出方案。。。还要做很鬼畜的标记，各种判断标识符。。

 #include <iostream>

 #include <cstdio>

 #include <cstring>

 #include <cstdlib>

 #include <algorithm>

 #include <queue>

 #include <vector>

 #include <cmath>

 #define min(a, b) ((a) < (b) ? (a) : (b))

 #define max(a, b) ((a) > (b) ? (a) : (b))

 #define abs(a) ((a) < 0 ? (-1 * (a)) : (a))

 template <class T>

 inline void swap(T& a, T& b)

 {

     T tmp = a;a = b;b = tmp;

 }

 inline void read(int &x)

 {

     x = ;char ch = getchar(), c = ch;

     while(ch < '' || ch > '') c = ch, ch = getchar();

     while(ch <= '' && ch >= '') x = x *  + ch - '', ch = getchar();

     if(c == '-') x = -x;

 }

 const int INF = 0x3f3f3f3f;

 const int MAXN =  + ;

 struct SuffixArray

 {

     char s[MAXN];int sa[MAXN], t1[MAXN], t2[MAXN], rank[MAXN], height[MAXN], c[MAXN], n;

     void clear(){n = ;memset(sa, , sizeof(sa));}

     void build_sa(int m)

     {

         int i, *x = t1, *y = t2;

         for(i = ;i <= m;++ i) c[i] = ;

         for(i = ;i <= n;++ i) ++ c[x[i] = s[i]];

         for(i = ;i <= m;++ i) c[i] += c[i - ];

         for(i = n;i >= ;-- i) sa[c[x[i]] --] = i;

         for(int k = ;k <= n;k <<= )

         {

             int p = ;

             for(i = n - k + ;i <= n;++ i) y[++ p] = i;

             for(i = ;i <= n;++ i) if(sa[i] > k) y[++ p] = sa[i] - k;

             for(i = ;i <= m;++ i) c[i] = ;

             for(i = ;i <= n;++ i) ++ c[x[y[i]]];

             for(i = ;i <= m;++ i) c[i] += c[i - ];

             for(i = n;i >= ;-- i) sa[c[x[y[i]]] --] = y[i];

             swap(x, y);p = ,x[sa[]] = ++ p;

             for(i = ;i <= n;++ i) x[sa[i]] = sa[i] + k <= n && sa[i - ] + k <= n && y[sa[i]] == y[sa[i - ]] && y[sa[i] + k] == y[sa[i - ] + k] ? p : ++ p;

             if(p >= n) break;m = p;

         }

     }

     void build_height()

     {

         int i,j,k = ;

         for(i = ;i <= n;++ i) rank[sa[i]] = i;

         for(i = ;i <= n;++ i)

         {

             if(k) -- k; if(rank[i] == ) continue;

             j = sa[rank[i] - ];

             while(i + k <= n && j + k <= n && s[i + k] == s[j + k]) ++ k;

             height[rank[i]] = k;

         }

     }

 }A;

 int cnt, n, tmp, ans, num[MAXN], vis[MAXN], tt = ;

 std::vector<int> node;

 int check(int x)

 {

     int num = , flag = , t = , f = ;

     for(int i = ;i <= n;++ i)

     {

         if(A.s[A.sa[i]] == 'z' +  || A.s[A.sa[i - ]] == 'z' +  || ::num[A.sa[i] + x] != ::num[A.sa[i]] || ::num[A.sa[i - ] + x] != ::num[A.sa[i - ]])

         {

             num = , ++ tt, t = , f = ;

             continue;

         }

         if(A.height[i] >= x)

         {

             if(t)

             {

                 t = ;

                 vis[::num[A.sa[i - ]]] = tt;

                 ++ num;

             }

             if(vis[::num[A.sa[i]]] == tt) continue;

             ++ num, vis[::num[A.sa[i]]] = tt;

         }

         else num = , ++ tt, t = , f = ;

         if(num >= cnt/ +  && f)

         {

             if(!flag) node.clear();

             node.push_back(i);

             f = ;

             flag =;

         }

     }

     return flag;

 }

 void put()

 {

     for(int i = ;i < node.size();++ i)

     {

         for(int j = A.sa[node[i]], k = ;k <= ans;++ k, ++ j)

             printf("%c", A.s[j]);

         putchar('\n');

     }

 }

 int main()

 {

     while(scanf("%d", &cnt) != EOF && cnt)

     {

         if(cnt == )

         {

             scanf("%s", A.s + );

             printf("%s\n\n", A.s + );

             continue;

         }

         A.clear();n = ;tmp = ;

         for(int i = ;i <= cnt;++ i)

         {

             scanf("%s", A.s + n + );

             int t = n + ;

             tmp = max(tmp, strlen(A.s + n + ));

             n += strlen(A.s + n + );

             A.s[++ n] =  'z' + ;

             for(int j = t;j <= n;++ j) num[j] = i;

         }

         A.n = n;

         A.build_sa('z' + );

         A.build_height();

         int l = , r = tmp, mid;ans = ;

         while(l <= r)

         {

             mid = (l + r) >> ;

             if(check(mid)) l = mid + , ans = mid;

             else r = mid - ;

         }

         if(ans)    put();

         else printf("?\n");

         putchar('\n');

     }

     return ;

 }

UVA11107

UVA11107 Life Forms SA模板的更多相关文章

UVA11107 Life Forms --- 后缀数组
UVA11107 Life Forms 题目描述: 求出出现在一半以上的字符串内的最长字符串. 数据范围: \(\sum len(string) <= 10^{5}\) 非常坑的题目. 思路非常 ...
UVA-11107 Life Forms（后缀数组）
题目大意:给出n个字符串,找出所有最长的在超过一半的字符串中出现的子串. 题目分析:将所有的字符串连成一个,二分枚举长度,每次用O(n)的时间复杂度判断.连接字符串的时候中间添一个没有出现过的字符. ...
UVA11107 Life Forms
思路后缀数组先都拼在一起二分+height分段按照小于x的为分界,判断是否有一个分段中包含超过n/2个串代码 #include <cstdio> #include <cst ...
【POJ3294】 Life Forms(SA)
...又是TLE,对于单组数据肯定TLE不了,问题是多组的时候就呵呵了... 按height分组去搞,然后判一下是否不属于同一个串... ; var x,y,rank,sa,c,col,h,rec:. ...
SA模板
#include<cstdio> #include<algorithm> #include<cstring> using namespace std; ; char ...
UVA-11107 Life Forms(求出现K次的子串，后缀数组+二分答案)
题解: 题意: 输入n个DNA序列,你的任务是求出一个长度最大的字符串,使得它在超过一半的DNA序列中出现.如果有多解,按照字典序从小到大输入所有解. 把n个DNA序列拼在一起,中间用没有出现过的字符 ...
洛谷3809 SA模板后缀数组学习笔记（复习）
其实SA这个东西很久之前就听过qwq 但是基本已经忘的差不多了嘤嘤嘤 QWQ感觉自己不是很理解啊所以写不出来那种博客 QWQ只能安利一些别人的博客了小老板真的是讲的非常好不要在意名字 orz ...
Visual Studio项目模板与向导开发
在[Xamarin+Prism开发详解系列]里面经常使用到[Prism unity app]的模板创建Prism.Forms项目: 备注:由于Unity社区已经不怎么活跃,下一个版本将会有Autofa ...
[BZOJ4650][NOI2016]优秀的拆分(SAM构建SA)
关于解法这个讲的很清楚了,主要用了设关键点的巧妙思想. 主要想说的是一个刚学的方法:通过后缀自动机建立后缀树,再转成后缀数组. 后缀数组功能强大,但是最令人头疼的地方是模板太难背容易写错.用这个方法, ...

随机推荐

Python接口测试框架实战与自动化进阶✍✍✍
Python接口测试框架实战与自动化进阶整个课程都看完了,这个课程的分享可以往下看,下面有链接,之前做java开发也做了一些年头,也分享下自己看这个视频的感受,单论单个知识点课程本身没问题,大家看 ...
Visual Studio 2019安装教程
一.下载网址:https://visualstudio.microsoft.com/zh-hans/vs/ 下载后是一个.exe文件二.安装双击打开下载的.exe文件,进入文件的提取提取完成后 ...
python基础语法（变量与数据类型）
python基础语法(变量与数据类型) 一.python变量 python中的变量不需要声明.每个变量在使用钱都需要赋值,变量赋值以后,该变量才会被创建在python中,变量就是变量,它没有类型,我 ...
ES6和常用特性归纳
ECMAScript 6(以下简称ES6)是JavaScript语言的下一代标准,已经在2015年6月正式发布了.Mozilla公司将在这个标准的基础上,推出JavaScript 2.0. ECMAS ...
【颓废篇】人生苦短，我用python（一)
谁渴望来一场华(ang)丽(zang)的python交易! 最近突然产生了系统学习python的想法. 其实自从上次luogu冬日绘板dalao们都在写脚本就有这种想法了. 最近被计算几何势力干翻的我 ...
css3 随记
1 让子元素对其的方式 box-pack 2 -webkit-text-size-adjust 解决字体大小失效问题http://www.frontopen.com/273.html 3 disp ...
IPMI 远程配置
#重启ipmi服务 #重启ipmi服务 #将 channel 1 设置为静态 IP #设置 IP #设置 channel 1 掩码 #设置 channel 1 网关 #查看用户名及 ID #设置ID号 ...
在双重for循环内部使用async异步请求axios中遇到的问题
在methods中的方法 async getPro () { let _this = this let newArr = [] await axios.get(`api/v1/dailyProTbms ...
Linux时间和时区设定
一.时区设定由于安装系统时采用了UTC,那么什么是UTC呢,简单的说UTC就是0时区的时间,是国际标准,而中国处于UTC+8时区. 使用tzselect命令,过程如下: 可以看到此环境变量已设置,将 ...
BZOJ 2281 消失之物
ftiasch 有 N 个物品, 体积分别是 W1, W2, -, WN. 由于她的疏忽, 第 i 个物品丢失了. "要使用剩下的 N – 1 物品装满容积为 x 的背包,有几种方法呢?&q ...

UVA11107 Life Forms SA模板

UVA11107 Life Forms SA模板的更多相关文章

随机推荐

热门专题