【LeetCode】004. Median of Two Sorted Arrays
题目:
There are two sorted arrays nums1 and nums2 of size m and n respectively.
Find the median of the two sorted arrays. The overall run time complexity should be O(log (m+n)).
Example 1:
nums1 = [1, 3]
nums2 = [2] The median is 2.0
Example 2:
nums1 = [1, 2]
nums2 = [3, 4] The median is (2 + 3)/2 = 2.5
题解:
第一次解Hard级的题,果然不是现在的我能做得了的。暴力解肯定不行(即用分治法将两个数组排序,然后计算中位数),题目规定了时间复杂度,只能从网上看大神的了。一是将问题扩大成寻找第K大的元素,先简单的实现,不考虑题目规定的时间复杂度。(Two Pointers思想)
Solution 1(46ms)
class Solution {
public:
int findKth(vector<int> nums1, vector<int> nums2, int k) {
int m = nums1.size(), n = nums2.size();
int p = , q = ;
for(int i=; i<k-; i++) {
if(p>=m && q<n) q++;
else if(q>=n && p<m) p++;
else if(nums1[p]>nums2[q]) q++;
else p++;
}
if(p>=m) return nums2[q];
else if(q>=n) return nums1[p];
else return min(nums1[p],nums2[q]);
}
int findKth2(vector<int> nums1, vector<int> nums2, int k) {
int m = nums1.size(), n = nums2.size();
int i = , j = , cur = ;
while(i<m&&j<n) {
if(nums1[i]<nums2[j]) {
cur++;
if(cur==k) return nums1[i];
i++;
}
else {
cur++;
if(cur==k) return nums2[j];
j++;
}
}
while(i<m) {
cur++;
if(cur==k) return nums1[i];
i++;
}
while(j<n) {
cur++;
if(cur==k) return nums2[j];
j++;
}
}
double findMedianSortedArrays(vector<int> nums1, vector<int> nums2) {
int total = nums1.size() + nums2.size();
if(total % ==){
return (findKth(nums1,nums2,total/)+findKth(nums1,nums2,total/+))/2.0;
} else {
return findKth(nums1,nums2,total/+);
}
}
};
时间复杂度为O(k)=O(m+n),原因在于findKth函数。那么怎么进行优化呢?实际上与O(logn)有关的算法第一个想到的就是二分思想。
如果想要时间复杂度将为O(log(m+n))。我们可以考虑从K入手。如果我们每次能够删除一个一定在第K个元素之前的元素,那么我们需要进行K次,但是如果每次我们都删除一半呢?由于两个数组都是有序的,我们应该充分利用这个信息。
假设A B 两数组的元素都大于K/2,我们将A B两数组的第K/2个元素进行比较。比较的结果有三种情况。
A[K/2] == B[K/2]; A[K/2] > B[K/2]; A[K/2] <= B[K/2]
如果 A[K/2] < B[K/2] 意味着 A[0] 到 A[K/2] 肯定在A∪B的前k个元素中。因此我们可以放心删除A数组的这个k/2个元素。同理A[K/2] > B[K/2]。
如果 A[K/2] == B[K/2] 说明已经找到了第K个元素,直接返回A[K/2]或者B[K/2]。(from here)
另外,需要处理边界条件:
如果A或者B为空,则直接返回B[k-1]或者A[k-1];如果k为1,我们只需要返回A[0]和B[0]中的较小值;如果A[k/2-1]=B[k/2-1],返回其中一个;(from here)
上述的描述不太准确,知道这个思想就好了。
Solution 2 ()
class Solution {
public:
double findKth(vector<int> nums1, vector<int> nums2, int k) {
int m = nums1.size(), n = nums2.size();
if(m <= ) return nums2[k-];
if(n <= ) return nums1[k-];
if(k <= ) return min(nums1[], nums2[]);
if(nums2[n/] >= nums1[m/]) {
if((m/ + + n/) >= k) {
return findKth(nums1, vector<int> (nums2.begin(), nums2.begin()+n/), k);
}
else
return findKth(vector<int> (nums1.begin()+m/+, nums1.end()), nums2, k-(m/+));
}
else {
if((m/ + + n/) >= k) {
return findKth(vector<int> (nums1.begin(), nums1.begin()+m/), nums2, k);
}
else
return findKth(nums1, vector<int> (nums2.begin()+n/+, nums2.end()),k-(n/+));
}
}
double findMedianSortedArrays(vector<int> nums1, vector<int> nums2) {
int m = nums1.size(), n = nums2.size();
if((m + n) % == )
return (findKth(nums1, nums2, (m+n)/) + findKth(nums1, nums2, (m+n)/+))/2.0;
else
return findKth(nums1, nums2, (m+n)/+);
}
};
Solution 2 代码简化
Solution 3 (82ms)
class Solution {
public:
int findKth(vector<int> nums1, vector<int> nums2, int k) {
int m = nums1.size(), n = nums2.size();
//确保m<=n
if (m > n)
return findKth(nums2, nums1, k);
if (m == )
return nums2[k - ];
if (k == )
return min(nums1[], nums2[]);
int i = min(m, k / ), j = k-i;
if (nums1[i - ] > nums2[j - ])
return findKth(nums1, vector<int>(nums2.begin() + j, nums2.end()), k - j);
else
return findKth(vector<int>(nums1.begin() + i, nums1.end()), nums2, k - i); }
double findMedianSortedArrays(vector<int> nums1, vector<int> nums2) {
int total = nums1.size() + nums2.size();
if(total % ==){
return (findKth(nums1,nums2,total/)+findKth(nums1,nums2,total/+))/2.0;
} else {
return findKth(nums1,nums2,total/+);
}
}
};
减少vector数组的复制开销,
Solution 4
class Solution {
public:
double findMedianSortedArrays(vector<int>& nums1, vector<int>& nums2) {
int m = nums1.size(), n = nums2.size();
int total = m + n;
if (total % == ) {
return (findKth(nums1, , m, nums2, , n, total / ) + findKth(nums1, , m, nums2, , n, total / + )) / 2.0;
} else {
return (findKth(nums1, , m, nums2, , n, total / + ));
}
} double findKth(vector<int>& nums1, int begin1, int end1, vector<int>& nums2, int begin2, int end2, int K) {
int len1 = end1 - begin1, len2 = end2 - begin2;
if (len1 > len2)
return findKth(nums2, begin2, end2, nums1, begin1, end1, K);
if (len1 < ) return nums2[begin2 + K - ];
if (len2 < ) return nums1[begin1 + K - ];
if (K < ) return min(nums1[begin1], nums2[begin2]); int mid1 = min(len1, K / ), mid2 = K - mid1;
if (nums1[begin1 + mid1 - ] > nums2[begin2 + mid2 - ])
return findKth(nums1, begin1, end1, nums2, begin2 + mid2, end2, K - mid2);
else
return findKth(nums1, begin1 + mid1, end1, nums2, begin2, end2, K - mid1); }
};
Solution 5 (76ms)
这个就比较牛了,具体细节在这里。
This problem is notoriously hard to implement due to all the corner cases. Most implementations consider odd-lengthed and even-lengthed arrays as two different cases and treat them separately. As a matter of fact, with a little mind twist. These two cases can be combined as one, leading to a very simple solution where (almost) no special treatment is needed. First, let's see the concept of 'MEDIAN' in a slightly unconventional way. That is: "if we cut the sorted array to two halves of EQUAL LENGTHS, then
median is the AVERAGE OF Max(lower_half) and Min(upper_half), i.e. the
two numbers immediately next to the cut".
For example, for [ ], we make the cut between and : [ / ]
then the median = (+)/. Note that I'll use '/' to represent a cut, and (number / number) to represent a cut made through a number in this article. for [ ], we make the cut right through like this: [ (/) ] Since we split into two halves, we say now both the lower and upper subarray contain . This notion also leads to the correct answer: ( + ) / = ; For convenience, let's use L to represent the number immediately left to the cut, and R the right counterpart. In [2 3 5 7], for instance, we have L = 3 and R = 5, respectively. We observe the index of L and R have the following relationship with the length of the array N: N Index of L / R
/
/
/
/
/
/
/
/
It is not hard to conclude that index of L = (N-)/, and R is at N/. Thus, the median can be represented as (L + R)/ = (A[(N-)/] + A[N/])/
To get ready for the two array situation, let's add a few imaginary 'positions' (represented as #'s) in between numbers, and treat numbers as 'positions' as well. [ ] -> [# # # # #] (N = )
position index (N_Position = ) [ ]-> [# # # # # #] (N = )
position index (N_Position = )
As you can see, there are always exactly *N+ 'positions' regardless of length N. Therefore, the middle cut should always be made on the Nth position (-based). Since index(L) = (N-)/ and index(R) = N/ in this situation, we can infer that index(L) = (CutPosition-)/, index(R) = (CutPosition)/. Now for the two-array case: A1: [# # # # # #] (N1 = , N1_positions = ) A2: [# # # # #] (N2 = , N2_positions = )
Similar to the one-array problem, we need to find a cut that divides the two arrays each into two halves such that "any number in the two left halves" <= "any number in the two right
halves".
We can also make the following observations: There are 2N1 + 2N2 + position altogether. Therefore, there must be exactly N1 + N2 positions on each side of the cut, and positions directly on the cut. Therefore, when we cut at position C2 = K in A2, then the cut position in A1 must be C1 = N1 + N2 - k. For instance, if C2 = , then we must have C1 = + - C2 = . [# # # # (/) # #] [# / # # #]
When the cuts are made, we'd have two L's and two R's. They are L1 = A1[(C1-)/]; R1 = A1[C1/];
L2 = A2[(C2-)/]; R2 = A2[C2/];
In the above example, L1 = A1[(-)/] = A1[] = ; R1 = A1[/] = A1[] = ;
L2 = A2[(-)/] = A2[] = ; R2 = A1[/] = A1[] = ;
Now how do we decide if this cut is the cut we want? Because L1, L2 are the greatest numbers on the left halves and R1, R2 are the smallest numbers on the right, we only need L1 <= R1 && L1 <= R2 && L2 <= R1 && L2 <= R2
to make sure that any number in lower halves <= any number in upper halves. As a matter of fact, since
L1 <= R1 and L2 <= R2 are naturally guaranteed because A1 and A2 are sorted, we only need to make sure: L1 <= R2 and L2 <= R1. Now we can use simple binary search to find out the result. If we have L1 > R1, it means there are too many large numbers on the left half of A1, then we must move C1 to the left (i.e. move C2 to the right);
If L2 > R1, then there are too many large numbers on the left half of A2, and we must move C2 to the left.
Otherwise, this cut is the right one.
After we find the cut, the medium can be computed as (max(L1, L2) + min(R1, R2)) / ;
Two side notes: A. since C1 and C2 can be mutually determined from each other, we might as well select the shorter array (say A2) and only move C2 around, and calculate C1 accordingly. That way we can achieve a run-time complexity of O(log(min(N1, N2))) B. The only edge case is when a cut falls on the 0th(first) or the 2Nth(last) position. For instance, if C2 = 2N2, then R2 = A2[*N2/] = A2[N2], which exceeds the boundary of the array. To solve this problem, we can imagine that both A1 and A2 actually have two extra elements, INT_MAX at A[-] and INT_MAX at A[N]. These additions don't change the result, but make the implementation easier: If any L falls out of the left boundary of the array, then L = INT_MIN, and if any R falls out of the right boundary, then R = INT_MAX.
class Solution {
public:
double findMedianSortedArrays(vector<int> nums1, vector<int> nums2) {
int m = nums1.size();
int n = nums2.size();
if (m < n) return findMedianSortedArrays(nums2, nums1);
if (n == ) return ((double)nums1[(m-)/] + (double)nums1[m/])/;
int i = , j = n * ;
while (i <= j) {
int mid2 = (i + j) / ;
int mid1 = m + n - mid2;
double L1 = (mid1 == ) ? INT_MIN : nums1[(mid1-)/];
double L2 = (mid2 == ) ? INT_MIN : nums2[(mid2-)/];
double R1 = (mid1 == m * ) ? INT_MAX : nums1[(mid1)/];
double R2 = (mid2 == n * ) ? INT_MAX : nums2[(mid2)/];
if (L1 > R2) i = mid2 + ;
else if (L2 > R1) j = mid2 - ;
else return (max(L1,L2) + min(R1, R2)) / ;
}
return -;
}
};
【LeetCode】004. Median of Two Sorted Arrays的更多相关文章
- 【LeetCode】4. Median of Two Sorted Arrays (2 solutions)
Median of Two Sorted Arrays There are two sorted arrays A and B of size m and n respectively. Find t ...
- 【LeetCode】4. Median of Two Sorted Arrays 寻找两个正序数组的中位数
作者: 负雪明烛 id: fuxuemingzhu 个人博客: http://fuxuemingzhu.cn/ 公众号:负雪明烛 本文关键词:数组,中位数,题解,leetcode, 力扣,python ...
- 【leetcode】4. Median of Two Sorted Arrays
题目描述: There are two sorted arrays nums1 and nums2 of size m and n respectively. Find the median of t ...
- 【LeetCode】4.Median of Two Sorted Arrays 两个有序数组中位数
题目: There are two sorted arrays nums1 and nums2 of size m and n respectively. Find the median of the ...
- 【LeetCode】4. Median of Two Sorted Arrays(思维)
[题意] 给两个有序数组,寻找两个数组组成后的中位数,要求时间复杂度为O(log(n+m)). [题解] 感觉这道题想法非常妙!! 假定原数组为a,b,数组长度为lena,lenb. 那么中位数一定是 ...
- 【一天一道LeetCode】#4 Median of Two Sorted Arrays
一天一道LeetCode (一)题目 There are two sorted arrays nums1 and nums2 of size m and n respectively. Find th ...
- 【leetcode】1213.Intersection of Three Sorted Arrays
题目如下: Given three integer arrays arr1, arr2 and arr3 sorted in strictly increasing order, return a s ...
- 【LeeetCode】4. Median of Two Sorted Arrays
There are two sorted arrays nums1 and nums2 of size m and n respectively. Find the median of the two ...
- 【medium】4. Median of Two Sorted Arrays 两个有序数组中第k小的数
There are two sorted arrays nums1 and nums2 of size m and n respectively. Find the median of the two ...
随机推荐
- 在树莓派上用Python控制LED
所需材料 一个已经安装配置好了的树莓派 连接控制树莓派所用的其他必须设备 200Ω电阻 x 8 led x 8 面包板及连接线若干 电路连接 电路图 按照电路图所示,在面包板上进行连接. 编写程序 安 ...
- 【HackerRank】Utopian tree
The Utopian tree goes through 2 cycles of growth every year. The first growth cycle of the tree occu ...
- php数组函数-array_map()
array_map()函数返回用户自定义函数作用后的数组.回调函数接受的参数 数目应该和传递给array_map()函数的数组数目一直. array_map(function,array1,array ...
- 斯坦福机器学习视频笔记 Week9 异常检测和高斯混合模型 Anomaly Detection
异常检测,广泛用于欺诈检测(例如“此信用卡被盗?”). 给定大量的数据点,我们有时可能想要找出哪些与平均值有显着差异. 例如,在制造中,我们可能想要检测缺陷或异常. 我们展示了如何使用高斯分布来建模数 ...
- Golang 连接Kafka
Kafka介绍 Kafka是Apache软件基金会开发的一个开源流处理平台,由Java和Scala编写:Kafka是一种高吞吐.分布式.基于订阅发布的消息系统. Kafka名称解释 Producer: ...
- Go 模板语法
Sprig the useful template functions for Go templates (http://masterminds.github.io/sprig/) Github - ...
- Python 数值类型
1.数值类型分为整形(二进制(0b),八进制(0o),十进制,十六进制(0x) ),浮点型,long,complex(复合行) 当我们说十进制数的时候,是逢10进1,就是说到达10的时候就要向前一位进 ...
- MapReduce-二进制输入
Hadoop的MapReduce不只是可以处理文本信息,它还可以处理二进制格式的数据1. 关于SequenceFileInputFormat类Hadoop的顺序文件格式存储二进制的键/值对的序列.由于 ...
- Proxy动态代理
Proxy动态代理 package com.test.dynamicproxy; public interface Subject { public void request(); } package ...
- IOS 发布被拒 PLA 1.2问题 整个过程介绍 01
公司前端时间发布IOS APP ,但是遇到一些很麻烦的问题,就是一个让人摸不着头脑的问题. 问题: The Seller and Artist names associated with your a ...