Given n arrays, find n number

Microsoft Interview Question for Software Engineer / Developers

0

of 0 votes

60
Answers
Given n arrays, find n number such that sum of their differences is minimum. For e.g. if there are three arrays

A = {4, 10, 15, 20}
B = {1, 13, 29}
C = {5, 14, 28}

find three numbers a, b, c such that |a-b| + |b-c| + |c-a| is minimum. Here the answer is a = 15, b = 13, and c = 14
- A June 30, 2011 | Report Duplicate | Flag | PURGE
Microsoft Software Engineer / Developer Algorithm

Email me when people comment.

An error occurred in subscribing you.

Email me when people comment.

An error occurred in subscribing you.

Comment hidden because of low score. Click to expand.

of 2 vote

I guess the arrays are sorted. In that case, the problem is same as selecting one element from each of n arrays such that (max element - min element) is minimized. If total # of element in n arrays is m, then using a min heap it can be solved in O(m logn).

- buried.shopno June 30, 2011 | Flag Reply

Comment hidden because of low score. Click to expand.

of 2 vote

y is there so much unnecessary discussion for a simple problem.
Below code is easy to trace so not presenting algo.
The code assumes that array are sorted.

#include <stdio.h>
#define ABSDIFF(a,b) (((a)>(b))?((a)-(b)):((b)-(a)))
void findMinDst (int* pArray1, int* pArray2, int* pArray3, int size1, int size2, int size3)
{
	int min=(1<<30),curMin=0;
	int i=0,j=0,k=0;
	int index1=0,index2=0,index3=0;
	while((i<size3)&&(i<size2)&&(k<size3))
	{
		curMin=ABSDIFF(pArray1[i],pArray2[j])+
			   ABSDIFF(pArray2[j],pArray3[k])+
			   ABSDIFF(pArray3[k],pArray1[i]);
		if(min>curMin)
		{
			min=curMin;
			index1=i;
			index2=j;
			index3=k;
		}
		if((pArray1[i]<pArray2[j])&&(pArray1[i]<pArray3[k]))
		{
			i++;
		}
		else if((pArray2[j]<pArray1[i])&&(pArray2[j]<pArray3[k]))
		{
			j++;
		}
		else
		{
			k++;
		}
	}
	printf("Min Dist:[%d]\n", min);
	printf("Elements:[%d][%d][%d]", pArray1[index1],pArray2[index2],pArray3[index3]);
}
int main()
{
	int A[] = {4, 10, 15, 20};
	int B[] = {1, 13, 29};
	int C[] = {5, 14, 28};
	findMinDst(A,B,C,sizeof(A)/sizeof(int),sizeof(A)/sizeof(int),sizeof(A)/sizeof(int));
	return 0;
}

- Anonymous July 02, 2011 | Flag Reply

Comment hidden because of low score. Click to expand.

of 0 votes

nice
but i think there was a typo, i corrected below

while((i<size3)&&(j<size2)&&(k<size3))

- omnipresent July 02, 2011 | Flag

Comment hidden because of low score. Click to expand.

of 0 votes

:) i also typed wrong. it should be like

while((i<size1)&&(j<size2)&&(k<size3))

- omnipresent July 02, 2011 | Flag

Comment hidden because of low score. Click to expand.

of 0 votes

But, I wonder if you read the problem statement carefully. Scroll up the page, and read it. It asks for N arrays. So, your assumption of 3 arrays is invalid.

- anon July 02, 2011 | Flag

Comment hidden because of low score. Click to expand.

of 0 votes

And exponential speed degrade should also be taken into account. Try running your example with 50 arrays and let us know the timing :)

- Denis July 02, 2011 | Flag

Comment hidden because of low score. Click to expand.

of 0 votes

Do u people need spoon feeding??
I did read the question carefully.
Solution can be generalized by using min heap.
Also some optimization can be done in computing target value for each iteration do u want me tell u what optimization??
:)

- @Denis and anon July 02, 2011 | Flag

Comment hidden because of low score. Click to expand.

of 0 votes

While your code might work, I would call it brute force solution as its running time is exponential and it doesn't really scales with the grows of the N.

- Dan July 02, 2011 | Flag

Comment hidden because of low score. Click to expand.

of 0 votes

This code may be right if generalized:
S[N][M] is N sorted arrays
p[N] is current point

each time
val=2*max{s[i][p[i]}-min{s[i][p[i]]},compare it with the min val.
and add the minimal pointer p[i]

- vabc3h September 12, 2012 | Flag

Comment hidden because of low score. Click to expand.

of 1 vote

Let we've 4 arrays, and the solution is {a,b,c,d}. Let, a >= b >= c >= d. 
Now, the expression for sum of the difference = abs(a-b) + abs(b-c) + abs(c-d) + abs(d-a). 
We can write it as = (a-b) + (b-c) + (c-d) + (a-d). 
First 3 components equates to (a-d). 
Hence, the expression becomes 2*(a-d).

Hope this helps to understand how the two problems are related.

- buried.shopno July 01, 2011 | Flag Reply

Comment hidden because of low score. Click to expand.

of 0 votes

I assume the solution to the transformed problem has been discussed many times here. It's the same technique that is used for k-way merge (discussed in Cormen's Algorithm book). It needs maintaining a min-heap of current smallest elements of each array. At each step, the smallest one is popped, and the next larger one of that array is pushed on heap.

- buried.shopno July 01, 2011 | Flag

Comment hidden because of low score. Click to expand.

of 0 votes

@buried.shopno: Your thinking is not correct.
lets a=2, b=4, c=8, d=6
so |a-b|+|b-c|+|c-d|+|d-a|
= |2-4|+|4-8|+|8-6|+|6-2|
= 2+4+2+4 = 12
but 2*|a-d|
2*|2-6|= 2*4 = 8

- Anonymous July 01, 2011 | Flag

Comment hidden because of low score. Click to expand.

of 0 votes

You did a mistake here. I've mentioned that a >= b >= c >= d, so the way of your mapping is wrong. My solution gives 2*(max elem - min elem) = 2 *(8-2) = 12.

- buried.shopno July 01, 2011 | Flag

Comment hidden because of low score. Click to expand.

of 0 votes

I think wat u should minimise is abs(a-b) + abs(b-c) +abs(c-d) + abs(d-a)+abs(a-c) + abs(b-d) and not just what u mentioned...i mean nC2 sets

- Anonymous July 01, 2011 | Flag

Comment hidden because of low score. Click to expand.

of 0 votes

We only need to consider nC2 differences i.e. only abs(a-b) + abs(b-c) + abs(c-d) + abs(d-a)

- Original Poster July 02, 2011 | Flag

Comment hidden because of low score. Click to expand.

of 0 votes

@OP: Your statement is contradicting. You are saying to consider nC2 difference. For n = 3, it's 3; for n = 4, it's 6. However, you said to consider 4 differences - as per my initial understanding.

- buried.shopno July 02, 2011 | Flag

Comment hidden because of low score. Click to expand.

of 0 votes

I am sorry, I meant N differences. We only need to consider N differences.

- Original Poster July 02, 2011 | Flag

Comment hidden because of low score. Click to expand.

of 0 votes

@buried.shopno: consider 4 numbers as a=0, b=2, c=1 and d=3 from 4 arrays respectively (smallest one among non-seen ones from each array).
How come you can sort these four values because that changes the result in this case?

1.abs(a-b) + abs(b-c) + abs(c-d) + abs(d-a) = 2+1+2+3 = 8
2.How to use your funda (a >= b >= c >= d) here?

- monish.gupta1 November 06, 2011 | Flag

Comment hidden because of low score. Click to expand.

of 0 vote

Is the arrays have to be sorted ?

- Anonymous June 30, 2011 | Flag Reply

Comment hidden because of low score. Click to expand.

of 0 vote

@buried.shopno: your comment doesnt really help anything. If possible, can you please elaborate more?

- xx July 01, 2011 | Flag Reply

Comment hidden because of low score. Click to expand.

of 0 vote

@buried.shopno: your comment doesnt really help anything. If possible, can you please elaborate more?

- xx July 01, 2011 | Flag Reply

Comment hidden because of low score. Click to expand.

of 0 vote

Well, I think it needs clarification (from original poster) if we really needs NC2 differences, or N differences. I'd work, and let know if find some efficient way for the NC2 differences.

- buried.shopno July 01, 2011 | Flag Reply

Comment hidden because of low score. Click to expand.

of 0 votes

@buried.shopno: Could you please let me know which book to refer to understand Karp's NP-completeness

- A July 01, 2011 | Flag

Comment hidden because of low score. Click to expand.

of 0 votes

Jeff Erickson (UIUC professor)'s lecture note is a good one. You might find other universities sites, specially on MIT OCW resources.

- buried.shopno July 01, 2011 | Flag

Comment hidden because of low score. Click to expand.

of 0 votes

Thank you buried

- A July 01, 2011 | Flag

Comment hidden because of low score. Click to expand.

of 0 votes

@buried.shopno - How do you make sure that a > b > c > d. For e.g., in my sample input, a > b > c doesn't hold.

- Original Poster July 02, 2011 | Flag

Comment hidden because of low score. Click to expand.

of 0 votes

@OP: you need not to maintain any ordering of elements. You only need to know the max and min of selected n items (one from each array). Your goal is to minimize the difference between max and min element. To understand why this is same to original problem, carefully follow my previous responses.

- buried.shopno July 02, 2011 | Flag

Comment hidden because of low score. Click to expand.

of 0 votes

@buried.shopno - Okay, so if I understood correctly, you are suggesting to find 4 pairs such that abs(a-b), abs(b-c), abs(c-d) and abs(d-a) is minimum. Now in those 4 pairs, we should find the pair such that the first value in the pair is smaller than the other values in the pairs and the second value is greater than other values in the pairs. For e.g. if we find (a,b) to be the pair complying with the above condition, then a < c < d < b or a < d < c < b

Please comment if this is the solution that you are suggesting.

- OP July 02, 2011 | Flag

Comment hidden because of low score. Click to expand.

of 0 vote

what if?
Let's merge all arrays into single one, with marking items. Merge sort will do that.

The next step would be to run thru big array finding minimal window (not sure about correct problem name). This is actually is O(sum(lengths)) hard.

The proof: suppose we found a solution with min = m0, lets prove that we need to move left pointer - by examining movement of right pointer instead we'd increase max(differences) => we need bla bla bla... seems pretty clear i think.

- Anonymous July 01, 2011 | Flag Reply

Comment hidden because of low score. Click to expand.

of 0 vote

So, what about the problem is exactly? Seems it has 2 variations - one answered by buried, how about the other variation? Which question was asked in the interview?

- anon July 01, 2011 | Flag Reply

Comment hidden because of low score. Click to expand.

of 0 vote

@Anonymous Finding window with minimum sum would not work as there's no assumption as to what elements come from what array. I.e. your window may be the whole array.

If we cannot make an assumption about a >= b >= c >= d, and we must find minimum elements for each pair of arrays as stated, I can think of something like:

Assuming arrays are sorted:
1. for each required pair do the merge keeping track of which element came from which array (simple bitmap would do that). This is O(n+m)
2. In one pass compare adjacent elements, making sure they are from different arrays, if not -> next, while comparing keep track of minimum and element indexes. This step is O(n)

3. Repeat for each pair.

Another idea is to use BST, but it is somewhat more complicated...

- Dan July 01, 2011 | Flag Reply

Comment hidden because of low score. Click to expand.

of 0 votes

I realized this is incomplete solution ... as comparing a and b and c may return different elements in b. Thinking of something complete... :)

- Dan July 01, 2011 | Flag

Comment hidden because of low score. Click to expand.

of 0 vote

As the problem statement does not say "sum of pairwise differences", rather mentioned "sum of their differences" - it implies buried.shopno's approach is right here.

- anonymous July 01, 2011 | Flag Reply

Comment hidden because of low score. Click to expand.

of 0 votes

Lol what do u understand by sum of their differences??i dont get any

- Anonymous July 01, 2011 | Flag

Comment hidden because of low score. Click to expand.

of 0 votes

Well, it has been explained using an example in the question.

- A July 02, 2011 | Flag

Comment hidden because of low score. Click to expand.

of 0 vote

You could model the problem as a graph and use Djikstra's algorithm, but the fact that you have to go back to the same number in A means that you have to replicate the nodes of B and C |A| times.. which actually wouldn't be difficult with some light bookkeeping. You would only have to know which edge is outgoing from the selected C node, given the choice of node in A..

- magritte128 July 01, 2011 | Flag Reply

Comment hidden because of low score. Click to expand.

of 0 votes

Never mind, I assumed we had to abide by the order the sets were given in..

- magritte128 July 01, 2011 | Flag

Comment hidden because of low score. Click to expand.

of 0 vote

Here's the solution. It assumes that:
if
|a-b| + |b-c| + |c-a| ==> min and |x| >= 0,
then
|a-b| is min of possible |A[i]-B[j]|, |b-c| is min of possible |B[i]-C[j]|, |c-a| is min of possible |C[i]-A[j]|

Solution itself:
Sort arrays if they aren't. Find best |a1[i]-a2[j]| value for each pair of arrays. In the original example they would be:
A (15)-->B (13)
B (13)-->C (14) AND B(29)-->C(28)
C (14)-->A (15)

We can verify if we found a "chain" of values, but for clarity let's omit this (it needs research whether it would save time, but it can) step.

Now we have 4 chains starts:
Chain 1) A(15)->B(13) which we should continue, finding appropriate minimum distance from B(13) till array C - will be 14 and then from C(14) till A automatically will be 15 because chain starts at it.
Chain 2) B(13)->C(14). Continue: C(14)->A(15) and then automatically to chain start at B(13)
Chain 3) B(29)-->C(28). COntinue: C(28)->A(20) and then automatically to chain start at B(29)
Chain 4) C(14)->A(15). Coninute: A(15)->B(13) and then automatically to chain start at C(14)

Now calculate results for 3 chains:
Chains 1), 2) and 4) result is 3
Chain 3) result is 18

Now choose the best chains (there can be several with the same sum) and filter out duplicates if you didn't do this after finding best distances between array pairs.

If I estimated correctly, algorithm is of O(n^2) speed, where n is max array length (if number of arrays is relatively small).

- Denis July 02, 2011 | Flag Reply

Comment hidden because of low score. Click to expand.

of 0 votes

Here's the solution. It assumes that:
if
|a-b| + |b-c| + |c-a| ==> min and |x| >= 0,
then
|a-b| is min of possible |A[i]-B[j]|, |b-c| is min of possible |B[i]-C[j]|, |c-a| is min of possible |C[i]-A[j]|

I don't think you can assume that. If you assume that, the problem becomes much much simpler and it should be solvable in O(n) time given the arrays are already sorted, which they are.

- A July 02, 2011 | Flag

Comment hidden because of low score. Click to expand.

of 0 votes

- Denis July 02, 2011 | Flag

Comment hidden because of low score. Click to expand.

of 0 votes

A,
You've probably missed that we don't simply find minimum between first two arrays and just go down by the chain. What is done due to assumption is that we're searching for a minimum distance between EACH pair of arrays and build resulting chains after that for each pair. Final step is to find the best chain between those m chains. m = quantity of arrays.

- Denis July 02, 2011 | Flag

Comment hidden because of low score. Click to expand.

of 0 vote

Actually my initial algorithm is wrong (though the assumption regarding f(x,y,z) is correct). Correct one is (it is also of O(n^2)):
For each value in A:
1) Find best entry in B.
2) Build chain B->C...->A finding best entry in each next array and automatically making closure to A arrays in the last one
3) Choose best chain

In our example:
A = {4, 10, 15, 20}
B = {1, 13, 29}
C = {5, 14, 28}

For values in A find best entries in B:
1) 4-1-5-4, value 8
2) 10-13-14-10, value 8
3) 15-13-14-15, value 3
4) 20-29-28-20, value 18
Choose the best variant: 3)

- Denis July 02, 2011 | Flag Reply

Comment hidden because of low score. Click to expand.

of 0 votes

Please ignore this post. Original solution is correct :) Find best distances between arrays pairs: A-B, B-C, C-A and build best distance chains from end-nodes of these best-distance-pairs.

- Denis July 02, 2011 | Flag

Comment hidden because of low score. Click to expand.

of 0 votes

I do not think that this is right.

For example:
4 10 15
1 13 29
5 20 28

what you will get is
15-13
29-28
5 - 4

However, the three number should be 4 1 5 while 1 will not come up.

- Anonymous July 10, 2011 | Flag

Comment hidden because of low score. Click to expand.

of 0 vote

@anonymous: you are simply an idiot! Who had asked you to post code here for others? Learn first how to solve a problem "accurately" and "efficiently". There are many smart people here on careercup, who don't come here to spoon feed others; rather enrich themselves & help others sharing and getting knowledge. you stupid asshole just keep your mouth shut up. And next time, before posting a solution, read (and again read) the problem. Solve the problem for yourself first. If you can't solve it, ask for help. If you can (i doubt), give your idea. Everyone should be able to code - when the idea is properly grasped.

- anon July 02, 2011 | Flag Reply

Comment hidden because of low score. Click to expand.

of 0 votes

hahahahahaha, f_u_c_k u :)

- Anonymous July 02, 2011 | Flag

Comment hidden because of low score. Click to expand.

of 0 votes

lolz...... you fuck me... and i fuck ur mum and ur dad is watching this... it's an awesome threesome! do u wanna join dude to fuck ur mum?

- anon July 02, 2011 | Flag

Comment hidden because of low score. Click to expand.

of 0 votes

hahaha, man when i will fuck u after that u wont be able to do anything else :)
this is what happened to your entire family 7 generation ago. I fucked your ancestors and now u can c your entire family need not to fuck each other for infants, my sperm is still contributing automatically :) MY SON .... MY BABY ....
realized the scaleability of my fucking???
ok, try to find of complexity of my efforts in N where N is the number of members in your family till now, including u :)
hahahahahahaha............

- Anonymous July 03, 2011 | Flag

Comment hidden because of low score. Click to expand.

of 0 votes

please show some maturity and stop this rubbish.. don't spoil this portal, plzzz

- CC user July 03, 2011 | Flag

Comment hidden because of low score. Click to expand.

of 0 votes

What's happening on such professional site? CC should impose every user to be registered to post. It should also monitor any abusive behavior. I expected it to be a portal like stackoverflow and wu riddle.

- Hatts Off! July 03, 2011 | Flag

Comment hidden because of low score. Click to expand.

of 0 vote

is it nC2 pairs or n pairs??
can any of above 41 posts tell me??

- Anonymous July 02, 2011 | Flag Reply

Comment hidden because of low score. Click to expand.

of 0 votes

Its only N pairs.

- Original Poster July 02, 2011 | Flag

Comment hidden because of low score. Click to expand.

of 0 vote

My IDEA is that better we can write a program like:

min = find_min(Array a,b) + find_min(Array b,c) + find_array(c,a)

Write the function like:
find_min(a,sizeofA,b,sizeofB)
{
here find the min of a and b;
retrun it;
}

I hope this helps :)

- Rajesh Manem July 18, 2011 | Flag Reply

Comment hidden because of low score. Click to expand.

of 0 votes

Of course dear this will surely help...but will you plz write the function of find_min();..????
and how will you return the array elements which are responsible for min(a,b) from there..?

- abh007 July 26, 2012 | Flag

Comment hidden because of low score. Click to expand.

of 0 vote

#include<stdio.h>
#include<math.h>
void minsum(int a[],int b[],int c[],int x,int y,int z){
int i,j,k,i1,j1,k1,max,max1;
max=((abs(a[0]-b[0]))+(abs(b[0]-c[0]))+(abs(c[0]-a[0])));
for(i=0;i<x;i++)
{
for(j=0;j<y;j++)
{
for(k=0;k<z;k++)
{
max1=((abs(a[i]-b[j]))+(abs(b[j]-c[k]))+(abs(c[k]-a[i])));
if(max1<max)
{
max=max1;
i1=i;
j1=j;
k1=k;
}
}
}
}
printf("%d %d %d",a[i1],b[j1],c[k1]);
}
void main(){
int a[]={4,10,15,20};
int b[]={1,13,29};
int c[]={5,14,28};
minsum(a,b,c,4,3,3);
}

- saurav July 21, 2011 | Flag Reply

Comment hidden because of low score. Click to expand.

of 0 votes

@Saurav, what abt n arrays?? this is Brute force approach...for n arrays it wil b O(n^n) ..nt feasible

- Prince July 22, 2011 | Flag

Comment hidden because of low score. Click to expand.

of 0 vote

Assuming arrays are sorted
1) Initialize i,j an k to 0. i, j and k are indices for the three arrays
2) find the sum of differences of elements at indices i, j and k. If this is smaller than current minimum, update current minimum
3) Find the smallest of the elements present at index i, j, k and increment that index if you are not going out of bound
4) Goto step 2, until all three arrays are exhausted
5) Return stored minimum

- IntwPrep.MS December 30, 2013 | Flag Reply

Comment hidden because of low score. Click to expand.

of 0 vote

public class CoverWithMinimumWindowSize {

	protected static class Element implements Comparator<Element>{
		int value;
		int listIndex;
		int positionInList;
		
		Element(){
			
		}
		
		Element(int value, int listIndex,int positionInList){
			this.value = value;
			this.listIndex = listIndex;
			this.positionInList = positionInList;
		}
		
		public int compare(Element e1, Element e2){
			return e1.value-e2.value;
		}
	}
	
	public static int findCoverWithMinimumWindowSize(List<int[]> lists) throws Exception{
		int numLists = lists.size();
		
		if(numLists == 1){
			throw new Exception("Only one list");
		}
		
		int index[] = new int[numLists];
		int listSize[] = new int[numLists];
		for(int i = 0; i < numLists; i++){
			int[] listi = lists.get(i);
			Arrays.sort(listi);
			listSize[i] = listi.length;
			if(listSize[i] == 0){
				throw new Exception("Empty list " + i);
			}
		}
		
		boolean done=false;
		Arrays.fill(index, 0);
		
		int windowStart,windowEnd;
		PriorityQueue<Element> minHeap = new PriorityQueue<Element>(numLists,new Element(0,0,0));
		PriorityQueue<Element> maxHeap = new PriorityQueue<Element>(numLists,Collections.reverseOrder(new Element(0,0,0)));
		for(int i = 0; i < numLists; i++){
			int[] listi = lists.get(i);
			Element elem = new Element(listi[0],i,0);
			minHeap.add(elem);
			maxHeap.add(elem);
		}
		
		Element minElement = minHeap.peek();
		Element maxElement = maxHeap.peek();
		int maxWindowSize = -1;
		
		while(!done){
			minElement = minHeap.peek();
			windowStart = minElement.value;
			int minElementSource = minElement.listIndex;
			int positionInList = minElement.positionInList;
			
			maxElement = maxHeap.peek();
			windowEnd = maxElement.value;
			int windowSize=windowEnd-windowStart;
			if(maxWindowSize == -1){
				maxWindowSize = windowSize;
			} else if(windowSize < maxWindowSize){
				maxWindowSize = windowSize;
			}
			
			minHeap.poll();
			maxHeap.remove(minElement);
			
			if(positionInList+1 < listSize[minElementSource]){
				int[] listi = lists.get(minElementSource);
				int value = listi[positionInList+1];
				Element elem = new Element(value,minElementSource,positionInList+1);
				minHeap.add(elem);
				maxHeap.add(elem);
				positionInList++;
			} else {
				done = true;
			}
		}

		return maxWindowSize;
	}
	
	public static void main(String[] args) throws Exception{
		int list1[] = {4, 10, 15, 20};
		int list2[] = {1, 13, 29};
		int list3[] = {5, 14, 28};

		List<int[]> testcase = new LinkedList<int[]>();
		testcase.add(list1);
		testcase.add(list2);
		testcase.add(list3);
				
		int result = findCoverWithMinimumWindowSize(testcase);
		
		System.out.println(result);
	}
}

- just_do_it August 05, 2014 | Flag Reply

Comment hidden because of low score. Click to expand.

-1

of 1 vote

1) Find the avg of combined array(i.e (4+10+15+20+1+13..+29)/10 = 13.9)
2) Using binary search find two elements from each array,1st element larger than avg ,2nd element smaller than average.
3) so at max we ll need to find the "min" of combination of the above 2+2+2 numbers
So complexity ll be O(n) i.e for finding avg.

Correct me if am wrong .

- msankith October 26, 2011 | Flag Reply

Comment hidden because of low score. Click to expand.

of 0 votes

its a wrong approach...try with different data set...
we can't just take the avg. and find the possible elements from array..

- abh007 July 26, 2012 | Flag

CareerCup

Microsoft Interview Question for Software Engineer / Developers

Books

Videos

Resume Review

Mock Interviews