Given an array of m+n elements

Amazon Interview Question for Software Engineer / Developers

0

of 0 votes

43
Answers
Given an array of m+n elements in the range 1 to n ie(m elements are repeated). Find the duplicate elements.
Time Complexity: O(m+n)
Space Complexity: O(m)
- amazingQ August 11, 2009 | Report Duplicate | Flag | PURGE
Amazon Software Engineer / Developer

Email me when people comment.

An error occurred in subscribing you.

Email me when people comment.

An error occurred in subscribing you.

Comment hidden because of low score. Click to expand.

of 0 vote

WE CAN have an array of n elements and then check

func(int Original[])
{
int arr[n];
for (i=0;i<n+m;i++)
{
if (arr[Original[i]]>=0)
printf ("\n Repeated element is %d",Original[i])
else
arr[Original[i]]=1;
}

Comments and more efficient codes are welcome

- CUNOMAD August 11, 2009 | Flag Reply

Comment hidden because of low score. Click to expand.

of 0 vote

Your code Space Complexity is: O(n) not O(m)

- cutepig August 11, 2009 | Flag Reply

Comment hidden because of low score. Click to expand.

of 0 vote

Since we know there are 1-n elements, we can sort the array in O(n). The first n elements will be the unique values and n to m will be the duplicates. This is O(n+m) && no extra space required.

for (int i = 0; i < array.Length; i++)
                {
                    if (array[i] != i + 1)
                    {
                        int t = array[array[i]-1];
                        array[array[i]-1] = array[i];
                        array[i] = t;
                    }
                }

- Anonymous August 11, 2009 | Flag Reply

Comment hidden because of low score. Click to expand.

of 0 votes

I think you should do a while loop instead of just an if

- Anonymous August 11, 2009 | Flag

Comment hidden because of low score. Click to expand.

of 0 votes

No we dont need a while, all we need to do is put the elements at its corresponding index and the duplicates will be pushed to the end...So the first 'n' will the unique values and the next 'm' will be the duplicates.

- Anonymous August 13, 2009 | Flag

Comment hidden because of low score. Click to expand.

of 0 votes

This looks right to me.

Here is an attempted proof:

1) Notice that if some element is in the right place, it will still hold the same value.

2) If a[k] != k+1 for some k, then there is some smallest j such that a[j] = k+1. When the loop hits the processing for j, it will do a[k] = k+1, putting k+1 in the right place.

Perhaps the problem was to treat the array as read-only?
(Hence the non constant space requirement).

- LOLer August 19, 2009 | Flag

Comment hidden because of low score. Click to expand.

of 0 votes

2,1,3,4,5,5,4,7,10,8,6,5,9,2
this will not work

- Anonymous May 26, 2010 | Flag

Comment hidden because of low score. Click to expand.

of 0 votes

Either you use an extra array, which is against the O(m) space constraint, or you will have to overwrite the original array, which the interviewer may not allow.

- Sunny December 22, 2012 | Flag

Comment hidden because of low score. Click to expand.

of 0 vote

here is the corrected version

func(int Original[])
{
int arr[m];
for (i=0;i<n+m;i++)
{
if (arr[Original[i]]>=0)
printf ("\n Repeated element is %d",Original[i])
else
arr[Original[i]]=1;
}

- CUNOMAD August 11, 2009 | Flag Reply

Comment hidden because of low score. Click to expand.

of 0 votes

You declared arr[m] in the function and accessed it without any operation on it.. This would straight away crash the code..

- Anonymous September 08, 2009 | Flag

Comment hidden because of low score. Click to expand.

of 0 vote

since n > m, so, for some Original[i] = n -1, arr[n-1] would be out of arr! wouldn't work.

- Anonymous August 12, 2009 | Flag Reply

Comment hidden because of low score. Click to expand.

of 0 vote

none of above solutions will work according to me or else if they are working could you please explain how?

- Anonymous August 12, 2009 | Flag Reply

Comment hidden because of low score. Click to expand.

of 0 vote

I try it! java ~

public fun(int[] a){
		
		BitSet bs = new BitSet(100);
		for(int i = 0;i<a.length;i++){
			if(bs.get(x[i])){
				System.out.println(x[i]);
			}
			bs.set(x[i], true);
		}
             }

- Anonymous August 12, 2009 | Flag Reply

Comment hidden because of low score. Click to expand.

of 0 vote

public fun(int[] a){
		
		BitSet bs = new BitSet(100);
		for(int i = 0;i<a.length;i++){
			if(bs.get(a[i])){
				System.out.println(x[i]);
			}
			bs.set(a[i], true);
		}
             }

- Anonymous August 12, 2009 | Flag Reply

Comment hidden because of low score. Click to expand.

of 0 vote

int[] findDuplicates(int[] a , int n, int m){ //size of a is n+m
int dup[] = new int[m];
int i =0 , j= 0;
for(;i<n+m;i++){

if(a[a[i]-1] > (n+1)){ //a[i] is occurred already i.e duplicate
dup[j++] = a[i]
}else{ //add (n+1) when u encounter an element first time;
a[a[i]-1] += (n+1);
}
}
return dup;
}

- Ravi August 14, 2009 | Flag Reply

Comment hidden because of low score. Click to expand.

of 0 vote

Please people, do write a word or two about what the approach is rather than just dumping the code !
Case if space complexity of O(n) is straight fwd. Only ....

- knap August 14, 2009 | Flag Reply

Comment hidden because of low score. Click to expand.

of 0 vote

If this array is a sorted one, it is possible.
If Not, a hash table (max m items) can be used, or use the m space in the array to simulate a hashtable operations.

- junma August 14, 2009 | Flag Reply

Comment hidden because of low score. Click to expand.

of 0 vote

3rd Post by Anonymous is the correct one.

All we need to do is put the elements at its corresponding index and the duplicates will be pushed to the end...So the first 'n' will the unique values and the next 'm' will be the duplicates.

- pk August 17, 2009 | Flag Reply

Comment hidden because of low score. Click to expand.

of 0 votes

space complexity is just to create confusion
OR
may be to copy the last m elements of original (m+n) array into a new array.

- pk August 17, 2009 | Flag

Comment hidden because of low score. Click to expand.

of 0 vote

the solution/approach by Anonymous is the correct and rest Junk!

- donkeyMan August 18, 2009 | Flag Reply

Comment hidden because of low score. Click to expand.

of 0 vote

Algorithm:
1. Move Left to Right, Swap element if not at right position.
2. Move Right to Left, Swap element if not at right position.
3. Take last m position as result.

Let Input is 5 3 1 2 2 4 5
Array
Left to Right
5 3 1 2 2 4 5
2 3 1 2 5 4 5
2 1 3 2 5 4 5
2 2 3 1 5 4 5
2 2 3 1 5 4 5
2 2 3 4 5 1 5
2 2 3 4 5 1 5

Let Input is 2 2 3 4 5 1 5
Right to Left
2 2 3 4 5 1 5
2 2 3 4 5 1 5
1 2 3 4 5 2 5
1 2 3 4 5 2 5
1 2 3 4 5 2 5
1 2 3 4 5 2 5
1 2 3 4 5 2 5
1 2 3 4 5 2 5

Result 2, 5

- Anonymous August 18, 2009 | Flag Reply

Comment hidden because of low score. Click to expand.

of 0 vote

- Deepak Agarwal August 18, 2009 | Flag Reply

Comment hidden because of low score. Click to expand.

of 0 vote

- Deepak Agarwal August 18, 2009 | Flag Reply

Comment hidden because of low score. Click to expand.

of 0 votes

Verg good solution. Seems perfect!!!!

- abcde August 19, 2009 | Flag

Comment hidden because of low score. Click to expand.

of 0 vote

any idea abt deepaks solution it seems correct to me

- CUNOMAD August 19, 2009 | Flag Reply

Comment hidden because of low score. Click to expand.

of 0 vote

deepak's solution is OK.
in the first shuffle [left to right] you partially sort the numbers in increasing order and the second step is the crucial one[right to left], in this step you actually put the numbers 1-n in their actual position pushing the duplicates to the end and that gives the required result.

- maddy August 30, 2009 | Flag Reply

Comment hidden because of low score. Click to expand.

of 0 vote

This can be done in O(n+m) time without using any space .
here is the pseudo code :-

for i->1 to m+n {
  a[a[i]%n]+=n
 } 

for i->1 to n {
  print "number i occurs "  a[i]/n "times
}

for example :-
take n=4
input :-
2,1,3,3,4,1,2

when i=1
2,5,3,3,4,1,2
i=2
6,5,3,3,4,1,2
so on final array :-
10,9,11,7,4,1,2

hence 1 occurs 10/4 = 2 times
'' 2 occurs 9/2=2 times
'' 3 occurs 11/4=2 times
4 occurs 7/4 = 1 times

- wolverine September 14, 2009 | Flag Reply

Comment hidden because of low score. Click to expand.

of 0 vote

I think the following algorithm is incorrect

Algorithm:
1. Move Left to Right, Swap element if not at right position.
2. Move Right to Left, Swap element if not at right position.
3. Take last m position as result.

Counter Example:

Left->Right

5 4 6 6 2 1 3
2 4 6 6 5 1 3
2 6 6 4 5 1 3
2 6 1 4 5 6 3
2 6 3 4 5 6 1

Right->Left
1 6 3 4 5 6 2

The output is 2, which is wrong.

- xslpeter October 06, 2009 | Flag Reply

Comment hidden because of low score. Click to expand.

of 0 votes

lemme know if this is right

Suppose we have the following numbers

524568917184

I start at index 1....

a[1] has 5 so i swap a[1] and a[5]
Repeat till a[1] has 1..then move forward....
Keep doing this till u reach n

all the remaining elements will be the m repeated ones...

Time Complexity O(N + M)
Space = 1

Is this Ok....

- Kiran Digumarti October 09, 2009 | Flag

Comment hidden because of low score. Click to expand.

of 0 votes

@xslpeter, excellent counter example. That approach does not work.

- rrs February 01, 2011 | Flag

Comment hidden because of low score. Click to expand.

of 0 vote

524638917184
324658917184
423658917184
623458917184
823456917184
123456987184
123456789184

1, 8 , 4 are repeated...:)
Looks good to me..

- Kiran Digumarti October 09, 2009 | Flag Reply

Comment hidden because of low score. Click to expand.

of 0 vote

Sorry dowsnt work....

- Kiran Digumarti October 09, 2009 | Flag Reply

Comment hidden because of low score. Click to expand.

of 0 vote

512464389817
612454389817
412456389817
4 and 4 ....Mov ahead
142456389817
again 4 and 4 move ahead
124456389817
again 4 and 4 move ahead..twice
123456489817
123456789814 814 are repeated....

will come up with an algp but this works

- Kiran Digumarti October 09, 2009 | Flag Reply

Comment hidden because of low score. Click to expand.

of 0 vote

public static void getRepeatedVariables(int [] array, int n) {
int m = array.length - n;
int i = 0;
while(i < array.length) {

swap(array, i, array[i]);
if(array[i] == i)
i++;

if(array[i] == array[array[i]])
i++;

printArray(array);
}

}

- kiran Digumarti October 09, 2009 | Flag Reply

Comment hidden because of low score. Click to expand.

of 0 vote

int i = 1;
while (i <= N + M)
{
if (arr[arr[i]]!=arr[i]) swap(arr[arr[i]],arr[i]);
else ++i;
}

- Anonymous November 21, 2009 | Flag Reply

Comment hidden because of low score. Click to expand.

of 0 votes

the last M numbers is the duplicated numbers

- Anonymous November 21, 2009 | Flag

Comment hidden because of low score. Click to expand.

of 0 vote

I start my solution by virtually partitioning the array into n followed by m elements. The last m elements will give the duplicates, while the first n elements will contain partially sorted, in place, first n elements.

for(i=m; i<m+n; i++)
{
curr = a[i];

/* traverse as long as we have no duplicate setting each element n to its position at a[n]*/
while(a[curr] != curr)
{
temp = a[curr];
a[curr] = curr;
curr = temp;
}

/* set the duplicate found to a[i] and move on */
a[i] = curr;
}

At the end, many of the first n elements will be sorted in their correct spot such that a[n]==n. Some will remain unsorted, but they will still be no duplicates, just shuffled away from their proper location because they were never hit by the bubbling process. The m elements at the end are constructed to be duplicates.

Note, the complexity is O(m+n). Even though it appears that for each of the m elements we could do n operations, however, it is an amortized analysis because we only do at most a complete pass across the n partition while processing all the m duplicates.

Thanks!

- catalin4ever November 27, 2009 | Flag Reply

Comment hidden because of low score. Click to expand.

of 0 vote

We know each number between 1-n must appear once, with one number appearing m times. We know the sum of the numbers between 1-n sum(n) = ((n+1)(n))/2. The calculated sum will be real_sum(arr) = ((n+1)(n))/2 + (m-1)(x) for some x repeated m-1 times. We can therefore calculate the value that is repeated through simple math. num = (sum - real)/(m - 1). n will be the maximum value in the array and sum will be the sum of the elements in the array, both done in one loop O(n+m). Calculation is O(1) and we're done.

int n = 0;
		int s = 0;
		for (int i = 0; i < arr.length; i++) {
            n = (arr[i] > n ? arr[i] : n);
            s += arr[i];
        }
		
		int e = ((n+1)*n)/2;
		int num = (s - e) / (arr.length - n);

- Prashanth November 30, 2009 | Flag Reply

Comment hidden because of low score. Click to expand.

of 0 vote

Given an array A that is assumed to start with index 1.

Consider A[1]. Swap A[1] with A[A[1]] if A[A[1] != A[1],
Only when you can cannot swap which is when A[A[1] = A[1], you proceed to next index in the array A[2] and so on and so forth. This means you have found the duplicate.

Once you are done with the array, you would be left with A[1..n] unique elements and A[n+1..m] duplicates.

- geek January 07, 2010 | Flag Reply

Comment hidden because of low score. Click to expand.

of 0 votes

Can u verify your solution works in O(m+n) time?

- V February 13, 2010 | Flag

Comment hidden because of low score. Click to expand.

of 0 vote

#include<stdio.h>

int main ()
{
int n = 9,m = 12;
int arr[12] = {9, 9, 9, 6, 5, 4, 3, 2, 1, 7, 8, 9};
int i,j,temp,curr;

for(i=0;i<m;i++) {
printf("%d : ",arr[i]); }

for(i=0; i<m; i++) 
{
curr = arr[i]; 
if((i+1) != curr)
{
temp = arr[curr-1];
arr[curr-1] = arr[i];
arr[i] = temp;
}
}

printf("\n");
for(i=n;i<m;i++)
printf("%d : ",arr[i]);

return 0;
}

- HMMM March 20, 2010 | Flag Reply

Comment hidden because of low score. Click to expand.

of 0 vote

Above alg failed at this case
int n = 9,m = 10;
int arr[10] = {6, 9, 5, 1, 4, 8, 3, 2, 7, 7};

6 : 9 : 5 : 1 : 4 : 8 : 3 : 2 : 7 : 7 :
4 :

- Anonymous February 20, 2011 | Flag Reply

Comment hidden because of low score. Click to expand.

of 0 vote

How come no one mentioned the "negate" approach? For each a[i], which we know is between 1 and n, check if a[a[i]] is negative. If not, set it to negative to indicate that we just seen a[i]. If it is negative, we know a[i] is a repeated number. Use a separate array to keep the m repeated elements, hence the O(m) space constraint. At the end, traverse the array and undo the negations to restore the array.

- Sunny December 22, 2012 | Flag Reply

CareerCup

Amazon Interview Question for Software Engineer / Developers

Books

Videos

Resume Review

Mock Interviews