Find the character that has ma

Amazon Interview Question for Software Engineer / Developers

0

of 0 votes

25
Answers
Find the character that has maximum frequency in an array of characters. If n is small and dealing with unicode char-space, extra space for hashtable is overhead, can you avoid?
- rahulm January 06, 2012 in United States for RCX | Report Duplicate | Flag | PURGE
Amazon Software Engineer / Developer

Email me when people comment.

An error occurred in subscribing you.

Team: RCX
Country: United States
Interview Type: In-Person

More Questions from This Interview

Email me when people comment.

An error occurred in subscribing you.

Comment hidden because of low score. Click to expand.

of 1 vote

1) Create a new array with length 26. Initialize all the elements with 0. Let's call it int[] countArray
2) Iterate the given char array and if char is 'a' then increment the value at index 0, if it is 'b' then increment the value at index '1' and so on... When reading the given array is finished iterate the countArray and print the character corresponding to highest index value

- Msingh January 06, 2012 | Flag Reply

Comment hidden because of low score. Click to expand.

of 0 votes

agree w/ this answer.

- guest January 06, 2012 | Flag

Comment hidden because of low score. Click to expand.

of 0 votes

but unicode character may consist of not only english alphabets!!
Do you think an array of length 26 will work??

- Anonymous January 07, 2012 | Flag

Comment hidden because of low score. Click to expand.

of 0 votes

Of course it won't work. You need an array of length 2^16-1. Whether or not this is acceptable depends on the situation.

- eugene.yarovoi January 08, 2012 | Flag

Comment hidden because of low score. Click to expand.

of 0 votes

This is a hash table solution...

- blackfedora January 11, 2012 | Flag

Comment hidden because of low score. Click to expand.

of 1 vote

1) Initially sort the array
After sorting the following code can solve this

main()
{
char a[]={a,a,a,a,b,b,c,c,d,d,e,e,e}; // Can be sorted in O(nlogn)
int max=0,curr=1,i;
char f;
for(i=0;i<9;i++)
{
if(a[i]==a[i+1])
{
curr++;
}
else
{
if(curr>max)
{
max=curr;
curr=1;
f=a[i];
}
}
}
printf("The max occuring character is %c",f);
}

- Pankaj January 07, 2012 | Flag Reply

Comment hidden because of low score. Click to expand.

of 0 votes

This assumes you can destroy the original array, which you should ask the interviewer about.

- eugene.yarovoi January 08, 2012 | Flag

Comment hidden because of low score. Click to expand.

of 0 votes

Or you can just make another array. We were told n is small; 2n space is probably not an issue.

- blackfedora January 11, 2012 | Flag

Comment hidden because of low score. Click to expand.

of 0 vote

1. Sort the array - O(n lg n)

2. Do a linear scan over the array. For each character, store the frequency in the array itself. So for example:

a,b,b,b,c,c,c,c,c,d,e,e,f,f,f will result in 1, b,3, c,5, e,2, f,3

3. Now scan the array once up to n-1 and:

a. If arr[i] is a number, i+1     // this means that this character had a frequency of 1
          b. If arr[i] is a char && arr[i+1] > maxFrequency, update maxFrequency, i+2
          c. else i+1;

Code assumes that at least one character repeats more than once.

- Anonymous January 06, 2012 | Flag Reply

Comment hidden because of low score. Click to expand.

of 0 votes

Small correction. In the else case

else i+2

}

- Anonymous January 06, 2012 | Flag

Comment hidden because of low score. Click to expand.

of 0 votes

Also, if n is known to be small, we can do Counting Sort and make the whole program run in O(n) time

- Anonymous January 06, 2012 | Flag

Comment hidden because of low score. Click to expand.

of 0 votes

Excellent stuff dude. Might I point out that when you store maxFrequency, you will also need to store the character with that maxFrequency

- Anonymous January 06, 2012 | Flag

Comment hidden because of low score. Click to expand.

of 0 vote

1) Sort the array - o(nlogn)
2) Do a linear scan , keep a max count and max index variable and record the max occurring character and its index.

- Gloryhunter January 06, 2012 | Flag Reply

Comment hidden because of low score. Click to expand.

of 0 vote

Sorting the array has overhead: time for sure + space, if you are be allowed to mess up the original array!

For most "fast" sorts O(n log n) has a big constant multiplier for small Ns. So it might or might not be feasible. For small Ns a simple (e.g. insertion) sort with worse asymptotic behaviour could be better.

- Selmeczy, Péter January 06, 2012 | Flag Reply

Comment hidden because of low score. Click to expand.

of 0 votes

question says uni-code character (not alphabetical) and no use of hash table

- S January 06, 2012 | Flag

Comment hidden because of low score. Click to expand.

of 0 votes

question says uni-code character (not alphabetical) and no use of hash table

- S January 06, 2012 | Flag

Comment hidden because of low score. Click to expand.

of 0 votes

Agreed, if N is small, insertion sort should probably be used. Depends on how small we're talking about.

- eugene.yarovoi January 08, 2012 | Flag

Comment hidden because of low score. Click to expand.

of 0 vote

Can we use first part of the distribution counting here?
We can create a distribution table for the unicode characters i.e. D[0..255] and store the frequency of each character in D. Then, find the largest number in D. I agree this will be similar to Hash table, but what if n is very small?? This would be most efficient in such cases since its efficiency is linear.

- Santosh January 17, 2012 | Flag Reply

Comment hidden because of low score. Click to expand.

of 0 vote

Iterate array for the element and compare.. whenever comparison done , shift the values to left so as to avoid comparison for other disticnt element:
Forexample: a, b, a, c, a, b
1st iteration: comparing a , will change the array as: a ,b,c,b . Store the max(a) count to be in some temp variable.
similar for b iteration: a, b, c . Now check the max of count and set the variables as required.
By this we can avoid many comaprison. However , in worst case, such as a, b ,c ,d, e
complexity will be O(n2)

- Anonymous January 17, 2012 | Flag Reply

Comment hidden because of low score. Click to expand.

-1

of 1 vote

1. Make a sorted copy of the array
2. Run longest common sub-sequence to get the most repeated character

That's one way I can think of avoiding hash tables, but then sorted copy of array will take O(n) space.

- Anonymous January 06, 2012 | Flag Reply

Comment hidden because of low score. Click to expand.

of 0 votes

This is actually a very unique way of finding max character! However, it will take up extra space, which kinda defeats the purpose of the question.

- Anonymous January 06, 2012 | Flag

Comment hidden because of low score. Click to expand.

of 0 votes

This wouldn't work because the longest common subsequence could include more than 1 type of character.

- eugene.yarovoi January 08, 2012 | Flag

Comment hidden because of low score. Click to expand.

-1

of 1 vote

As this is dealing with uni code character set, it will have 16 bits for each character, so can we store the count in 8 bits and character in 8 bits?

- Anonymous January 06, 2012 | Flag Reply

Comment hidden because of low score. Click to expand.

-1

of 1 vote

@rahulm, [Out of topic] how many rounds of interview you faced?

- Anonymous January 06, 2012 | Flag Reply

Comment hidden because of low score. Click to expand.

-2

of 2 vote

Hash tables are quite generic stuff. You can use a hash table but the hash function should be a bit more sophisticated than the character value (with Unicode this is 16 bits, resulting in a potentially huge hash table). A hash functions to add up/XORing the two bytes of the unicode char will do, size of hash table is reasonable (256 items). This case the items are linked lists of (char, count) pairs.

- Selmeczy, Péter January 06, 2012 | Flag Reply

CareerCup

Amazon Interview Question for Software Engineer / Developers

Books

Videos

Resume Review

Mock Interviews