June 28, 2013

Beautiful, unique strings

There may come a time in your career when you will have to solve the issue of finding the most beautiful and unique string from an input, or else!…OK, I admit the probability of this is very low, but if you are interested in solving a brain-teasing problem, here is you’re chance.

Problem:
1. String s is called unique if all the characters of s are different.
2. String s2 is producible from string s1, if we can remove some characters of s1 to obtain s2.
3. String s1 is more beautiful than string s2 if the length of s1 is greater than the length of s2 or, if they have equal length, if s1 is lexicographically greater than s2.
Given a string s, you have to find the most beautiful and unique string that is producible from s.

Solution:
Here are the main ideas behind the solution:

Let the input and the output strings be represented as “I[]” and “O[]”.
For each character “c” from the input string I[], do the following validations:
1. If c is not found in O[], append c to O[]
2. If c already exists in O[]:
  1. Let “p” be the position of the character c in O[], and “ip” be the position of the character in I[]
  2. Find the first character in O[], placed after the position p, that is lexicographically greater than c

Here is the implementation in C#: sources

Example:
I = “ccabdcab”
O = “”

c = ‘c’
Character not found in O
O = “c”
c = ‘c’
Character already in O.
p = 0;
No greater character is found in O after p => do nothing.
O = “c”
c = ‘a’
Character not found in O
O = “ca”
c = ‘b’
Character not found in O.
O = “cab”
c = ‘d’
Character not found in O.
O = “cabd”
c = ‘c’
Character already in O.
p = 0;
ip = 5;
A greater character (‘d’) is found in O at position gp = 3.
All the characters between p and gp (‘a’,’b’) are found in I[] after the position ip = 5. This means that these characters will be validated and moved later in the process, and therefore the character from the position p can be safely removed from O.
O =”abdc”
c = ‘a’
Character already in O.
p=0;
ip=6
p = 0;
A greater character (‘b’) is found in O at position gp = 1. Since there is no smaller character between ‘a’ and ‘b’ in the output string, the character ‘a’ can be safely removed from the position p=0, and appended at the end of O.
O = “bdca”
c = ‘b’
Character already in O.
p = 0;
A greater character (‘d’) is found in O at position gp = 1. Since there is no smaller character between ‘b’ and ‘d’ in the output string, the character ‘b’ can be safely removed from the position p=0, and appended at the end of O.
O = “dcab”

And since everybody is going bananas associating the following two examples with this problem, here is the generated output in those cases, following this solution:

Input: babab
Output: ba

Input: nlhthgrfdnnlprjtecpdrthigjoqdejsfkasoctjijaoebqlrgaiakfsbljmpibkidjsrtkgrdnqsknbarpabgokbsrfhmeklrle
Output: tsocrpkijgdqnbafhmle

2 comments

July 20, 2013 - 8:56 pm kapil

could you explain the logic on how this algo actually works

- July 20, 2013 - 9:27 pm Rasemanju
  
  Sure. At what step are you having troubles?
  The main idea is to create the result in an almost greedy manner. You take one letter at a time from the input string and you compare it to the characters you have in the result output.
  The more tricky part of this algo comes when you already have the character in the output. We now start wondering about the lexicographically part of the problem. If you find a lexicographically greater char after it in the output, it triggers the possibility of finding a lexicographically greater result. The condition is that all the characters between these two characters (which are obviously lexicographically smaller) are to be founded after the character which is lex. greater, in the input string. This means that all of this chars will be treated in the next iterations and will pe replaced in the output string, after the char which is lex. greater. If one char is not founded in the input string, it will remain at it’s current position, and the output string will be lexicographically smaller instead.
  Please let me know if this explanation answered your questions.

SkillDrill

Beautiful, unique strings

2 comments

Leave a comment Cancel reply

Share this:

Related

2 comments

Leave a comment Cancel reply