Tagged: beautiful

June 28, 2013

Beautiful, unique strings

There may come a time in your career when you will have to solve the issue of finding the most beautiful and unique string from an input, or else!…OK, I admit the probability of this is very low, but if you are interested in solving a brain-teasing problem, here is you’re chance.

Problem:
1. String s is called unique if all the characters of s are different.
2. String s2 is producible from string s1, if we can remove some characters of s1 to obtain s2.
3. String s1 is more beautiful than string s2 if the length of s1 is greater than the length of s2 or, if they have equal length, if s1 is lexicographically greater than s2.
Given a string s, you have to find the most beautiful and unique string that is producible from s.

Solution:
Here are the main ideas behind the solution:

Let the input and the output strings be represented as “I[]” and “O[]”.
For each character “c” from the input string I[], do the following validations:
1. If c is not found in O[], append c to O[]
2. If c already exists in O[]:
  1. Let “p” be the position of the character c in O[], and “ip” be the position of the character in I[]
  2. Find the first character in O[], placed after the position p, that is lexicographically greater than c

Here is the implementation in C#: sources

Example:
I = “ccabdcab”
O = “”

c = ‘c’
Character not found in O
O = “c”
c = ‘c’
Character already in O.
p = 0;
No greater character is found in O after p => do nothing.
O = “c”
c = ‘a’
Character not found in O
O = “ca”
c = ‘b’
Character not found in O.
O = “cab”
c = ‘d’
Character not found in O.
O = “cabd”
c = ‘c’
Character already in O.
p = 0;
ip = 5;
A greater character (‘d’) is found in O at position gp = 3.
All the characters between p and gp (‘a’,’b’) are found in I[] after the position ip = 5. This means that these characters will be validated and moved later in the process, and therefore the character from the position p can be safely removed from O.
O =”abdc”
c = ‘a’
Character already in O.
p=0;
ip=6
p = 0;
A greater character (‘b’) is found in O at position gp = 1. Since there is no smaller character between ‘a’ and ‘b’ in the output string, the character ‘a’ can be safely removed from the position p=0, and appended at the end of O.
O = “bdca”
c = ‘b’
Character already in O.
p = 0;
A greater character (‘d’) is found in O at position gp = 1. Since there is no smaller character between ‘b’ and ‘d’ in the output string, the character ‘b’ can be safely removed from the position p=0, and appended at the end of O.
O = “dcab”

And since everybody is going bananas associating the following two examples with this problem, here is the generated output in those cases, following this solution:

Input: babab
Output: ba

Input: nlhthgrfdnnlprjtecpdrthigjoqdejsfkasoctjijaoebqlrgaiakfsbljmpibkidjsrtkgrdnqsknbarpabgokbsrfhmeklrle
Output: tsocrpkijgdqnbafhmle