Longest word chain from a list of words











up vote
30
down vote

favorite
4












So, this is a part of a function I'm trying to make.



I don't want the code to be too complicated.



I have a list of words, e.g.



words = ['giraffe', 'elephant', 'ant', 'tiger', 'racoon', 'cat', 'hedgehog', 'mouse']


The idea of the word chain sequence is for the next word to begin with the letter that the last word ended in.



(Edit: Each word cannot be used more than once. Other than that there are no other constraints.)



I want the output to give the longest word chain sequence, which in this case is:



['hedgehog', 'giraffe', 'elephant', 'tiger', 'racoon']


I'm not really sure how to do it, I had different attempts at trying this. One of them...



This code finds the word chain correctly if we start with a specific word from the list, e.g. words[0] (so 'giraffe'):



words = ['giraffe', 'elephant', 'ant', 'tiger', 'racoon', 'cat', 'hedgehog', 'mouse']

word_chain =

word_chain.append(words[0])

for word in words:
for char in word[0]:

if char == word_chain[-1][-1]:
word_chain.append(word)

print(word_chain)


Output:



['giraffe', 'elephant', 'tiger', 'racoon']


BUT, I want to find the longest possible chain of words (explained above).



My method: So, I tried to use the above working code that I wrote and loop through, using each word from the list as the starting point and finding the word chain for each word[0], word[1], word[2] etc. Then I tried to find the longest word chain by using an if statement and compare the length to the previous longest chain, but I can't get it done properly and I don't really know where this is going.



words = ['giraffe', 'elephant', 'ant', 'tiger', 'racoon', 'cat', 'hedgehog', 'mouse']

word_chain =
max_length = 0
for starting_word_index in range(len(words) - 1):

word_chain.append(words[starting_word_index])

for word in words:
for char in word[0]:

if char == word_chain[-1][-1]:
word_chain.append(word)

# Not sure

if len(word_chain) > max_length:
final_word_chain = word_chain
longest = len(word_chain)
word_chain.clear()

print(final_word_chain)


This is my nth attempt, I think this one prints an empty list, I had different attempts before this that failed to clear the word_chain list properly and ended up repeating words over again.



Any help much appreciated. Hopefully I didn't make this too teedious or confusing... Thanks!










share|improve this question
























  • @dataLeo Hi, each word cannot be used more than once (so elephant can only be used once), no other constraints apart from that. Aim is to find the longest word chain sequence.
    – Mandingo
    Nov 26 at 16:16








  • 1




    Do you need an added precaution in case someone slips in a name that start and ends with the same letter? (… not as if I can come up with one ...)
    – usr2564301
    Nov 26 at 16:30






  • 1




    This would be a great contribution to codegolf.stackexchange.com
    – Frozenthia
    Nov 26 at 17:57








  • 2




    This problem is equivalent to finding the longest path where edges are traversed at most once in a directed, cyclic graph.
    – Mateen Ulhaq
    Nov 27 at 4:35








  • 1




    The general name of this problem: stackoverflow.com/questions/29522351/…
    – nhahtdh
    Nov 27 at 7:42

















up vote
30
down vote

favorite
4












So, this is a part of a function I'm trying to make.



I don't want the code to be too complicated.



I have a list of words, e.g.



words = ['giraffe', 'elephant', 'ant', 'tiger', 'racoon', 'cat', 'hedgehog', 'mouse']


The idea of the word chain sequence is for the next word to begin with the letter that the last word ended in.



(Edit: Each word cannot be used more than once. Other than that there are no other constraints.)



I want the output to give the longest word chain sequence, which in this case is:



['hedgehog', 'giraffe', 'elephant', 'tiger', 'racoon']


I'm not really sure how to do it, I had different attempts at trying this. One of them...



This code finds the word chain correctly if we start with a specific word from the list, e.g. words[0] (so 'giraffe'):



words = ['giraffe', 'elephant', 'ant', 'tiger', 'racoon', 'cat', 'hedgehog', 'mouse']

word_chain =

word_chain.append(words[0])

for word in words:
for char in word[0]:

if char == word_chain[-1][-1]:
word_chain.append(word)

print(word_chain)


Output:



['giraffe', 'elephant', 'tiger', 'racoon']


BUT, I want to find the longest possible chain of words (explained above).



My method: So, I tried to use the above working code that I wrote and loop through, using each word from the list as the starting point and finding the word chain for each word[0], word[1], word[2] etc. Then I tried to find the longest word chain by using an if statement and compare the length to the previous longest chain, but I can't get it done properly and I don't really know where this is going.



words = ['giraffe', 'elephant', 'ant', 'tiger', 'racoon', 'cat', 'hedgehog', 'mouse']

word_chain =
max_length = 0
for starting_word_index in range(len(words) - 1):

word_chain.append(words[starting_word_index])

for word in words:
for char in word[0]:

if char == word_chain[-1][-1]:
word_chain.append(word)

# Not sure

if len(word_chain) > max_length:
final_word_chain = word_chain
longest = len(word_chain)
word_chain.clear()

print(final_word_chain)


This is my nth attempt, I think this one prints an empty list, I had different attempts before this that failed to clear the word_chain list properly and ended up repeating words over again.



Any help much appreciated. Hopefully I didn't make this too teedious or confusing... Thanks!










share|improve this question
























  • @dataLeo Hi, each word cannot be used more than once (so elephant can only be used once), no other constraints apart from that. Aim is to find the longest word chain sequence.
    – Mandingo
    Nov 26 at 16:16








  • 1




    Do you need an added precaution in case someone slips in a name that start and ends with the same letter? (… not as if I can come up with one ...)
    – usr2564301
    Nov 26 at 16:30






  • 1




    This would be a great contribution to codegolf.stackexchange.com
    – Frozenthia
    Nov 26 at 17:57








  • 2




    This problem is equivalent to finding the longest path where edges are traversed at most once in a directed, cyclic graph.
    – Mateen Ulhaq
    Nov 27 at 4:35








  • 1




    The general name of this problem: stackoverflow.com/questions/29522351/…
    – nhahtdh
    Nov 27 at 7:42















up vote
30
down vote

favorite
4









up vote
30
down vote

favorite
4






4





So, this is a part of a function I'm trying to make.



I don't want the code to be too complicated.



I have a list of words, e.g.



words = ['giraffe', 'elephant', 'ant', 'tiger', 'racoon', 'cat', 'hedgehog', 'mouse']


The idea of the word chain sequence is for the next word to begin with the letter that the last word ended in.



(Edit: Each word cannot be used more than once. Other than that there are no other constraints.)



I want the output to give the longest word chain sequence, which in this case is:



['hedgehog', 'giraffe', 'elephant', 'tiger', 'racoon']


I'm not really sure how to do it, I had different attempts at trying this. One of them...



This code finds the word chain correctly if we start with a specific word from the list, e.g. words[0] (so 'giraffe'):



words = ['giraffe', 'elephant', 'ant', 'tiger', 'racoon', 'cat', 'hedgehog', 'mouse']

word_chain =

word_chain.append(words[0])

for word in words:
for char in word[0]:

if char == word_chain[-1][-1]:
word_chain.append(word)

print(word_chain)


Output:



['giraffe', 'elephant', 'tiger', 'racoon']


BUT, I want to find the longest possible chain of words (explained above).



My method: So, I tried to use the above working code that I wrote and loop through, using each word from the list as the starting point and finding the word chain for each word[0], word[1], word[2] etc. Then I tried to find the longest word chain by using an if statement and compare the length to the previous longest chain, but I can't get it done properly and I don't really know where this is going.



words = ['giraffe', 'elephant', 'ant', 'tiger', 'racoon', 'cat', 'hedgehog', 'mouse']

word_chain =
max_length = 0
for starting_word_index in range(len(words) - 1):

word_chain.append(words[starting_word_index])

for word in words:
for char in word[0]:

if char == word_chain[-1][-1]:
word_chain.append(word)

# Not sure

if len(word_chain) > max_length:
final_word_chain = word_chain
longest = len(word_chain)
word_chain.clear()

print(final_word_chain)


This is my nth attempt, I think this one prints an empty list, I had different attempts before this that failed to clear the word_chain list properly and ended up repeating words over again.



Any help much appreciated. Hopefully I didn't make this too teedious or confusing... Thanks!










share|improve this question















So, this is a part of a function I'm trying to make.



I don't want the code to be too complicated.



I have a list of words, e.g.



words = ['giraffe', 'elephant', 'ant', 'tiger', 'racoon', 'cat', 'hedgehog', 'mouse']


The idea of the word chain sequence is for the next word to begin with the letter that the last word ended in.



(Edit: Each word cannot be used more than once. Other than that there are no other constraints.)



I want the output to give the longest word chain sequence, which in this case is:



['hedgehog', 'giraffe', 'elephant', 'tiger', 'racoon']


I'm not really sure how to do it, I had different attempts at trying this. One of them...



This code finds the word chain correctly if we start with a specific word from the list, e.g. words[0] (so 'giraffe'):



words = ['giraffe', 'elephant', 'ant', 'tiger', 'racoon', 'cat', 'hedgehog', 'mouse']

word_chain =

word_chain.append(words[0])

for word in words:
for char in word[0]:

if char == word_chain[-1][-1]:
word_chain.append(word)

print(word_chain)


Output:



['giraffe', 'elephant', 'tiger', 'racoon']


BUT, I want to find the longest possible chain of words (explained above).



My method: So, I tried to use the above working code that I wrote and loop through, using each word from the list as the starting point and finding the word chain for each word[0], word[1], word[2] etc. Then I tried to find the longest word chain by using an if statement and compare the length to the previous longest chain, but I can't get it done properly and I don't really know where this is going.



words = ['giraffe', 'elephant', 'ant', 'tiger', 'racoon', 'cat', 'hedgehog', 'mouse']

word_chain =
max_length = 0
for starting_word_index in range(len(words) - 1):

word_chain.append(words[starting_word_index])

for word in words:
for char in word[0]:

if char == word_chain[-1][-1]:
word_chain.append(word)

# Not sure

if len(word_chain) > max_length:
final_word_chain = word_chain
longest = len(word_chain)
word_chain.clear()

print(final_word_chain)


This is my nth attempt, I think this one prints an empty list, I had different attempts before this that failed to clear the word_chain list properly and ended up repeating words over again.



Any help much appreciated. Hopefully I didn't make this too teedious or confusing... Thanks!







python recursion graph path-finding






share|improve this question















share|improve this question













share|improve this question




share|improve this question








edited Nov 27 at 21:00









Ajax1234

39.1k42452




39.1k42452










asked Nov 26 at 16:11









Mandingo

30719




30719












  • @dataLeo Hi, each word cannot be used more than once (so elephant can only be used once), no other constraints apart from that. Aim is to find the longest word chain sequence.
    – Mandingo
    Nov 26 at 16:16








  • 1




    Do you need an added precaution in case someone slips in a name that start and ends with the same letter? (… not as if I can come up with one ...)
    – usr2564301
    Nov 26 at 16:30






  • 1




    This would be a great contribution to codegolf.stackexchange.com
    – Frozenthia
    Nov 26 at 17:57








  • 2




    This problem is equivalent to finding the longest path where edges are traversed at most once in a directed, cyclic graph.
    – Mateen Ulhaq
    Nov 27 at 4:35








  • 1




    The general name of this problem: stackoverflow.com/questions/29522351/…
    – nhahtdh
    Nov 27 at 7:42




















  • @dataLeo Hi, each word cannot be used more than once (so elephant can only be used once), no other constraints apart from that. Aim is to find the longest word chain sequence.
    – Mandingo
    Nov 26 at 16:16








  • 1




    Do you need an added precaution in case someone slips in a name that start and ends with the same letter? (… not as if I can come up with one ...)
    – usr2564301
    Nov 26 at 16:30






  • 1




    This would be a great contribution to codegolf.stackexchange.com
    – Frozenthia
    Nov 26 at 17:57








  • 2




    This problem is equivalent to finding the longest path where edges are traversed at most once in a directed, cyclic graph.
    – Mateen Ulhaq
    Nov 27 at 4:35








  • 1




    The general name of this problem: stackoverflow.com/questions/29522351/…
    – nhahtdh
    Nov 27 at 7:42


















@dataLeo Hi, each word cannot be used more than once (so elephant can only be used once), no other constraints apart from that. Aim is to find the longest word chain sequence.
– Mandingo
Nov 26 at 16:16






@dataLeo Hi, each word cannot be used more than once (so elephant can only be used once), no other constraints apart from that. Aim is to find the longest word chain sequence.
– Mandingo
Nov 26 at 16:16






1




1




Do you need an added precaution in case someone slips in a name that start and ends with the same letter? (… not as if I can come up with one ...)
– usr2564301
Nov 26 at 16:30




Do you need an added precaution in case someone slips in a name that start and ends with the same letter? (… not as if I can come up with one ...)
– usr2564301
Nov 26 at 16:30




1




1




This would be a great contribution to codegolf.stackexchange.com
– Frozenthia
Nov 26 at 17:57






This would be a great contribution to codegolf.stackexchange.com
– Frozenthia
Nov 26 at 17:57






2




2




This problem is equivalent to finding the longest path where edges are traversed at most once in a directed, cyclic graph.
– Mateen Ulhaq
Nov 27 at 4:35






This problem is equivalent to finding the longest path where edges are traversed at most once in a directed, cyclic graph.
– Mateen Ulhaq
Nov 27 at 4:35






1




1




The general name of this problem: stackoverflow.com/questions/29522351/…
– nhahtdh
Nov 27 at 7:42






The general name of this problem: stackoverflow.com/questions/29522351/…
– nhahtdh
Nov 27 at 7:42














9 Answers
9






active

oldest

votes

















up vote
23
down vote



accepted










You can use recursion to explore every "branch" that emerges when every possible letter containing the proper initial character is added to a running list:



words = ['giraffe', 'elephant', 'ant', 'tiger', 'racoon', 'cat', 'hedgehog', 'mouse']
def get_results(_start, _current, _seen):
if all(c in _seen for c in words if c[0] == _start[-1]):
yield _current
else:
for i in words:
if i[0] == _start[-1]:
yield from get_results(i, _current+[i], _seen+[i])


new_d = [list(get_results(i, [i], ))[0] for i in words]
final_d = max([i for i in new_d if len(i) == len(set(i))], key=len)


Output:



['hedgehog', 'giraffe', 'elephant', 'tiger', 'racoon']


This solution works similar to the breadth-first search, as the function get_resuls will continue to iterate over the entire list as long as the current value has not been called on before. Values that have been seen by the function are added to the _seen list, ultimately ceasing the stream of recursive calls.



This solution will also ignore results with duplicates:



words = ['giraffe', 'elephant', 'ant', 'ning', 'tiger', 'racoon', 'cat', 'hedgehog', 'mouse',]
new_d = [list(get_results(i, [i], ))[0] for i in words]
final_d = max([i for i in new_d if len(i) == len(set(i))], key=len)


Output:



['ant', 'tiger', 'racoon', 'ning', 'giraffe', 'elephant']





share|improve this answer






























    up vote
    14
    down vote













    I have a new idea, as the figure shows:



    enter image description here



    We can construct a directed graph by word[0] == word[-1], then the problem is converted to find the maximum length path.






    share|improve this answer






























      up vote
      11
      down vote













      As mentioned by others, the problem is to find the longest path in a directed acyclic graph.



      For anything graph related in Python, networkx is your friend.



      You just need to initialize the graph, add the nodes, add the edges and launch dag_longest_path:



      import networkx as nx
      import matplotlib.pyplot as plt

      words = ['giraffe', 'elephant', 'ant', 'tiger', 'racoon', 'cat',
      'hedgehog', 'mouse']

      G = nx.DiGraph()
      G.add_nodes_from(words)

      for word1 in words:
      for word2 in words:
      if word1 != word2 and word1[-1] == word2[0]:
      G.add_edge(word1, word2)
      nx.draw_networkx(G)
      plt.show()
      print(nx.algorithms.dag.dag_longest_path(G))


      enter image description here



      It outputs:



      ['hedgehog', 'giraffe', 'elephant', 'tiger', 'racoon']


      Note : this algorithm only works if there are no cycles (loops) in the graph. It means it will fail with ['ab', 'ba'] because there would be a path of infinite length: ['ab', 'ba', 'ab', 'ba', 'ab', 'ba', ...]






      share|improve this answer



















      • 1




        Acyclic is a big assumption. And with a cycle the problem is NP-Hard.
        – slider
        Nov 27 at 16:34






      • 1




        @slider: Indeed. I'll investigate to see if it's possible to solve the general problem with networkx.
        – Eric Duminil
        Nov 27 at 17:34


















      up vote
      4
      down vote













      In the spirit of brute force solutions, you can check all permutations of the words list and choose the best continuous starting sequence:



      from itertools import permutations

      def continuous_starting_sequence(words):
      chain = [words[0]]
      for i in range(1, len(words)):
      if not words[i].startswith(words[i - 1][-1]):
      break
      chain.append(words[i])
      return chain

      words = ['giraffe', 'elephant', 'ant', 'tiger', 'racoon', 'cat', 'hedgehog', 'mouse']
      best = max((continuous_starting_sequence(seq) for seq in permutations(words)), key=len)

      print(best)
      # ['hedgehog', 'giraffe', 'elephant', 'tiger', 'racoon']


      Since we're considering all permutations, we know that there must be a permutation that starts with the largest word chain.



      This, of course, has O(n n!) time complexity :D






      share|improve this answer






























        up vote
        3
        down vote













        This function creates a type of iterator called a generator (see: What does the "yield" keyword do?). It recursively creates further instances of the same generator to explore all possible tail sequences:



        words = ['giraffe', 'elephant', 'ant', 'tiger', 'racoon', 'cat', 'hedgehog', 'mouse']

        def chains(words, previous_word=None):
        # Consider an empty sequence to be valid (as a "tail" or on its own):
        yield
        # Remove the previous word, if any, from consideration, both here and in any subcalls:
        words = [word for word in words if word != previous_word]
        # Take each remaining word...
        for each_word in words:
        # ...provided it obeys the chaining rule
        if not previous_word or each_word.startswith(previous_word[-1]):
        # and recurse to consider all possible tail sequences that can follow this particular word:
        for tail in chains(words, previous_word=each_word):
        # Concatenate the word we're considering with each possible tail:
        yield [each_word] + tail

        all_legal_sequences = list(chains(words)) # convert the output (an iterator) to a list
        all_legal_sequences.sort(key=len) # sort the list of chains in increasing order of chain length
        for seq in all_legal_sequences: print(seq)
        # The last line (and hence longest chain) prints as follows:
        # ['hedgehog', 'giraffe', 'elephant', 'tiger', 'racoon']


        Or, to get straight to the longest chain more efficiently:



        print(max(chains(words), key=len)


        Finally, here is an alternate version that allows repeated words in the input (i.e. if you include a word N times, you may use it up to N times in the chain):



        def chains(words, previous_word_index=None):
        yield
        if previous_word_index is not None:
        previous_letter = words[previous_word_index][-1]
        words = words[:previous_word_index] + words[previous_word_index + 1:]
        for i, each_word in enumerate( words ):
        if previous_word_index is not None and not each_word.startswith(previous_letter): continue
        for tail in chains(words, previous_word_index=i):
        yield [each_word] + tail





        share|improve this answer






























          up vote
          2
          down vote













          Here is a working recursive brute-force approach:



          def brute_force(pool, last=None, so_far=None):
          so_far = so_far or
          if not pool:
          return so_far
          candidates =
          for w in pool:
          if not last or w.startswith(last):
          c_so_far, c_pool = list(so_far) + [w], set(pool) - set([w])
          candidates.append(brute_force(c_pool, w[-1], c_so_far))
          return max(candidates, key=len, default=so_far)

          >>> brute_force(words)
          ['hedgehog', 'giraffe', 'elephant', 'tiger', 'racoon']


          At every recursive call, this tries to continue the chain with every eligible word form the remaining pool. It then chooses the longest such continuation.






          share|improve this answer






























            up vote
            2
            down vote













            I have a tree-based approach for this question which might be faster. I am still working on implementation of the code but here is what I would do:



                1. Form a tree with the root node as first word. 
            2. Form the branches if there is any word or words that starts
            with the alphabet with which this current word ends.
            3. Exhaust the entire given list based on the ending alphabet
            of current word and form the entire tree.
            4. Now just find the longest path of this tree and store it.
            5. Repeat steps 1 to 4 for each of the words given in the list
            and print the longest path among the longest paths we got above.


            I hope this might give a better solution in case there is a large list of words given. I will update this with the actual code implementation.






            share|improve this answer




























              up vote
              1
              down vote













              Another answer using a recursive approach:



              def word_list(w_list, remaining_list):
              max_result_len=0
              res = w_list
              for word_index in range(len(remaining_list)):
              # if the last letter of the word list is equal to the first letter of the word
              if w_list[-1][-1] == remaining_list[word_index][0]:
              # make copies of the lists to not alter it in the caller function
              w_list_copy = w_list.copy()
              remaining_list_copy = remaining_list.copy()
              # removes the used word from the remaining list
              remaining_list_copy.pop(word_index)
              # append the matching word to the new word list
              w_list_copy.append(remaining_list[word_index])
              res_aux = word_list(w_list_copy, remaining_list_copy)
              # Keep only the longest list
              res = res_aux if len(res_aux) > max_result_len else res
              return res

              words = ['giraffe', 'elephant', 'ant', 'tiger', 'racoon', 'cat', 'hedgehog', 'mouse']
              word_list(['dog'], words)


              output:



              ['dog', 'giraffe', 'elephant', 'tiger', 'racoon']





              share|improve this answer




























                up vote
                1
                down vote













                Hopefully, a more intuitive way of doing it without recursion. Iterate through the list and let Python's sort and list comprehension do the work for you:



                words = ['giraffe', 'elephant', 'ant', 'tiger', 'racoon', 'cat', 'hedgehog', 'mouse']

                def chain_longest(pivot, words):
                new_words =
                new_words.append(pivot)
                for word in words:
                potential_words = [i for i in words if i.startswith(pivot[-1]) and i not in new_words]
                if potential_words:
                next_word = sorted(potential_words, key = lambda x: len)[0]
                new_words.append(next_word)
                pivot = next_word
                else:
                pass
                return new_words

                max([chain_longest(i, words) for i in words], key = len)
                >>
                ['hedgehog', 'giraffe', 'elephant', 'tiger', 'racoon']


                Set a pivot and check for potential_words if they start with your pivot word and do not appear in your new list of words. If found then just sort them by length and take the first element.



                The list comprehension goes through every word as a pivot and returns you the longest chain.






                share|improve this answer























                • The expected output and longest word chain is: ['hedgehog', 'giraffe', 'elephant', 'tiger', 'racoon'], your one gives ['giraffe', 'elephant', 'tiger', 'racoon'] or am I missing something?
                  – Mandingo
                  Nov 26 at 16:31






                • 1




                  You can replace key = lambda x: len(x) with key=len.
                  – Keyur Potdar
                  Nov 26 at 16:32






                • 1




                  @mandingo sorry I got confused with the output. Let me change that.
                  – BernardL
                  Nov 26 at 16:32










                protected by Ajax1234 Nov 28 at 2:43



                Thank you for your interest in this question.
                Because it has attracted low-quality or spam answers that had to be removed, posting an answer now requires 10 reputation on this site (the association bonus does not count).



                Would you like to answer one of these unanswered questions instead?














                9 Answers
                9






                active

                oldest

                votes








                9 Answers
                9






                active

                oldest

                votes









                active

                oldest

                votes






                active

                oldest

                votes








                up vote
                23
                down vote



                accepted










                You can use recursion to explore every "branch" that emerges when every possible letter containing the proper initial character is added to a running list:



                words = ['giraffe', 'elephant', 'ant', 'tiger', 'racoon', 'cat', 'hedgehog', 'mouse']
                def get_results(_start, _current, _seen):
                if all(c in _seen for c in words if c[0] == _start[-1]):
                yield _current
                else:
                for i in words:
                if i[0] == _start[-1]:
                yield from get_results(i, _current+[i], _seen+[i])


                new_d = [list(get_results(i, [i], ))[0] for i in words]
                final_d = max([i for i in new_d if len(i) == len(set(i))], key=len)


                Output:



                ['hedgehog', 'giraffe', 'elephant', 'tiger', 'racoon']


                This solution works similar to the breadth-first search, as the function get_resuls will continue to iterate over the entire list as long as the current value has not been called on before. Values that have been seen by the function are added to the _seen list, ultimately ceasing the stream of recursive calls.



                This solution will also ignore results with duplicates:



                words = ['giraffe', 'elephant', 'ant', 'ning', 'tiger', 'racoon', 'cat', 'hedgehog', 'mouse',]
                new_d = [list(get_results(i, [i], ))[0] for i in words]
                final_d = max([i for i in new_d if len(i) == len(set(i))], key=len)


                Output:



                ['ant', 'tiger', 'racoon', 'ning', 'giraffe', 'elephant']





                share|improve this answer



























                  up vote
                  23
                  down vote



                  accepted










                  You can use recursion to explore every "branch" that emerges when every possible letter containing the proper initial character is added to a running list:



                  words = ['giraffe', 'elephant', 'ant', 'tiger', 'racoon', 'cat', 'hedgehog', 'mouse']
                  def get_results(_start, _current, _seen):
                  if all(c in _seen for c in words if c[0] == _start[-1]):
                  yield _current
                  else:
                  for i in words:
                  if i[0] == _start[-1]:
                  yield from get_results(i, _current+[i], _seen+[i])


                  new_d = [list(get_results(i, [i], ))[0] for i in words]
                  final_d = max([i for i in new_d if len(i) == len(set(i))], key=len)


                  Output:



                  ['hedgehog', 'giraffe', 'elephant', 'tiger', 'racoon']


                  This solution works similar to the breadth-first search, as the function get_resuls will continue to iterate over the entire list as long as the current value has not been called on before. Values that have been seen by the function are added to the _seen list, ultimately ceasing the stream of recursive calls.



                  This solution will also ignore results with duplicates:



                  words = ['giraffe', 'elephant', 'ant', 'ning', 'tiger', 'racoon', 'cat', 'hedgehog', 'mouse',]
                  new_d = [list(get_results(i, [i], ))[0] for i in words]
                  final_d = max([i for i in new_d if len(i) == len(set(i))], key=len)


                  Output:



                  ['ant', 'tiger', 'racoon', 'ning', 'giraffe', 'elephant']





                  share|improve this answer

























                    up vote
                    23
                    down vote



                    accepted







                    up vote
                    23
                    down vote



                    accepted






                    You can use recursion to explore every "branch" that emerges when every possible letter containing the proper initial character is added to a running list:



                    words = ['giraffe', 'elephant', 'ant', 'tiger', 'racoon', 'cat', 'hedgehog', 'mouse']
                    def get_results(_start, _current, _seen):
                    if all(c in _seen for c in words if c[0] == _start[-1]):
                    yield _current
                    else:
                    for i in words:
                    if i[0] == _start[-1]:
                    yield from get_results(i, _current+[i], _seen+[i])


                    new_d = [list(get_results(i, [i], ))[0] for i in words]
                    final_d = max([i for i in new_d if len(i) == len(set(i))], key=len)


                    Output:



                    ['hedgehog', 'giraffe', 'elephant', 'tiger', 'racoon']


                    This solution works similar to the breadth-first search, as the function get_resuls will continue to iterate over the entire list as long as the current value has not been called on before. Values that have been seen by the function are added to the _seen list, ultimately ceasing the stream of recursive calls.



                    This solution will also ignore results with duplicates:



                    words = ['giraffe', 'elephant', 'ant', 'ning', 'tiger', 'racoon', 'cat', 'hedgehog', 'mouse',]
                    new_d = [list(get_results(i, [i], ))[0] for i in words]
                    final_d = max([i for i in new_d if len(i) == len(set(i))], key=len)


                    Output:



                    ['ant', 'tiger', 'racoon', 'ning', 'giraffe', 'elephant']





                    share|improve this answer














                    You can use recursion to explore every "branch" that emerges when every possible letter containing the proper initial character is added to a running list:



                    words = ['giraffe', 'elephant', 'ant', 'tiger', 'racoon', 'cat', 'hedgehog', 'mouse']
                    def get_results(_start, _current, _seen):
                    if all(c in _seen for c in words if c[0] == _start[-1]):
                    yield _current
                    else:
                    for i in words:
                    if i[0] == _start[-1]:
                    yield from get_results(i, _current+[i], _seen+[i])


                    new_d = [list(get_results(i, [i], ))[0] for i in words]
                    final_d = max([i for i in new_d if len(i) == len(set(i))], key=len)


                    Output:



                    ['hedgehog', 'giraffe', 'elephant', 'tiger', 'racoon']


                    This solution works similar to the breadth-first search, as the function get_resuls will continue to iterate over the entire list as long as the current value has not been called on before. Values that have been seen by the function are added to the _seen list, ultimately ceasing the stream of recursive calls.



                    This solution will also ignore results with duplicates:



                    words = ['giraffe', 'elephant', 'ant', 'ning', 'tiger', 'racoon', 'cat', 'hedgehog', 'mouse',]
                    new_d = [list(get_results(i, [i], ))[0] for i in words]
                    final_d = max([i for i in new_d if len(i) == len(set(i))], key=len)


                    Output:



                    ['ant', 'tiger', 'racoon', 'ning', 'giraffe', 'elephant']






                    share|improve this answer














                    share|improve this answer



                    share|improve this answer








                    edited Nov 27 at 2:42

























                    answered Nov 26 at 16:23









                    Ajax1234

                    39.1k42452




                    39.1k42452
























                        up vote
                        14
                        down vote













                        I have a new idea, as the figure shows:



                        enter image description here



                        We can construct a directed graph by word[0] == word[-1], then the problem is converted to find the maximum length path.






                        share|improve this answer



























                          up vote
                          14
                          down vote













                          I have a new idea, as the figure shows:



                          enter image description here



                          We can construct a directed graph by word[0] == word[-1], then the problem is converted to find the maximum length path.






                          share|improve this answer

























                            up vote
                            14
                            down vote










                            up vote
                            14
                            down vote









                            I have a new idea, as the figure shows:



                            enter image description here



                            We can construct a directed graph by word[0] == word[-1], then the problem is converted to find the maximum length path.






                            share|improve this answer














                            I have a new idea, as the figure shows:



                            enter image description here



                            We can construct a directed graph by word[0] == word[-1], then the problem is converted to find the maximum length path.







                            share|improve this answer














                            share|improve this answer



                            share|improve this answer








                            edited Nov 27 at 8:53









                            Wilson

                            446417




                            446417










                            answered Nov 27 at 3:26









                            TimeSeam

                            1515




                            1515






















                                up vote
                                11
                                down vote













                                As mentioned by others, the problem is to find the longest path in a directed acyclic graph.



                                For anything graph related in Python, networkx is your friend.



                                You just need to initialize the graph, add the nodes, add the edges and launch dag_longest_path:



                                import networkx as nx
                                import matplotlib.pyplot as plt

                                words = ['giraffe', 'elephant', 'ant', 'tiger', 'racoon', 'cat',
                                'hedgehog', 'mouse']

                                G = nx.DiGraph()
                                G.add_nodes_from(words)

                                for word1 in words:
                                for word2 in words:
                                if word1 != word2 and word1[-1] == word2[0]:
                                G.add_edge(word1, word2)
                                nx.draw_networkx(G)
                                plt.show()
                                print(nx.algorithms.dag.dag_longest_path(G))


                                enter image description here



                                It outputs:



                                ['hedgehog', 'giraffe', 'elephant', 'tiger', 'racoon']


                                Note : this algorithm only works if there are no cycles (loops) in the graph. It means it will fail with ['ab', 'ba'] because there would be a path of infinite length: ['ab', 'ba', 'ab', 'ba', 'ab', 'ba', ...]






                                share|improve this answer



















                                • 1




                                  Acyclic is a big assumption. And with a cycle the problem is NP-Hard.
                                  – slider
                                  Nov 27 at 16:34






                                • 1




                                  @slider: Indeed. I'll investigate to see if it's possible to solve the general problem with networkx.
                                  – Eric Duminil
                                  Nov 27 at 17:34















                                up vote
                                11
                                down vote













                                As mentioned by others, the problem is to find the longest path in a directed acyclic graph.



                                For anything graph related in Python, networkx is your friend.



                                You just need to initialize the graph, add the nodes, add the edges and launch dag_longest_path:



                                import networkx as nx
                                import matplotlib.pyplot as plt

                                words = ['giraffe', 'elephant', 'ant', 'tiger', 'racoon', 'cat',
                                'hedgehog', 'mouse']

                                G = nx.DiGraph()
                                G.add_nodes_from(words)

                                for word1 in words:
                                for word2 in words:
                                if word1 != word2 and word1[-1] == word2[0]:
                                G.add_edge(word1, word2)
                                nx.draw_networkx(G)
                                plt.show()
                                print(nx.algorithms.dag.dag_longest_path(G))


                                enter image description here



                                It outputs:



                                ['hedgehog', 'giraffe', 'elephant', 'tiger', 'racoon']


                                Note : this algorithm only works if there are no cycles (loops) in the graph. It means it will fail with ['ab', 'ba'] because there would be a path of infinite length: ['ab', 'ba', 'ab', 'ba', 'ab', 'ba', ...]






                                share|improve this answer



















                                • 1




                                  Acyclic is a big assumption. And with a cycle the problem is NP-Hard.
                                  – slider
                                  Nov 27 at 16:34






                                • 1




                                  @slider: Indeed. I'll investigate to see if it's possible to solve the general problem with networkx.
                                  – Eric Duminil
                                  Nov 27 at 17:34













                                up vote
                                11
                                down vote










                                up vote
                                11
                                down vote









                                As mentioned by others, the problem is to find the longest path in a directed acyclic graph.



                                For anything graph related in Python, networkx is your friend.



                                You just need to initialize the graph, add the nodes, add the edges and launch dag_longest_path:



                                import networkx as nx
                                import matplotlib.pyplot as plt

                                words = ['giraffe', 'elephant', 'ant', 'tiger', 'racoon', 'cat',
                                'hedgehog', 'mouse']

                                G = nx.DiGraph()
                                G.add_nodes_from(words)

                                for word1 in words:
                                for word2 in words:
                                if word1 != word2 and word1[-1] == word2[0]:
                                G.add_edge(word1, word2)
                                nx.draw_networkx(G)
                                plt.show()
                                print(nx.algorithms.dag.dag_longest_path(G))


                                enter image description here



                                It outputs:



                                ['hedgehog', 'giraffe', 'elephant', 'tiger', 'racoon']


                                Note : this algorithm only works if there are no cycles (loops) in the graph. It means it will fail with ['ab', 'ba'] because there would be a path of infinite length: ['ab', 'ba', 'ab', 'ba', 'ab', 'ba', ...]






                                share|improve this answer














                                As mentioned by others, the problem is to find the longest path in a directed acyclic graph.



                                For anything graph related in Python, networkx is your friend.



                                You just need to initialize the graph, add the nodes, add the edges and launch dag_longest_path:



                                import networkx as nx
                                import matplotlib.pyplot as plt

                                words = ['giraffe', 'elephant', 'ant', 'tiger', 'racoon', 'cat',
                                'hedgehog', 'mouse']

                                G = nx.DiGraph()
                                G.add_nodes_from(words)

                                for word1 in words:
                                for word2 in words:
                                if word1 != word2 and word1[-1] == word2[0]:
                                G.add_edge(word1, word2)
                                nx.draw_networkx(G)
                                plt.show()
                                print(nx.algorithms.dag.dag_longest_path(G))


                                enter image description here



                                It outputs:



                                ['hedgehog', 'giraffe', 'elephant', 'tiger', 'racoon']


                                Note : this algorithm only works if there are no cycles (loops) in the graph. It means it will fail with ['ab', 'ba'] because there would be a path of infinite length: ['ab', 'ba', 'ab', 'ba', 'ab', 'ba', ...]







                                share|improve this answer














                                share|improve this answer



                                share|improve this answer








                                edited Nov 27 at 8:32

























                                answered Nov 27 at 8:02









                                Eric Duminil

                                39k53066




                                39k53066








                                • 1




                                  Acyclic is a big assumption. And with a cycle the problem is NP-Hard.
                                  – slider
                                  Nov 27 at 16:34






                                • 1




                                  @slider: Indeed. I'll investigate to see if it's possible to solve the general problem with networkx.
                                  – Eric Duminil
                                  Nov 27 at 17:34














                                • 1




                                  Acyclic is a big assumption. And with a cycle the problem is NP-Hard.
                                  – slider
                                  Nov 27 at 16:34






                                • 1




                                  @slider: Indeed. I'll investigate to see if it's possible to solve the general problem with networkx.
                                  – Eric Duminil
                                  Nov 27 at 17:34








                                1




                                1




                                Acyclic is a big assumption. And with a cycle the problem is NP-Hard.
                                – slider
                                Nov 27 at 16:34




                                Acyclic is a big assumption. And with a cycle the problem is NP-Hard.
                                – slider
                                Nov 27 at 16:34




                                1




                                1




                                @slider: Indeed. I'll investigate to see if it's possible to solve the general problem with networkx.
                                – Eric Duminil
                                Nov 27 at 17:34




                                @slider: Indeed. I'll investigate to see if it's possible to solve the general problem with networkx.
                                – Eric Duminil
                                Nov 27 at 17:34










                                up vote
                                4
                                down vote













                                In the spirit of brute force solutions, you can check all permutations of the words list and choose the best continuous starting sequence:



                                from itertools import permutations

                                def continuous_starting_sequence(words):
                                chain = [words[0]]
                                for i in range(1, len(words)):
                                if not words[i].startswith(words[i - 1][-1]):
                                break
                                chain.append(words[i])
                                return chain

                                words = ['giraffe', 'elephant', 'ant', 'tiger', 'racoon', 'cat', 'hedgehog', 'mouse']
                                best = max((continuous_starting_sequence(seq) for seq in permutations(words)), key=len)

                                print(best)
                                # ['hedgehog', 'giraffe', 'elephant', 'tiger', 'racoon']


                                Since we're considering all permutations, we know that there must be a permutation that starts with the largest word chain.



                                This, of course, has O(n n!) time complexity :D






                                share|improve this answer



























                                  up vote
                                  4
                                  down vote













                                  In the spirit of brute force solutions, you can check all permutations of the words list and choose the best continuous starting sequence:



                                  from itertools import permutations

                                  def continuous_starting_sequence(words):
                                  chain = [words[0]]
                                  for i in range(1, len(words)):
                                  if not words[i].startswith(words[i - 1][-1]):
                                  break
                                  chain.append(words[i])
                                  return chain

                                  words = ['giraffe', 'elephant', 'ant', 'tiger', 'racoon', 'cat', 'hedgehog', 'mouse']
                                  best = max((continuous_starting_sequence(seq) for seq in permutations(words)), key=len)

                                  print(best)
                                  # ['hedgehog', 'giraffe', 'elephant', 'tiger', 'racoon']


                                  Since we're considering all permutations, we know that there must be a permutation that starts with the largest word chain.



                                  This, of course, has O(n n!) time complexity :D






                                  share|improve this answer

























                                    up vote
                                    4
                                    down vote










                                    up vote
                                    4
                                    down vote









                                    In the spirit of brute force solutions, you can check all permutations of the words list and choose the best continuous starting sequence:



                                    from itertools import permutations

                                    def continuous_starting_sequence(words):
                                    chain = [words[0]]
                                    for i in range(1, len(words)):
                                    if not words[i].startswith(words[i - 1][-1]):
                                    break
                                    chain.append(words[i])
                                    return chain

                                    words = ['giraffe', 'elephant', 'ant', 'tiger', 'racoon', 'cat', 'hedgehog', 'mouse']
                                    best = max((continuous_starting_sequence(seq) for seq in permutations(words)), key=len)

                                    print(best)
                                    # ['hedgehog', 'giraffe', 'elephant', 'tiger', 'racoon']


                                    Since we're considering all permutations, we know that there must be a permutation that starts with the largest word chain.



                                    This, of course, has O(n n!) time complexity :D






                                    share|improve this answer














                                    In the spirit of brute force solutions, you can check all permutations of the words list and choose the best continuous starting sequence:



                                    from itertools import permutations

                                    def continuous_starting_sequence(words):
                                    chain = [words[0]]
                                    for i in range(1, len(words)):
                                    if not words[i].startswith(words[i - 1][-1]):
                                    break
                                    chain.append(words[i])
                                    return chain

                                    words = ['giraffe', 'elephant', 'ant', 'tiger', 'racoon', 'cat', 'hedgehog', 'mouse']
                                    best = max((continuous_starting_sequence(seq) for seq in permutations(words)), key=len)

                                    print(best)
                                    # ['hedgehog', 'giraffe', 'elephant', 'tiger', 'racoon']


                                    Since we're considering all permutations, we know that there must be a permutation that starts with the largest word chain.



                                    This, of course, has O(n n!) time complexity :D







                                    share|improve this answer














                                    share|improve this answer



                                    share|improve this answer








                                    edited Nov 26 at 19:30

























                                    answered Nov 26 at 17:39









                                    slider

                                    7,6601129




                                    7,6601129






















                                        up vote
                                        3
                                        down vote













                                        This function creates a type of iterator called a generator (see: What does the "yield" keyword do?). It recursively creates further instances of the same generator to explore all possible tail sequences:



                                        words = ['giraffe', 'elephant', 'ant', 'tiger', 'racoon', 'cat', 'hedgehog', 'mouse']

                                        def chains(words, previous_word=None):
                                        # Consider an empty sequence to be valid (as a "tail" or on its own):
                                        yield
                                        # Remove the previous word, if any, from consideration, both here and in any subcalls:
                                        words = [word for word in words if word != previous_word]
                                        # Take each remaining word...
                                        for each_word in words:
                                        # ...provided it obeys the chaining rule
                                        if not previous_word or each_word.startswith(previous_word[-1]):
                                        # and recurse to consider all possible tail sequences that can follow this particular word:
                                        for tail in chains(words, previous_word=each_word):
                                        # Concatenate the word we're considering with each possible tail:
                                        yield [each_word] + tail

                                        all_legal_sequences = list(chains(words)) # convert the output (an iterator) to a list
                                        all_legal_sequences.sort(key=len) # sort the list of chains in increasing order of chain length
                                        for seq in all_legal_sequences: print(seq)
                                        # The last line (and hence longest chain) prints as follows:
                                        # ['hedgehog', 'giraffe', 'elephant', 'tiger', 'racoon']


                                        Or, to get straight to the longest chain more efficiently:



                                        print(max(chains(words), key=len)


                                        Finally, here is an alternate version that allows repeated words in the input (i.e. if you include a word N times, you may use it up to N times in the chain):



                                        def chains(words, previous_word_index=None):
                                        yield
                                        if previous_word_index is not None:
                                        previous_letter = words[previous_word_index][-1]
                                        words = words[:previous_word_index] + words[previous_word_index + 1:]
                                        for i, each_word in enumerate( words ):
                                        if previous_word_index is not None and not each_word.startswith(previous_letter): continue
                                        for tail in chains(words, previous_word_index=i):
                                        yield [each_word] + tail





                                        share|improve this answer



























                                          up vote
                                          3
                                          down vote













                                          This function creates a type of iterator called a generator (see: What does the "yield" keyword do?). It recursively creates further instances of the same generator to explore all possible tail sequences:



                                          words = ['giraffe', 'elephant', 'ant', 'tiger', 'racoon', 'cat', 'hedgehog', 'mouse']

                                          def chains(words, previous_word=None):
                                          # Consider an empty sequence to be valid (as a "tail" or on its own):
                                          yield
                                          # Remove the previous word, if any, from consideration, both here and in any subcalls:
                                          words = [word for word in words if word != previous_word]
                                          # Take each remaining word...
                                          for each_word in words:
                                          # ...provided it obeys the chaining rule
                                          if not previous_word or each_word.startswith(previous_word[-1]):
                                          # and recurse to consider all possible tail sequences that can follow this particular word:
                                          for tail in chains(words, previous_word=each_word):
                                          # Concatenate the word we're considering with each possible tail:
                                          yield [each_word] + tail

                                          all_legal_sequences = list(chains(words)) # convert the output (an iterator) to a list
                                          all_legal_sequences.sort(key=len) # sort the list of chains in increasing order of chain length
                                          for seq in all_legal_sequences: print(seq)
                                          # The last line (and hence longest chain) prints as follows:
                                          # ['hedgehog', 'giraffe', 'elephant', 'tiger', 'racoon']


                                          Or, to get straight to the longest chain more efficiently:



                                          print(max(chains(words), key=len)


                                          Finally, here is an alternate version that allows repeated words in the input (i.e. if you include a word N times, you may use it up to N times in the chain):



                                          def chains(words, previous_word_index=None):
                                          yield
                                          if previous_word_index is not None:
                                          previous_letter = words[previous_word_index][-1]
                                          words = words[:previous_word_index] + words[previous_word_index + 1:]
                                          for i, each_word in enumerate( words ):
                                          if previous_word_index is not None and not each_word.startswith(previous_letter): continue
                                          for tail in chains(words, previous_word_index=i):
                                          yield [each_word] + tail





                                          share|improve this answer

























                                            up vote
                                            3
                                            down vote










                                            up vote
                                            3
                                            down vote









                                            This function creates a type of iterator called a generator (see: What does the "yield" keyword do?). It recursively creates further instances of the same generator to explore all possible tail sequences:



                                            words = ['giraffe', 'elephant', 'ant', 'tiger', 'racoon', 'cat', 'hedgehog', 'mouse']

                                            def chains(words, previous_word=None):
                                            # Consider an empty sequence to be valid (as a "tail" or on its own):
                                            yield
                                            # Remove the previous word, if any, from consideration, both here and in any subcalls:
                                            words = [word for word in words if word != previous_word]
                                            # Take each remaining word...
                                            for each_word in words:
                                            # ...provided it obeys the chaining rule
                                            if not previous_word or each_word.startswith(previous_word[-1]):
                                            # and recurse to consider all possible tail sequences that can follow this particular word:
                                            for tail in chains(words, previous_word=each_word):
                                            # Concatenate the word we're considering with each possible tail:
                                            yield [each_word] + tail

                                            all_legal_sequences = list(chains(words)) # convert the output (an iterator) to a list
                                            all_legal_sequences.sort(key=len) # sort the list of chains in increasing order of chain length
                                            for seq in all_legal_sequences: print(seq)
                                            # The last line (and hence longest chain) prints as follows:
                                            # ['hedgehog', 'giraffe', 'elephant', 'tiger', 'racoon']


                                            Or, to get straight to the longest chain more efficiently:



                                            print(max(chains(words), key=len)


                                            Finally, here is an alternate version that allows repeated words in the input (i.e. if you include a word N times, you may use it up to N times in the chain):



                                            def chains(words, previous_word_index=None):
                                            yield
                                            if previous_word_index is not None:
                                            previous_letter = words[previous_word_index][-1]
                                            words = words[:previous_word_index] + words[previous_word_index + 1:]
                                            for i, each_word in enumerate( words ):
                                            if previous_word_index is not None and not each_word.startswith(previous_letter): continue
                                            for tail in chains(words, previous_word_index=i):
                                            yield [each_word] + tail





                                            share|improve this answer














                                            This function creates a type of iterator called a generator (see: What does the "yield" keyword do?). It recursively creates further instances of the same generator to explore all possible tail sequences:



                                            words = ['giraffe', 'elephant', 'ant', 'tiger', 'racoon', 'cat', 'hedgehog', 'mouse']

                                            def chains(words, previous_word=None):
                                            # Consider an empty sequence to be valid (as a "tail" or on its own):
                                            yield
                                            # Remove the previous word, if any, from consideration, both here and in any subcalls:
                                            words = [word for word in words if word != previous_word]
                                            # Take each remaining word...
                                            for each_word in words:
                                            # ...provided it obeys the chaining rule
                                            if not previous_word or each_word.startswith(previous_word[-1]):
                                            # and recurse to consider all possible tail sequences that can follow this particular word:
                                            for tail in chains(words, previous_word=each_word):
                                            # Concatenate the word we're considering with each possible tail:
                                            yield [each_word] + tail

                                            all_legal_sequences = list(chains(words)) # convert the output (an iterator) to a list
                                            all_legal_sequences.sort(key=len) # sort the list of chains in increasing order of chain length
                                            for seq in all_legal_sequences: print(seq)
                                            # The last line (and hence longest chain) prints as follows:
                                            # ['hedgehog', 'giraffe', 'elephant', 'tiger', 'racoon']


                                            Or, to get straight to the longest chain more efficiently:



                                            print(max(chains(words), key=len)


                                            Finally, here is an alternate version that allows repeated words in the input (i.e. if you include a word N times, you may use it up to N times in the chain):



                                            def chains(words, previous_word_index=None):
                                            yield
                                            if previous_word_index is not None:
                                            previous_letter = words[previous_word_index][-1]
                                            words = words[:previous_word_index] + words[previous_word_index + 1:]
                                            for i, each_word in enumerate( words ):
                                            if previous_word_index is not None and not each_word.startswith(previous_letter): continue
                                            for tail in chains(words, previous_word_index=i):
                                            yield [each_word] + tail






                                            share|improve this answer














                                            share|improve this answer



                                            share|improve this answer








                                            edited Nov 28 at 1:17

























                                            answered Nov 26 at 16:30









                                            jez

                                            7,5501941




                                            7,5501941






















                                                up vote
                                                2
                                                down vote













                                                Here is a working recursive brute-force approach:



                                                def brute_force(pool, last=None, so_far=None):
                                                so_far = so_far or
                                                if not pool:
                                                return so_far
                                                candidates =
                                                for w in pool:
                                                if not last or w.startswith(last):
                                                c_so_far, c_pool = list(so_far) + [w], set(pool) - set([w])
                                                candidates.append(brute_force(c_pool, w[-1], c_so_far))
                                                return max(candidates, key=len, default=so_far)

                                                >>> brute_force(words)
                                                ['hedgehog', 'giraffe', 'elephant', 'tiger', 'racoon']


                                                At every recursive call, this tries to continue the chain with every eligible word form the remaining pool. It then chooses the longest such continuation.






                                                share|improve this answer



























                                                  up vote
                                                  2
                                                  down vote













                                                  Here is a working recursive brute-force approach:



                                                  def brute_force(pool, last=None, so_far=None):
                                                  so_far = so_far or
                                                  if not pool:
                                                  return so_far
                                                  candidates =
                                                  for w in pool:
                                                  if not last or w.startswith(last):
                                                  c_so_far, c_pool = list(so_far) + [w], set(pool) - set([w])
                                                  candidates.append(brute_force(c_pool, w[-1], c_so_far))
                                                  return max(candidates, key=len, default=so_far)

                                                  >>> brute_force(words)
                                                  ['hedgehog', 'giraffe', 'elephant', 'tiger', 'racoon']


                                                  At every recursive call, this tries to continue the chain with every eligible word form the remaining pool. It then chooses the longest such continuation.






                                                  share|improve this answer

























                                                    up vote
                                                    2
                                                    down vote










                                                    up vote
                                                    2
                                                    down vote









                                                    Here is a working recursive brute-force approach:



                                                    def brute_force(pool, last=None, so_far=None):
                                                    so_far = so_far or
                                                    if not pool:
                                                    return so_far
                                                    candidates =
                                                    for w in pool:
                                                    if not last or w.startswith(last):
                                                    c_so_far, c_pool = list(so_far) + [w], set(pool) - set([w])
                                                    candidates.append(brute_force(c_pool, w[-1], c_so_far))
                                                    return max(candidates, key=len, default=so_far)

                                                    >>> brute_force(words)
                                                    ['hedgehog', 'giraffe', 'elephant', 'tiger', 'racoon']


                                                    At every recursive call, this tries to continue the chain with every eligible word form the remaining pool. It then chooses the longest such continuation.






                                                    share|improve this answer














                                                    Here is a working recursive brute-force approach:



                                                    def brute_force(pool, last=None, so_far=None):
                                                    so_far = so_far or
                                                    if not pool:
                                                    return so_far
                                                    candidates =
                                                    for w in pool:
                                                    if not last or w.startswith(last):
                                                    c_so_far, c_pool = list(so_far) + [w], set(pool) - set([w])
                                                    candidates.append(brute_force(c_pool, w[-1], c_so_far))
                                                    return max(candidates, key=len, default=so_far)

                                                    >>> brute_force(words)
                                                    ['hedgehog', 'giraffe', 'elephant', 'tiger', 'racoon']


                                                    At every recursive call, this tries to continue the chain with every eligible word form the remaining pool. It then chooses the longest such continuation.







                                                    share|improve this answer














                                                    share|improve this answer



                                                    share|improve this answer








                                                    edited Nov 26 at 16:56

























                                                    answered Nov 26 at 16:31









                                                    schwobaseggl

                                                    35.2k31937




                                                    35.2k31937






















                                                        up vote
                                                        2
                                                        down vote













                                                        I have a tree-based approach for this question which might be faster. I am still working on implementation of the code but here is what I would do:



                                                            1. Form a tree with the root node as first word. 
                                                        2. Form the branches if there is any word or words that starts
                                                        with the alphabet with which this current word ends.
                                                        3. Exhaust the entire given list based on the ending alphabet
                                                        of current word and form the entire tree.
                                                        4. Now just find the longest path of this tree and store it.
                                                        5. Repeat steps 1 to 4 for each of the words given in the list
                                                        and print the longest path among the longest paths we got above.


                                                        I hope this might give a better solution in case there is a large list of words given. I will update this with the actual code implementation.






                                                        share|improve this answer

























                                                          up vote
                                                          2
                                                          down vote













                                                          I have a tree-based approach for this question which might be faster. I am still working on implementation of the code but here is what I would do:



                                                              1. Form a tree with the root node as first word. 
                                                          2. Form the branches if there is any word or words that starts
                                                          with the alphabet with which this current word ends.
                                                          3. Exhaust the entire given list based on the ending alphabet
                                                          of current word and form the entire tree.
                                                          4. Now just find the longest path of this tree and store it.
                                                          5. Repeat steps 1 to 4 for each of the words given in the list
                                                          and print the longest path among the longest paths we got above.


                                                          I hope this might give a better solution in case there is a large list of words given. I will update this with the actual code implementation.






                                                          share|improve this answer























                                                            up vote
                                                            2
                                                            down vote










                                                            up vote
                                                            2
                                                            down vote









                                                            I have a tree-based approach for this question which might be faster. I am still working on implementation of the code but here is what I would do:



                                                                1. Form a tree with the root node as first word. 
                                                            2. Form the branches if there is any word or words that starts
                                                            with the alphabet with which this current word ends.
                                                            3. Exhaust the entire given list based on the ending alphabet
                                                            of current word and form the entire tree.
                                                            4. Now just find the longest path of this tree and store it.
                                                            5. Repeat steps 1 to 4 for each of the words given in the list
                                                            and print the longest path among the longest paths we got above.


                                                            I hope this might give a better solution in case there is a large list of words given. I will update this with the actual code implementation.






                                                            share|improve this answer












                                                            I have a tree-based approach for this question which might be faster. I am still working on implementation of the code but here is what I would do:



                                                                1. Form a tree with the root node as first word. 
                                                            2. Form the branches if there is any word or words that starts
                                                            with the alphabet with which this current word ends.
                                                            3. Exhaust the entire given list based on the ending alphabet
                                                            of current word and form the entire tree.
                                                            4. Now just find the longest path of this tree and store it.
                                                            5. Repeat steps 1 to 4 for each of the words given in the list
                                                            and print the longest path among the longest paths we got above.


                                                            I hope this might give a better solution in case there is a large list of words given. I will update this with the actual code implementation.







                                                            share|improve this answer












                                                            share|improve this answer



                                                            share|improve this answer










                                                            answered Nov 26 at 22:14









                                                            CodeHunter

                                                            871325




                                                            871325






















                                                                up vote
                                                                1
                                                                down vote













                                                                Another answer using a recursive approach:



                                                                def word_list(w_list, remaining_list):
                                                                max_result_len=0
                                                                res = w_list
                                                                for word_index in range(len(remaining_list)):
                                                                # if the last letter of the word list is equal to the first letter of the word
                                                                if w_list[-1][-1] == remaining_list[word_index][0]:
                                                                # make copies of the lists to not alter it in the caller function
                                                                w_list_copy = w_list.copy()
                                                                remaining_list_copy = remaining_list.copy()
                                                                # removes the used word from the remaining list
                                                                remaining_list_copy.pop(word_index)
                                                                # append the matching word to the new word list
                                                                w_list_copy.append(remaining_list[word_index])
                                                                res_aux = word_list(w_list_copy, remaining_list_copy)
                                                                # Keep only the longest list
                                                                res = res_aux if len(res_aux) > max_result_len else res
                                                                return res

                                                                words = ['giraffe', 'elephant', 'ant', 'tiger', 'racoon', 'cat', 'hedgehog', 'mouse']
                                                                word_list(['dog'], words)


                                                                output:



                                                                ['dog', 'giraffe', 'elephant', 'tiger', 'racoon']





                                                                share|improve this answer

























                                                                  up vote
                                                                  1
                                                                  down vote













                                                                  Another answer using a recursive approach:



                                                                  def word_list(w_list, remaining_list):
                                                                  max_result_len=0
                                                                  res = w_list
                                                                  for word_index in range(len(remaining_list)):
                                                                  # if the last letter of the word list is equal to the first letter of the word
                                                                  if w_list[-1][-1] == remaining_list[word_index][0]:
                                                                  # make copies of the lists to not alter it in the caller function
                                                                  w_list_copy = w_list.copy()
                                                                  remaining_list_copy = remaining_list.copy()
                                                                  # removes the used word from the remaining list
                                                                  remaining_list_copy.pop(word_index)
                                                                  # append the matching word to the new word list
                                                                  w_list_copy.append(remaining_list[word_index])
                                                                  res_aux = word_list(w_list_copy, remaining_list_copy)
                                                                  # Keep only the longest list
                                                                  res = res_aux if len(res_aux) > max_result_len else res
                                                                  return res

                                                                  words = ['giraffe', 'elephant', 'ant', 'tiger', 'racoon', 'cat', 'hedgehog', 'mouse']
                                                                  word_list(['dog'], words)


                                                                  output:



                                                                  ['dog', 'giraffe', 'elephant', 'tiger', 'racoon']





                                                                  share|improve this answer























                                                                    up vote
                                                                    1
                                                                    down vote










                                                                    up vote
                                                                    1
                                                                    down vote









                                                                    Another answer using a recursive approach:



                                                                    def word_list(w_list, remaining_list):
                                                                    max_result_len=0
                                                                    res = w_list
                                                                    for word_index in range(len(remaining_list)):
                                                                    # if the last letter of the word list is equal to the first letter of the word
                                                                    if w_list[-1][-1] == remaining_list[word_index][0]:
                                                                    # make copies of the lists to not alter it in the caller function
                                                                    w_list_copy = w_list.copy()
                                                                    remaining_list_copy = remaining_list.copy()
                                                                    # removes the used word from the remaining list
                                                                    remaining_list_copy.pop(word_index)
                                                                    # append the matching word to the new word list
                                                                    w_list_copy.append(remaining_list[word_index])
                                                                    res_aux = word_list(w_list_copy, remaining_list_copy)
                                                                    # Keep only the longest list
                                                                    res = res_aux if len(res_aux) > max_result_len else res
                                                                    return res

                                                                    words = ['giraffe', 'elephant', 'ant', 'tiger', 'racoon', 'cat', 'hedgehog', 'mouse']
                                                                    word_list(['dog'], words)


                                                                    output:



                                                                    ['dog', 'giraffe', 'elephant', 'tiger', 'racoon']





                                                                    share|improve this answer












                                                                    Another answer using a recursive approach:



                                                                    def word_list(w_list, remaining_list):
                                                                    max_result_len=0
                                                                    res = w_list
                                                                    for word_index in range(len(remaining_list)):
                                                                    # if the last letter of the word list is equal to the first letter of the word
                                                                    if w_list[-1][-1] == remaining_list[word_index][0]:
                                                                    # make copies of the lists to not alter it in the caller function
                                                                    w_list_copy = w_list.copy()
                                                                    remaining_list_copy = remaining_list.copy()
                                                                    # removes the used word from the remaining list
                                                                    remaining_list_copy.pop(word_index)
                                                                    # append the matching word to the new word list
                                                                    w_list_copy.append(remaining_list[word_index])
                                                                    res_aux = word_list(w_list_copy, remaining_list_copy)
                                                                    # Keep only the longest list
                                                                    res = res_aux if len(res_aux) > max_result_len else res
                                                                    return res

                                                                    words = ['giraffe', 'elephant', 'ant', 'tiger', 'racoon', 'cat', 'hedgehog', 'mouse']
                                                                    word_list(['dog'], words)


                                                                    output:



                                                                    ['dog', 'giraffe', 'elephant', 'tiger', 'racoon']






                                                                    share|improve this answer












                                                                    share|improve this answer



                                                                    share|improve this answer










                                                                    answered Nov 26 at 16:46









                                                                    Pedro Torres

                                                                    658313




                                                                    658313






















                                                                        up vote
                                                                        1
                                                                        down vote













                                                                        Hopefully, a more intuitive way of doing it without recursion. Iterate through the list and let Python's sort and list comprehension do the work for you:



                                                                        words = ['giraffe', 'elephant', 'ant', 'tiger', 'racoon', 'cat', 'hedgehog', 'mouse']

                                                                        def chain_longest(pivot, words):
                                                                        new_words =
                                                                        new_words.append(pivot)
                                                                        for word in words:
                                                                        potential_words = [i for i in words if i.startswith(pivot[-1]) and i not in new_words]
                                                                        if potential_words:
                                                                        next_word = sorted(potential_words, key = lambda x: len)[0]
                                                                        new_words.append(next_word)
                                                                        pivot = next_word
                                                                        else:
                                                                        pass
                                                                        return new_words

                                                                        max([chain_longest(i, words) for i in words], key = len)
                                                                        >>
                                                                        ['hedgehog', 'giraffe', 'elephant', 'tiger', 'racoon']


                                                                        Set a pivot and check for potential_words if they start with your pivot word and do not appear in your new list of words. If found then just sort them by length and take the first element.



                                                                        The list comprehension goes through every word as a pivot and returns you the longest chain.






                                                                        share|improve this answer























                                                                        • The expected output and longest word chain is: ['hedgehog', 'giraffe', 'elephant', 'tiger', 'racoon'], your one gives ['giraffe', 'elephant', 'tiger', 'racoon'] or am I missing something?
                                                                          – Mandingo
                                                                          Nov 26 at 16:31






                                                                        • 1




                                                                          You can replace key = lambda x: len(x) with key=len.
                                                                          – Keyur Potdar
                                                                          Nov 26 at 16:32






                                                                        • 1




                                                                          @mandingo sorry I got confused with the output. Let me change that.
                                                                          – BernardL
                                                                          Nov 26 at 16:32















                                                                        up vote
                                                                        1
                                                                        down vote













                                                                        Hopefully, a more intuitive way of doing it without recursion. Iterate through the list and let Python's sort and list comprehension do the work for you:



                                                                        words = ['giraffe', 'elephant', 'ant', 'tiger', 'racoon', 'cat', 'hedgehog', 'mouse']

                                                                        def chain_longest(pivot, words):
                                                                        new_words =
                                                                        new_words.append(pivot)
                                                                        for word in words:
                                                                        potential_words = [i for i in words if i.startswith(pivot[-1]) and i not in new_words]
                                                                        if potential_words:
                                                                        next_word = sorted(potential_words, key = lambda x: len)[0]
                                                                        new_words.append(next_word)
                                                                        pivot = next_word
                                                                        else:
                                                                        pass
                                                                        return new_words

                                                                        max([chain_longest(i, words) for i in words], key = len)
                                                                        >>
                                                                        ['hedgehog', 'giraffe', 'elephant', 'tiger', 'racoon']


                                                                        Set a pivot and check for potential_words if they start with your pivot word and do not appear in your new list of words. If found then just sort them by length and take the first element.



                                                                        The list comprehension goes through every word as a pivot and returns you the longest chain.






                                                                        share|improve this answer























                                                                        • The expected output and longest word chain is: ['hedgehog', 'giraffe', 'elephant', 'tiger', 'racoon'], your one gives ['giraffe', 'elephant', 'tiger', 'racoon'] or am I missing something?
                                                                          – Mandingo
                                                                          Nov 26 at 16:31






                                                                        • 1




                                                                          You can replace key = lambda x: len(x) with key=len.
                                                                          – Keyur Potdar
                                                                          Nov 26 at 16:32






                                                                        • 1




                                                                          @mandingo sorry I got confused with the output. Let me change that.
                                                                          – BernardL
                                                                          Nov 26 at 16:32













                                                                        up vote
                                                                        1
                                                                        down vote










                                                                        up vote
                                                                        1
                                                                        down vote









                                                                        Hopefully, a more intuitive way of doing it without recursion. Iterate through the list and let Python's sort and list comprehension do the work for you:



                                                                        words = ['giraffe', 'elephant', 'ant', 'tiger', 'racoon', 'cat', 'hedgehog', 'mouse']

                                                                        def chain_longest(pivot, words):
                                                                        new_words =
                                                                        new_words.append(pivot)
                                                                        for word in words:
                                                                        potential_words = [i for i in words if i.startswith(pivot[-1]) and i not in new_words]
                                                                        if potential_words:
                                                                        next_word = sorted(potential_words, key = lambda x: len)[0]
                                                                        new_words.append(next_word)
                                                                        pivot = next_word
                                                                        else:
                                                                        pass
                                                                        return new_words

                                                                        max([chain_longest(i, words) for i in words], key = len)
                                                                        >>
                                                                        ['hedgehog', 'giraffe', 'elephant', 'tiger', 'racoon']


                                                                        Set a pivot and check for potential_words if they start with your pivot word and do not appear in your new list of words. If found then just sort them by length and take the first element.



                                                                        The list comprehension goes through every word as a pivot and returns you the longest chain.






                                                                        share|improve this answer














                                                                        Hopefully, a more intuitive way of doing it without recursion. Iterate through the list and let Python's sort and list comprehension do the work for you:



                                                                        words = ['giraffe', 'elephant', 'ant', 'tiger', 'racoon', 'cat', 'hedgehog', 'mouse']

                                                                        def chain_longest(pivot, words):
                                                                        new_words =
                                                                        new_words.append(pivot)
                                                                        for word in words:
                                                                        potential_words = [i for i in words if i.startswith(pivot[-1]) and i not in new_words]
                                                                        if potential_words:
                                                                        next_word = sorted(potential_words, key = lambda x: len)[0]
                                                                        new_words.append(next_word)
                                                                        pivot = next_word
                                                                        else:
                                                                        pass
                                                                        return new_words

                                                                        max([chain_longest(i, words) for i in words], key = len)
                                                                        >>
                                                                        ['hedgehog', 'giraffe', 'elephant', 'tiger', 'racoon']


                                                                        Set a pivot and check for potential_words if they start with your pivot word and do not appear in your new list of words. If found then just sort them by length and take the first element.



                                                                        The list comprehension goes through every word as a pivot and returns you the longest chain.







                                                                        share|improve this answer














                                                                        share|improve this answer



                                                                        share|improve this answer








                                                                        edited Nov 26 at 17:18

























                                                                        answered Nov 26 at 16:28









                                                                        BernardL

                                                                        2,3001829




                                                                        2,3001829












                                                                        • The expected output and longest word chain is: ['hedgehog', 'giraffe', 'elephant', 'tiger', 'racoon'], your one gives ['giraffe', 'elephant', 'tiger', 'racoon'] or am I missing something?
                                                                          – Mandingo
                                                                          Nov 26 at 16:31






                                                                        • 1




                                                                          You can replace key = lambda x: len(x) with key=len.
                                                                          – Keyur Potdar
                                                                          Nov 26 at 16:32






                                                                        • 1




                                                                          @mandingo sorry I got confused with the output. Let me change that.
                                                                          – BernardL
                                                                          Nov 26 at 16:32


















                                                                        • The expected output and longest word chain is: ['hedgehog', 'giraffe', 'elephant', 'tiger', 'racoon'], your one gives ['giraffe', 'elephant', 'tiger', 'racoon'] or am I missing something?
                                                                          – Mandingo
                                                                          Nov 26 at 16:31






                                                                        • 1




                                                                          You can replace key = lambda x: len(x) with key=len.
                                                                          – Keyur Potdar
                                                                          Nov 26 at 16:32






                                                                        • 1




                                                                          @mandingo sorry I got confused with the output. Let me change that.
                                                                          – BernardL
                                                                          Nov 26 at 16:32
















                                                                        The expected output and longest word chain is: ['hedgehog', 'giraffe', 'elephant', 'tiger', 'racoon'], your one gives ['giraffe', 'elephant', 'tiger', 'racoon'] or am I missing something?
                                                                        – Mandingo
                                                                        Nov 26 at 16:31




                                                                        The expected output and longest word chain is: ['hedgehog', 'giraffe', 'elephant', 'tiger', 'racoon'], your one gives ['giraffe', 'elephant', 'tiger', 'racoon'] or am I missing something?
                                                                        – Mandingo
                                                                        Nov 26 at 16:31




                                                                        1




                                                                        1




                                                                        You can replace key = lambda x: len(x) with key=len.
                                                                        – Keyur Potdar
                                                                        Nov 26 at 16:32




                                                                        You can replace key = lambda x: len(x) with key=len.
                                                                        – Keyur Potdar
                                                                        Nov 26 at 16:32




                                                                        1




                                                                        1




                                                                        @mandingo sorry I got confused with the output. Let me change that.
                                                                        – BernardL
                                                                        Nov 26 at 16:32




                                                                        @mandingo sorry I got confused with the output. Let me change that.
                                                                        – BernardL
                                                                        Nov 26 at 16:32





                                                                        protected by Ajax1234 Nov 28 at 2:43



                                                                        Thank you for your interest in this question.
                                                                        Because it has attracted low-quality or spam answers that had to be removed, posting an answer now requires 10 reputation on this site (the association bonus does not count).



                                                                        Would you like to answer one of these unanswered questions instead?



                                                                        Popular posts from this blog

                                                                        flock() on closed filehandle LOCK_FILE at /usr/bin/apt-mirror

                                                                        Mangá

                                                                        Eduardo VII do Reino Unido